Hi! I am a research scientist at Shanghai AI Lab. I got my PhD degree from the University of Hong Kong at the end of 2021, affiliated with the HKU database group and NLP group. I am advised by Prof. Ben Kao. I am also working closely with Lingpeng Kong. Before that, I received my B.E. degree from Wuhan University in 2017. Throughout my graduate studies, I had great internships in Tencent AI Lab and Huawei Noah’s Ark Lab.
My research centers around large language models (LLMs) with a special focus on building the next generation of natural language interfaces that can interact with and learn from real-world environments. You can find a prototype of my ambitious goal at OS-Copilot.
I have multiple internship positions available (OS-Copilot related and language agent in general) , please feel free to hit me up with your CV or questions if interested.
🔥 News
- 2024.04.02: 🎉 New homepage!
📝 Selected Publications
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement
Zhiyong Wu, Chengcheng Han, Zichen Ding, Zhenmin Weng, Zhoumianze Liu, Shunyu Yao, Tao Yu, Lingpeng Kong.
- Check demos at Our Website
- Build your personal agents at Code .
- Join our Discord to have fun, or follow us on Twitter
arXiv 2024
A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond Qiushi Sun, Zhirui Chen, Fangzhi Xu, …, Pengcheng Yin, Qipeng Guo, Xipeng Qiu, Xiaoli Li, Fei Yuan, Lingpeng Kong, Xiang Li, Zhiyong Wu.ICLR 2024
EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling, Siyu Ren, Zhiyong Wu, Kenny Q Zhu.arXiv 2024
SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents, Kanzhi Cheng, Qiushi Sun, Yougang Chu, Fangzhi Xu, Yantao Li, Jianbing Zhang, Zhiyong Wu.arXiv 2024
Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models, Fangzhi Xu, Zhiyong Wu, et al.ACL 2023
Self-adaptive In-context Learning, Zhiyong Wu, Yaoxiang Wang, Jiacheng Ye, Lingpeng Kong.ICLR 2023
DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models, Shansan Gong, Mukai Li, Jiangtao Feng, Zhiyong Wu, Lingpeng KongACL 2020
Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT, Zhiyong Wu, Yun Chen, et al.
🤖 Interns
- Jiacheng Ye (2022.1-2023.5) EMNLP’22a EMNLP’22b ICML’23
- Sijie Cheng (2022.3-2022.8) AAAI’23
- Yaoxiang Wang (2022.10-2024.4) ACL’23a ACL’23b Under review at ACL’24
- Zhenyu Wu (2022.10-) ACL’23b Under review at ACL’24
- Siyu Ren (2023.8-2024.2) ICLR’24
- Qiushi Sun (2023.6-) Under review at COLM’24, Survey Paper
- Fangzhi Xu (2023.8-) Under review at ACL’24
- Kanzhi Cheng (2023.8-) Under review at ACL’24
📖 Educations
- 2017.09 - 2021.11, PhD, University of Hong Kong.
- 2013.09 - 2017.06, Undergraduate, Wuhan Univeristy.
📚 Full Publication List
🤖 Agents
arXiv 2024
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement Zhiyong Wu, Chengcheng Han, Zichen Ding, Zhenmin Weng, Zhoumianze Liu, Shunyu Yao, Tao Yu, Lingpeng Kong.arXiv 2024
SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents, Kanzhi Cheng, Qiushi Sun, Yougang Chu, Fangzhi Xu, Yantao Li, Jianbing Zhang, Zhiyong Wu.arXiv 2024
Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models, Fangzhi Xu, Zhiyong Wu, et al.arXiv 2024
TDAG: A Multi-Agent Framework based on Dynamic Task Decomposition and Agent Generation Yaoxiang Wang, Zhiyong Wu, Junfeng Yao, Jinsong SuarXiv 2023
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration Qiushi Sun, Zhangyue Yin, Xiang Li, Zhiyong Wu, Xipeng Qiu, Lingpeng Kong.
❓ In-Context Learning
arXiv 2023
A Survey on In-context Learning, Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Zhiyong Wu, et al.ACL 2023
Self-adaptive In-context Learning, Zhiyong Wu, Yaoxiang Wang, Jiacheng Ye, Lingpeng Kong.ACL 2023 (demo)
OpenICL: An Open-Source Framework for In-context Learning, Zhenyu Wu, YaoXiang Wang, Zhiyong Wu et al.ICML 2023
Compositional Exemplars for In-context Learning, Jiacheng Ye, Zhiyong Wu, Jiangtao Feng, Tao Yu, Lingpeng Kong.EMNLP 2023
Can We Edit Factual Knowledge by In-Context Learning? Ce Zheng, Lei Li, Qingxiu Dong, Yuxuan Fan, Zhiyong Wu, Jingjing Xu, Baobao Chang
📃 Data Augmentation using LLMs
ICLR 2023
Self-Guided High-Quality Data Generation in Efficient Zero-Shot Learning Jiahui Gao, Renjie Pi, Yong Lin, Hang Xu, Jiacheng Ye, Zhiyong Wu, et al.EMNLP 2022
ZeroGen: Efficient Zero-shot Learning via Dataset Generation, Jiacheng Ye, Jiahui Gao, Qintong Li, Hang Xu, Jiangtao Feng, Zhiyong Wu, Tao Yu and Lingpeng Kong.EMNLP 2022
ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback, Jiacheng Ye, Jiahui Gao, Zhiyong Wu, Jiangtao Feng, Tao Yu, and Lingpeng Kong.
🎼 Interpretability
ACL 2023 (findings)
Explanation Regeneration via Information Bottleneck Qintong Li, Zhiyong Wu, Lingpeng Kong, Wei Bi.AAAI 2023
Unsupervised Explanation Generation via Correct Instantiations Sijie Chen, Zhiyong Wu, Jiangjie Chen, Zhixing Li, Yang Liu, and Lingpeng KongACL 2021
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation, Zhiyong Wu, Lingpeng Kong, Wei Bi, Xiang Li, Ben Kao.ACL 2020
Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT, Zhiyong Wu, Yun Chen, et al.WSDM 2020
PERQ: Predicting, Explaining, and Rectifying Failed Questions in KB-QA Systems Zhiyong Wu, Ben Kao, Tien-Hsuan Wu, Pengcheng Yin, Qun Liu.
🧑🎨 Generative Model
ICLR 2024
EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling, Siyu Ren, Zhiyong Wu, Kenny Q Zhu.EMNLP 2023 (findings)
DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion ModelsShansan Gong, Mukai Li, Jiangtao Feng, Zhiyong Wu, Lingpeng Kong.ICLR 2023
DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models Shansan Gong, Mukai Li, Jiangtao Feng, Zhiyong Wu, Lingpeng Kong.ACL 2021
Lexical Knowledge Internalization for Neural Conversational Models Zhiyong Wu, Wei Bi, Xiang Li, Lingpeng Kong, Ben Kao.ACL 2021
Cascaded Head-colliding Attention Lin Zheng, Zhiyong Wu, Lingpeng Kong