Hi! I am exploring something new now!

I am a research scientist at Shanghai AI Lab. I got my PhD degree from the University of Hong Kong at the end of 2021, affiliated with the HKU database group and NLP group. Before that, I received my B.E. degree from Wuhan University in 2017.

I am building general OS agents: OS-Copilot, OS-Atlas, OS-Genesis, SeeClick.

I am looking for talented interns to work with me on OS agents and RL , please feel free to hit me up with your CV or questions if interested.

🔥 News

  • 2025.02: OS-Atlas is accepted as a spotlight paper at ICLR 2025. See you in Singapore!
  • 2024.05: SeeClick and Symbol-LLM are accepted to ACL main conference! See you in Bangkok!
  • 2024.04: 🎉 New homepage!

📝 Selected Publications

ICLR'25 Spotlight
sym

OS-ATLAS: A Foundation Action Model For Generalist GUI Agents
Zhiyong Wu, Zhenyu Wu, Fangzhi Xu, Yian Wang*, Qiushi Sun, Chengyou Jia, Kanzhi Cheng, Zichen Ding, Liheng Chen, Paul Pu Liang, Yu Qiao.

  • Check demos at Our Website
  • SOTA GUI grounding and action model upon which you can easily build your own agent. Code .
  • Repost and like us on Twitter
LLMAgents@ICLR 2024
sym

OS-Copilot: Towards Generalist Computer Agents with Self-Improvement
Zhiyong Wu, Chengcheng Han*, Zichen Ding, Zhenmin Weng, Zhoumianze Liu, Shunyu Yao, Tao Yu, Lingpeng Kong.

🤖 Interns

📚 Full Publication List

🤖 Agents

❓ In-Context Learning

📃 Data Augmentation using LLMs

🎼 Interpretability

🧑‍🎨 Generative Model