Building OS agents at ByteDance Seed.

I am proceeding steadily towards a general self-improving OS agents: OS-Copilot, SeeClick, OS-Atlas, OS-Genesis.

I am looking for interns to work with me on OS agents and RL , please feel free to hit me up with your CV or questions if interested.

🔥 News

📝 Selected Publications

ICLR'25 Spotlight
sym

OS-ATLAS: A Foundation Action Model For Generalist GUI Agents
Zhiyong Wu, Zhenyu Wu, Fangzhi Xu, Yian Wang*, Qiushi Sun, Chengyou Jia, Kanzhi Cheng, Zichen Ding, Liheng Chen, Paul Pu Liang, Yu Qiao.

  • Check demos at Our Website
  • SOTA GUI grounding and action model upon which you can easily build your own agent. Code .
  • Repost and like us on Twitter
LLMAgents@ICLR 2024
sym

OS-Copilot: Towards Generalist Computer Agents with Self-Improvement
Zhiyong Wu, Chengcheng Han*, Zichen Ding, Zhenmin Weng, Zhoumianze Liu, Shunyu Yao, Tao Yu, Lingpeng Kong.

🤖 Interns

📚 Full Publication List

🤖 Agents

❓ In-Context Learning

📃 Data Augmentation using LLMs

🎼 Interpretability

🧑‍🎨 Generative Model