
Xiao Yu
I am a third year Ph.D. student in Computer Science at Columbia University advised by Zhou Yu. Before joining the Ph.D. program, I was an undergrad also at Columbia University, majoring in Computer Science and minoring in Applied Physics.
🌟 Currently I am interested in improving the environment understanding and planning capabilities of AI agents, especially for browser/computer/phone-use.
Scalable Reinforcement Learning algorithms
World Model training methods such as Dyna
Planning Algorithms such as MCTS
🚀 My most recent work include:
arXiv
Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Xiao Yu, Baolin Peng, Michel Galley, Hao Cheng, Qianhui Wu, Janardhan Kulkarni, Suman Nath, Zhou Yu, Jianfeng Gao
arXiv
Dyna-Think: Synergizing Reasoning, Acting, and World Model Simulation in AI Agents
Xiao Yu, Baolin Peng, Ruize Xu, Michel Galley, Hao Cheng, Suman Nath, Jianfeng Gao, Zhou Yu