Action Images: End-to-End Policy Learning via Multiview Video Generation
Haoyu Zhen, Zixian Gao, Qiao Sun, Yilin Zhao, Yuncong Yang, Yilun Du, Pengsheng Guo, Tsun-Hsuan Wang, Yi-Ling Qiao, Chuang Gan
Xiaomi Robotics · Robot Foundation Model Team
Towards Building Embodied Intelligence that Can Understand & Interact with the Physical World.
I am a Researcher with the Robot Foundation Model Team at Xiaomi Robotics, while continuing my visiting research at UMass Amherst & MIT-IBM Watson AI Lab under Prof. Chuang Gan and Dr. Yilun Du. My academic journey took an unconventional path — from Civil Engineering and Financial Management at Tianjin University, to Electrical and Computer Engineering at Fudan University, before fully committing to AI research.
My research centers on embodied AGI — building robots that can perceive, reason about, and physically interact with the world. I work at the intersection of world models, vision-language-action models, and scalable robotic learning, aiming to translate the recent breakthroughs in foundation models into the physical world.
Excited to announce Action Images: End-to-End Policy Learning via Multiview Video Generation is now on arXiv! In collaboration with the team at MIT-IBM and Stanford. Check out the paper 🚀
Excited to share Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution — an open-sourced VLA model with real-time execution. Read the paper 🤖
TesserAct is Accepted at ICCV 2025!
Looking forward to the next trip to Hawaii 🏝️!!!
Excited to announce that the
Paper,
Code,
Model, and
Website
of
TesserAct: Learning 4D Embodied World Models
are publicly available!
Explore and enjoy our Model right now 🤩
Thrilled to be picked up as an on-site Volunteer 🎉 Cannot wait to meet Old & New friends at Abu Dhabi!!
One of my first-authored papers has been Accepted at COLING 2025! I am thrilled to go for an exciting journey in the United Arab Emirates 🇦🇪!!!
I am honored to be a reviewer for two papers. Trying my best to do a perfect work!
I am going to deliver a presentation on MiniConGTS at EMNLP 2024, Miami 🏝️. Thanks to all attendees for the engaging discussions!
Haoyu Zhen, Zixian Gao, Qiao Sun, Yilin Zhao, Yuncong Yang, Yilun Du, Pengsheng Guo, Tsun-Hsuan Wang, Yi-Ling Qiao, Chuang Gan
Rui Cai, Jun Guo, Xinze He, Piaopiao Jin, Jie Li, Bingxuan Lin, Futeng Liu, Wei Liu, Fei Ma, Kun Ma, Feng Qiu, Heng Qu, Yifei Su, Qiao Sun, Dong Wang, Donghao Wang, Yunhong Wang, Rujie Wu, Diyun Xiang, Yu Yang, Hangjun Ye, Yuan Zhang, Quanyun Zhou
Under review at ICLR 2026
Leiyu Wang, Jun Lv, Qiao Sun, Yifei Wu, Ao-Bo Wang, Cewu Lu, Qinying Gu, Nanyang Ye
Weihan Yin, Qinying Gu, Yaoyun Zhang, Lin Zhu, Qiao Sun, Liujia Yang, Xinbing Wang, and Nanyang Ye
Accepted at COLING 2025
Qiao Sun, Jiexin Xie, Nanyang Ye, Qinying Gu, Shijie Guo
Accepted at Main Conference of EMNLP 2024
Qiao Sun, Liujia Yang, Minghao Ma, Nanyang Ye, Qinying Gu
Qi Fan*, Qiao Sun*, Nanyang Ye, Qinying Gu
Nanyang Ye*, Qiao Sun*, Yifei Wang, Liujia Yang, Jundong Zhou, Lei Wang, Guang-Zhong Yang, Xinbing Wang, Chenghu Zhou, Wei Ren, Leilei Gu, Huaqiang Wu, Qinying Gu
* Denotes Equal Contributions
Email: qiaosun22@m.fudan.edu.cn
GitHub: github.com/qiaosun22
Location: 220 Handan Rd., Yangpu, Shanghai, China
WeChat: Please scan this QR Code .