I am a CS Ph.D. student at UNC Chapel Hill, supervised by Prof. Mingyu Ding and Prof. Huaxiu Yao. I received both my Bachelor’s and Master’s degrees from Tongji University. I interned at UC Berkeley as a visiting student researcher, where I collaborated with Prof. Masayoshi Tomizuka, Prof. Mingyu Ding, and Dr. Wei Zhan.

My research interests include LLM/VLM, RL , and embodied AI. And my research goal is to improve the performance of LLMs and VLMs through reinforcement learning and other post-training techniques, and to leverage these models to build efficient and intelligent agents capable of interacting with the physical world (embodied intelligence) and driving scientific exploration (AI for Science).

Besides, I am deeply interested in modeling and analyzing LLM reasoning from a geometric and dynamical systems perspective. Recently, I have been actively exploring how post-training signals, such as RL, reshape the geometry, stability of these reasoning dynamics. If you are interested in related questions, I am always happy to connect and exchange ideas.

I am actively seeking 2026 summer internships. Let’s connect!

🔥 News

2025.11: Agent0, Agent0-VL are released.
2025.11: 🎉🎉 One paper is accepted by AAAI 2026.
2025.10: Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails is preprinted.
2025.09: We released SciReasoner, a scientific reasoning foundation model covering 103 tasks. It is pretrained on a 206B-token corpus and further enhanced through supervised fine-tuning and reinforcement learning to continually improve its reasoning ability. The model achieves state-of-the-art performance on more than 50 tasks. Paper, Github, Hugging Face
2025.08: We released A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers, providing a comprehensive, data-centric roadmap of Sci-LLMs from foundational models to agentic frontiers. Paper, Github
2025.08: We released VIPER-R1, a multimodal model for Visual Induction for Physics-based Equation Reasoning to discover fundamental symbolic formulas. Paper, Project.
2025.06: PhysUniBench is released, which is a large-scale multimodal benchmark for evaluating undergraduate-level physics reasoning in AI models. Paper, Project.
2025.06: 🎉🎉 I received my Master degree from Tongji University, and got the honor of Shanghai Outstanding Graduate(Top 1%)!
2025.03: 🎉🎉 One paper is accepted by IEEE Transactions on Vehicular Technology.
2025.02: 🎉🎉 One paper (first author) is accepted by IEEE Robotics and Automation Letters (RA-L).
2024.11: 🎉🎉 One paper (first author) is accepted by IEEE Transactions on Intelligent Transportation Systems.
2024.11: 🎉🎉 One paper (first author) is accepted by IEEE Transactions on Artificial Intelligence.
2024.10: 🎉🎉 One paper (first author) is accepted by IEEE Transactions on Intelligent Vehicles.
2024.08: 🎉🎉 One paper (Co-first author) is accepted by ECCV 2024(Oral)
2024.07: 🎉🎉 One paper (first author) is accepted by ITSC 2024.
2024.06: 🎉🎉 One paper (first author) is accepted by Transportmetrica B: Transport Dynamics.
2024.04: 🎉🎉 One paper (first author) is accepted by IEEE Transactions on Vehicular Technology.
2023.11: 🎉🎉 One paper (first author) is accepted by IEEE Transactions on Intelligent Vehicles.
2023.07: 🎉🎉 Three papers were accepted by ITSC 2023
2022.09: 🎉🎉 I was bestowed with the Nominee Award for Shanghai University Student Annual Character, a recognition of paramount importance in evaluating the comprehensive aptitudes and societal influence of college students within the Shanghai municipality. Only 20 people in Shanghai receive the award each year (including nominees)!
2022.06: 🎉🎉 I received my Bachelor degree from Tongji University with the first overall ranking in my major (1/163), and I got the honor of Shanghai Outstanding Graduate(Top 1%)!
2021.05: 🎉🎉 I received the Tongji University Pursuit of Excellence Award, the highest honor for undergraduates at Tongji University! Only three undergraduates are able to receive this honor each year!
2020.10: 🎉🎉 I won the Best Paper Award in CUMCM! CUMCM is the largest basic academic competition for undergraduate students in China, and 3 papers were awarded from 42,000+ candidates!

📝 Publications

Mimicking the Physicist’s Eye:A VLM-centric Approach for Physics Formula Discovery (VIPER-R1)

Jiaqi Liu, Songning Lai, Pengze Li, Di Yu, Wenjie Zhou, Yiyang Zhou, Peng Xia, Zijun Wang, Xi Chen, Shixiang Tang, Lei Bai, Wanli Ouyang, Mingyu Ding, Huaxiu Yao, Aoran Wang, accepted by NeurIPS 2025 Efficient Reasoning Workshop, submitted to ICLR 2026.

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

Jiaqi Liu, Kaiwen Xiong, Peng Xia, Yiyang Zhou, Haonian Ji, Lu Feng, Siwei Han, Mingyu Ding, Huaxiu Yao, arxiv:2511.19900.

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Peng Xia, Kaide Zeng, Jiaqi Liu, Can Qin, Fang Wu, Yiyang Zhou, Caiming Xiong, Huaxiu Yao, arxiv:2511.16043.

Mixture of Horizons in Action Chunking

Dong Jing, Gang Wang, Jiaqi Liu, Weiliang Tang, Zelong Sun, Yunchao Yao, Zhenyu Wei, Yunhui Liu, Zhiwu Lu, Mingyu Ding, arxiv:2511.19433.

ARCHE: A Novel Task to Evaluate LLMs on Latent Reasoning Chain Extraction

Pengze Li, Jiaqi Liu, Junchi Yu, Lihao Liu, Mingyu Ding, Wanli Ouyang, Shixiang Tang, Xi Chen, AAAI 2026.

SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines

Yizhou Wang$^\star$, Chen Tang$^\star$, Han Deng$^\star$, Jiabei Xiao$^\star$, Jiaqi Liu$^\star$, Jianyu Wu$^\star$, …, Philip Torr, Shixiang Tang, Xinzhu Ma, Wanli Ouyang, Lei Bai et al. . arXiv:2509.21320.

Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails

Siwei Han, Jiaqi Liu, Yaofeng Su, Wenbo Duan, Xinyuan Liu, Cihang Xie, Mohit Bansal, Mingyu Ding, Linjun Zhang, Huaxiu Yao, arxiv:2510.04860.

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Ming Hu, Chenglong Ma, Wei Li, Wanghan Xu, Jiamin Wu, Jucheng Hu, Tianbin Li, Guohang Zhuang, Jiaqi Liu, Yingzhou Lu, …, Wanli Ouyang, Yu Qiao, Zongyuan Ge, Shixiang Tang, Junjun He et al. . arxiv:2508.21148.

Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning

Jiaqi Liu, Chengkai Xu, Peng Hang, Jian Sun, Mingyu Ding, Wei Zhan, Masayoshi Tomizuka, IEEE Robotics and Automation Letters (RA-L), DOI: 10.1109/LRA.2025.3551098.

DDM-Lag : A Diffusion-based Decision-making Model for Autonomous Vehicles with Lagrangian Safety Enhancement

Jiaqi Liu, Peng Hang, Xiaocong Zhao, Jianqiang Wang, Jian Sun, IEEE Transactions on Artificial Intelligence(TAI), DOI: 10.1109/TAI.2024.3497918.

Cooperative Decision-Making for CAVs at Unsignalized Intersections: A MARL Approach with Attention and Hierarchical Game Priors

Jiaqi Liu, Peng Hang, Xiaoxiang Na, Chao Huang, Jian Sun, IEEE Transactions on Intelligent Transportation Systems (TITS), DOI: 10.1109/TITS.2024.3503092, 2024.

MAPPO-PIS: A Multi-Agent Proximal Policy Optimization Method with Prior Intent Sharing for CAVs’ Cooperative Decision-Making

Yicheng Guo$^\star$, Jiaqi Liu$^\star$, Rongjie Yu, Peng Hang, Jian Sun, accepted by ECCV 2024 MAAS Workshop(Oral).

🎖 Honors and Awards

2025.06 Outstanding Graduates from Shanghai(Top 1%).
2024.10 National Scholarship, Ministry of Education of China
2022.10 Shanghai University Student Annual Character (Nomination Award)!
2022.06 Outstanding Graduates from Shanghai(Top 1%).
2021.12 National Scholarship, Ministry of Education of China
2021.05 Tongji University Pursuit of Excellence Award (3/4450) !
2020.10 Best paper award of Contemporary Undergraduate Mathematical Contest in Modeling (CUMCM) (3/ 42000+) !
2020.10 First Prize in the Shanghai College Students Transportation Innovation Competition
2020.5 First Prize in Tongji University Mathematical Modeling Contest
2019.12 National Scholarship, Ministry of Education of China

📖 Educations

2025.08 - now, PhD, UNC Chapel Hill, NC, US
2022.09 - 2025.06, Master, Tongji University, Shanghai, China
2018.09 - 2022.06, Bachelor, Tongji University, Shanghai, China (GPA: 4.89/5, rank: 1/163)

📚 Academic Services

Reviews

Conference Reviewer: NeurIPS, ICML, CVPR, ICCV, AAAI, ICRA, ITSC
Journal Reviewer: IEEE Transactions on Intelligent Vehicles (TIV), IEEE Transactions on Intelligent Transportation Systems (TITS), IEEE Transactions on Neural Networks and Learning(TNNLS),IEEE Robotics and Automation Letters (RA-L), Journal of Field Robotics, IEEE Transactions on Industrial Informatics (TII), IEEE Transactions on Vehicular Technology (TVT), IEEE Transactions on Automation Science and Engineering (TASE), IEEE Internet of Things Journal, Nonlinear Dynamics, Journal of Advanced Transportation, Scientific Reports

Mentoring

Yaofeng Su: Fudan University
Kaiwen Xiong: Shanghai Jiao Tong University
Carsen Sharkey: UNC Chapel Hill
Yicheng Guo: Tongji University
Chengkai Xu: Tongji University
Yuhang Zhang: Tongji University

Teaching Assistant

CS 790-183: Transfer Learning, UNC, Fall 2025