Jiawei He

About Me

I am Jiawei He, a visiting scholar at the Beijing Academy of Artificial Intelligence (BAAI). I work with Prof. He Wang focusing on generalizable embodied AI with multimodal large language models. Before that, I received my PhD degree in June 2024 from Institute of Automation, Chinese Academy of Sciences under the advisory of Prof. Zhaoxiang Zhang. From 2020 to 2022, I was a research intern at TuSimple, mentored by Zehao Huang and Naiyan Wang, focusing on Multiple Object Tracking and 3D Object Detection. Before this, I got my BS degree from Xi'an Jiaotong University in 2019. During my undergraduate years, I joined X-Plan project and was a research intern at Institute of Artificial Intelligence and Robotics (IAIR) in XJTU from 2017 to 2018.

CV / Google Scholar / GitHub / PhD Research Statement

Currently recruiting on-site/remote research interns, focusing on embodied AI, VLM/VLA, autonomous driving, and 3D perception. Interested graduate and senior undergraduate students can contact me via email [email protected].

Research Interests

I am interested in computer vision, embodied AI, deep learning, multiple object tracking, 3D perception and reconstruction, learning-based combinatorial optimization, video analysis, image and video generation, etc.

Publications

  • Jiawei He, Danshi Li, XinQiang Yu, Zekun Qi, Wenyao Zhang, Jiayi Chen, Zhaoxiang Zhang, Zhizheng Zhang, Li Yi, He Wang. DexVLG: Dexterous Vision-Language-Grasp Model at Scale.
  • Zekun Qi, Wenyao Zhang, Yufei Ding, Runpei Dong, XinQiang Yu, Jingwen Li, Lingyun Xu, Baoyu Li, Xialin He, Guofan Fan, Jiazhao Zhang, Jiawei He, Jiayuan Gu, Xin Jin, Kaisheng Ma, Zhizheng Zhang, He Wang, Li Yi. SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation. [arXiv]
  • Yuqi Wang*, Ke Cheng*, Jiawei He*, Qitai Wang*, Hengchen Dai, Yuntao Chen, Fei Xia, Zhaoxiang Zhang. DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model. In NeurIPS 2024 D&B track. [arXiv] [project page] [机器之心 (In Chinese)]
  • Qitai Wang, Jiawei He, Yuntao Chen, Zhaoxiang Zhang. OneTrack: Demystifying the Conflict Between Detection and Tracking in End-to-End 3D Trackers. In European Conference on Computer Vision (ECCV) 2024. [pdf]
  • Yingyan Li, Lue Fan, Jiawei He, Yuqi Wang, Yuntao Chen, Zhaoxiang Zhang, Tieniu Tan. Enhancing End-to-End Autonomous Driving with Latent World Model. In International Conference on Learning Representations (ICLR) 2025. [arXiv]
  • Jiawei He, Zehao Huang, Naiyan Wang, Zhaoxiang Zhang. Learnable Graph Matching: A Practical Paradigm for Data Association. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024. [paper] [arXiv] [code][bibtex]
  • Yuqi Wang*, Jiawei He*, Lue Fan*, Hongxin Li*, Yuntao Chen, Zhaoxiang Zhang. Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024. [arXiv] [project page] [code] [机器之心 (In Chinese)]
  • Jiawei He, Yuqi Wang, Yuntao Chen, Zhaoxiang Zhang. Weakly Supervised 3D Object Detection with Multi-Stage Generalization. TPAMI (major revision)[arXiv] [project page]
  • Jiawei He, Lue Fan, Yuqi Wang, Yuntao Chen, Zehao Huang, Naiyan Wang, Zhaoxiang Zhang. Tracking Objects with 3D Representation from Videos. [arXiv]
  • Jiawei He, Yuntao Chen, Naiyan Wang, Zhaoxiang Zhang. 3D Video Object Detection with Learnable Object-Centric Global Optimization. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023. [pdf] [arXiv] [code][bibtex]
  • Yingyan Li, Yuntao Chen, Jiawei He, Zhaoxiang Zhang. Densely Constrained Depth Estimator for Monocular 3D Object Detection. In European Conference on Computer Vision (ECCV) 2022. [pdf] [code][bibtex]
  • Jiawei He, Zehao Huang, Naiyan Wang, Zhaoxiang Zhang. Learnable Graph Matching: Incorporating Graph Partitioning with Deep Feature Learning for Multiple Object Tracking. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021. [pdf] [code] [bibtex] [poster] [YouTube] [bilibili]
  • Zhixiong Nan, Yang Feng, Jiawei He, Ping Wei, Linhai Xu, Hongbin Sun, Nanning Zheng. Scene-Guided Region Proposal Re-ranking Method for On-road Vehicle Candidate Generation. In IEEE Intelligent Vehicles Symposium (IV) 2019. [paper] [bibtex]

Presentations

  • Reconstruction-based 3D Perception. In CRIPAC Summer Symposium. July 16, 2023. [Slides]

Professional Services

  • Conference reviewer: ICLR, ICML, CVPR, ECCV, ICCV, ACCV
  • Journal reviewer: IJCV, TIP, PR, SCIS, TNNLS, TCSVT, Information Fusion, TBIOM

Contact Details

Beijing Academy of Artificial Intelligence
150 Chengfu Road, BEIJING, CHINA

北京智源人工智能研究院
北京市海淀区成府路150号

E-mail: [email protected]; [email protected]

Last update: Mar. 20, 2025.