Gangwei Xu

Gangwei Xu | 许刚伟

I am a final-year Ph.D. student at Huazhong University of Science and Technology, advised by Prof. Xin Yang (expected to graduate in June 2027). As first author (including co-first author), I have published 11 papers in top-tier journals and conferences such as T-PAMI, NeurIPS, CVPR, and ICCV. Two of my first-author papers have each received over 400 citations on Google Scholar. I was awarded funding from the National Natural Science Foundation of China PhD Student Research Program and was honored with the title of “Academic Star” at HUST.

Currently, I am a research intern at Ant Group (RobbyAnt), mentored by Yinghao Xu and Qihang Zhang, focusing on video world models and robot learning.

I am actively looking for full-time positions. I’m always open to collaborations. Feel free to reach out!

News
[2025-09]Pixel-Perfect Depth accepted by NeurIPS 2025.
[2025-05]IGEV++ accepted by TPAMI 2025.
[2025-03]MonSter accepted by CVPR 2025.
[2024-10]Honored as “Academic Star” (学术新星) at HUST.
[2024-05]Granted the National Natural Science Foundation of China (NSFC) PhD Student Research Program.
Selected Publications
PPD
Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Gangwei Xu, Haotong Lin, Hongcheng Luo, Xianqi Wang, Jingfeng Yao, Lianghui Zhu, Yuechuan Pu, Cheng Chi, Haiyang Sun, Bing Wang, Guang Chen, Hangjun Ye, Sida Peng, Xin Yang
NeurIPS 2025
Paper Code

A monocular depth foundation model with pixel-space diffusion transformers; estimated depth maps recover high-quality, flying-pixel-free point clouds.

IGEV
Iterative Geometry Encoding Volume for Stereo Matching
Gangwei Xu, Xianqi Wang, Xiaohuan Ding, Xin Yang
CVPR 2023
Paper Code

A new architecture combining the complementary advantages of filtering-based and optimization-based methods.

IGEV++
IGEV++: Iterative Multi-range Geometry Encoding Volumes for Stereo Matching
Gangwei Xu, Xianqi Wang, Zhaoxing Zhang, Junda Cheng, Chunyuan Liao, Xin Yang
TPAMI 2025
Paper Code

An effective and efficient method for handling large disparities and extensive ill-posed regions.

ACVNet
Attention Concatenation Volume for Accurate and Efficient Stereo Matching
Gangwei Xu, Junda Cheng, Peng Guo, Xin Yang
CVPR 2022
Paper Code

A novel cost volume representation (Attention Concatenation Volume) for stereo matching.

Fast-ACVNet
Accurate and Efficient Stereo Matching via Attention Concatenation Volume
Gangwei Xu, Yun Wang, Junda Cheng, Jinhui Tang, Xin Yang
TPAMI 2024
Paper Code

A novel cost volume representation (Fast-ACV) for real-time stereo matching.

All Publications
Experience
[01/2026 - now]Research Intern, Ant Group (RobbyAnt) — video world models / embodied AI
[02/2025 - 12/2025]Research Intern, Xiaomi EV — depth foundation models
[05/2023 - 12/2023]Research Intern, Shanghai AI Laboratory — computational photography
Honors & Awards
Academic Services

Conference Reviewer: CVPR 2023, ICCV 2023, SIGGRAPH Asia 2023, CVPR 2024, ECCV 2024, ICRA 2024, ICLR 2025, AAAI 2025, CVPR 2025, ICCV 2025, NeurIPS 2025, AAAI 2026, ICLR 2026, CVPR 2026.

Journal Reviewer: TPAMI, IJCV, TIP, RA-L.