Gangwei Xu | 许刚伟

I am a final-year Ph.D. student at Huazhong University of Science and Technology, advised by Prof. Xin Yang (expected to graduate in June 2027). As first author (including co-first author), I have published 11 papers in top-tier journals and conferences such as T-PAMI, NeurIPS, CVPR, and ICCV. My most-cited first-author paper alone has received over 500 citations on Google Scholar. I was awarded funding from the National Natural Science Foundation of China PhD Student Research Program and was honored with the title of “Academic Star” at HUST.

Currently, I am a research intern at Robbyant, Ant Group, mentored by Yinghao Xu and Qihang Zhang, focusing on video world models and robot learning.

I am actively looking for full-time positions. I’m always open to collaborations. Feel free to reach out!

Email: gwxu [at] hust.edu.cn

News

[2026-06]	Our WAM paper Next Forcing is now available as an arXiv preprint.
[2025-09]	Pixel-Perfect Depth accepted by NeurIPS 2025.
[2025-05]	IGEV++ accepted by TPAMI 2025.
[2025-03]	MonSter accepted by CVPR 2025.
[2024-10]	Honored as “Academic Star” (学术新星) at HUST.
[2024-05]	Granted the National Natural Science Foundation of China (NSFC) PhD Student Research Program.

Selected Publications

Next Forcing: World Action Modeling with Multi-Chunk Prediction

Gangwei Xu, Qihang Zhang, Jiaming Zhou, Xing Zhu, Yujun Shen, Xin Yang, Yinghao Xu

arXiv Preprint, 2026

Paper Project

A multi-chunk prediction (MCP) framework for causal WAMs that improves training convergence speed, final accuracy, and inference efficiency via MCP-accelerated parallel chunk generation.

Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers

Gangwei Xu, Haotong Lin, Hongcheng Luo, Xianqi Wang, Jingfeng Yao, Lianghui Zhu, Yuechuan Pu, Cheng Chi, Haiyang Sun, Bing Wang, Guang Chen, Hangjun Ye, Sida Peng, Xin Yang

NeurIPS 2025

Paper Code

A monocular depth foundation model with pixel-space diffusion transformers; estimated depth maps recover high-quality, flying-pixel-free point clouds.

Iterative Geometry Encoding Volume for Stereo Matching

Gangwei Xu, Xianqi Wang, Xiaohuan Ding, Xin Yang

CVPR 2023

Paper Code

A new architecture combining the complementary advantages of filtering-based and optimization-based methods.

IGEV++: Iterative Multi-range Geometry Encoding Volumes for Stereo Matching

Gangwei Xu, Xianqi Wang, Zhaoxing Zhang, Junda Cheng, Chunyuan Liao, Xin Yang

TPAMI 2025

Paper Code

An effective and efficient method for handling large disparities and extensive ill-posed regions.

Attention Concatenation Volume for Accurate and Efficient Stereo Matching

Gangwei Xu, Junda Cheng, Peng Guo, Xin Yang

CVPR 2022

Paper Code

A novel cost volume representation (Attention Concatenation Volume) for stereo matching.

Accurate and Efficient Stereo Matching via Attention Concatenation Volume

Gangwei Xu, Yun Wang, Junda Cheng, Jinhui Tang, Xin Yang

TPAMI 2024

Paper Code

A novel cost volume representation (Fast-ACV) for real-time stereo matching.

All Publications

2026

Next Forcing: World Action Modeling with Multi-Chunk Prediction

Gangwei Xu, Qihang Zhang, Jiaming Zhou, Xing Zhu, Yujun Shen, Xin Yang, Yinghao Xu

arXiv Preprint, 2026

Paper Project

2025

Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers

Gangwei Xu, Haotong Lin, Hongcheng Luo, Xianqi Wang, Jingfeng Yao, Lianghui Zhu, Yuechuan Pu, Cheng Chi, Haiyang Sun, Bing Wang, Guang Chen, Hangjun Ye, Sida Peng, Xin Yang

NeurIPS 2025 (ratings: 5,5,5,5)

Paper Code

BANet: Bilateral Aggregation Network for Mobile Stereo Matching

Gangwei Xu, Jiaxin Liu, Xianqi Wang, Junda Cheng, Yong Deng, Jinliang Zang, Yurui Chen, Xin Yang

ICCV 2025

Paper Code

IGEV++: Iterative Multi-range Geometry Encoding Volumes for Stereo Matching

Gangwei Xu, Xianqi Wang, Zhaoxing Zhang, Junda Cheng, Chunyuan Liao, Xin Yang

TPAMI 2025

Paper Code

2024

HDRFlow: Real-Time HDR Video Reconstruction with Large Motions

Gangwei Xu, Yujin Wang, Jinwei Gu, Tianfan Xue, Xin Yang

CVPR 2024

Paper Code

Selective-Stereo: Adaptive Frequency Information Selection for Stereo Matching

Xianqi Wang, Gangwei Xu, Hao Jia, Xin Yang

CVPR 2024 (Highlight)

Paper Code

Accurate and Efficient Stereo Matching via Attention Concatenation Volume

Gangwei Xu, Yun Wang, Junda Cheng, Jinhui Tang, Xin Yang

TPAMI 2024

Paper Code

2023

Iterative Geometry Encoding Volume for Stereo Matching

Gangwei Xu, Xianqi Wang, Xiaohuan Ding, Xin Yang

CVPR 2023

Paper Code

2022

Attention Concatenation Volume for Accurate and Efficient Stereo Matching

Gangwei Xu, Junda Cheng, Peng Guo, Xin Yang

CVPR 2022

Paper Code

Experience

[01/2026 - now]	Research Intern, Robbyant, Ant Group — video world models / embodied AI
[02/2025 - 12/2025]	Research Intern, Xiaomi EV — depth foundation models
[05/2023 - 12/2023]	Research Intern, Shanghai AI Laboratory — computational photography

Honors & Awards

入选中国科协青年科技人才培育工程博士生专项计划, 2025
National Scholarship (国家奖学金), PhD, 2025
挑战杯全国赛特等奖, 2025
获批首届国家自然科学基金博士研究生项目, 2024
HUST Academic Star (华中科技大学“学术新星”), 2024
National Scholarship (国家奖学金), PhD, 2024
National Scholarship (国家奖学金), Master, 2023
中国研究生 AI 大赛一等奖, 2023
Xiaomi Scholarship, 2022

Academic Services

Conference Reviewer: CVPR 2023, ICCV 2023, SIGGRAPH Asia 2023, CVPR 2024, ECCV 2024, ICRA 2024, ICLR 2025, AAAI 2025, CVPR 2025, ICCV 2025, NeurIPS 2025, AAAI 2026, ICLR 2026, CVPR 2026.

Journal Reviewer: TPAMI, IJCV, TIP, RA-L.