avatar

Xu Zheng

AI Thrust,
Hong Kong University of Science and Technology (HKUST)
zhengxu128@gmail.com



👋 I am a Ph.D. candidate in the AI Thrust at HKUST(GZ). I am fortunate to be advised by Prof. Xuming Hu @ HKUST and Prof. Raymond Chi-Wing Wong @ HKUST. I have served as a Resident Doctoral Researcher at INSAIT supervised by Prof. Luc Van Gool from 2025.02 to 2026.02. Recently, I have also been collaborating with Prof. Philip S. Yu @ UIC, Prof. Nicu Sebe @ UNITN, Linfeng Zhang @ SJTU, and Kailun Yang @ HNU.

My doctoral research develops robust and interpretable multi-modal learning algorithms spanning perception, understanding, reasoning, and generation. My two main doctoral research directions are:

Omnidirectional Vision (click to expand)
DPPASS CVPR 2023
DATR ICCV 2023
360SFUDA CVPR 2024
360SFUDA++ TPAMI 2025
OmniSAM ICCV 2025 Highlight
UNLOCK ICCV 2025
Pano-R1 ACM MM Asia 2025
Multi-modal Visual Understanding (click to expand)
ExACT CVPR 2024 Highlight
EventDance CVPR 2024
EventBind ECCV 2024
MAGIC ECCV 2024
Any2Seg ECCV 2024 Oral
MMSS-Bench CVPRW 2025 Best Paper
MFEnR ICCV 2025

My recent research interest lies in:

Artificial Intelligence Generated Content (AIGC)
RealRAG ICML 2025
TransDiff arXiv 2025
Multimodal Foundation Models
UniBind CVPR 2024
UiG Arxiv 2025
Scene Understanding & Spatial Intelligence
Pano-R1 ACM MM Asia 2025
Egonight ICLR 2026
Novel / Omnidirectional Sensors
360SFUDA++ TPAMI 2024
OmniSAM ICCV 2025
Robustness & Security
CIARD ICCV 2025
MRPD AAAI 2026

I also survey papers in cutting-edge topics:

🔥 I am actively seeking job opportunities (academia & industry) for Fall 2026!

News

  • 2026.01: Five paper accepted to ICLR 2026.
  • 2026.01: One paper accepted to ACM ToMM.
  • 2025.11: Selected to the MBZUAI Machine Learning Winter School 2026 in Abu Dhabi.
  • 2025.11: Two papers accepted to AAAI 2026.
  • 2025.10: The first Multi-modal Spatial Reasoning survey released: Paper.
  • 2025.10: One paper accepted to IJCV.
  • 2025.10: One paper accepted to IEEE TCSVT: CLIP-to-Seg.
  • 2025.09: Two papers accepted to NeurIPS 2025: Domain-RAG and HoloV.
  • 2025.06: One paper accepted to BMVC 2025: Split Matching.
  • 2025.06: Four papers (one Highlight (2.8%)) accepted to ICCV 2025: OmniSAM (Highlight), CIARD, UNLOCK, and Unimodal Bias.
  • 2025.06: Our paper is selected as Best Paper at CVPR 2025 @ TMM Open-World! Paper.
  • 2025.06: One paper accepted to IROS 2025: SHIFTNet.
  • 2025.05: Two papers accepted to ACL 2025 Findings: MMUNLearner and Mathematical Reasoning Survey.
  • 2025.05: One paper accepted to ICML 2025: RealRAG.
  • 2025.04: The first RAG in CV survey released: Paper.
  • 2025.04: Our paper accepted to CVPR 2025 @ TMM Open-World as Oral Presentation: MMSS-Bench.
  • 2025.02: Visit INSAIT as a Resident Doctoral Researcher! LinkedIn.
  • 2025.01: Successfully passed PhD Qualifying Examination!
  • 2024.12: Invited as an Area Chair of PDLM @ AAAI 2025.
  • 2024.10: One paper accepted to IEEE TPAMI: 360SFUDA++.
  • 2024.10: Oral presentation @ ECCV 2024 Oral Session 5A: Segmentation Video.
  • 2024.09: One paper accepted to Pattern Recognition.
  • 2024.07: Three papers (one Oral (1.5%)) accepted to ECCV 2024.
  • 2024.03: One paper accepted to IEEE CAI 2024.
  • 2024.03: One paper accepted to Pattern Recognition.
  • 2024.03: Five papers (one Highlight (2.8%)) accepted to CVPR 2024.
  • 2024.02: Two papers accepted to ICRA 2024.
  • 2023.07: Two papers accepted to ICCV 2023.
  • 2023.03: One paper accepted to CVPR 2023.

Invited Talks

  • “Omnidirectional Vision: From Scene Understanding, Spatial Intelligence to Industrial Applications”
    SPIC Energy Science and Technology Research Institute, Shanghai, China, August 2025

  • “PANORAMA: Exploring the Industrial Potentials of Omnidirectional Vision”
    Yangtze River Delta International Talent Port, Wuxi, China, August 2025

  • “Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning”
    VIVO, Shenzhen, China, August 2025. Invited by Dr. Kanzhi Wu

Mentorship

Current: Chenfei Liao (MPhil, HKUST-GZ); Zihao Dongfang (RA, HKUST-GZ); Ziqiao Weng (MPhil, HKUST-GZ)

Past: Yuanhuiyi Lyu (PhD, HKUST-GZ); Lutao Jiang (PhD, HKUST-GZ); Jialei Chen (PhD, Nagoya); Mengzhen Chi (PhD, NEU); Junha Moon (MPhil, HKUST-GZ); Kaiyu Lei (MPhil, HKUST-GZ); Leyi Sheng (UG, HKUST-GZ); Ding Zhong (MS, Michigan); Yunhao Luo (PhD, Umich); Tianbo Pan (PhD, NUS); Zhenquan Zhang (MPhil, SCUT); Boyuan Zheng (MPhil, Tongji)

✉️ Feel free to contact me for discussion and collaboration!

Academic Services

  • Area Chair: PDLM Workshop @ AAAI 2025
  • Reviewers: IJCV, TIP, TNNLS, TMM, TCI, Neurocomputing, etc.
  • PC Members: ICLR (2024,2025,2026), CVPR (2025,2026), ICML (2025), ICCV (2025), ECCV (2024), NeurIPS (2024,2025), AAAI (2026), ACM MM (2025), ICRA (2025), ICME (2025), WACV (2026)