About Me

I am a first-year Ph.D. student at MMLAB@HKUST, under the supervision of Prof. Anyi Rao and Prof. Huamin Qu. I obtained my masterโ€™s degree at Zhejiang University, where I collaborated with Prof. Wei-Wei Xu and Prof. Changqing Zou. Previously, I received my Bachelorโ€™s degree from Hunan University.

My research focuses on 3D/4D content generation and world model:

  • Generalizable 3D Foundation Models: Generative 3D reconstruction from sparse, multi-view, or in-the-wild data.
  • Interactive World Models: Real-time inference and control with Long-term memory.

๐Ÿ”ฅ News

  • 2025.06: ย ๐ŸŽ‰๐ŸŽ‰ One paper accepted by ICCV 2025.
  • 2024.12: ย ๐ŸŽ‰๐ŸŽ‰ One paper accepted by AAAI 2025.
  • 2024.02: ย ๐ŸŽ‰๐ŸŽ‰ One paper accepted by CVPR 2024.
  • 2023.12: ย ๐ŸŽ‰๐ŸŽ‰ One paper accepted by TCSVT.

๐Ÿ“ Publications

ICCV 2025
sym

[ICCV 2025] SpatialCrafter: Unleashing the Imagination of Video Diffusion Models for Scene Reconstruction from Limited Observations
Songchun Zhang, Huiyao Xu, Sitong Guo, Zhongwei Xie, Pengwei Liu, Hujun Bao, Weiwei Xu, Changqing Zou.
[Project page] [paper]

  • This paper presents a 3D scene reconstruction method from sparse inputs.
AAAI 2025
sym

[AAAI 2025] Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views
Songchun Zhang, Chunhui Zhao.
[Project page] [paper]

  • This paper presents a 3D object reconstruction method from sparse and unposed inputs.
CVPR 2024
sym

[CVPR 2024] 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation
Songchun Zhang, Yibo Zhang, Quan Zheng, Rui Ma, Wei Hua, Hujun Bao, Weiwei Xu, Changqing Zou.
[Project page] [paper]

  • This paper presents a novel text-driven 3D scene generation method that improves visual quality and 3D consistency.
TCSVT 2023
sym

[TCSVT 2023] Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization
Songchun Zhang, Chunhui Zhao.
[paper] [code]

  • This paper presents a weakly-supervised action localization framework leveraging cross-video information.

๐Ÿ–ฅ๏ธ Experience

  • March 2024 - Sept. 2024
    Research Intern - Anti-Entropy Research Group, miHoYo
    Research included: 3D Scene Generation, Video Diffusion

  • Sept. 2023 - Feb. 2024
    Research Intern - Taobao and Tmall Group, Alibaba
    Research included: Sparse View Object Reconstruction

  • April 2023 - Dec. 2023
    Research Assistant - State Key Lab of CAD&CG, Zhejiang University
    Advisor: Prof. Changqing Zou and Prof. Weiwei Xu
    Research included: Text-Guided 3D Generation

  • Sept. 2021 - April 2022
    Research Intern - OpenDriveLab of Shanghai AI Laboratory
    Advisor: Prof. Hongyang Li and Xiangwei Geng
    Research included: Self-supervised Depth Estimation

๐ŸŽ“ Academic Service

  • Reviewing
    • Conferences: CVPR, ICRA, NeurIPS, AAAI, Siggraph Asia
    • Journals: TCSVT, TMM, KBS

๐ŸŽ– Honors and Awards

  • 2024.12 ย ย  Outstanding Graduate of Zhejiang University
  • 2021.12 ย ย  National Scholarship (Top 1% among all undergraduates)
  • 2020.12 ย ย  First Prize in China Undergraduate Mathematical Contest in Model (Top 0.1% among all undergraduates)
  • 2020.12 ย ย  National Scholarship (Top 1% among all undergraduates)
  • 2019.12 ย ย  National Scholarship (Top 1% among all undergraduates)