About Me
I am an incoming Ph.D. student at MMLAB@HKUST, under the supervision of Prof. Anyi Rao and Prof. Huamin Qu. Currently, I am pursuing a master’s degree in Control Science and Engineering at Zhejiang University under the supervision of Prof. Chunhui Zhao and in collaboration with Prof. Wei-Wei Xu and Prof. Changqing Zou. Previously, I obtained my Bachelor’s degree in the College of Electrical and Information Engineering from Hunan University. My research experience focuses on controllable multimodal 3D/4D content generation.
🔥 News
- 2024.12: 🎉🎉 One paper accepted by AAAI 2025.
- 2024.02: 🎉🎉 One paper accepted by CVPR 2024.
- 2023.12: 🎉🎉 One paper accepted by TCSVT.
📝 Publications

[arXiv] SpatialCrafter: Unleashing the Imagination of Video Diffusion Models for Scene Reconstruction from Limited Observations
Songchun Zhang, Huiyao Xu, Sitong Guo, Zhongwei Xie, Pengwei Liu, Hujun Bao, Weiwei Xu, Changqing Zou.
[Project page]
[paper]
- This paper presents a 3D scene reconstruction method from sparse inputs.

[AAAI 2025] Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views
Songchun Zhang, Chunhui Zhao.
[Project page]
[paper]
- This paper presents a 3D object reconstruction method from sparse and unposed inputs.

[CVPR 2024] 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation
Songchun Zhang, Yibo Zhang, Quan Zheng, Rui Ma, Wei Hua, Hujun Bao, Weiwei Xu, Changqing Zou.
[Project page]
[paper]
- This paper presents a novel text-driven 3D scene generation method that improves visual quality and 3D consistency.

[TCSVT 2023] Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization
Songchun Zhang, Chunhui Zhao.
[paper]
[code]
- This paper presents a weakly-supervised action localization framework leveraging cross-video information.
- [AAAI 2025] Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views, Songchun Zhang, Chunhui Zhao.
- [CVPR 2024] 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation, Songchun Zhang, Yibo Zhang, Quan Zheng, Rui Ma, Wei Hua, Hujun Bao, Weiwei Xu, Changqing Zou
- [TCSVT 2023] Cross-Video Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization, Songchun Zhang, Chunhui Zhao.
- [ICRA 2023] Dyna-DepthFormer: Multi-frame Transformer for Self-Supervised Depth Estimation in Dynamic Scenes, Songchun Zhang, Chunhui Zhao.
🖥️ Experience
-
March 2024 - Sept. 2024
Research Intern - Anti-Entropy Research Group, miHoYo
Advisor: Cheng Lin
Research included: 3D Scene Generation, Video Diffusion -
Sept. 2023 - Feb. 2024
Research Intern - Taobao and Tmall Group, Alibaba
Research included: Sparse View Object Reconstruction -
April 2023 - Dec. 2023
Research Assistant - State Key Lab of CAD&CG, Zhejiang University
Advisor: Prof. Changqing Zou and Prof. Weiwei Xu
Research included: Text-Guided 3D Generation -
Sept. 2021 - April 2022
Research Intern - OpenDriveLab of Shanghai AI Laboratory
Advisor: Prof. Hongyang Li and Xiangwei Geng
Research included: Self-supervised Depth Estimation
🎓 Academic Service
- Reviewing
- Conferences: CVPR, ICRA, NeurIPS, AAAI
- Journals: TCSVT, TMM, KBS
🎖 Honors and Awards
- 2024.12 Outstanding Graduate of Zhejiang University
- 2021.12 National Scholarship (Top 1% among all undergraduates)
- 2020.12 First Prize in China Undergraduate Mathematical Contest in Model (Top 0.1% among all undergraduates)
- 2020.12 National Scholarship (Top 1% among all undergraduates)
- 2019.12 National Scholarship (Top 1% among all undergraduates)