I am currently an M.Phil. student in the AI Thrust at HKUST(gz) LogoHKUST(GZ). My supervisor is Prof. Yingcong Chen, a super nice mentor.

Before that, I received my Bachelor’s degree in Computer Science and Technology from the School of Computer Science and Technology, HIT-LogoHarbin Institute of Technology, under the supervision of Prof. Shaohui Liu, who has been tremendously supportive.

At present, I am working closely with Luozhou Wang and Guibao Shen, both of whom are very kind and supportive. My current research focuses on controllable generation, and I aim to make rapid progress and contributions in this area.

😍😍😍Please feel free to contact with me via duyihua0130@gmail.com😍😍😍

🔥 News

  • 2026.01:  🔥 We Release the Project Page, Paper of our Video-World-Models survey.
  • 2025.12:  🎉 StereoPilot was reported by JiQiZhiXin.
  • 2025.12:  ❤️ We Release Project Page, Code, Paper of StereoPilot.
  • 2025.12:  ❤️ We Release Project Page, Paper of VideoMemory.
  • 2025.12:  🏎️ See you at the 2026 Formula 1 Chinese Grand Prix in Shanghai!!!

Show more news ▼

📝 Publications

*denotes equal contribution, denotes corresponding author

Arxiv 2025
sym

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

Guibao Shen*, Yihua Du*, Wenhang Ge*, Jing He, Chirui Chang, Donghao Zhou, Zhen Yang, Luozhou Wang, Xin Tao, Ying-Cong Chen

Project Page | Code | 🤗 Daily Paper | 📰 机器之心 | arXiv

  • Converting a 5-second monocular video into a stereoscopic video takes only 11 seconds. Built both parallel and converged 3D dataset.
Arxiv 2026
sym

VideoMemory: Toward Consistent Video Generation via Memory Integration

Jinsong Zhou*, Yihua Du*, Xinli Xu*, Luozhou Wang, Zijie Zhuang, Yehang Zhang, Shuaibo Li, Xiaojun Hu, Bolan Su, Ying-Cong Chen

Project Page | arXiv

  • Propose an entity-centric, dynamic memory bank framework that enables coherent long-form, multi-shot narrative video generation by retrieving and updating character/prop/background states to preserve identity across temporal gaps.
Arxiv 2026
sym

A Mechanistic View on Video Generation as World Models: State and Dynamics

Luozhou Wang*, Zhifei Chen*, Yihua Du, Dongyu Yan, Wenhang Ge, Guibao Shen, Xinli Xu, Leyi Wu, Man Chen, Tianshuo Xu, Peiran Ren, Xin Tao, Pengfei Wan, Ying-Cong Chen

Code | 🤗 Daily Paper | arXiv

  • Bridge this divide: (1) Taxonomy: Unifying Implicit Context vs. Explicit Latent Compression (2) Evolution: Shifting benchmarks from “Visual Fidelity” to “Physical Persistence” and Causality.
Undergraduate Project
sym

Liveness detection method based on multi-angle forensics

Yihua Du, Puchao Zhou, Yu Li, Shaohui Liu

  • This work is completed under the guidance of my undergraduate graduation supervisor and the laboratory senior.
  • Won the excellent innovation comprehensive graduation design

🎖 Honors and Awards

  • 2025.06 Liveness detection method based on multi-angle forensics. Excellent and innovative comprehensive graduation design(卓越创新综合设计奖【哈尔滨工业大学本科生院颁发】)
  • 2024.11 HKUST(GZ) RBM Postgraduate Scholarship.
  • 2024.11 The Sixth Global Campus Artificial Intelligence Algorithm Elite Competition. The accuracy was second in the whole list and won the second prize in the nation(国家二等奖准确率榜单第二)
  • 2024.09 Virtual authoring Communication Community based on Diffusion Model. National Innovation and Entrepreneurship(国家级大创)
  • 2022,2023 Excellent student of the school (校优秀学生)
  • 2022.09 HIT third-class name Scholarship (三等人民奖学金)

📖 Educations

  • 2021.08 - 2025.09, HIT-Logo Bachelor, Harbin Institute of Technology, Harbin.
  • 2023.07 - 2023.08, nus-Logo Summer Camp student(Workshop), National University of Singapore, Singapore.
  • 2025.09 - (now), hkust-Logo M.Phil., Hong Kong University of Science and Technology (Guangzhou), Guangzhou.

💻 Internships

  • 2026.01 - now, TencentTencent-Logo, IEG, Computer Vision, Shenzhen.
  • 2025.02 - 2025.08, ENVISION Lab, HKUST(GZ)hkust-Logo, Research Assistant, Guangzhou.
  • 2024.03 - 2024.07, Meituanmeituan-Logo, Algorithm strategy, Beijing.

📄 My Resume

🍭 Habits


Last Updated: January 22 2026