Ph.D. Student
Wangxuan Institute of Computer Technology (WICT) [Google Scholar] [GitHub] [Twitter] [WeChat] |
![]() |
Hi there👋, my name is Chenguo Lin (in Chinese: 林琛果). I am a third-year Ph.D. student of computer science and artificial intelligence at Peking University, China, supervised by Prof. Yadong Mu. Currently, I'm working as a research intern at ByteDance, focusing on spatial intelligence and 3D/4D AIGC. I've spent wonderful time as a research assistant/intern with Dr. Zhirong Wu at Microsoft Research Asia, Prof. Ping Luo at HKU, and Dr. Chaoning Zhang at KAIST. My research focuses on advancing the frontier of Multi-modal World Models: AI systems to 👀perceive, 🧠reason about, and 🦾act within the physical world. I pursue this vision through three tightly connected directions: (1) Multi-modality Learning for Visual Reasoning, (2) Geometry-grounded Perception for Embodied Agents, and (3) Generative Models for Controllable 3D/4D Content. By integrating these three directions, my overarching goal is to develop general-purpose intelligence systems: AI that can see, understand, generate, and interact with complex physical environments. I'm always open to research discussions and collaborations. Feel free to contact me if you are interested.
*: equal contribution; †: project lead
arXiv 2025
![]() |
|
arXiv 2025
|
|
ICLR 2025
|
|
ICLR 2025
|
|
NeurIPS 2024
|
|
arXiv 2024
![]() |
|
ICLR 2024
|
|
TMLR 2024
![]() |
|
© Chenguo Lin