Biography
I am a Student Researcher on the
Gemini Team at Google DeepMind, working on multimodal agentic
search and orchestration for the Gemini model family. I am also a
first-year PhD student at Rutgers University - New Brunswick, advised
by Prof. Chengzhi Mao. My research studies latent-space reasoning in
multimodal foundation models, with a focus on search, parallel
reasoning, and 3D/spatial understanding.
I am particularly interested in when continuous internal representations can support visual and geometric reasoning better than text-only reasoning traces.
Before my PhD, I worked on generative AI for 3D vision, spanning generation, inverse rendering, and geometric reconstruction. I interned at Tencent LightSpeed Studio, Microsoft Research Asia, and SmartMore, working closely with Jinglu Wang, Shuai Yang, and Jinnan Chen.
I received my M.Phil. in Artificial Intelligence from HKUST, advised by Prof. Yingcong Chen and Prof. Dan Xu, and my B.S. in Mathematics from Sun Yat-sen University.
News
- • 2025.12: We announced UltraShape 1.0, an open-source Large 3D Shape Generation Model. Technical Report and Code are available on the project page.
- • 2025.3: I started my research intern at Tencent LIGHTSPEED STUDIOS, working on Large 3D Models. Collaborations are highly welcomed.
- • 2024.12: I am looking for PhD position in 2025. Please contact me if you are interested.
Experience
- Jun 2026 — Present
Google DeepMind
Student Researcher · Gemini Team · New York, NY, USWorking on searching agent orchestration.
- Mar 2025 — Sep 2025
Tencent IEG
Research Intern · LightSpeed Studios Research · Shenzhen, CNWorking on Large 3D Models for game asset generation.
- Jun 2024 — Feb 2025
Microsoft Research Asia Stars of Tomorrow
Research Intern · Media Computing Group · Beijing, CNWorked on generalizable 3D Gaussian Splatting from unposed videos.
-
Jun 2022 — May 2024SmartMore
Research Intern · Optical Imaging Research Group · Shenzhen, CNWorked on neural 3D reconstruction with polarization cues.
-
Mar 2021 — Nov 2021Sun Yat-sen University
Research Assistant · BME AI Lab · Guangzhou, CNWorked on biomedical image segmentation.
Publications [Google Scholar]
*: Equal Contribution
Honors & Awards
| Ph.D. Fellowship, Department of Computer Science, Rutgers University | 2025 |
| Ph.D. Fellowship, Department of Computer Science, Dartmouth College | 2025 |
| Stars of Tomorrow (Award of Excellent Intern), Microsoft Research Asia | 2025 |
| McGill & Mila Quebec Ph.D. Fellowship, McGill University | 2024 |
| Postgraduate Scholarship, HKUST, GZ | 2022 |
| Undergraduate Excellent Scholarship, Sun Yat-sen University | 2019 |
Professional Services
Conference Reviewer
NeurIPS, ICLR, CVPR, AISTATS, ICML · 2025–2026
Teaching Assistant
- CS 439 Introduction to Data Science · Spring 2026
- CS 205 Discrete Mathematics · Fall 2025
Research Interests
I study latent-space reasoning in multimodal foundation models, especially for search, parallel reasoning, and 3D/spatial understanding.