night.yuchen [at] gmail [dot] com
Hello World! My name is Yuchen Zhuang. I am a Research Scientist at Google DeepMind. I am a core contributor of the Gemini Thinking / Reasoning team, including the Gemini 3.0 and Gemini 3.1 models. I obtained my Ph.D. in Machine Learning from Georgia Institute of Technology, advised by Prof. Chao Zhang. I am also very fortunate to work closely with Prof. Le Song and Prof. Bo Dai. My research focuses on building large language models (LLMs) and LLM-based agents with reasoning and planning capabilities for challenging real-world problems, e.g., math, coding, and gaming. My recent research covers the following directions:
- Post-training and alignment: LLM post-training over different stages, including SFT, RLHF, preference optimization, and reward modeling for improving LLM capabilities in coding, thinking, and instruction following;
- Advanced agentic coding: Enhancing model capabilities in complicated agentic coding tasks, e.g., software engineering (SWE) and machine learning engineering (MLE), via reinforcement learning and reward modeling;
- Search- and tool-integrated learning: Effective and efficient frameworks that integrate external knowledge and tools into LLM reasoning, including retrieval-augmented generation (RAG), tool use, and personalization.
News
- [01/2026] Two papers accepted in ICLR'26: MLE-Smith for scaling MLE tasks with automated multi-agent pipeline, and MedAgentGym (Oral) for agentic training in biomedical data science!
- [09/2025] Two papers accepted in NeurIPS'25: MLE-Dojo for interactive environments empowering LLM agents in MLE, and Matryoshka Pilot for learning to drive black-box LLMs!
- [05/2025] Joined Google DeepMind as a Research Scientist working on Gemini for reasoning and coding!
- [01/2025] One paper accepted in NAACL'25 (Oral), discussing enhancing LLMs' fundamental agentic capabilities through continual pre-training.
- [11/2024] Awarded the 2024 J.P. Morgan Chase AI PhD Fellowship.
- [09/2024] Two papers accepted in NeurIPS'24: LLM personalization and LLM alignment with representation editing.
- [06/2024] Our ICML paper has been selected as Spotlight!
- [05/2024] One paper accepted in ICML'24, introducing a novel method for black-box LLM adaptation.
- [01/2024] One paper accepted in ICLR'24, discussing efficient navigation of LLM agent in action space.
Selected Publications
Please refer to my Google Scholar for the full list. (* = equal contribution)
MLE-Smith: Scaling MLE Tasks with Automated Multi-Agent Pipeline
ICLR'26
MedAgentGym: A Scalable Agentic Training Environment for Code-Centric Reasoning in Biomedical Data Science
ICLR'26 (Oral)
MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering
NeurIPS'25
Matryoshka Pilot: Learning to Drive Black-Box LLMs with LLMs
NeurIPS'25
Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training
NAACL'25 (Oral)
BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models
ToolChain*: Efficient Action Space Navigation in Large Language Models with A* Search
Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias
Experience
Google DeepMind, Gemini Thinking
Topic: Gemini for reasoning and coding
Amazon, Rufus Team
Topic: Pre-Training Agent LLM to Enhance Fundamental Agentic Capabilities
Adobe Research
Topic: ToolChain* - Efficient Action Space Navigation with A* Search [ICLR'24]
Amazon, Personalization Team
Topic: G-STO - Sequential Shopping Intention Detection [CIKM'23]
Academic Services
- Area Chair: NeurIPS 2026-, ICML 2026-, COLM 2026-, ACL 2024-, EMNLP 2024-, NAACL 2025-.
- Conference Program Committee: NeurIPS 2023-2025; ICLR 2023-2026; ICML 2023-2025; COLM 2024-2025; KDD 2021-2023; ACL 2021-2024; AAAI 2023-2024; AISTATS 2024-2025; SDM 2024.
Selected Awards
- [2024] J.P. Morgan Chase AI PhD Fellowship
- [2023] NeurIPS Scholar Award
- [2023] Best Paper Award, ACM BCB
- [2023] ACM SIGKDD Student Travel Grant
- [2020] Second Prize, Excellent Undergraduate Graduation Thesis, Jiangsu Province
- [2019] Most Influential Graduate Award Nomination (Top 20/4,000), Southeast University
- [2018-2019] Qingyun Sun Innovation Scholarship, Southeast University