Atlas / Reports / Detail
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Mathematical Reasoning Models
Connected researchers
Runxin Xu
DeepSeek
Researcher at DeepSeek whose public homepage describes work on DeepSeek R1, V1, V2, V3, Math, Coder, and mixture-of-experts systems.
Renqi Xu
DeepSeek
PhD student at Tsinghua University and visiting student at MIT, focused on theorem proving and formal verification in Lean.
Daya Guo
DeepSeek / Moonshot AI
DeepSeek researcher focused on NLP, code intelligence, and LLM reasoning, with public work spanning DeepSeek-Coder, DeepSeekMath, DeepSeek-V2, DeepSeek-V3, and DeepSeek-R1.
Xinyu Zheng
DeepSeek
PhD student at Tsinghua University working on formal theorem proving, machine learning, formal methods, and programming languages.
Qihao Zhu
DeepSeek
Research scientist focused on foundation models and multimodal large language models; his homepage notes earlier work at DeepSeek AI and current research at the University of Southern California.
Zhiyuan Gou
DeepSeek
Researcher working on theorem proving and reinforcement learning.
Y. Wu
DeepSeek
Yu Wu is a researcher at DeepSeek AI and head of its LLM Alignment Team. His public homepage highlights work on reinforcement learning and alignment for the DeepSeek model family, including DeepSeek-V3, DeepSeek-R1, and DeepSeekMath, and notes prior work at Microsoft Research Asia.
Chong Ruan
DeepSeek
Researcher at DeepSeek and former master's student in EECS at Peking University. Public profiles list work on LLMs, multimodal systems, transformers, BERT, and machine translation.