DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Canonical link

Connected researchers

Researcher at DeepSeek and former master's student in EECS at Peking University. Public profiles list work on LLMs, multimodal systems, transformers, BERT, and machine translation.

AI researcher at DeepSeek working on natural language processing, code intelligence, and large language model reasoning.

Research scientist focused on foundation models and multimodal large language models; his homepage notes earlier work at DeepSeek AI and current research at the University of Southern California.

Researcher at DeepSeek whose public homepage describes work on DeepSeek R1, V1, V2, V3, Math, Coder, and mixture-of-experts systems.

Yu Wu's public homepage says he is a technical staff member at DeepSeek AI who leads the LLM Alignment Team and works on writing, QA, AI search, reasoning, and safety.

PhD student at Tsinghua University working on formal theorem proving, machine learning, formal methods, and programming languages.

Researcher working on theorem proving and reinforcement learning.

PhD student at Tsinghua University and visiting student at MIT, focused on theorem proving and formal verification in Lean.

Canonical link

Chong Ruan

Daya Guo

Qihao Zhu

Runxin Xu

Y. Wu

Xinyu Zheng

Zhiyuan Gou

Renqi Xu