LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Mixture-of-Experts Language Models report from DeepSeek with 17 connected researchers in the LLMpeople atlas.

DeepSeek2024-01-1117 researchers
Field
Mixture-of-Experts Language Models
Organization
DeepSeek
arXiv
2401.06066

Canonical link

https://arxiv.org/abs/2401.06066

Connected researchers

Damai Dai portrait
Researcher 4 reports

Damai Dai

DeepSeek

DeepSeek report author whose DBLP publication record includes DeepSeek LLM, DeepSeekMoE, DeepSeek-V2, DeepSeek-V3, and DeepSeek-R1 work.

DeepSeek
Chengqi Deng portrait
Researcher 3 reports

Chengqi Deng

DeepSeek

Research scientist at DeepSeek with public GitHub projects spanning language models and AI systems.

DeepSeek
China
Chenggang Zhao portrait
Researcher 4 reports

Chenggang Zhao

DeepSeek

Research engineer at DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

DeepSeek
China
R. X. Xu portrait
Researcher 2 reports

R. X. Xu

DeepSeek

R. X. Xu is a research scientist at DeepSeek AI. His homepage says he works on trustworthy and efficient large language models, open-ended reasoning, and AI for healthcare.

DeepSeek
Huazuo Gao portrait
Researcher 7 reports

Huazuo Gao

DeepSeek

Researcher at DeepSeek AI working on decision-making and post-training for large language models.

DeepSeek
Deli Chen portrait
Researcher 4 reports

Deli Chen

DeepSeek

DeepSeek report author whose DBLP record includes DeepSeek LLM, DeepSeekMoE, DeepSeek-Coder-V2, DeepSeek-V3, and DeepSeek-R1 work.

DeepSeek
Jiashi Li portrait
Researcher 4 reports

Jiashi Li

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

DeepSeek
Wangding Zeng portrait
Researcher 3 reports

Wangding Zeng

DeepSeek

Wangding Zeng is a researcher at DeepSeek. His OpenReview profile also lists graduate and undergraduate study at Beijing University of Posts and Telecommunications.

DeepSeek
Xingkai Yu portrait
Researcher 4 reports

Xingkai Yu

DeepSeek

Xingkai Yu is a report-backed author in the LLMpeople atlas, connected through DeepSeek technical reports including DeepSeek-V3, DeepSeek-V2, DeepSeek LLM, and DeepSeekMoE; his public GitHub profile lists DeepSeek affiliation.

DeepSeek
China
Y. Wu portrait
Researcher 8 reports

Y. Wu

DeepSeek

Yu Wu's public homepage says he is a technical staff member at DeepSeek AI who leads the LLM Alignment Team and works on writing, QA, AI search, reasoning, and safety.

DeepSeek
Zhenda Xie portrait
Researcher 5 reports

Zhenda Xie

DeepSeek

DeepSeek report author listed on DeepSeek LLM, DeepSeekMoE, DeepSeek-Coder, DeepSeek-V2, and DeepSeek-V3 reports, with report-backed work on large language models, mixture-of-experts systems, and code models.

DeepSeek
United States
Y. K. Li portrait
Researcher 2 reports

Y. K. Li

DeepSeek

Y. K. Li is a report-backed author in the LLMpeople atlas, connected through 2 technical reports.

DeepSeek
Panpan Huang portrait
Researcher 4 reports

Panpan Huang

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

DeepSeek
Fuli Luo portrait
Researcher 5 reports

Fuli Luo

DeepSeek

Research scientist working on large language models and retrieval-augmented generation; creator of the open-source project tiny-universe.

DeepSeek
Chong Ruan portrait
Researcher 6 reports

Chong Ruan

DeepSeek

Researcher at DeepSeek and former master's student in EECS at Peking University. Public profiles list work on LLMs, multimodal systems, transformers, BERT, and machine translation.

DeepSeek
China
Zhifang Sui portrait
Researcher 1 reports

Zhifang Sui

DeepSeek

Zhifang Sui is a report-backed author in the LLMpeople atlas, connected through DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models.

DeepSeek
Wenfeng Liang portrait
Researcher 8 reports

Wenfeng Liang

DeepSeek

Wenfeng Liang, also known as Liang Wenfeng, is linked to DeepSeek technical reports in LLMpeople and is identified in public references as the founder and CEO of DeepSeek.

DeepSeek
China

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy ยท Terms