LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / People / Detail

Yuntao Bai

Anthropic researcher whose work includes reinforcement learning from human feedback and Constitutional AI; previously a Sherman Fairchild Postdoctoral Scholar in theoretical high-energy physics at Caltech.

Researcher1 organizations4 reports

Profile status: updated

Yuntao Bai portrait
Suggest a correction
Suggest a source

Trust signals

Profile completeness55%
Public sources3
Official sources1
Last reviewedJun 8, 2026
Scholar profile Structured work
updated 3 public sources
report_authorAnthropic

Work

Anthropic Role not listed

Public links

openreview OpenReview profile

Organizations

core Anthropic

Reports

Alignment and RLHF Collective Constitutional AI: Aligning a Language Model with Public Input Alignment and RLHF Constitutional AI: Harmlessness from AI Feedback Alignment and Safety Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training Alignment and RLHF Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Official and primary sources

Yuntao Bai OpenReview profile Official source · openreview · openreview.net

Supporting sources

Mistral 7B Supporting source · report · arXiv Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training Supporting source · report · arXiv

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy · Terms