Home People Organizations Reports Fields Schools

Public Atlas People first, reports as evidence, organizations as context.

Atlas / People / Detail

Jesse Mu

Jesse Mu is a Research Scientist at Anthropic and a visiting researcher at Stanford University. His work spans machine learning, AI safety, reinforcement learning, and deep learning theory.

Researcher1 organizations1 reports

Profile status: updated

Suggest a correction

Suggest a source

Trust signals

Profile completeness58%

Public sources2

Official sources2

Last reviewedMar 13, 2026

Official homepage Scholar profile

updated 2 public sources

Public links

website Personal homepage google_scholar Google Scholar profile

Organizations

core Anthropic

Reports

Alignment and Safety Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

Official and primary sources

https://jessemu.com/ Official source · homepage https://scholar.google.com/citations?user=JQ7zzFEAAAAJ&hl=en&oi=ao Official source · scholar

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy · Terms