Home People Organizations Reports Fields Schools

Public Atlas People first, reports as evidence, organizations as context.

Atlas / People / Detail

Buck Shlegeris

Buck Shlegeris is a Member of Technical Staff at Anthropic whose public homepage focuses on AI safety, model evaluations, and alignment.

Researcher1 organizations3 reports

Profile status: updated

Suggest a correction

Suggest a source

Trust signals

Profile completeness39%

Public sources1

Official sources1

Last reviewedMar 13, 2026

Official homepage

updated 1 public sources

Public links

website Personal homepage

Organizations

core Anthropic

Reports

Alignment and Safety Alignment faking in large language models Alignment and Safety Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training Interpretability Tracing the thoughts of a large language model

Official and primary sources

https://buckslager.com/ Official source · homepage

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy · Terms