LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / People / Detail

Buck Shlegeris

Buck Shlegeris is a Member of Technical Staff at Anthropic whose public homepage focuses on AI safety, model evaluations, and alignment.

Researcher1 organizations3 reports

Profile status: updated

Buck Shlegeris portrait
Suggest a correction
Suggest a source

Trust signals

Profile completeness39%
Public sources1
Official sources1
Last reviewedMar 13, 2026
Official homepage
updated 1 public sources

Public links

website Personal homepage

Organizations

core Anthropic

Reports

Alignment and Safety Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training Alignment and Safety Alignment faking in large language models Interpretability Tracing the thoughts of a large language model

Official and primary sources

https://buckslager.com/ Official source · homepage

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy · Terms