LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / People / Detail

Deep Ganguli

Co-founder and head of alignment science at Anthropic.

Researcher1 organizations6 reports

Profile status: updated

Deep Ganguli portrait
Suggest a correction
Suggest a source

Trust signals

Profile completeness41%
Public sources1
Official sources1
Last reviewedMar 13, 2026
Official homepage
updated 1 public sources

Public links

website Anthropic team profile

Organizations

core Anthropic

Reports

Alignment and RLHF Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback Alignment and RLHF Constitutional AI: Harmlessness from AI Feedback Alignment and Safety Many-shot Jailbreaking Alignment and Safety Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training Alignment and Safety Constitutional Classifiers++: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming Interpretability Tracing the thoughts of a large language model

Official and primary sources

https://www.anthropic.com/team/deep-ganguli Official source · homepage

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy · Terms