LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

On the Biology of a Large Language Model

Interpretability

AnthropicUndated13 researchers
Field
Interpretability
Organization
Anthropic
arXiv
2504.19173

Canonical link

https://arxiv.org/abs/2504.19173

Connected researchers

Profile Reports

Samuel Marks

Anthropic

Senior research engineer at Anthropic interested in agent foundations, model organisms of misalignment, and human-computer interaction.

Anthropic
Unknown 6
Profile Reports

David Duvenaud

Anthropic

Associate Professor at the University of Toronto whose research spans deep learning, probabilistic modeling, and machine learning methods for science and AI safety.

Anthropic
Canada 4
Profile Reports

Nora Belrose

Anthropic

Nora Belrose is an AI researcher whose work studies neural language models, latent structure, and cognition. She has contributed to Anthropic research on tracing and interpreting reasoning in large language models.

Anthropic
Unknown 2
Profile Reports

David Bau

Anthropic

Research scientist at Anthropic and assistant professor of computer science at Northeastern University working on interpretability and model understanding.

Anthropic
United States 3
Profile Reports

Ethan Perez

Anthropic

Research scientist at Anthropic focused on scalable oversight, AI safety, and language model evaluation; previously worked at New York University and Google.

Anthropic
Unknown 8
Profile Reports

Stephen Casper

Anthropic

Alignment science researcher at Anthropic whose work focuses on black-box evaluations, white-box evaluations, and AI risk.

Anthropic
Unknown 1
Profile Reports

Yonatan Belinkov

Anthropic

Associate Professor in the Technion Faculty of Data and Decision Sciences and a visiting research professor at Google working on natural language processing and machine learning.

Anthropic
Israel 1
Profile Reports

Nikhil Prakash

Anthropic

Profile still being enriched.

Anthropic
Unknown 2
Profile Reports

Benjamin Crouzier

Anthropic

Profile still being enriched.

Anthropic
Unknown 1
Profile Reports

Can Rager

Anthropic

Profile still being enriched.

Anthropic
Unknown 1
Profile Reports

David Krueger

Anthropic

Profile still being enriched.

Anthropic
Unknown 1
Profile Reports

Eric J. Michaud

Anthropic

Profile still being enriched.

Anthropic
Unknown 1
Profile Reports

Max Tegmark

Anthropic

Profile still being enriched.

Anthropic
Unknown 1

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.