Latest review note
Added personal homepage and a source-grounded Anthropic alignment science bio tied to Sleeper Agents.
Atlas / People / Detail
Member of technical staff at Anthropic working on alignment science and the evaluation of hidden objectives in language models.
Profile status: updated
Added personal homepage and a source-grounded Anthropic alignment science bio tied to Sleeper Agents.