Latest review note
Added personal homepage, Google Scholar, and concise bio verified against public homepage with Anthropic affiliation.
Atlas / People / Detail
Research scientist at Anthropic working on trustworthy AI and deceptive alignment.
Profile status: updated
Added personal homepage, Google Scholar, and concise bio verified against public homepage with Anthropic affiliation.