Latest review note
Added Anthropic profile and interview links plus a source-grounded AI safety and alignment bio tied to hidden-objectives auditing.
Atlas / People / Detail
Anthropic researcher focused on AI safety, alignment, and auditing hidden objectives in language models.
Profile status: updated
Added Anthropic profile and interview links plus a source-grounded AI safety and alignment bio tied to hidden-objectives auditing.