person
Isabelle Frances-Wright
personactiveprovisional
isabelle-frances-wright-e2228ac3·1 events·first seen 13d agoAliases: Isabelle Frances-Wright
Co-occurring entities
More like this (12)
Recent events (1)
Anthropic publishes elections-risk testing methodology and releases automated evaluation tools
Anthropic describes its two-stage process for identifying and mitigating elections-related risks in Claude: qualitative 'Policy Vulnerability Testing' (PVT) conducted with external subject matter experts, followed by large-scale automated evaluations. The post details how findings from PVT inform mitigation strategies such as policy updates, model fine-tuning, and response behavior changes, with a case study on election administration accuracy. Anthropic is also releasing some of its automated evaluation tools publicly to help other organizations improve election integrity efforts.