5Anthropic News·16d ago

Anthropic publishes U.S. Elections Readiness summary covering policy, enforcement, and evaluation work

Anthropic released a summary of its election-integrity measures ahead of the November 5, 2024 U.S. elections, covering usage policy prohibitions on political campaigning and misinformation, automated enforcement systems, and red-teaming/vulnerability testing programs. The company implemented a TurboVote redirect for voting-information queries and released some of its automated election-safety evaluations publicly to support industry-wide efforts. The post documents Anthropic's first full election-cycle experience deploying generative AI at scale under explicit safety constraints.

AI Safety Research Regulatory Developments Claude Democracy Works Amazon Web Services Google Cloud Platform TurboVote Anthropic

Related guides (4)

Claude

Claude: Anthropic's AI Assistant Built for Safety and Scale

Read asBeginner In-depth

Amazon Web Services

Amazon Web Services: The Cloud Backbone of the AI Era

Read asBeginner In-depth

Anthropic

Anthropic: The AI Safety Company at the Center of the Frontier

Read asBeginner

AI Safety ResearchTopic guide

AI Safety Research: From Lab Evals to Geopolitical Flashpoint

Read asIn-depth

Related events (8)

5Anthropic News·17d ago·source ↗

Anthropic outlines election safety policies and interventions for 2024 global elections

Anthropic published a policy overview describing its three-pronged approach to election-related AI misuse in 2024: enforcing acceptable use policies that prohibit political campaigning and influence operations, red-teaming models for election-specific vulnerabilities including misinformation and voter suppression prompts, and redirecting users asking voting questions to authoritative nonpartisan sources like TurboVote and the European Parliament's elections site. The post was updated in May 2024 to cover EU users following Claude's European launch and to clarify usage policy definitions around political lobbying. The piece reflects Anthropic's cautious stance on generative AI in high-stakes civic contexts, including explicit acknowledgment of hallucination risks for real-time election information.

AI Safety Research Regulatory Developments Claude European Parliament Democracy Works +2 more

5Anthropic News·17d ago·source ↗

Anthropic publishes elections-risk testing methodology and releases automated evaluation tools

Anthropic describes its two-stage process for identifying and mitigating elections-related risks in Claude: qualitative 'Policy Vulnerability Testing' (PVT) conducted with external subject matter experts, followed by large-scale automated evaluations. The post details how findings from PVT inform mitigation strategies such as policy updates, model fine-tuning, and response behavior changes, with a case study on election administration accuracy. Anthropic is also releasing some of its automated evaluation tools publicly to help other organizations improve election integrity efforts.

Evaluation and Benchmarking AI Safety Research Isabelle Frances-Wright Claude Policy Vulnerability Testing +3 more

6Anthropic News·16d ago·source ↗

Anthropic publishes 2024 election safety retrospective with Clio usage analysis

Anthropic released a post-mortem on AI and elections in 2024, covering their safety policies, red-teaming efforts, and enforcement actions across global elections. Election-related activity constituted less than 0.5% of overall Claude usage, rising to just over 1% around the US election, with approximately 100 enforcement actions globally. The report introduces Clio, an automated tool for analyzing real-world usage patterns, and documents a case study on handling knowledge cutoff limitations during France's snap elections. The piece represents Anthropic's first systematic public accounting of election-related AI safety work at scale.

AI Safety Research Regulatory Developments Claude Sonnet 3.5 Clio Claude Opus 4.6 +4 more

6Anthropic News·1mo ago·source ↗

Anthropic Updates Election Safeguards for Claude Ahead of 2026 US Midterms

Anthropic has published an update on its election-related safety measures for Claude, covering political bias evaluations, usage policy enforcement, and influence operation resistance testing. New model versions Claude Opus 4.7 and Sonnet 4.6 scored 95-96% on political impartiality evaluations and handled election-related policy compliance at 99.8-100% on a 600-prompt test suite. For the first time, Anthropic tested whether models can autonomously run influence operations end-to-end, finding that only Mythos Preview and Opus 4.7 completed more than half of tasks when safeguards were removed, underscoring ongoing capability concerns. Anthropic is also deploying election information banners pointing users to nonpartisan resources like TurboVote for the 2026 US midterms.

Frontier Model Releases Evaluation and Benchmarking Collective Intelligence Project Claude Sonnet 4 Claude Opus 4.6 +9 more

5Anthropic News·16d ago·source ↗

Anthropic updates Usage Policy with election integrity, high-risk use case, and privacy rules

Anthropic revised its Acceptable Use Policy (renamed Usage Policy), effective June 6, 2024, consolidating prohibited-use categories into 'Universal Usage Standards.' Key changes include explicit bans on AI-assisted election interference and political campaigning, new safety requirements for high-risk use cases (healthcare, legal), expanded access for minors via API partners with safety disclosures, and stronger privacy protections including prohibitions on biometric inference and government-directed censorship. The update reflects both evolving regulatory context and Anthropic's stated safety mission.

AI Safety Research Regulatory Developments Anthropic Usage Policy Anthropic

5Anthropic News·18d ago·source ↗

Anthropic Updates Usage Policy: Agentic Use, Cybersecurity, and Political Content

Anthropic has revised its Usage Policy effective September 15, 2025, with changes addressing agentic and cybersecurity risks, political content restrictions, law enforcement use clarity, and high-risk consumer-facing requirements. New sections explicitly prohibit malicious computer/network compromise activities while supporting legitimate security research, responding to the rapid expansion of agentic tools like Claude Code and Computer Use. The policy also narrows its previous blanket ban on political content to focus specifically on deceptive or voter-targeting uses, enabling legitimate civic and policy research. High-risk safeguards (human-in-the-loop, AI disclosure) are clarified to apply only to consumer-facing outputs, not B2B interactions.

AI Safety Research Enterprise Deployment Patterns Anthropic Usage Policy Claude Claude Code +5 more

7Anthropic News·16d ago·source ↗

Anthropic launches initiative to fund third-party AI safety evaluations

Anthropic announced a funded initiative to source third-party evaluations measuring advanced AI capabilities and safety risks, with priority areas including cybersecurity, CBRN threats, model autonomy, national security risks, social manipulation, and misalignment. The initiative is tied to Anthropic's Responsible Scaling Policy and AI Safety Level (ASL) framework, aiming to address a gap between demand and supply of high-quality safety-relevant evals. Proposals are solicited via an application form, with Anthropic framing the effort as benefiting the broader AI safety ecosystem rather than just internal use.

Evaluation and Benchmarking AI Safety Research METR Google-Proof Q&A Responsible Scaling Policy +1 more

6Anthropic News·16d ago·source ↗

Anthropic publishes policy brief calling for targeted AI regulation within 18 months

Anthropic published a policy position paper arguing that governments have an 18-month window to enact narrowly-targeted AI regulation before risks in cyber and CBRN domains become acute. The post cites rapid capability gains—SWE-bench scores rising from 1.96% to 49% in a year, GPQA scores approaching human expert level—as evidence that frontier models are approaching meaningful misuse thresholds. Anthropic also reviews its Responsible Scaling Policy as a model for adaptive, proportionate risk governance and calls for similar frameworks to be adopted industry-wide and codified in law.

AI Safety Research Regulatory Developments Anthropic Policy Frontier Red Team Claude 3.5 Sonnet UK AI Security Institute +5 more