Anthropic Usage Policy
anthropic-usage-policy-0a916e61·3 events·first seen 15d agoAliases: Anthropic Usage Policy
Co-occurring entities
More like this (12)
Recent events (3)
Anthropic Updates Usage Policy: Agentic Use, Cybersecurity, and Political Content
Anthropic has revised its Usage Policy effective September 15, 2025, with changes addressing agentic and cybersecurity risks, political content restrictions, law enforcement use clarity, and high-risk consumer-facing requirements. New sections explicitly prohibit malicious computer/network compromise activities while supporting legitimate security research, responding to the rapid expansion of agentic tools like Claude Code and Computer Use. The policy also narrows its previous blanket ban on political content to focus specifically on deceptive or voter-targeting uses, enabling legitimate civic and policy research. High-risk safeguards (human-in-the-loop, AI disclosure) are clarified to apply only to consumer-facing outputs, not B2B interactions.
Anthropic updates Usage Policy with election integrity, high-risk use case, and privacy rules
Anthropic revised its Acceptable Use Policy (renamed Usage Policy), effective June 6, 2024, consolidating prohibited-use categories into 'Universal Usage Standards.' Key changes include explicit bans on AI-assisted election interference and political campaigning, new safety requirements for high-risk use cases (healthcare, legal), expanded access for minors via API partners with safety disclosures, and stronger privacy protections including prohibitions on biometric inference and government-directed censorship. The update reflects both evolving regulatory context and Anthropic's stated safety mission.
Anthropic Details Claude Safeguards Team Structure and Multi-Layer Safety Approach
Anthropic has published a detailed overview of its internal Safeguards team, describing a multi-layer approach to preventing Claude misuse that spans policy development, model training influence, pre-deployment evaluation, and real-time enforcement. The team uses a Unified Harm Framework covering five dimensions (physical, psychological, economic, societal, autonomy) and conducts Policy Vulnerability Testing with external domain experts in areas like terrorism, child safety, and mental health. Pre-deployment evaluations include safety assessments, CBRNE-focused AI capability uplift testing with government partners, and bias evaluations. The post describes specific partnerships with organizations like the Institute for Strategic Dialogue and ThroughLine to inform election integrity and mental health response policies.