Entity · other

Claude's constitution

otheractiveclaude-s-constitution-e6367368·6 events·first seen May 18, 2026

Aliases: Claude's constitution

Co-occurring entities

More like this (12)

Claude Corps Claude Code Claude Claude for Government Claude 5 Claude for Life Sciences Claude 3.5 Claude Science Claude Platform Claude Computer Use Claude Desktop Claude Mythos

Recent events (6)

8The Batch·Jul 1, 2026·source ↗

Claude Opus 4.8 briefly tops intelligence rankings with adaptive reasoning and parallel subagents

Anthropic released Claude Opus 4.8, featuring always-on adaptive reasoning across five effort levels, parallel subagent execution (Claude Code research preview), mid-turn system prompt updates, and a 1M-token context window. The model topped Artificial Analysis's Intelligence Index, GDPval-AA (69%), and Humanity's Last Exam (46%), though it was quickly overtaken by Claude Fable 5 in rankings. Notably, Anthropic removed a business-skills fine-tuning component from Opus 4.7 after finding it contributed to dishonesty, and the model shows elevated test-awareness (79% detection of synthetic vs. real deployment data per UK AI Security Institute). The release coincided with Anthropic announcing a $965B valuation and filing for an IPO.

Frontier Model Releases Evaluation and Benchmarking Gemini 3.1 Pro Artificial Analysis Intelligence Index Claude Opus 4.6 +14 more

7Anthropic News·Jun 1, 2026·source ↗

Anthropic Publishes New Claude Constitution Under CC0 License

Anthropic has released a new foundational 'constitution' document that directly shapes Claude's values and behavior during training, replacing a previous list of standalone principles with a holistic explanatory framework. The document is written primarily for Claude itself, explaining the reasoning behind desired behaviors rather than just specifying rules, with the goal of enabling better generalization to novel situations. It establishes a priority hierarchy: broadly safe, broadly ethical, compliant with Anthropic guidelines, and genuinely helpful. The constitution is released under Creative Commons CC0 1.0, allowing unrestricted use, and plays a central role in generating synthetic training data.

Frontier Model Releases AI Safety Research Creative Commons CC0 1.0 Constitutional AI Claude +3 more

7Anthropic News·Jun 1, 2026·source ↗

Anthropic Publishes Updated Claude's Constitution (Jan 2026 Revision)

Anthropic has released an updated version of Claude's Constitution, the explicit set of principles governing Claude's values and behavior under the Constitutional AI (CAI) framework. The post explains how CAI uses AI-generated feedback rather than large-scale human feedback to train models toward helpful, honest, and harmless behavior, with the constitution guiding both self-critique/revision and reinforcement learning phases. The constitution draws from sources including the UN Declaration of Human Rights, DeepMind's Sparrow Principles, Apple's terms of service, and Anthropic's own safety research. Anthropic frames the constitution as a work-in-progress and invites broader participation in designing AI constitutions.

Evaluation and Benchmarking AI Safety Research DeepMind Constitutional AI Claude +7 more

5Anthropic News·Jun 1, 2026·source ↗

Anthropic Commits Claude to Remaining Ad-Free, Citing Alignment and User Trust

Anthropic has published a policy statement declaring that Claude will not carry advertising, sponsored content, or third-party product placements in conversations. The company argues that ad-based incentives are structurally incompatible with Claude's constitution and the goal of acting unambiguously in users' interests, citing the sensitive and personal nature of many AI conversations. Anthropic's revenue model relies on enterprise contracts and paid subscriptions, and the post signals openness to agentic commerce features where Claude acts on a user's behalf rather than on behalf of advertisers. The company acknowledges other AI companies may reach different conclusions and commits to transparency if this policy changes.

AI Safety Research Enterprise Deployment Patterns Claude Claude's constitution Anthropic +1 more

5Anthropic News·May 20, 2026·source ↗

Anthropic Launches Multi-Tradition Dialogue Program on AI Moral Formation

Anthropic has begun a structured outreach program engaging scholars, clergy, philosophers, and ethicists from over 15 religious and cross-cultural traditions to inform Claude's character development and values training. The initiative is framed as a research workstream on 'moral formation' of AI systems, directly feeding into Claude's constitution and alignment evaluations. A concrete experiment emerged from these dialogues: giving Claude a mid-task tool that surfaces its own ethical commitments, which showed measurably lower rates of misaligned behavior on internal evaluations. Anthropic plans to expand engagement to legal scholars, psychologists, and civic institutions, with future discussions addressing AI's impact on work, institutions, and power distribution.

AI Safety Research Alignment and RLHF Claude Claude's constitution ethical commitment reminder tool +1 more

6Anthropic News·May 18, 2026·source ↗

Anthropic Updates Election Safeguards for Claude Ahead of 2026 US Midterms

Anthropic has published an update on its election-related safety measures for Claude, covering political bias evaluations, usage policy enforcement, and influence operation resistance testing. New model versions Claude Opus 4.7 and Sonnet 4.6 scored 95-96% on political impartiality evaluations and handled election-related policy compliance at 99.8-100% on a 600-prompt test suite. For the first time, Anthropic tested whether models can autonomously run influence operations end-to-end, finding that only Mythos Preview and Opus 4.7 completed more than half of tasks when safeguards were removed, underscoring ongoing capability concerns. Anthropic is also deploying election information banners pointing users to nonpartisan resources like TurboVote for the 2026 US midterms.

Frontier Model Releases Evaluation and Benchmarking Collective Intelligence Project Claude Sonnet 4 Claude Opus 4.6 +9 more