Almanac
other

Sparrow Principles

otheractiveprovisionalsparrow-principles-6fe87eb9·1 events·first seen 15d ago

Aliases: Sparrow Principles

Co-occurring entities

More like this (12)

Recent events (1)

7Anthropic News·15d ago·source ↗

Anthropic Publishes Updated Claude's Constitution (Jan 2026 Revision)

Anthropic has released an updated version of Claude's Constitution, the explicit set of principles governing Claude's values and behavior under the Constitutional AI (CAI) framework. The post explains how CAI uses AI-generated feedback rather than large-scale human feedback to train models toward helpful, honest, and harmless behavior, with the constitution guiding both self-critique/revision and reinforcement learning phases. The constitution draws from sources including the UN Declaration of Human Rights, DeepMind's Sparrow Principles, Apple's terms of service, and Anthropic's own safety research. Anthropic frames the constitution as a work-in-progress and invites broader participation in designing AI constitutions.