Entity · other

Sparrow Principles

otheractivesparrow-principles-6fe87eb9·1 events·first seen Jun 1, 2026

Aliases: Sparrow Principles

Co-occurring entities

DeepMind Constitutional AI Claude Reinforcement Learning from Human Feedback UN Declaration of Human Rights Claude's constitution scalable oversight Anthropic

More like this (12)

FAIR Principles Pang Principle Under-18 Principles SPEAR Matching Principle DRY principle SPEARBench PRISMA DSR Foundation Model ClinPRISM Moral Foundations Theory Schwartz's Theory of Basic Human Values

Recent events (1)

7Anthropic News·Jun 1, 2026·source ↗

Anthropic Publishes Updated Claude's Constitution (Jan 2026 Revision)

Anthropic has released an updated version of Claude's Constitution, the explicit set of principles governing Claude's values and behavior under the Constitutional AI (CAI) framework. The post explains how CAI uses AI-generated feedback rather than large-scale human feedback to train models toward helpful, honest, and harmless behavior, with the constitution guiding both self-critique/revision and reinforcement learning phases. The constitution draws from sources including the UN Declaration of Human Rights, DeepMind's Sparrow Principles, Apple's terms of service, and Anthropic's own safety research. Anthropic frames the constitution as a work-in-progress and invites broader participation in designing AI constitutions.

Evaluation and Benchmarking AI Safety Research DeepMind Constitutional AI Claude +7 more