6Anthropic News·26d ago

Anthropic co-founder Chris Olah speaks at Vatican on Pope Leo XIV's AI encyclical 'Magnifica humanitas'

Pope Leo XIV released an encyclical titled 'Magnifica humanitas: On safeguarding the human person in the time of artificial intelligence' on May 25, 2026, and Anthropic co-founder Chris Olah was invited to speak at its presentation in Vatican City. Olah acknowledged that frontier AI labs operate under incentives that can conflict with doing the right thing, and called for external moral voices—including religious institutions—to serve as informed critics of AI development. He highlighted three areas requiring discernment: AI's impact on the global poor and labor displacement, the conditions for human flourishing in an AI-saturated world, and the uncertain nature of AI models themselves, noting that his interpretability research has found internal states that functionally mirror emotions. The remarks represent Anthropic's effort to broaden the AI governance conversation beyond the technical community.

AI Safety Research Regulatory Developments Alignment and RLHF mechanistic interpretability Magnifica humanitas Vatican City Catholic Church Chris Olah Pope Leo XIV Anthropic

Related guides (4)

mechanistic interpretabilityConcept

Mechanistic Interpretability: Looking Inside the AI Black Box

Read asBeginner In-depth

Anthropic

Anthropic: The AI Safety Company at the Center of the Frontier

Read asBeginner

AI Safety ResearchTopic guide

AI Safety Research: From Lab Policies to Real-World Flashpoints

Read asBeginner In-depth

Regulatory DevelopmentsTopic guide

AI Regulatory Developments: From Voluntary Frameworks to Government Enforcement

Read asIn-depth

Related events (8)

4Don'T Worry About The Vase·25d ago·source ↗

RTMH: Pope Leo's Magnifica Humanitas on AI

Zvi Mowshowitz's commentary covers Pope Leo's papal document 'Magnifica Humanitas' addressing AI. The piece analyzes the Catholic Church's formal position on artificial intelligence as expressed through a significant ecclesiastical document. This represents a notable religious institution staking out a substantive stance on AI development and ethics.

AI Safety Research Regulatory Developments Pope Leo Magnifica humanitas Zvi Mowshowitz +1 more

5Simon Willison'S Weblog·25d ago·source ↗

Notes on Pope Leo XIV's Encyclical on AI

Simon Willison comments on Pope Leo XIV's encyclical addressing artificial intelligence, marking a significant moment in which the Catholic Church formally engages with AI ethics and governance at the highest doctrinal level. An encyclical represents one of the most authoritative forms of papal teaching, suggesting the Church is staking out a formal position on AI's societal and moral implications. The body of the source item appears empty, so the specific content of Willison's notes is unavailable, but the event itself signals growing institutional and religious engagement with AI policy.

AI Safety Research Regulatory Developments Simon Willison Catholic Church Pope Leo XIV

5Anthropic News·1mo ago·source ↗

Anthropic Launches Multi-Tradition Dialogue Program on AI Moral Formation

Anthropic has begun a structured outreach program engaging scholars, clergy, philosophers, and ethicists from over 15 religious and cross-cultural traditions to inform Claude's character development and values training. The initiative is framed as a research workstream on 'moral formation' of AI systems, directly feeding into Claude's constitution and alignment evaluations. A concrete experiment emerged from these dialogues: giving Claude a mid-task tool that surfaces its own ethical commitments, which showed measurably lower rates of misaligned behavior on internal evaluations. Anthropic plans to expand engagement to legal scholars, psychologists, and civic institutions, with future discussions addressing AI's impact on work, institutions, and power distribution.

AI Safety Research Alignment and RLHF Claude Claude's constitution ethical commitment reminder tool +1 more

6Anthropic News·17d ago·source ↗

Dario Amodei calls for stronger AI safety focus at Paris AI Action Summit

Anthropic CEO Dario Amodei issued a statement following the Paris AI Action Summit, expressing concern that the event underweighted critical issues including democratic leadership in AI, CBRN and autonomous-risk governance, and labor market disruption. Amodei forecasts that by 2026-2027 AI capabilities may be equivalent to 'a country of geniuses in a datacenter,' framing this as both an opportunity and an urgent governance challenge. He called for governments to enforce transparency of frontier lab safety plans, fund third-party evaluations, and monitor economic impacts—pointing to Anthropic's newly released Economic Index as a model. The statement also reaffirmed Anthropic's Responsible Scaling Policy as the first of its kind among frontier labs.

AI Safety Research Regulatory Developments Dario Amodei French Government Responsible Scaling Policy +3 more

6Anthropic News·19d ago·source ↗

Anthropic Launches The Anthropic Institute for AI Societal Impact Research

Anthropic is establishing The Anthropic Institute, a new interdisciplinary research body led by co-founder Jack Clark in his new role as Head of Public Benefit. The Institute consolidates and expands three existing Anthropic teams—Frontier Red Team, Societal Impacts, and Economic Research—to study AI's effects on economies, jobs, governance, and legal systems. Notable founding hires include Matt Botvinick (AI and rule of law), Anton Korinek (transformative AI economics), and Zoë Hitzig (AI social/economic impacts). Anthropic is simultaneously expanding its Public Policy organization and opening a Washington DC office.

Evaluation and Benchmarking AI Safety Research Dario Amodei Frontier Red Team Zoë Hitzig +12 more

6Anthropic News·17d ago·source ↗

Anthropic policy recap: US Executive Order, G7 Code of Conduct, and Bletchley Park AI Safety Summit

Anthropic published a policy commentary summarizing three major AI governance events from late October/early November 2023: the US Executive Order on AI, the G7 International Code of Conduct for advanced AI developers, and the UK-hosted Bletchley Park AI Safety Summit. The post covers Anthropic's positions on each, including support for NIST capacity-building, the G7 Code of Conduct, and the newly announced UK and US AI Safety Institutes. Dario Amodei presented Anthropic's Responsible Scaling Policy at Bletchley as a potential regulatory prototype, and the 28-country Bletchley Declaration notably included China among its signatories.

AI Safety Research Regulatory Developments Dario Amodei Trump Administration Executive Order on AI Bletchley Declaration +6 more

5Anthropic News·18d ago·source ↗

Anthropic responds to California Governor Newsom's AI working group draft report

Anthropic published a formal response to the California Governor's Working Group on AI Frontier Models draft report, endorsing its emphasis on transparency and evidence-based policy. Anthropic argues that light-touch mandatory disclosure of safety and security practices would be beneficial without impeding innovation, noting that current voluntary practices are uneven across frontier labs. The response also references Anthropic's Responsible Scaling Policy and Economic Index as examples of existing transparency efforts, and signals urgency given Anthropic's view that powerful AI systems may arrive as early as end of 2026.

AI Safety Research Regulatory Developments Gavin Newsom California Working Group on AI Frontier Models Responsible Scaling Policy +2 more

6Anthropic News·17d ago·source ↗

Anthropic commits to signing the EU General-Purpose AI Code of Practice

Anthropic announced its intention to sign the EU's General-Purpose AI Code of Practice, citing alignment with its existing Responsible Scaling Policy on transparency, safety, and accountability. The company frames the Code's mandatory Safety and Security Frameworks—including CBRN risk assessment—as complementary to its own internal standards. Anthropic also signals continued collaboration with the EU AI Office and third-party bodies like the Frontier Model Forum to keep standards adaptive as the technology evolves.

AI Safety Research Regulatory Developments EU AI Act EU General-Purpose AI Code of Practice Frontier Model Forum +3 more