Almanac
Topic guide · Beginner

AI Regulatory Developments: From Voluntary Frameworks to Government Enforcement

Regulatory DevelopmentsBeginneractive·v1 · live·generated 6d ago
TL;DRAI regulation has shifted from voluntary industry commitments to active government enforcement — with the U.S. government banning one major lab from defense contracts, ordering access suspensions over jailbreak concerns, and standing up a new pre-deployment testing regime. The central tension is no longer whether AI should be regulated, but who sets the rules, how fast, and whether safety guardrails are a feature or a liability in the eyes of government customers.

Key takeaways

  • The U.S. Department of War used a national-security supply-chain designation — previously reserved for foreign companies — against Anthropic, a U.S. lab, over its refusal to enable autonomous weapons and mass domestic surveillance.
  • NIST's new TRAINS task force (announced May 2026) requires Google, Microsoft, xAI, Anthropic, and OpenAI to submit frontier models for pre-deployment national-security evaluation — a sharp reversal from earlier deregulatory posture.
  • A Trump executive order signed June 2026 mandates AI testing before frontier model releases, and the White House separately considered an FDA-style pre-approval regime.
  • California's SB 53, effective January 1 2026, became the first mandatory frontier AI safety and transparency law in the U.S., prompting Anthropic to publish a formal Frontier Compliance Framework.
  • The EU AI Act's high-risk compliance deadline was pushed from August 2026 to December 2027 following industry lobbying, while a new ban on AI-generated child sexual abuse material was added.
  • China blocked Meta's $2.5B acquisition of Singapore-based Manus, asserting jurisdiction over AI technology built by Chinese engineers regardless of corporate domicile — effectively killing the 'Singapore strategy' for Chinese AI startups.

What this area covers

AI regulatory developments span the rules, enforcement actions, and governance frameworks that governments and companies are building around artificial intelligence — covering everything from voluntary safety policies that labs publish themselves, to executive orders, national security designations, state laws, and international frameworks like the EU AI Act.

This is no longer a quiet policy backwater. Within roughly eighteen months, the U.S. government has banned a major AI lab from defense contracts, ordered model access suspended over security concerns, and stood up a new pre-deployment testing regime. The story is moving fast, and the stakes are high.

Why it matters

If you work with AI — building it, buying it, or deploying it — the regulatory environment now directly shapes what you can do. Labs are being told which customers they can serve, which use cases are off-limits, and in some cases whether they can release a model at all. For businesses, the compliance picture is fragmenting: California has its own law, the EU has its own timeline, and the U.S. federal government is improvising in real time.

The voluntary era: labs set their own rules

The story starts with AI labs writing their own rulebooks. Anthropic published its Responsible Scaling Policy (RSP), a framework that assigns AI Safety Levels (ASL-1 through ASL-5+) to models based on how dangerous their capabilities are — borrowed conceptually from biosafety level standards. The idea: as models get more powerful, they face stricter deployment constraints. In May 2025, Anthropic activated ASL-3 protections for Claude Opus 4 for the first time, adding "Constitutional Classifiers" to block end-to-end workflows for developing chemical, biological, radiological, or nuclear weapons.

OpenAI followed a parallel path, publishing its own Frontier Governance Framework explicitly aligned to EU and California regulatory requirements. Both companies framed these as good-faith efforts to get ahead of government mandates — and as templates they hoped governments would adopt.

The first mandatory law: California SB 53

California moved first among U.S. jurisdictions. SB 53 — the Transparency in Frontier AI Act — took effect January 1, 2026, requiring frontier AI developers to conduct formal risk assessments, protect model weights, and report incidents. Anthropic published a detailed Frontier Compliance Framework in December 2025 to meet its requirements, framing it as an extension of the RSP. The company also called on the federal government to create an analogous national framework — a sign that labs, at least publicly, prefer one clear set of rules over a patchwork of state laws.

The military standoff: when safety policies collide with government demands

The most dramatic regulatory episode in this period began quietly: the U.S. Department of War (the renamed Department of Defense) wanted Anthropic to agree to "any lawful use" of Claude and remove two specific safeguards — one covering mass domestic surveillance of Americans, one covering fully autonomous weapons.

Anthropic refused. CEO Dario Amodei published a public statement explaining that Claude was already deployed across defense and intelligence systems for intelligence analysis, operational planning, and cyber operations — but that these two uses crossed a line the company would not cross, citing both democratic values and the current reliability limitations of AI systems.

The government's response was unprecedented: it invoked a supply-chain risk designation under 10 USC 3252 — a legal tool previously used only against foreign companies — against a U.S. AI lab. A Trump Truth Social post threatened civil and criminal consequences. Secretary of War Pete Hegseth formalized the designation. Anthropic announced it would challenge the designation in court while continuing to serve the government at nominal cost during any transition.

OpenAI, meanwhile, signed a contract with the Department of War allowing use of its models "for all lawful purposes," with carve-outs for surveillance and autonomous weapons that CEO Sam Altman later described as rushed and subsequently renegotiated. OpenAI also deployed a custom ChatGPT instance on GenAI.mil and, through a GSA partnership, made ChatGPT Enterprise available to the entire U.S. federal executive branch workforce at essentially no cost for one year.

Export controls and jailbreak disputes

The government's reach extended further in June 2026, when a U.S. export control directive ordered Anthropic to immediately disable access to its Fable 5 and Mythos 5 models for all foreign nationals — effectively forcing a full customer suspension. The government cited awareness of a jailbreak method. Anthropic disputed the severity, arguing the technique was narrow, non-universal, and produced results already achievable by other publicly available models. The company complied while publicly disagreeing, and warned that a standard of perfect jailbreak resistance would halt all frontier model deployments industry-wide.

Reports also surfaced that conversations between Amazon CEO Andy Jassy and U.S. officials had contributed to the crackdown — highlighting how the relationships between major cloud providers, AI labs, and government are deeply entangled.

Pre-deployment testing: the TRAINS task force

In May 2026, NIST announced the TRAINS (Testing Risks of AI for National Security) task force — a multi-agency group designed to evaluate frontier models for cybersecurity, biosecurity, and chemical weapons risks before public deployment. Google, Microsoft, xAI, Anthropic, and OpenAI all agreed to submit models, including versions with limited guardrails. The White House simultaneously considered an executive order making pre-deployment approval mandatory — an FDA-style regime for AI.

In June 2026, President Trump signed an executive order requiring AI testing prior to frontier model releases, marking a significant reversal from the administration's earlier deregulatory stance.

The EU AI Act: delays and additions

Across the Atlantic, the EU AI Act — the world's most comprehensive AI law — hit implementation turbulence. The European Parliament and member states agreed to push the high-risk AI system compliance deadline from August 2026 to December 2027, and extended other deadlines for watermarking and sandbox environments. The changes followed sustained lobbying from European industry concerned about competitive disadvantage. One provision was strengthened rather than weakened: a new ban on AI-generated sexually explicit images of children and non-consensual nude images.

China's approach: asserting control across borders

China's regulatory posture is distinct. When Meta attempted to acquire Manus — a Singapore-based AI agent startup originally founded in China — China's National Development and Reform Commission blocked the $2.5 billion deal, asserting jurisdiction over technology built by Chinese engineers regardless of corporate domicile. The ruling effectively ended the "Singapore strategy" used by Chinese AI startups to attract Western capital, and caused founders and investors to cancel plans to relocate abroad.

Separately, Anthropic publicly identified three Chinese AI laboratories — DeepSeek, Moonshot AI, and MiniMax — as conducting coordinated large-scale "distillation attacks," generating over 16 million exchanges through approximately 24,000 fraudulent accounts to harvest Claude's capabilities. The White House acknowledged the distillation threat in an April 2026 memo, framing it as an adversarial national security concern.

The emerging picture

Several fault lines are now visible:

Safety guardrails as a liability. The Anthropic-DoW standoff established a precedent: a lab's own safety policies can become grounds for government exclusion. Labs that refuse certain uses face contract loss; labs that comply face reputational and legal risk from the uses they enable.

Fragmented compliance. California has mandatory rules. The EU has a delayed but comprehensive framework. The U.S. federal government is layering executive orders, export controls, and task forces without a unified statute. Labs are navigating all of these simultaneously.

Pre-deployment approval is on the table. The shift from "labs self-certify" to "government tests before release" is underway. How fast it moves, and how prescriptive it becomes, will define the next phase of AI development.

Geopolitical dimensions are inseparable. Export controls, distillation attacks, and China's blocking of foreign acquisitions mean that AI regulation is now as much about great-power competition as it is about consumer protection or safety.

The regulatory pressure map: who is pushing on whom

How major labs are positioned under the new regulatory landscape

LabU.S. Defense statusPre-deployment testing (TRAINS)Key voluntary frameworkNotable regulatory event
AnthropicBanned from DoW contracts; supply-chain risk designationAgreed to submit modelsResponsible Scaling Policy v3.0; Frontier Compliance Framework (CA SB 53)Export control order suspending Fable 5 & Mythos 5 (Jun 2026)
OpenAIActive DoW contract; ChatGPT on GenAI.mil; GSA federal workforce dealAgreed to submit modelsFrontier Governance Framework (aligned to EU & CA rules)Musk lawsuit dismissed; PBC restructuring completed
GoogleAgreed to TRAINS submissionAgreed to submit modelsDeep Anthropic investment; EU AI Act compliance planning

Synthesized from the events bundle; unknown cells render —.

Timeline

  1. Anthropic publishes Responsible Scaling Policy with AI Safety Level framework

  2. Anthropic activates ASL-3 safety protections for Claude Opus 4 — first model deployed at that level

  3. Anthropic publishes Frontier Compliance Framework ahead of California SB 53 taking effect

  4. Anthropic refuses DoD demand to remove safeguards on autonomous weapons and mass surveillance

  5. DoW formally designates Anthropic a supply-chain risk; OpenAI signs DoW contract

  6. NIST announces TRAINS task force for pre-deployment national-security AI testing

  7. EU AI Act high-risk compliance deadline delayed from Aug 2026 to Dec 2027

  8. Trump signs executive order requiring AI testing before frontier model releases

  9. U.S. government orders Anthropic to suspend Fable 5 and Mythos 5 access citing jailbreak concerns

Related topics

FAQ

What is the TRAINS task force and why does it matter?

TRAINS (Testing Risks of AI for National Security) is a NIST-led multi-agency group that evaluates frontier AI models for cybersecurity, biosecurity, and chemical weapons risks before public deployment. It matters because it marks the U.S. government's first structured move toward mandatory pre-release testing — a significant shift from the earlier hands-off approach.

Why was Anthropic banned from U.S. military contracts?

The Department of War designated Anthropic a supply-chain risk after the company refused to remove two usage restrictions from Claude: enabling fully autonomous weapons and mass domestic surveillance of Americans. OpenAI signed a contract without those restrictions and took over the work.

What is California's SB 53?

SB 53 is the first mandatory frontier AI safety and transparency law in the United States, effective January 1, 2026. It requires frontier AI developers to conduct risk assessments, protect model weights, and report incidents — Anthropic published a Frontier Compliance Framework specifically to meet its requirements.

What happened to the EU AI Act timeline?

The EU Parliament and member states agreed to delay the high-risk AI system compliance deadline from August 2026 to December 2027, and extended other deadlines for watermarking and sandbox environments, following lobbying from European industry. A new ban on AI-generated child sexual abuse material was added at the same time.

What does the export control order on Anthropic's models mean in practice?

The U.S. government ordered Anthropic to immediately disable access to Fable 5 and Mythos 5 for all foreign nationals, citing awareness of a jailbreak method. Anthropic is complying while publicly disputing the severity of the jailbreak, arguing that requiring perfect jailbreak resistance would halt all frontier model deployments industry-wide.

How is China regulating AI differently from the West?

China is asserting control over AI technology built by Chinese engineers regardless of where the company is incorporated — blocking Meta's acquisition of Singapore-based Manus on national security grounds and effectively ending the practice of Chinese AI startups relocating abroad to attract Western investment.

Stay current

Call Me Almanac pairs the week's AI news with guides like this one — Midweek & Sunday.

Versions

  • v1live6d ago

Related guides (4)

More on Regulatory Developments (6)

7The Batch·1mo ago·source ↗

U.S. Government to Pre-Release Test AI Models for National Security Risks via NIST TRAINS Task Force

NIST announced a new multi-agency task force called TRAINS (Testing Risks of AI for National Security), overseen by its Center for AI Standards and Innovation, to evaluate frontier AI models for cybersecurity, biosecurity, and chemical weapons risks before public deployment. Google, Microsoft, xAI, Anthropic, and OpenAI have voluntarily agreed to submit models with limited guardrails for evaluation. The policy shift follows Anthropic's announcement that Claude Mythos Preview can autonomously exploit software vulnerabilities, and marks a sharp reversal from the Trump Administration's earlier deregulatory stance. The White House is also considering an executive order that would make pre-release government testing mandatory.

7The Batch·22d ago·source ↗

European Union Regulators Delay Some AI Act Provisions, Delete Others

The European Parliament and member states agreed to amend the EU AI Act, delaying high-risk AI system compliance deadlines from August 2026 to December 2027 and extending other deadlines for watermarking, sandbox environments, and AI-driven products. The amendments also simplify compliance burdens for smaller companies, adjust personal data usage rules, and carve out exemptions for industrial machinery already covered by product-safety law. One area was strengthened: a new ban on AI-generated sexually explicit images of children and non-consensual nude images. The changes await formal adoption and follow sustained lobbying from European industry and two influential competitiveness reports.

9Anthropic News·7d ago·source ↗

US government orders Anthropic to suspend access to Fable 5 and Mythos 5 citing national security jailbreak concerns

The US government issued an export control directive requiring Anthropic to immediately disable Fable 5 and Mythos 5 for all foreign nationals, effectively forcing a full customer suspension to ensure compliance. The government cited awareness of a jailbreak method, but Anthropic disputes the severity, stating the demonstrated technique is a narrow, non-universal jailbreak that produces results already achievable by other publicly available models including GPT-5.5. Anthropic is complying with the directive while publicly disagreeing with the standard applied, arguing that requiring perfect jailbreak resistance would halt all frontier model deployments industry-wide. This is a significant regulatory and safety governance flashpoint involving government authority over commercial AI model access.

9Anthropic News·19d ago·source ↗

Anthropic Responds to Department of War Supply Chain Risk Designation

Anthropic CEO Dario Amodei issued a public statement after the U.S. Department of War formally designated Anthropic as a supply chain risk to national security, confirming the company will challenge the designation in court. Amodei clarified that the designation under 10 USC 3252 has narrow scope, affecting only direct use of Claude within Department of War contracts rather than all customers with such contracts. Anthropic committed to continuing to provide models to the Department of War and national security community at nominal cost during any transition period, while reiterating its two narrow usage exceptions: fully autonomous weapons and mass domestic surveillance. Amodei also apologized for a leaked internal post written on a difficult day, characterizing it as out-of-date and not reflecting his considered views.

7Anthropic News·19d ago·source ↗

Anthropic Publishes Frontier Compliance Framework for California's SB 53 Transparency in Frontier AI Act

Anthropic has released its Frontier Compliance Framework (FCF) in advance of California's SB 53 taking effect on January 1, 2026, which establishes the first mandatory frontier AI safety and transparency requirements in the US. The FCF covers risk assessment and mitigation for cyber, CBRN, and AI autonomy/control risks, tiered capability evaluation, model weight protection, and incident response. Anthropic frames the FCF as an evolution of its existing Responsible Scaling Policy, which will continue as a voluntary safety policy beyond regulatory minimums. The company also calls for a federal AI transparency framework with analogous requirements applied only to the largest frontier developers.

7The Batch·1mo ago·source ↗

China's Regulators Block Meta's Acquisition of Manus, an Agentic Startup Headquartered in Singapore

China's National Development and Reform Commission (NDRC) blocked Meta's proposed $2.5 billion acquisition of Manus, a Singapore-based AI agent startup originally founded in China by Butterfly Effect. The NDRC cited concerns over data transfers and foreign ownership of technology developed by Chinese engineers, asserting jurisdiction despite Manus having relocated to Singapore. The ruling has effectively killed the 'Singapore strategy' used by Chinese AI startups to attract Western capital, causing founders and investors to cancel plans to move abroad or pursue foreign partnerships. The episode marks a significant escalation in China's assertion of control over strategically important AI technology regardless of corporate domicile.