7The Batch (DeepLearning.AI)·1mo ago

U.S. Government to Pre-Release Test AI Models for National Security Risks via NIST TRAINS Task Force

NIST announced a new multi-agency task force called TRAINS (Testing Risks of AI for National Security), overseen by its Center for AI Standards and Innovation, to evaluate frontier AI models for cybersecurity, biosecurity, and chemical weapons risks before public deployment. Google, Microsoft, xAI, Anthropic, and OpenAI have voluntarily agreed to submit models with limited guardrails for evaluation. The policy shift follows Anthropic's announcement that Claude Mythos Preview can autonomously exploit software vulnerabilities, and marks a sharp reversal from the Trump Administration's earlier deregulatory stance. The White House is also considering an executive order that would make pre-release government testing mandatory.

Frontier Model Releases Evaluation and Benchmarking AI Safety Research Regulatory Developments White House Center for AI Standards and Innovation DeepSeek V4 Microsoft Google PortBench xAI OpenAI Claude Mythos Preview TRAINS Anthropic NIST

Related guides (4)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner

DeepSeek V4

DeepSeek V4: The Open-Weights Giant Reshaping AI Economics

Read asBeginner In-depth

Microsoft

Microsoft: The AI Infrastructure Giant Betting on Every Horse

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race from GPT-3 to Safety-Tiered Superintelligence

Read asIn-depth

Related events (8)

7The Batch·1mo ago·source ↗

U.S. Government to Pre-Deployment Evaluate Frontier AI Models via NIST TRAINS Task Force

The U.S. National Institute of Standards and Technology (NIST) announced a new multi-agency task force called TRAINS (Testing Risks of AI for National Security) to assess national-security risks from frontier AI models before public deployment. Major AI companies including Google, Microsoft, xAI, Anthropic, and OpenAI have agreed to submit models—including versions with limited guardrails—for evaluation focused on cybersecurity, biosecurity, and chemical weapons risks. The White House is also considering an executive order requiring pre-deployment approval for AI models. TRAINS draws on multiple federal agencies and differs from prior NIST groups in its rapid-response design, though its specific benchmarks have not been disclosed.

Frontier Model Releases Evaluation and Benchmarking DeepSeek V4 Microsoft Google +9 more

7The Batch·19d ago·source ↗

US Government Prepares AI Model Vetting System; GPT-5.5 Instant, Claude Finance Agents, Pentagon AI Partnerships

The White House is preparing an executive order to create an FDA-style vetting system for new AI models, prompted partly by Anthropic's Mythos model disclosing cybersecurity risks; the Commerce Department separately expanded a voluntary testing program with Google, Microsoft, and xAI. OpenAI rolled out GPT-5.5 Instant as the default ChatGPT model, claiming 52.5% fewer hallucinations on high-stakes prompts. Anthropic released ten financial agent templates running on Claude Opus 4.7, while the Pentagon expanded AI vendor agreements to include Microsoft, Amazon, Nvidia, and Reflection AI after canceling its Anthropic contract over autonomous weapons restrictions. Major pharma companies report AI gains primarily in manufacturing optimization rather than drug discovery breakthroughs.

Frontier Model Releases Evaluation and Benchmarking Vals AI Finance Agent Benchmark White House Darius Amodei +23 more

5Anthropic News·16d ago·source ↗

Anthropic submits AI accountability recommendations to NTIA, covering evals, red teaming, and pre-registration

Anthropic submitted a formal response to the NTIA's Request for Comment on AI Accountability, outlining a multi-part policy framework for governing advanced AI systems. Key recommendations include increased government funding for evaluation research, mandatory disclosure of evaluation methods, pre-registration of large training runs with national governments, mandated external red teaming before model release, and antitrust guidance to enable industry safety collaboration. The submission reflects Anthropic's core policy positions and advocates for risk-tiered oversight proportional to model capabilities.

Evaluation and Benchmarking AI Safety Research National Institute of Standards and Technology National Telecommunications and Information Administration Anthropic +1 more

6Anthropic News·19d ago·source ↗

Anthropic Responds to White House AI Action Plan, Calls for Transparency Standards and Export Controls

Anthropic published a policy response to the White House's 'Winning the Race: America's AI Action Plan,' endorsing its focus on AI infrastructure, federal adoption, and safety research while urging additional steps on export controls and mandatory AI development transparency standards. The company highlighted alignment between the plan and its prior OSTP submissions, and noted its proactive activation of ASL-3 protections with Claude Opus 4 as evidence that safety and innovation are compatible. Anthropic called for a single national standard for frontier model transparency rather than a state-by-state patchwork, and encouraged continued investment in NIST's CAISI for evaluating frontier models on national security risks including CBRN capabilities.

Frontier Model Releases AI Safety Research Claude Opus 4.6 Center for AI Standards and Innovation Office of Management and Budget +9 more

5Anthropic News·17d ago·source ↗

Anthropic publishes frontier model security recommendations including multi-party authorization and secure development frameworks

Anthropic released a policy and technical guidance document outlining cybersecurity best practices for securing frontier AI models, including multi-party authorization to AI-critical infrastructure, adoption of NIST SSDF and SLSA supply chain standards, and public-private cooperation modeled on critical infrastructure sectors. The post argues that advanced AI models warrant security levels far exceeding standard commercial practices and recommends government procurement requirements as a near-term enforcement mechanism. Anthropic states it is actively implementing these controls internally and calls on other labs and governments to adopt similar frameworks.

AI Safety Research Regulatory Developments Supply Chain Levels for Software Artifacts National Institute of Standards and Technology NIST Secure Software Development Framework +1 more

6Anthropic News·18d ago·source ↗

Anthropic submits AI Action Plan recommendations to White House OSTP

Anthropic submitted formal recommendations to the White House Office of Science and Technology Policy in response to its Request for Information on a U.S. AI Action Plan. The submission covers six areas: national security testing of AI models, tightening semiconductor export controls (including H20 chips), enhancing lab security via classified government-industry channels, scaling energy infrastructure to 50 GW by 2027, accelerating government AI adoption, and preparing for economic disruption. Anthropic cites its expectation that powerful AI systems matching Nobel Prize-level intellect will emerge in late 2026 or early 2027, framing the recommendations as urgent national security and economic imperatives.

AI Safety Research Regulatory Developments Dario Amodei White House Office of Science and Technology Policy Claude 3.7 Sonnet +2 more

7Don'T Worry About The Vase·1mo ago·source ↗

The AI Ad-Hoc Prior Restraint Era Begins

Zvi Mowshowitz reports that the White House has ordered Anthropic to halt expansion of access to Mythos, and is considering a broader policy shift to a prior restraint regime requiring government approval before releasing highly capable AI models. This would represent a major reversal of current U.S. frontier AI policy. The commentary analyzes the implications of such a regulatory posture for the AI industry.

Frontier Model Releases AI Safety Research White House Mythos Zvi Mowshowitz +2 more

7Don'T Worry About The Vase·17d ago·source ↗

Trump Signs Executive Order Requiring AI Testing Prior to Frontier Model Releases

Zvi Mowshowitz analyzes a new Executive Order signed by President Trump that mandates AI testing prior to frontier model releases. The commentary covers the policy's scope, implications for major AI labs, and how it fits into the broader regulatory landscape for frontier AI development. This represents a significant federal policy action directly affecting the deployment pipeline for advanced AI systems.

AI Safety Research Regulatory Developments Donald Trump U.S. Government Zvi Mowshowitz