7The Batch (DeepLearning.AI)·1mo ago

U.S. Government to Pre-Deployment Evaluate Frontier AI Models via NIST TRAINS Task Force

The U.S. National Institute of Standards and Technology (NIST) announced a new multi-agency task force called TRAINS (Testing Risks of AI for National Security) to assess national-security risks from frontier AI models before public deployment. Major AI companies including Google, Microsoft, xAI, Anthropic, and OpenAI have agreed to submit models—including versions with limited guardrails—for evaluation focused on cybersecurity, biosecurity, and chemical weapons risks. The White House is also considering an executive order requiring pre-deployment approval for AI models. TRAINS draws on multiple federal agencies and differs from prior NIST groups in its rapid-response design, though its specific benchmarks have not been disclosed.

Frontier Model Releases Evaluation and Benchmarking AI Safety Research Regulatory Developments DeepSeek V4 Microsoft Google PortBench CAISI xAI OpenAI TRAINS Anthropic NIST

Related guides (4)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner

DeepSeek V4

DeepSeek V4: The Open-Weights Giant Reshaping AI Economics

Read asBeginner In-depth

Microsoft

Microsoft: The AI Infrastructure Giant Betting on Every Horse

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race from GPT-3 to Safety-Tiered Superintelligence

Read asIn-depth

Related events (8)

7The Batch·1mo ago·source ↗

U.S. Government to Pre-Release Test AI Models for National Security Risks via NIST TRAINS Task Force

NIST announced a new multi-agency task force called TRAINS (Testing Risks of AI for National Security), overseen by its Center for AI Standards and Innovation, to evaluate frontier AI models for cybersecurity, biosecurity, and chemical weapons risks before public deployment. Google, Microsoft, xAI, Anthropic, and OpenAI have voluntarily agreed to submit models with limited guardrails for evaluation. The policy shift follows Anthropic's announcement that Claude Mythos Preview can autonomously exploit software vulnerabilities, and marks a sharp reversal from the Trump Administration's earlier deregulatory stance. The White House is also considering an executive order that would make pre-release government testing mandatory.

Frontier Model Releases Evaluation and Benchmarking White House Center for AI Standards and Innovation DeepSeek V4 +11 more

7The Batch·19d ago·source ↗

US Government Prepares AI Model Vetting System; GPT-5.5 Instant, Claude Finance Agents, Pentagon AI Partnerships

The White House is preparing an executive order to create an FDA-style vetting system for new AI models, prompted partly by Anthropic's Mythos model disclosing cybersecurity risks; the Commerce Department separately expanded a voluntary testing program with Google, Microsoft, and xAI. OpenAI rolled out GPT-5.5 Instant as the default ChatGPT model, claiming 52.5% fewer hallucinations on high-stakes prompts. Anthropic released ten financial agent templates running on Claude Opus 4.7, while the Pentagon expanded AI vendor agreements to include Microsoft, Amazon, Nvidia, and Reflection AI after canceling its Anthropic contract over autonomous weapons restrictions. Major pharma companies report AI gains primarily in manufacturing optimization rather than drug discovery breakthroughs.

Frontier Model Releases Evaluation and Benchmarking Vals AI Finance Agent Benchmark White House Darius Amodei +23 more

7Don'T Worry About The Vase·17d ago·source ↗

Trump Signs Executive Order Requiring AI Testing Prior to Frontier Model Releases

Zvi Mowshowitz analyzes a new Executive Order signed by President Trump that mandates AI testing prior to frontier model releases. The commentary covers the policy's scope, implications for major AI labs, and how it fits into the broader regulatory landscape for frontier AI development. This represents a significant federal policy action directly affecting the deployment pipeline for advanced AI systems.

AI Safety Research Regulatory Developments Donald Trump U.S. Government Zvi Mowshowitz

5Anthropic News·17d ago·source ↗

Anthropic publishes frontier model security recommendations including multi-party authorization and secure development frameworks

Anthropic released a policy and technical guidance document outlining cybersecurity best practices for securing frontier AI models, including multi-party authorization to AI-critical infrastructure, adoption of NIST SSDF and SLSA supply chain standards, and public-private cooperation modeled on critical infrastructure sectors. The post argues that advanced AI models warrant security levels far exceeding standard commercial practices and recommends government procurement requirements as a near-term enforcement mechanism. Anthropic states it is actively implementing these controls internally and calls on other labs and governments to adopt similar frameworks.

AI Safety Research Regulatory Developments Supply Chain Levels for Software Artifacts National Institute of Standards and Technology NIST Secure Software Development Framework +1 more

6Openai Blog·17d ago·source ↗

OpenAI proposes federal governance blueprint for frontier AI safety and national security

OpenAI published a policy blueprint calling for a U.S. federal framework to govern frontier AI, covering safety, resilience, and national security dimensions. The proposal outlines OpenAI's vision for democratic oversight of the most capable AI systems. As a tier-1 primary source from a leading lab, this represents a significant public policy position that will likely influence regulatory discussions.

AI Safety Research Regulatory Developments OpenAI

5Openai Blog·1mo ago·source ↗

Frontier AI regulation: Managing emerging risks to public safety

OpenAI published a policy position on regulating frontier AI systems, focusing on managing emerging risks to public safety. The piece outlines OpenAI's perspective on how governments and regulatory bodies should approach oversight of the most capable AI models. This represents a formal public stance from a leading AI lab on the shape of future AI governance frameworks.

AI Safety Research Regulatory Developments OpenAI

6Anthropic News·19d ago·source ↗

Anthropic Responds to White House AI Action Plan, Calls for Transparency Standards and Export Controls

Anthropic published a policy response to the White House's 'Winning the Race: America's AI Action Plan,' endorsing its focus on AI infrastructure, federal adoption, and safety research while urging additional steps on export controls and mandatory AI development transparency standards. The company highlighted alignment between the plan and its prior OSTP submissions, and noted its proactive activation of ASL-3 protections with Claude Opus 4 as evidence that safety and innovation are compatible. Anthropic called for a single national standard for frontier model transparency rather than a state-by-state patchwork, and encouraged continued investment in NIST's CAISI for evaluating frontier models on national security risks including CBRN capabilities.

Frontier Model Releases AI Safety Research Claude Opus 4.6 Center for AI Standards and Innovation Office of Management and Budget +9 more

5Anthropic News·16d ago·source ↗

Anthropic submits AI accountability recommendations to NTIA, covering evals, red teaming, and pre-registration

Anthropic submitted a formal response to the NTIA's Request for Comment on AI Accountability, outlining a multi-part policy framework for governing advanced AI systems. Key recommendations include increased government funding for evaluation research, mandatory disclosure of evaluation methods, pre-registration of large training runs with national governments, mandated external red teaming before model release, and antitrust guidance to enable industry safety collaboration. The submission reflects Anthropic's core policy positions and advocates for risk-tiered oversight proportional to model capabilities.

Evaluation and Benchmarking AI Safety Research National Institute of Standards and Technology National Telecommunications and Information Administration Anthropic +1 more