6OpenAI Blog·1mo ago

OpenAI Launches Preparedness Team and Challenge for Catastrophic Risk

OpenAI announced the formation of a dedicated Preparedness team focused on evaluating and mitigating catastrophic risks from highly capable AI systems. The initiative includes a challenge to solicit external input on frontier risk scenarios. This represents a formal organizational commitment to tracking and preparing for severe AI safety risks beyond existing red-teaming efforts.

Frontier Model Releases AI Safety Research Preparedness Challenge OpenAI OpenAI Preparedness Team

Related guides (3)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

AI Safety ResearchTopic guide

AI Safety Research: From Lab Policies to Real-World Flashpoints

Read asBeginner In-depth

Related events (8)

6Openai Blog·1mo ago·source ↗

OpenAI Updates Its Preparedness Framework

OpenAI has published an updated version of its Preparedness Framework, which governs how the company measures and mitigates severe risks from frontier AI capabilities. The framework sets thresholds and protocols for evaluating dangerous capability levels across domains such as CBRN, cybersecurity, and persuasion. This update reflects ongoing evolution in OpenAI's internal safety governance as frontier models grow more capable.

Frontier Model Releases Evaluation and Benchmarking Preparedness Framework OpenAI +1 more

4Openai Blog·1mo ago·source ↗

Strengthening cyber resilience as AI capabilities advance

OpenAI published a post outlining its approach to cybersecurity risk as its models grow more capable, covering risk assessment frameworks, misuse mitigation, and collaboration with the security community. The piece addresses both offensive risk (AI-enabled attacks) and defensive applications. It represents OpenAI's public positioning on responsible deployment in a high-stakes domain.

AI Safety Research Enterprise Deployment Patterns OpenAI

6Openai Blog·1mo ago·source ↗

Preparing for future AI risks in biology

OpenAI has published a post outlining its proactive approach to assessing and mitigating biosecurity risks from advanced AI systems capable of biological applications. The piece describes capability evaluations and safeguards designed to prevent misuse of AI in biology and medicine. This reflects OpenAI's ongoing effort to get ahead of dual-use risks before capabilities reach dangerous thresholds.

Evaluation and Benchmarking AI Safety Research OpenAI biology/medicine dual-use AI AI biosecurity risk assessment

5Openai Blog·1mo ago·source ↗

Preparing for malicious uses of AI

OpenAI co-authored a multi-institutional paper forecasting how malicious actors could misuse AI technology, produced in collaboration with FHI, CSER, CNAS, EFF, and others over nearly a year. The paper outlines potential threat vectors and proposes prevention and mitigation strategies. This represents an early coordinated effort among AI safety and policy organizations to systematically address AI misuse risks.

AI Safety Research Regulatory Developments Center for a New American Security Centre for the Study of Existential Risk Electronic Frontier Foundation +3 more

5Openai Blog·1mo ago·source ↗

OpenAI Red Teaming Network

OpenAI is launching an open call for a Red Teaming Network, inviting domain experts to participate in ongoing safety evaluations of its models. The initiative aims to build a structured community of external red teamers who can help identify risks and failure modes across OpenAI's model releases. This represents a formalization of OpenAI's external adversarial testing program beyond one-off pre-release red teaming exercises.

Evaluation and Benchmarking AI Safety Research OpenAI Red Teaming Network OpenAI

5Openai Blog·1mo ago·source ↗

OpenAI Expands External Safety Testing Ecosystem

OpenAI published a post describing its use of independent experts to evaluate frontier AI systems through third-party testing. The initiative aims to strengthen safety validation, verify safeguards, and increase transparency around capability and risk assessments. The announcement signals a continued push toward external accountability mechanisms for frontier model evaluation.

Evaluation and Benchmarking AI Safety Research OpenAI

7Anthropic News·16d ago·source ↗

Anthropic launches initiative to fund third-party AI safety evaluations

Anthropic announced a funded initiative to source third-party evaluations measuring advanced AI capabilities and safety risks, with priority areas including cybersecurity, CBRN threats, model autonomy, national security risks, social manipulation, and misalignment. The initiative is tied to Anthropic's Responsible Scaling Policy and AI Safety Level (ASL) framework, aiming to address a gap between demand and supply of high-quality safety-relevant evals. Proposals are solicited via an application form, with Anthropic framing the effort as benefiting the broader AI safety ecosystem rather than just internal use.

Evaluation and Benchmarking AI Safety Research METR Google-Proof Q&A Responsible Scaling Policy +1 more

6Openai Blog·1mo ago·source ↗

OpenAI Board Forms Safety and Security Committee

OpenAI's board of directors has established a new Safety and Security Committee, signaling a formal governance response to ongoing concerns about AI safety oversight. The committee is intended to advise the board on critical safety and security decisions as OpenAI continues to develop frontier models. This move follows scrutiny of OpenAI's internal safety culture and the dissolution of its prior Superalignment team.

AI Safety Research Regulatory Developments OpenAI Safety & Security Committee OpenAI Board of Directors OpenAI