6OpenAI Blog·1mo ago

Operator System Card

OpenAI published a system card for Operator, its autonomous web-browsing agent, detailing the multi-layered safety mitigations deployed. The document covers protections against prompt injection and jailbreaks, privacy and security measures, external red teaming results, and safety evaluations. It reflects OpenAI's established safety frameworks applied to an agentic product capable of taking real-world actions on behalf of users.

Evaluation and Benchmarking AI Safety Research Agent and Tool Ecosystem OpenAI Operator Operator System Card

Related guides (4)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner In-depth

AI Safety ResearchTopic guide

AI Safety Research: From Lab Policies to Real-World Flashpoints

Read asBeginner In-depth

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How the Infrastructure Layer Around LLMs Is Consolidating

Read asIn-depth

Evaluation and BenchmarkingTopic guide

Evaluation and Benchmarking: The Shifting Yardstick of AI Capability

Read asIn-depth

Related events (8)

8Openai Blog·1mo ago·source ↗

OpenAI o1 System Card

OpenAI has published the system card for its o1 and o1-mini models, documenting safety evaluations conducted prior to release. The report covers external red teaming exercises and frontier risk assessments performed under OpenAI's Preparedness Framework. This represents the formal safety disclosure accompanying the o1 model family launch.

Frontier Model Releases Evaluation and Benchmarking Preparedness Framework o1-mini OpenAI +3 more

8Openai Blog·1mo ago·source ↗

ChatGPT Agent System Card

OpenAI has published a system card for its ChatGPT agent, an agentic model that integrates research, browser automation, and code execution tools into a unified system. The release is accompanied by safety documentation under OpenAI's Preparedness Framework. The system card details the safeguards and evaluations applied to the agent prior to deployment. This represents OpenAI's formal safety disclosure for a production agentic product.

Frontier Model Releases Evaluation and Benchmarking ChatGPT agent Preparedness Framework OpenAI +3 more

6Openai Blog·1mo ago·source ↗

OpenAI o3-mini System Card

OpenAI has published the system card for its o3-mini model, detailing safety evaluations, external red teaming efforts, and assessments conducted under the Preparedness Framework. The document covers the safety work performed prior to deployment of the o3-mini reasoning model. This is a standard pre-release safety disclosure accompanying the model launch.

Frontier Model Releases Evaluation and Benchmarking Preparedness Framework o3-mini OpenAI +1 more

8Openai Blog·1mo ago·source ↗

OpenAI o3 and o4-mini System Card

OpenAI has published the system card for its o3 and o4-mini models, which combine advanced reasoning capabilities with a full suite of integrated tools including web browsing, Python execution, image and file analysis, image generation, canvas, automations, file search, and memory. The system card documents safety evaluations and deployment considerations for these frontier reasoning models. This represents a significant capability expansion over prior o-series models by natively integrating tool use alongside chain-of-thought reasoning.

Frontier Model Releases Evaluation and Benchmarking o3 and o4-mini system card OpenAI o3-mini OpenAI +2 more

6Openai Blog·1mo ago·source ↗

Deep Research System Card

OpenAI has published the system card for its Deep Research capability, detailing pre-release safety work including external red teaming and frontier risk evaluations conducted under the Preparedness Framework. The document outlines identified risk areas and the mitigations implemented before deployment. This is the formal safety disclosure accompanying the Deep Research product launch.

Frontier Model Releases AI Safety Research Deep Research Preparedness Framework OpenAI +1 more

7Openai Blog·1mo ago·source ↗

GPT-5.1-Codex-Max System Card

OpenAI has published the system card for GPT-5.1-Codex-Max, a coding-focused model variant. The card details model-level safety mitigations including specialized safety training against harmful tasks and prompt injection attacks, as well as product-level controls such as agent sandboxing and configurable network access. This represents OpenAI's formal safety documentation for an agentic coding model deployment.

Frontier Model Releases AI Safety Research prompt injection GPT-5.1-Codex-Max OpenAI +2 more

8Openai Blog·1mo ago·source ↗

Introducing Operator

OpenAI has announced Operator, a new AI agent product capable of taking actions on the web on behalf of users. The announcement comes from OpenAI's official blog, signaling a major step toward autonomous web-based task execution. Operator represents OpenAI's entry into the agentic AI product space, where models can browse, interact with, and complete tasks across websites without direct user intervention.

Frontier Model Releases Enterprise Deployment Patterns OpenAI Operator +1 more

6Openai Blog·1mo ago·source ↗

OpenAI Publishes System Card Addendum for Codex Agent and codex-1 Model

OpenAI released an addendum to the o3 and o4-mini system cards covering Codex, a cloud-based coding agent powered by codex-1—a variant of o3 fine-tuned for software engineering via reinforcement learning on real-world coding tasks. codex-1 is designed to produce code matching human style and PR conventions, follow instructions precisely, and iterate on tests until they pass. The addendum provides safety and capability documentation for this specialized agentic deployment.

Frontier Model Releases AI Safety Research o3 and o4-mini system card o4-mini OpenAI +4 more