Almanac
← Events
6OpenAI Blog·1mo ago

Operator System Card

OpenAI published a system card for Operator, its autonomous web-browsing agent, detailing the multi-layered safety mitigations deployed. The document covers protections against prompt injection and jailbreaks, privacy and security measures, external red teaming results, and safety evaluations. It reflects OpenAI's established safety frameworks applied to an agentic product capable of taking real-world actions on behalf of users.

Related guides (4)

Related events (8)

8Openai Blog·1mo ago·source ↗

OpenAI o1 System Card

OpenAI has published the system card for its o1 and o1-mini models, documenting safety evaluations conducted prior to release. The report covers external red teaming exercises and frontier risk assessments performed under OpenAI's Preparedness Framework. This represents the formal safety disclosure accompanying the o1 model family launch.

8Openai Blog·1mo ago·source ↗

ChatGPT Agent System Card

OpenAI has published a system card for its ChatGPT agent, an agentic model that integrates research, browser automation, and code execution tools into a unified system. The release is accompanied by safety documentation under OpenAI's Preparedness Framework. The system card details the safeguards and evaluations applied to the agent prior to deployment. This represents OpenAI's formal safety disclosure for a production agentic product.

6Openai Blog·1mo ago·source ↗

OpenAI o3-mini System Card

OpenAI has published the system card for its o3-mini model, detailing safety evaluations, external red teaming efforts, and assessments conducted under the Preparedness Framework. The document covers the safety work performed prior to deployment of the o3-mini reasoning model. This is a standard pre-release safety disclosure accompanying the model launch.

8Openai Blog·1mo ago·source ↗

OpenAI o3 and o4-mini System Card

OpenAI has published the system card for its o3 and o4-mini models, which combine advanced reasoning capabilities with a full suite of integrated tools including web browsing, Python execution, image and file analysis, image generation, canvas, automations, file search, and memory. The system card documents safety evaluations and deployment considerations for these frontier reasoning models. This represents a significant capability expansion over prior o-series models by natively integrating tool use alongside chain-of-thought reasoning.

6Openai Blog·1mo ago·source ↗

Deep Research System Card

OpenAI has published the system card for its Deep Research capability, detailing pre-release safety work including external red teaming and frontier risk evaluations conducted under the Preparedness Framework. The document outlines identified risk areas and the mitigations implemented before deployment. This is the formal safety disclosure accompanying the Deep Research product launch.

7Openai Blog·1mo ago·source ↗

GPT-5.1-Codex-Max System Card

OpenAI has published the system card for GPT-5.1-Codex-Max, a coding-focused model variant. The card details model-level safety mitigations including specialized safety training against harmful tasks and prompt injection attacks, as well as product-level controls such as agent sandboxing and configurable network access. This represents OpenAI's formal safety documentation for an agentic coding model deployment.

8Openai Blog·1mo ago·source ↗

Introducing Operator

OpenAI has announced Operator, a new AI agent product capable of taking actions on the web on behalf of users. The announcement comes from OpenAI's official blog, signaling a major step toward autonomous web-based task execution. Operator represents OpenAI's entry into the agentic AI product space, where models can browse, interact with, and complete tasks across websites without direct user intervention.

6Openai Blog·1mo ago·source ↗

OpenAI Publishes System Card Addendum for Codex Agent and codex-1 Model

OpenAI released an addendum to the o3 and o4-mini system cards covering Codex, a cloud-based coding agent powered by codex-1—a variant of o3 fine-tuned for software engineering via reinforcement learning on real-world coding tasks. codex-1 is designed to produce code matching human style and PR conventions, follow instructions precisely, and iterate on tests until they pass. The addendum provides safety and capability documentation for this specialized agentic deployment.