Almanac
← Events
5Interconnects (Nathan Lambert)·1mo ago

GPT 5.4 is a big step for Codex

A Tier 2 commentary piece from Interconnects evaluates GPT 5.4 in the context of OpenAI's Codex agent ecosystem, examining what the model release means for the frontier of AI agents. The author reflects on the current state of agent evaluation and notes a continued preference for Claude in practice. The piece offers analysis of how GPT 5.4 advances coding-agent capabilities relative to competing offerings.

Related guides (4)

Related events (8)

8Openai Blog·1mo ago·source ↗

Introducing GPT-5.3-Codex

OpenAI has announced GPT-5.3-Codex, described as a Codex-native agent combining frontier coding performance with general reasoning capabilities. The model is designed to support long-horizon, real-world technical work. The announcement positions it as an agentic coding system rather than a standalone language model.

8Openai Blog·1mo ago·source ↗

GPT-5.3-Codex System Card

OpenAI has released the system card for GPT-5.3-Codex, described as the most capable agentic coding model to date. It combines the frontier coding performance of GPT-5.2-Codex with the reasoning and professional knowledge capabilities of GPT-5.2. The release represents a continuation of OpenAI's Codex line of specialized coding models within the GPT-5 family.

7Openai Blog·1mo ago·source ↗

OpenAI releases GPT-5-Codex: GPT-5 variant optimized for agentic coding

OpenAI has published an addendum to the GPT-5 system card introducing GPT-5-Codex, a version of GPT-5 specifically optimized for agentic coding within the Codex environment. The model features dynamic thinking-effort adjustment, scaling compute based on task complexity—responding quickly to simple queries while sustaining longer independent work on complex coding tasks. This represents a specialized derivative of GPT-5 targeting software engineering agents rather than general-purpose use.

8Openai Blog·1mo ago·source ↗

Introducing GPT-5.2-Codex

OpenAI has released GPT-5.2-Codex, described as their most advanced coding model. The model features long-horizon reasoning, large-scale code transformation capabilities, and enhanced cybersecurity features. This represents a specialized coding-focused model in the GPT-5 family.

7Openai Blog·1mo ago·source ↗

OpenAI Introduces GPT-5.1-Codex-Max for Agentic Coding

OpenAI has released GPT-5.1-Codex-Max, a new model optimized for agentic coding tasks within the Codex platform. The model targets long-running, project-scale software development work with improvements in reasoning and token efficiency. It is positioned as a faster and more capable successor for autonomous coding workflows.

8Openai Blog·1mo ago·source ↗

Addendum to GPT-5.2 System Card: GPT-5.2-Codex

OpenAI published a system card addendum for GPT-5.2-Codex, a specialized variant of GPT-5.2 focused on coding capabilities. The document provides safety evaluations, capability assessments, and deployment considerations specific to this coding-oriented model. As a Tier 1 source system card, it represents official documentation of a frontier coding model's properties and risk profile.

4One Useful Thing·1mo ago·source ↗

Sign of the Future: GPT-5.5 Commentary

A tier-2 commentary piece from One Useful Thing discusses GPT-5.5 as a notable step in the AI capability curve. The piece frames the release as a signal of future AI development trajectories. As a commentary source, it likely offers analysis of what GPT-5.5's capabilities imply rather than primary technical reporting.

8The Batch·17d ago·source ↗

GPT-5.4 released with tool search, computer use, and frontier benchmark performance

OpenAI released GPT-5.4 in Thinking and Pro variants, featuring an expanded context window (up to 1.05M input tokens), native computer use, tool search capabilities, and adjustable reasoning levels. In independent testing by Artificial Analysis, GPT-5.4 Pro at xhigh reasoning achieved state-of-the-art on GDP-Val-AA, BrowseComp, Terminal-Bench-Hard, SWE-Bench-Pro, and MCP Atlas, while trailing Gemini 3.1 Pro Preview on MMMU-Pro and Humanity's Last Exam. Pricing is set at the top of the market ($30/$180 per million input/output tokens for Pro), and the release also powers Codex, OpenAI's competitor to Claude Code. The item is reported via The Batch (tier 2 commentary) and includes additional context on Andrew Ng's chub CLI tool for agent documentation sharing.