5Interconnects (Nathan Lambert)·1mo ago

GPT 5.4 is a big step for Codex

A Tier 2 commentary piece from Interconnects evaluates GPT 5.4 in the context of OpenAI's Codex agent ecosystem, examining what the model release means for the frontier of AI agents. The author reflects on the current state of agent evaluation and notes a continued preference for Claude in practice. The piece offers analysis of how GPT 5.4 advances coding-agent capabilities relative to competing offerings.

Frontier Model Releases Evaluation and Benchmarking Agent and Tool Ecosystem Interconnects Claude OpenAI OpenAI Codex GPT-5.5 Anthropic

Related guides (4)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner In-depth

Claude

Claude: Anthropic's AI Assistant Built for Safety and Scale

Read asBeginner In-depth

GPT-5.5

GPT-5.5: OpenAI's Benchmark-Leading Agentic Model with a Hallucination Problem

Read asIn-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race from GPT-3 to Safety-Tiered Superintelligence

Read asIn-depth

Related events (8)

8Openai Blog·1mo ago·source ↗

Introducing GPT-5.3-Codex

OpenAI has announced GPT-5.3-Codex, described as a Codex-native agent combining frontier coding performance with general reasoning capabilities. The model is designed to support long-horizon, real-world technical work. The announcement positions it as an agentic coding system rather than a standalone language model.

Frontier Model Releases Inference Economics GPT-5.3-Codex OpenAI Codex +1 more

8Openai Blog·1mo ago·source ↗

GPT-5.3-Codex System Card

OpenAI has released the system card for GPT-5.3-Codex, described as the most capable agentic coding model to date. It combines the frontier coding performance of GPT-5.2-Codex with the reasoning and professional knowledge capabilities of GPT-5.2. The release represents a continuation of OpenAI's Codex line of specialized coding models within the GPT-5 family.

Frontier Model Releases Evaluation and Benchmarking GPT-5.3-Codex GPT-5.2 OpenAI +1 more

7Openai Blog·1mo ago·source ↗

OpenAI releases GPT-5-Codex: GPT-5 variant optimized for agentic coding

OpenAI has published an addendum to the GPT-5 system card introducing GPT-5-Codex, a version of GPT-5 specifically optimized for agentic coding within the Codex environment. The model features dynamic thinking-effort adjustment, scaling compute based on task complexity—responding quickly to simple queries while sustaining longer independent work on complex coding tasks. This represents a specialized derivative of GPT-5 targeting software engineering agents rather than general-purpose use.

Frontier Model Releases Inference Economics GPT-5.3-Codex OpenAI GPT-5.5 System Card +3 more

8Openai Blog·1mo ago·source ↗

Introducing GPT-5.2-Codex

OpenAI has released GPT-5.2-Codex, described as their most advanced coding model. The model features long-horizon reasoning, large-scale code transformation capabilities, and enhanced cybersecurity features. This represents a specialized coding-focused model in the GPT-5 family.

Long Context Evolution Frontier Model Releases GPT-5.3-Codex OpenAI +2 more

7Openai Blog·1mo ago·source ↗

OpenAI Introduces GPT-5.1-Codex-Max for Agentic Coding

OpenAI has released GPT-5.1-Codex-Max, a new model optimized for agentic coding tasks within the Codex platform. The model targets long-running, project-scale software development work with improvements in reasoning and token efficiency. It is positioned as a faster and more capable successor for autonomous coding workflows.

Frontier Model Releases Inference Economics GPT-5.1-Codex-Max OpenAI Codex +1 more

8Openai Blog·1mo ago·source ↗

Addendum to GPT-5.2 System Card: GPT-5.2-Codex

OpenAI published a system card addendum for GPT-5.2-Codex, a specialized variant of GPT-5.2 focused on coding capabilities. The document provides safety evaluations, capability assessments, and deployment considerations specific to this coding-oriented model. As a Tier 1 source system card, it represents official documentation of a frontier coding model's properties and risk profile.

Frontier Model Releases AI Safety Research GPT-5.3-Codex GPT-5.2 OpenAI +1 more

4One Useful Thing·1mo ago·source ↗

Sign of the Future: GPT-5.5 Commentary

A tier-2 commentary piece from One Useful Thing discusses GPT-5.5 as a notable step in the AI capability curve. The piece frames the release as a signal of future AI development trajectories. As a commentary source, it likely offers analysis of what GPT-5.5's capabilities imply rather than primary technical reporting.

Frontier Model Releases One Useful Thing OpenAI GPT-5.5

8The Batch·17d ago·source ↗

GPT-5.4 released with tool search, computer use, and frontier benchmark performance

OpenAI released GPT-5.4 in Thinking and Pro variants, featuring an expanded context window (up to 1.05M input tokens), native computer use, tool search capabilities, and adjustable reasoning levels. In independent testing by Artificial Analysis, GPT-5.4 Pro at xhigh reasoning achieved state-of-the-art on GDP-Val-AA, BrowseComp, Terminal-Bench-Hard, SWE-Bench-Pro, and MCP Atlas, while trailing Gemini 3.1 Pro Preview on MMMU-Pro and Humanity's Last Exam. Pricing is set at the top of the market ($30/$180 per million input/output tokens for Pro), and the release also powers Codex, OpenAI's competitor to Claude Code. The item is reported via The Batch (tier 2 commentary) and includes additional context on Andrew Ng's chub CLI tool for agent documentation sharing.

Frontier Model Releases Inference Economics DeepLearning.AI Artificial Analysis Intelligence Index Claude Opus 4.6 +14 more