Almanac
model

GPT-5.4 mini

modelactiveprovisionalgpt-5-4-mini-fafb0963·4 events·first seen 28d ago

Aliases: GPT-5.4 mini

Co-occurring entities

More like this (12)

Recent events (4)

8Openai Blog·28d ago·source ↗

Introducing GPT-5.4 mini and nano

OpenAI has released GPT-5.4 mini and nano, smaller and faster variants of GPT-5.4 optimized for coding, tool use, multimodal reasoning, and high-volume API and sub-agent workloads. These models are positioned for efficiency-sensitive deployment scenarios including agentic pipelines. The release extends the GPT-5.4 family with tiered model options targeting different cost and latency tradeoffs.

6The Batch·14d ago·source ↗

Data Points: NemoClaw enterprise stack, GPT-5.4 mini/nano, Nemotron 3 Nano 4B, Midjourney V8, and Mamba-3

A multi-item roundup covers several AI developments: Nvidia unveiled NemoClaw at GTC 2026, an enterprise software stack integrating with OpenClaw to add security and governance for agentic deployments, with launch partners including Salesforce, Cisco, and CrowdStrike. OpenAI released GPT-5.4 mini and nano, smaller variants optimized for speed with benchmark results on SWE-Bench Pro and OSWorld-Verified, priced at $0.75 and $0.20 per million input tokens respectively. Nvidia also released Nemotron 3 Nano 4B, a hybrid Mamba-Transformer 4B parameter on-device model. Additional items cover Midjourney V8 alpha (5x faster, diffusion-only) and Mamba-3, a 1.5B state space model from CMU and Together.AI with improved accuracy over Mamba-2.

5Openai Blog·28d ago·source ↗

Gradient Labs gives every bank customer an AI account manager

Gradient Labs is deploying AI agents for banking support workflows, powered by OpenAI's GPT-4.1 and GPT-5.4 mini and nano models. The system targets low latency and high reliability for automating customer-facing banking operations. This represents a concrete enterprise deployment of frontier OpenAI models in a regulated financial services context.

6arXiv · cs.AI·7d ago·source ↗

Frontier coding agents use metaprogramming to handle esoteric programming languages

A new arXiv paper evaluates six LLM-based coding agents on four esoteric programming languages (including Brainfuck and Befunge-98), finding that the strongest agents—Claude Opus 4.6 and GPT-5.4 xhigh—often avoid writing the target language directly, instead generating it via Python metaprograms. Forbidding this strategy causes large performance drops, and text guidance alone does not transfer the capability to weaker models, though sharing Opus-derived Python helper code does sharply improve mid-tier agents. The study reveals capability stratification that mainstream benchmarks like SWE-Bench Verified compress into narrow bands, suggesting frontier agents succeed by constructing and debugging working models of unfamiliar environments rather than pattern-matching to training data.