Almanac
model

OpenAI o3-mini

modelactiveopenai-o3-mini-b07a21db·6 events·first seen 1mo ago

Aliases: OpenAI o3-mini, OpenAI o1-mini, OpenAI o4-mini

Co-occurring entities

More like this (12)

Recent events (6)

8Openai Blog·28d ago·source ↗

OpenAI o3 and o4-mini System Card

OpenAI has published the system card for its o3 and o4-mini models, which combine advanced reasoning capabilities with a full suite of integrated tools including web browsing, Python execution, image and file analysis, image generation, canvas, automations, file search, and memory. The system card documents safety evaluations and deployment considerations for these frontier reasoning models. This represents a significant capability expansion over prior o-series models by natively integrating tool use alongside chain-of-thought reasoning.

9Openai Blog·28d ago·source ↗

Introducing OpenAI o1

OpenAI announced o1, a new series of AI models designed to spend more time 'thinking' before responding, using chain-of-thought reasoning to tackle complex problems in science, coding, and mathematics. The o1-preview and o1-mini models are being released, with o1-preview representing the most capable version and o1-mini offering a faster, cheaper alternative optimized for coding and reasoning tasks. OpenAI claims o1-preview ranks in the 89th percentile on competitive programming problems and performs at a PhD level on science benchmarks. This release marks a significant shift in OpenAI's approach to scaling, moving from purely training-time compute to inference-time compute as a new axis of capability improvement.

4Openai Blog·27d ago·source ↗

Building an autonomous financial analyst with o1 and o3-mini

OpenAI highlights Endex, a company building an autonomous financial analyst product powered by OpenAI's o1 and o3-mini reasoning models. The post is a brief case study or partner spotlight demonstrating enterprise deployment of OpenAI's reasoning models in the financial analysis domain. It illustrates how frontier reasoning models are being applied to specialized professional workflows.

9Deepseek News·1mo ago·source ↗

DeepSeek-R1 Release: Open-Source Reasoning Model on Par with OpenAI o1

DeepSeek has released DeepSeek-R1, a reasoning-focused large language model claiming performance parity with OpenAI o1 on math, code, and reasoning benchmarks. The model is fully open-source under the MIT License, including weights and outputs, enabling distillation and commercial use. Six distilled smaller models (up to 32B and 70B) are also released, with the 32B and 70B variants reportedly matching OpenAI o1-mini. API access is live at significantly lower pricing than comparable frontier models ($0.55/M input tokens, $2.19/M output tokens).

8Qwen Research·1mo ago·source ↗

Qwen3 Release: Flagship 235B MoE and Full Model Family Announced

Alibaba's Qwen team has released Qwen3, a new family of large language models including the flagship Qwen3-235B-A22B mixture-of-experts model. The flagship model claims competitive benchmark performance against DeepSeek-R1, OpenAI o1/o3-mini, Grok-3, and Gemini-2.5-Pro on coding, math, and general capabilities. A smaller MoE variant, Qwen3-30B-A3B, reportedly outperforms QwQ-32B despite using only one-tenth the activated parameters, and the 4B model is said to match Qwen2.5's larger models. Models are available across Hugging Face, ModelScope, and Kaggle.

6Openai Blog·28d ago·source ↗

OpenAI Upgrades Operator Agent to o3 Model

OpenAI is replacing the GPT-4o-based model powering its Operator agent with a version based on o3, while the API version of Operator remains on GPT-4o. This update is accompanied by a system card addendum documenting the change. The move brings o3's reasoning capabilities to Operator's browser-based task automation.