
o1
o1-a8a81b78·5 events·first seen 28d agoAliases: o1
Co-occurring entities
More like this (12)
Recent events (5)
Deliberative Alignment: Reasoning Enables Safer Language Models
OpenAI introduces deliberative alignment, a new alignment strategy applied to o1 models in which the model is directly taught safety specifications and trained to reason over them at inference time. Unlike prior approaches that embed safety implicitly through RLHF, this method makes safety reasoning explicit and inspectable. The announcement positions deliberative alignment as a meaningful advance in scalable oversight and safe deployment of frontier reasoning models.
OpenAI o1 and New Developer Tools Announced
OpenAI has announced the full release of the o1 model alongside a set of developer-facing updates including Realtime API improvements and a new fine-tuning method. The announcement targets developers building on the OpenAI platform. Specific capability details and pricing were not elaborated in the source body.
OpenAI o1 System Card
OpenAI has published the system card for its o1 and o1-mini models, documenting safety evaluations conducted prior to release. The report covers external red teaming exercises and frontier risk assessments performed under OpenAI's Preparedness Framework. This represents the formal safety disclosure accompanying the o1 model family launch.
Learning to Reason with LLMs
OpenAI announced a new model or capability focused on reasoning in large language models, published on September 12, 2024. The post, hosted on the OpenAI blog, describes advances in training LLMs to perform complex multi-step reasoning. This likely corresponds to the release of the o1 (formerly 'Strawberry') model series, which uses chain-of-thought reasoning trained via reinforcement learning to achieve significantly improved performance on math, science, and coding benchmarks.
OpenAI o1-mini: Cost-Efficient Reasoning Model
OpenAI announced o1-mini, a smaller and more cost-efficient variant of its o1 reasoning model series. The release targets use cases where reasoning capability is needed at lower inference cost. This follows the broader o1 launch and represents OpenAI's effort to make chain-of-thought reasoning models accessible at different price points.