Hello GPT-4o
OpenAI announces GPT-4o (Omni), a new flagship multimodal model capable of reasoning across audio, vision, and text in real time. The model represents a significant step toward natively multimodal AI, processing and generating across modalities without separate pipeline stages. It is positioned as OpenAI's primary production model going forward.
Related guides (4)
Related events (8)
OpenAI Spring Update: GPT-4o Announced, Expanded Free ChatGPT Capabilities
OpenAI announced GPT-4o, a new flagship model, alongside an expansion of capabilities available to free-tier ChatGPT users. GPT-4o represents a new omnimodal architecture capable of handling text, audio, and vision in a unified model. The announcement was made via a live demo event and marks a significant shift in OpenAI's product and model strategy.
GPT-4 Release
OpenAI released GPT-4, a large multimodal model accepting image and text inputs and producing text outputs. The model demonstrates human-level performance on various professional and academic benchmarks. It represents OpenAI's latest milestone in scaling deep learning.
GPT-4o System Card
OpenAI published the system card for GPT-4o, its flagship multimodal model. The document covers safety evaluations, capability assessments, and risk mitigations conducted prior to deployment. It provides transparency into the model's performance across modalities including text, audio, and vision, as well as alignment and red-teaming findings.
Introducing 4o Image Generation
OpenAI has integrated a native image generation capability directly into GPT-4o, positioning it as a primary model capability rather than a separate system. The announcement frames this as their most advanced image generator to date, emphasizing both aesthetic quality and practical utility. This represents a shift toward unified multimodal models that generate images natively rather than relying on separate diffusion-based pipelines.
Introducing GPT-4o and More Tools to ChatGPT Free Users
OpenAI is launching GPT-4o, its newest flagship model, and expanding access to additional capabilities for free-tier ChatGPT users. This represents a significant democratization move, bringing frontier model capabilities to users without a paid subscription. The announcement signals OpenAI's strategy to broaden its user base while maintaining competitive pressure on rivals.
Fine-tuning now available for GPT-4o
OpenAI has launched fine-tuning support for GPT-4o, its flagship multimodal model, as of August 20, 2024. This allows developers to customize GPT-4o on their own datasets via the OpenAI API. The release extends the fine-tuning capability previously available on GPT-3.5 and GPT-4 to the most capable model in OpenAI's lineup, enabling task-specific optimization at the frontier.
OpenAI Upgrades Moderation API with GPT-4o-Based Multimodal Model
OpenAI has released an updated Moderation API powered by a new model built on GPT-4o, extending content moderation capabilities to both text and images. The update aims to improve accuracy in detecting harmful content, giving developers better tools for building moderation systems. This represents an expansion of OpenAI's safety infrastructure into multimodal domains.
Introducing GPT-5.2
OpenAI has released GPT-5.2, described as their most advanced frontier model for professional use, featuring state-of-the-art reasoning, long-context understanding, coding, and vision capabilities. The model is available through ChatGPT and the OpenAI API. It is positioned to support faster and more reliable agentic workflows.



