ChatGPT can now see, hear, and speak
OpenAI announced multimodal capabilities for ChatGPT, enabling the model to process images (vision), listen to voice input, and respond with synthesized speech. These features expand ChatGPT beyond text-only interaction into a multimodal assistant experience. The rollout was announced for Plus and Enterprise users first, with broader availability to follow.
Related guides (3)
Related events (8)
Introducing ChatGPT
OpenAI announced ChatGPT, a conversational model trained to engage in dialogue, answer follow-up questions, acknowledge errors, challenge incorrect premises, and decline inappropriate requests. The model's dialogue format represented a significant step in making large language models accessible and interactive for general users. This November 2022 launch marked a pivotal moment in public AI adoption.
Introducing ChatGPT Images 2.0
OpenAI has launched ChatGPT Images 2.0, a new image generation model integrated into ChatGPT. The release highlights improved text rendering, multilingual support, and advanced visual reasoning capabilities. This represents an upgrade to OpenAI's consumer-facing image generation offering.
Introducing ChatGPT Agent
OpenAI has launched ChatGPT agent, a new capability that combines reasoning with tool use to autonomously complete multi-step tasks such as research, bookings, and presentation creation. The agent operates under user guidance, integrating thinking and acting in a unified workflow. This represents OpenAI's move to bring agentic capabilities directly into the ChatGPT product for general consumers.
ChatGPT Plugins: Initial Support Announced
OpenAI announced initial support for plugins in ChatGPT, enabling the model to access up-to-date information, run computations, and interact with third-party services. Plugins are described as tools designed specifically for language models with safety as a core principle. This marks a significant expansion of ChatGPT's capabilities beyond its base language model functionality, introducing a structured ecosystem for external tool integration.
GPT-5.1: A smarter, more conversational ChatGPT
OpenAI is rolling out GPT-5.1, an upgrade to the GPT-5 series, beginning with paid users on November 12, 2025. The update emphasizes warmer conversational tone, improved capabilities, and new options for customizing ChatGPT's tone and style. No specific benchmark results or architectural details are provided in the announcement.
OpenAI Spring Update: GPT-4o Announced, Expanded Free ChatGPT Capabilities
OpenAI announced GPT-4o, a new flagship model, alongside an expansion of capabilities available to free-tier ChatGPT users. GPT-4o represents a new omnimodal architecture capable of handling text, audio, and vision in a unified model. The announcement was made via a live demo event and marks a significant shift in OpenAI's product and model strategy.
Introducing GPTs: Custom Versions of ChatGPT
OpenAI announced GPTs, a feature allowing users to create customized versions of ChatGPT by combining custom instructions, additional knowledge, and selectable capabilities. GPTs can be built without coding and are designed for specific use cases, ranging from personal productivity to enterprise deployment. OpenAI also announced a forthcoming GPT Store where creators can share and potentially monetize their GPTs.
Introducing ChatGPT Go, now available worldwide
OpenAI has launched ChatGPT Go as a new globally available subscription tier, providing access to GPT-5.2 Instant with higher usage limits and extended memory capabilities. The offering is positioned as a more affordable entry point to advanced AI features for users worldwide. This represents a new pricing and access tier in OpenAI's consumer product lineup.


