Introducing vision to the fine-tuning API
OpenAI has extended its fine-tuning API to support multimodal inputs, allowing developers to fine-tune GPT-4o using both images and text. This enables customization of vision capabilities for domain-specific tasks. The update expands the existing text-only fine-tuning pipeline to handle image-text pairs.
Related guides (4)
Related events (8)
Fine-tuning now available for GPT-4o
OpenAI has launched fine-tuning support for GPT-4o, its flagship multimodal model, as of August 20, 2024. This allows developers to customize GPT-4o on their own datasets via the OpenAI API. The release extends the fine-tuning capability previously available on GPT-3.5 and GPT-4 to the most capable model in OpenAI's lineup, enabling task-specific optimization at the frontier.
GPT-3.5 Turbo fine-tuning and API updates
OpenAI has opened fine-tuning access for GPT-3.5 Turbo, allowing developers to customize the model with their own data for specific use cases. This extends fine-tuning capabilities previously available on older GPT-3 models to the more capable Turbo variant. The announcement also includes associated API updates to support this functionality.
OpenAI Upgrades Moderation API with GPT-4o-Based Multimodal Model
OpenAI has released an updated Moderation API powered by a new model built on GPT-4o, extending content moderation capabilities to both text and images. The update aims to improve accuracy in detecting harmful content, giving developers better tools for building moderation systems. This represents an expansion of OpenAI's safety infrastructure into multimodal domains.
Customizing GPT-3 for your application
OpenAI announced fine-tuning capabilities for GPT-3, enabling developers to customize the model for specific applications via a single command. This feature allows users to adapt GPT-3's behavior to their use case by training on domain-specific data. The announcement marks an early milestone in making large language model customization accessible through an API.
OpenAI Improves Fine-Tuning API and Expands Custom Models Program
OpenAI announced enhancements to its fine-tuning API giving developers greater control over the training process, alongside an expansion of its custom models program. The updates aim to provide more flexibility for enterprise and developer use cases requiring tailored model behavior. Specific new features include additional hyperparameter controls and tooling improvements, while the custom models program expansion opens new pathways for organizations to build bespoke models with OpenAI's assistance.
Building smarter maps with GPT-4o vision fine-tuning
OpenAI published a case study on Grab using GPT-4o vision fine-tuning to improve map intelligence. The deployment demonstrates a real-world enterprise application of fine-tuned multimodal models for geospatial data processing. This represents a concrete example of GPT-4o's vision capabilities being adapted for domain-specific tasks in Southeast Asian markets.
GPT-4V(ision) System Card
OpenAI published the system card for GPT-4V(ision), the multimodal extension of GPT-4 that accepts image inputs alongside text. The document covers capability evaluations, safety assessments, and known limitations of the vision-enabled model. It represents OpenAI's formal safety and transparency disclosure accompanying the GPT-4V release.
Introducing 4o Image Generation
OpenAI has integrated a native image generation capability directly into GPT-4o, positioning it as a primary model capability rather than a separate system. The announcement frames this as their most advanced image generator to date, emphasizing both aesthetic quality and practical utility. This represents a shift toward unified multimodal models that generate images natively rather than relying on separate diffusion-based pipelines.



