6OpenAI Blog·1mo ago

Introducing vision to the fine-tuning API

OpenAI has extended its fine-tuning API to support multimodal inputs, allowing developers to fine-tune GPT-4o using both images and text. This enables customization of vision capabilities for domain-specific tasks. The update expands the existing text-only fine-tuning pipeline to handle image-text pairs.

Frontier Model Releases Enterprise Deployment Patterns Multimodal Progress GPT-4o OpenAI Fine-Tuning OpenAI

Related guides (4)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Multimodal ProgressTopic guide

Multimodal Progress: How AI Learned to See, Hear, and Act

Read asBeginner

Enterprise Deployment PatternsTopic guide

Enterprise Deployment Patterns: From LLM Demo to Production Reality

Read asIn-depth

Related events (8)

7Openai Blog·1mo ago·source ↗

Fine-tuning now available for GPT-4o

OpenAI has launched fine-tuning support for GPT-4o, its flagship multimodal model, as of August 20, 2024. This allows developers to customize GPT-4o on their own datasets via the OpenAI API. The release extends the fine-tuning capability previously available on GPT-3.5 and GPT-4 to the most capable model in OpenAI's lineup, enabling task-specific optimization at the frontier.

Frontier Model Releases Inference Economics GPT-4o OpenAI Fine-Tuning OpenAI +1 more

7Openai Blog·1mo ago·source ↗

GPT-3.5 Turbo fine-tuning and API updates

OpenAI has opened fine-tuning access for GPT-3.5 Turbo, allowing developers to customize the model with their own data for specific use cases. This extends fine-tuning capabilities previously available on older GPT-3 models to the more capable Turbo variant. The announcement also includes associated API updates to support this functionality.

Frontier Model Releases Enterprise Deployment Patterns GPT-3.5 Turbo OpenAI API OpenAI +2 more

5Openai Blog·1mo ago·source ↗

OpenAI Upgrades Moderation API with GPT-4o-Based Multimodal Model

OpenAI has released an updated Moderation API powered by a new model built on GPT-4o, extending content moderation capabilities to both text and images. The update aims to improve accuracy in detecting harmful content, giving developers better tools for building moderation systems. This represents an expansion of OpenAI's safety infrastructure into multimodal domains.

AI Safety Research Enterprise Deployment Patterns GPT-4o OpenAI Moderation API OpenAI +1 more

5Openai Blog·1mo ago·source ↗

Customizing GPT-3 for your application

OpenAI announced fine-tuning capabilities for GPT-3, enabling developers to customize the model for specific applications via a single command. This feature allows users to adapt GPT-3's behavior to their use case by training on domain-specific data. The announcement marks an early milestone in making large language model customization accessible through an API.

Enterprise Deployment Patterns Alignment and RLHF GPT-3 OpenAI Fine-Tuning OpenAI

5Openai Blog·1mo ago·source ↗

OpenAI Improves Fine-Tuning API and Expands Custom Models Program

OpenAI announced enhancements to its fine-tuning API giving developers greater control over the training process, alongside an expansion of its custom models program. The updates aim to provide more flexibility for enterprise and developer use cases requiring tailored model behavior. Specific new features include additional hyperparameter controls and tooling improvements, while the custom models program expansion opens new pathways for organizations to build bespoke models with OpenAI's assistance.

Enterprise Deployment Patterns Agent and Tool Ecosystem OpenAI Fine-Tuning OpenAI OpenAI Custom Models Program

4Openai Blog·1mo ago·source ↗

Building smarter maps with GPT-4o vision fine-tuning

OpenAI published a case study on Grab using GPT-4o vision fine-tuning to improve map intelligence. The deployment demonstrates a real-world enterprise application of fine-tuned multimodal models for geospatial data processing. This represents a concrete example of GPT-4o's vision capabilities being adapted for domain-specific tasks in Southeast Asian markets.

Enterprise Deployment Patterns Multimodal Progress Grab GPT-4o OpenAI +1 more

7Openai Blog·1mo ago·source ↗

GPT-4V(ision) System Card

OpenAI published the system card for GPT-4V(ision), the multimodal extension of GPT-4 that accepts image inputs alongside text. The document covers capability evaluations, safety assessments, and known limitations of the vision-enabled model. It represents OpenAI's formal safety and transparency disclosure accompanying the GPT-4V release.

Frontier Model Releases Evaluation and Benchmarking GPT-4V OpenAI GPT-4 +2 more

8Openai Blog·1mo ago·source ↗

Introducing 4o Image Generation

OpenAI has integrated a native image generation capability directly into GPT-4o, positioning it as a primary model capability rather than a separate system. The announcement frames this as their most advanced image generator to date, emphasizing both aesthetic quality and practical utility. This represents a shift toward unified multimodal models that generate images natively rather than relying on separate diffusion-based pipelines.

Frontier Model Releases Inference Economics GPT-4o GPT-4o Image Generation OpenAI +1 more