4OpenAI Blog·1mo ago

Building smarter maps with GPT-4o vision fine-tuning

OpenAI published a case study on Grab using GPT-4o vision fine-tuning to improve map intelligence. The deployment demonstrates a real-world enterprise application of fine-tuned multimodal models for geospatial data processing. This represents a concrete example of GPT-4o's vision capabilities being adapted for domain-specific tasks in Southeast Asian markets.

Enterprise Deployment Patterns Multimodal Progress Grab GPT-4o OpenAI GPT-4o vision fine-tuning

Related guides (3)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner In-depth

Multimodal ProgressTopic guide

Multimodal Progress: How AI Learned to See, Hear, and Act

Read asBeginner In-depth

Enterprise Deployment PatternsTopic guide

Enterprise Deployment Patterns: From AI Demo to Production Reality

Read asBeginner In-depth

Related events (8)

7Openai Blog·1mo ago·source ↗

Fine-tuning now available for GPT-4o

OpenAI has launched fine-tuning support for GPT-4o, its flagship multimodal model, as of August 20, 2024. This allows developers to customize GPT-4o on their own datasets via the OpenAI API. The release extends the fine-tuning capability previously available on GPT-3.5 and GPT-4 to the most capable model in OpenAI's lineup, enabling task-specific optimization at the frontier.

Frontier Model Releases Inference Economics GPT-4o OpenAI Fine-Tuning OpenAI +1 more

6Openai Blog·1mo ago·source ↗

Introducing vision to the fine-tuning API

OpenAI has extended its fine-tuning API to support multimodal inputs, allowing developers to fine-tune GPT-4o using both images and text. This enables customization of vision capabilities for domain-specific tasks. The update expands the existing text-only fine-tuning pipeline to handle image-text pairs.

Frontier Model Releases Enterprise Deployment Patterns GPT-4o OpenAI Fine-Tuning OpenAI +1 more

7Openai Blog·1mo ago·source ↗

GPT-4o System Card

OpenAI published the system card for GPT-4o, its flagship multimodal model. The document covers safety evaluations, capability assessments, and risk mitigations conducted prior to deployment. It provides transparency into the model's performance across modalities including text, audio, and vision, as well as alignment and red-teaming findings.

Frontier Model Releases Evaluation and Benchmarking GPT-4o OpenAI +3 more

7Openai Blog·1mo ago·source ↗

GPT-4V(ision) System Card

OpenAI published the system card for GPT-4V(ision), the multimodal extension of GPT-4 that accepts image inputs alongside text. The document covers capability evaluations, safety assessments, and known limitations of the vision-enabled model. It represents OpenAI's formal safety and transparency disclosure accompanying the GPT-4V release.

Frontier Model Releases Evaluation and Benchmarking GPT-4V OpenAI GPT-4 +2 more

9Openai Blog·1mo ago·source ↗

GPT-4 Release

OpenAI released GPT-4, a large multimodal model accepting image and text inputs and producing text outputs. The model demonstrates human-level performance on various professional and academic benchmarks. It represents OpenAI's latest milestone in scaling deep learning.

Frontier Model Releases Evaluation and Benchmarking OpenAI GPT-4 +1 more

9Openai Blog·1mo ago·source ↗

Hello GPT-4o

OpenAI announces GPT-4o (Omni), a new flagship multimodal model capable of reasoning across audio, vision, and text in real time. The model represents a significant step toward natively multimodal AI, processing and generating across modalities without separate pipeline stages. It is positioned as OpenAI's primary production model going forward.

Frontier Model Releases Inference Economics GPT-4o OpenAI GPT-4 +1 more

8Openai Blog·1mo ago·source ↗

Introducing 4o Image Generation

OpenAI has integrated a native image generation capability directly into GPT-4o, positioning it as a primary model capability rather than a separate system. The announcement frames this as their most advanced image generator to date, emphasizing both aesthetic quality and practical utility. This represents a shift toward unified multimodal models that generate images natively rather than relying on separate diffusion-based pipelines.

Frontier Model Releases Inference Economics GPT-4o GPT-4o Image Generation OpenAI +1 more

4Openai Blog·1mo ago·source ↗

New in ChatGPT for Business: April 2025 Updates

OpenAI published an April 2025 update for ChatGPT's business tier, highlighting four capability areas: the o3 reasoning model, image generation, enhanced memory, and internal knowledge retrieval. The announcement is framed around hands-on demos for enterprise users. This represents an incremental rollout of recently released capabilities into the business product line rather than a new model launch.

Frontier Model Releases Enterprise Deployment Patterns ChatGPT Memory ChatGPT OpenAI +2 more