Almanac
← Events
5OpenAI Blog·1mo ago

Be My Eyes Integrates GPT-4 for Visual Accessibility

Be My Eyes, a visual assistance app for blind and low-vision users, has integrated GPT-4 to enhance its accessibility capabilities. The partnership leverages GPT-4's multimodal vision features to provide richer, AI-powered visual interpretation for users. This represents an early real-world deployment of GPT-4's vision capabilities in an assistive technology context.

Related guides (3)

Related events (8)

7Openai Blog·1mo ago·source ↗

GPT-4V(ision) System Card

OpenAI published the system card for GPT-4V(ision), the multimodal extension of GPT-4 that accepts image inputs alongside text. The document covers capability evaluations, safety assessments, and known limitations of the vision-enabled model. It represents OpenAI's formal safety and transparency disclosure accompanying the GPT-4V release.

8Openai Blog·1mo ago·source ↗

ChatGPT can now see, hear, and speak

OpenAI announced multimodal capabilities for ChatGPT, enabling the model to process images (vision), listen to voice input, and respond with synthesized speech. These features expand ChatGPT beyond text-only interaction into a multimodal assistant experience. The rollout was announced for Plus and Enterprise users first, with broader availability to follow.

4The Batch·19d ago·source ↗

Blind Users Can Use AI Models As Virtual Mirrors, But Don't Always Like What They See

Blind and visually impaired users are increasingly relying on vision-language models (notably GPT-4 Vision via Be My Eyes) to assess their own appearance, gaining independence but also encountering AI outputs that reflect conventional beauty standards and may be factually inaccurate. A BBC article by blind journalist Milagros Costabel documents cases where AI feedback was psychologically harmful, including unsolicited critical commentary on facial features. Psychologists warn that blind users are especially vulnerable because they cannot independently verify AI visual judgments. The piece raises broader questions about accuracy, trust calibration, and empathy in AI products designed for accessibility.

9Openai Blog·1mo ago·source ↗

GPT-4 Release

OpenAI released GPT-4, a large multimodal model accepting image and text inputs and producing text outputs. The model demonstrates human-level performance on various professional and academic benchmarks. It represents OpenAI's latest milestone in scaling deep learning.

4Openai Blog·1mo ago·source ↗

Building smarter maps with GPT-4o vision fine-tuning

OpenAI published a case study on Grab using GPT-4o vision fine-tuning to improve map intelligence. The deployment demonstrates a real-world enterprise application of fine-tuned multimodal models for geospatial data processing. This represents a concrete example of GPT-4o's vision capabilities being adapted for domain-specific tasks in Southeast Asian markets.

4Openai Blog·1mo ago·source ↗

Ada Uses GPT-4 to Deliver a New Customer Service Standard

Ada, a customer service platform, has integrated GPT-4 to power its AI-driven support capabilities. The announcement, published on OpenAI's blog, highlights the deployment of GPT-4 in an enterprise customer service context. This represents a concrete enterprise deployment case study for GPT-4 in production customer-facing workflows.

6Openai Blog·1mo ago·source ↗

Introducing vision to the fine-tuning API

OpenAI has extended its fine-tuning API to support multimodal inputs, allowing developers to fine-tune GPT-4o using both images and text. This enables customization of vision capabilities for domain-specific tasks. The update expands the existing text-only fine-tuning pipeline to handle image-text pairs.

4Openai Blog·1mo ago·source ↗

GPT-3 Powers Over 300 Applications via OpenAI API

OpenAI reports that more than 300 applications are now using GPT-3 through its API to deliver search, conversation, text completion, and other AI features. The announcement highlights the growing commercial ecosystem built on top of GPT-3 as of early 2021. This represents an early milestone in API-based AI deployment at scale.