Be My Eyes Integrates GPT-4 for Visual Accessibility
Be My Eyes, a visual assistance app for blind and low-vision users, has integrated GPT-4 to enhance its accessibility capabilities. The partnership leverages GPT-4's multimodal vision features to provide richer, AI-powered visual interpretation for users. This represents an early real-world deployment of GPT-4's vision capabilities in an assistive technology context.
Related guides (3)
Related events (8)
GPT-4V(ision) System Card
OpenAI published the system card for GPT-4V(ision), the multimodal extension of GPT-4 that accepts image inputs alongside text. The document covers capability evaluations, safety assessments, and known limitations of the vision-enabled model. It represents OpenAI's formal safety and transparency disclosure accompanying the GPT-4V release.
ChatGPT can now see, hear, and speak
OpenAI announced multimodal capabilities for ChatGPT, enabling the model to process images (vision), listen to voice input, and respond with synthesized speech. These features expand ChatGPT beyond text-only interaction into a multimodal assistant experience. The rollout was announced for Plus and Enterprise users first, with broader availability to follow.
Blind Users Can Use AI Models As Virtual Mirrors, But Don't Always Like What They See
Blind and visually impaired users are increasingly relying on vision-language models (notably GPT-4 Vision via Be My Eyes) to assess their own appearance, gaining independence but also encountering AI outputs that reflect conventional beauty standards and may be factually inaccurate. A BBC article by blind journalist Milagros Costabel documents cases where AI feedback was psychologically harmful, including unsolicited critical commentary on facial features. Psychologists warn that blind users are especially vulnerable because they cannot independently verify AI visual judgments. The piece raises broader questions about accuracy, trust calibration, and empathy in AI products designed for accessibility.
GPT-4 Release
OpenAI released GPT-4, a large multimodal model accepting image and text inputs and producing text outputs. The model demonstrates human-level performance on various professional and academic benchmarks. It represents OpenAI's latest milestone in scaling deep learning.
Building smarter maps with GPT-4o vision fine-tuning
OpenAI published a case study on Grab using GPT-4o vision fine-tuning to improve map intelligence. The deployment demonstrates a real-world enterprise application of fine-tuned multimodal models for geospatial data processing. This represents a concrete example of GPT-4o's vision capabilities being adapted for domain-specific tasks in Southeast Asian markets.
Ada Uses GPT-4 to Deliver a New Customer Service Standard
Ada, a customer service platform, has integrated GPT-4 to power its AI-driven support capabilities. The announcement, published on OpenAI's blog, highlights the deployment of GPT-4 in an enterprise customer service context. This represents a concrete enterprise deployment case study for GPT-4 in production customer-facing workflows.
Introducing vision to the fine-tuning API
OpenAI has extended its fine-tuning API to support multimodal inputs, allowing developers to fine-tune GPT-4o using both images and text. This enables customization of vision capabilities for domain-specific tasks. The update expands the existing text-only fine-tuning pipeline to handle image-text pairs.
GPT-3 Powers Over 300 Applications via OpenAI API
OpenAI reports that more than 300 applications are now using GPT-3 through its API to deliver search, conversation, text completion, and other AI features. The announcement highlights the growing commercial ecosystem built on top of GPT-3 as of early 2021. This represents an early milestone in API-based AI deployment at scale.


