Almanac
← Events
4The Batch (DeepLearning.AI)·19d ago

Blind Users Can Use AI Models As Virtual Mirrors, But Don't Always Like What They See

Blind and visually impaired users are increasingly relying on vision-language models (notably GPT-4 Vision via Be My Eyes) to assess their own appearance, gaining independence but also encountering AI outputs that reflect conventional beauty standards and may be factually inaccurate. A BBC article by blind journalist Milagros Costabel documents cases where AI feedback was psychologically harmful, including unsolicited critical commentary on facial features. Psychologists warn that blind users are especially vulnerable because they cannot independently verify AI visual judgments. The piece raises broader questions about accuracy, trust calibration, and empathy in AI products designed for accessibility.

Related guides (3)

Related events (8)

5Openai Blog·1mo ago·source ↗

Be My Eyes Integrates GPT-4 for Visual Accessibility

Be My Eyes, a visual assistance app for blind and low-vision users, has integrated GPT-4 to enhance its accessibility capabilities. The partnership leverages GPT-4's multimodal vision features to provide richer, AI-powered visual interpretation for users. This represents an early real-world deployment of GPT-4's vision capabilities in an assistive technology context.

4Hugging Face Blog·1mo ago·source ↗

Ethics and Society Newsletter #4: Bias in Text-to-Image Models

Hugging Face's Ethics and Society team publishes their fourth newsletter focusing on bias in text-to-image generative models. The piece examines how these models encode and reproduce societal biases in visual outputs, likely covering evaluation methods, documented failure modes, and mitigation approaches. As a Tier 2 commentary piece from a major ML platform, it contributes to ongoing discourse around fairness and safety in multimodal AI systems.

3Simon Willison'S Weblog·1mo ago·source ↗

Your AI Use Is Breaking My Brain

Simon Willison comments on the phenomenon of AI-generated or AI-assisted content degrading the quality of online discourse and information environments. The piece reflects on how widespread AI use is affecting the experience of consuming internet content. This is a commentary piece from a prominent developer/blogger on the social and epistemic effects of AI proliferation.

4One Useful Thing·1mo ago·source ↗

Giving your AI a Job Interview

This commentary piece argues that as AI-generated advice becomes more consequential, users need systematic methods to evaluate AI reliability and quality—analogous to a job interview process. The author proposes frameworks for assessing AI outputs before trusting them for important decisions. The piece addresses the practical challenge of calibrating trust in AI systems across different use cases.

5Google Deepmind Blog·1mo ago·source ↗

Teaching AI to See the World More Like We Do

DeepMind has published a new research paper analyzing how AI systems organize and perceive the visual world differently from humans. The work examines the gap between human visual cognition and current AI visual representations. The research aims to understand and potentially close the perceptual alignment gap between human and machine vision.

4One Useful Thing·1mo ago·source ↗

Against "Brain Damage": AI's Effect on Human Thinking

This commentary from One Useful Thing examines whether AI use helps or harms human cognitive capabilities. The piece engages with the ongoing debate about whether reliance on AI tools degrades or augments human thinking. It likely addresses concerns about cognitive offloading and the conditions under which AI assistance is beneficial versus detrimental.

6arXiv · cs.CL·19d ago·source ↗

Vision-Language Models Suppress Female Representations Under Ambiguous Input

This paper investigates gender bias in vision-language models (VLMs) when inputs are ambiguous (e.g., workers in full gear or seen from behind), finding that models default to male outputs even for strongly female-stereotyped occupations. The authors introduce LALS (Latent Association Leaning Score), a zero-shot metric that probes internal visual-token activations to measure concept associations across layers. Across 15 occupations, 800+ ambiguous images, and four VLMs, they find a systematic decoupling: models internally encode female associations but suppress them before generation, with male signals amplifying end-to-end while female signals peak mid-network and are filtered out. Cultural visual cues like clothing color further modulate these internal associations.

5Openai Blog·1mo ago·source ↗

Lessons learned on language model safety and misuse

OpenAI published a post summarizing their evolving thinking on language model safety and misuse in deployed systems. The piece is intended to share lessons with other AI developers facing similar challenges. It covers OpenAI's internal approaches to mitigating harmful outputs and misuse patterns observed in production.