4Hugging Face Blog·1mo ago

Ethics and Society Newsletter #4: Bias in Text-to-Image Models

Hugging Face's Ethics and Society team publishes their fourth newsletter focusing on bias in text-to-image generative models. The piece examines how these models encode and reproduce societal biases in visual outputs, likely covering evaluation methods, documented failure modes, and mitigation approaches. As a Tier 2 commentary piece from a major ML platform, it contributes to ongoing discourse around fairness and safety in multimodal AI systems.

Evaluation and Benchmarking AI Safety Research Multimodal Progress Hugging Face Ethics and Society Team text-to-image models Hugging Face

Related guides (3)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

AI Safety ResearchTopic guide

AI Safety Research: From Lab Policies to Real-World Flashpoints

Read asBeginner In-depth

Multimodal ProgressTopic guide

Multimodal Progress: How AI Learned to See, Hear, and Act

Read asBeginner In-depth

Related events (8)

3Hugging Face Blog·1mo ago·source ↗

Hugging Face Ethics and Society Newsletter #1

Hugging Face launched its first Ethics and Society Newsletter, signaling an institutional commitment to addressing ethical dimensions of AI/ML development. The newsletter likely covers topics such as bias, fairness, transparency, and responsible deployment of machine learning models. As a tier-2 source from a major open-weights platform, it reflects growing industry attention to AI ethics as a structured practice rather than an afterthought.

AI Safety Research Hugging Face Hugging Face Ethics and Society Newsletter

3Hugging Face Blog·1mo ago·source ↗

Ethics and Society Newsletter #3: Ethical Openness at Hugging Face

Hugging Face's Ethics and Society team publishes their third newsletter focusing on the concept of 'ethical openness' — the tension between open-source AI development and potential harms. The piece examines how openness in AI models and datasets intersects with safety, accountability, and responsible deployment. It reflects ongoing internal and community discourse at Hugging Face about balancing accessibility with risk mitigation.

Open Weights Progress AI Safety Research Hugging Face Ethics and Society Team Hugging Face

4Hugging Face Blog·1mo ago·source ↗

Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality

Hugging Face's Ethics and Society team publishes their sixth newsletter focusing on data quality as a foundational concern for AI development. The piece addresses how training data composition, curation practices, and quality standards affect model behavior, safety, and societal impact. It situates data quality within broader responsible AI development frameworks.

Evaluation and Benchmarking AI Safety Research Hugging Face Ethics and Society Team Hugging Face

3Hugging Face Blog·1mo ago·source ↗

Ethics and Society Newsletter #5: Hugging Face Goes To Washington and Other Summer 2023 Musings

Hugging Face's Ethics and Society team reflects on their summer 2023 policy and advocacy activities, including engagement with Washington policymakers. The newsletter covers regulatory developments, AI ethics considerations, and the organization's positioning on AI governance. As a tier-2 source commentary piece, it offers perspective on how a major open-weights platform is engaging with the regulatory landscape.

Open Weights Progress Regulatory Developments Hugging Face Ethics and Society Team Hugging Face

4The Batch·1mo ago·source ↗

Abeba Birhane on Bias in Web-Scraped Training Datasets

Researcher Abeba Birhane examines how large-scale web-scraped datasets used to train trillion-parameter NLP and vision models propagate bias and antisocial content. The commentary highlights that performance gains in deep neural networks come alongside inherited societal biases from web training data. Two posts from The Batch summarize her work on cleaning up web datasets and the specific mechanisms by which NLP models absorb web-sourced biases.

Evaluation and Benchmarking AI Safety Research DeepLearning.AI Abeba Birhane The Batch

4Hugging Face Blog·1mo ago·source ↗

Evaluating Language Model Bias with 🤗 Evaluate

This Hugging Face blog post introduces tooling and methodology for evaluating bias in language models using the Evaluate library. It covers bias measurement approaches and how practitioners can apply them to assess fairness properties of LLMs. The post is oriented toward applied practitioners working with open-source models.

Evaluation and Benchmarking AI Safety Research Hugging Face Evaluate Hugging Face

5arXiv · cs.CL·46h ago·source ↗

StylisticBias benchmark reveals a small set of visual cues drives most social bias in MLLMs

Researchers introduce StylisticBias, a controlled benchmark of ~25K photorealistic face images with single-attribute variations designed to isolate how specific visual cues shift social judgments in multimodal LLMs. Evaluating six MLLMs across 25 binary social judgment scenarios, they find that age and body type dominate identity-level effects, while fashion style drives the largest attribute-level shifts, with ~15 attributes accounting for ~80% of total bias variation. The benchmark is released publicly on GitHub and Hugging Face, enabling fine-grained bias auditing of multimodal models.

Evaluation and Benchmarking AI Safety Research StylisticBias StylisticBias: A Few Human Visual Cues Drive Most Social Biases in MLLMs +1 more

4The Batch·19d ago·source ↗

Blind Users Can Use AI Models As Virtual Mirrors, But Don't Always Like What They See

Blind and visually impaired users are increasingly relying on vision-language models (notably GPT-4 Vision via Be My Eyes) to assess their own appearance, gaining independence but also encountering AI outputs that reflect conventional beauty standards and may be factually inaccurate. A BBC article by blind journalist Milagros Costabel documents cases where AI feedback was psychologically harmful, including unsolicited critical commentary on facial features. Psychologists warn that blind users are especially vulnerable because they cannot independently verify AI visual judgments. The piece raises broader questions about accuracy, trust calibration, and empathy in AI products designed for accessibility.

AI Safety Research Enterprise Deployment Patterns Envision AI University of Bristol GPT-4 Vision +9 more