Entity · benchmark

Waterbirds

benchmarkactivewaterbirds-c2311810·1 events·first seen May 28, 2026

Aliases: Waterbirds

Co-occurring entities

Colored MNIST CelebA Bias Leaves a Gradient Trail Non-Negative Matrix Factorization

More like this (12)

BIRD Glasswing Ornith-1.0 BIRDNet goose BigBird PipeDream-2BW Gray Swan Nature Portfolio Canary Windsurf WhaleSpotter

Recent events (1)

6arXiv · cs.LG·May 28, 2026·source ↗

Label-Free Bias Identification in Vision Models via Gradient Probes on Concept Decompositions

This paper introduces a post-hoc, label-free method for identifying spurious correlations in frozen vision classifiers without requiring bias annotations, group labels, or retraining. The approach applies non-negative matrix factorization to intermediate activations to extract interpretable concept vectors, then ranks them using a gradient-based bias estimator derived from misclassified examples. On Colored MNIST, Waterbirds, and CelebA benchmarks, the method recovers known spurious cues and improves worst-group accuracy by up to 17.9 percentage points on Waterbirds by suppressing top-ranked concepts at inference time. Notably, the method surfaces decision-relevant directions that do not always coincide with annotated attributes, offering both an auditing tool and a debiasing handle for deployed models.

Evaluation and Benchmarking AI Safety Research Colored MNIST Waterbirds CelebA +2 more