Almanac
benchmark

Waterbirds

benchmarkactiveprovisionalwaterbirds-c2311810·1 events·first seen 20d ago

Aliases: Waterbirds

Co-occurring entities

More like this (12)

Recent events (1)

6arXiv · cs.LG·20d ago·source ↗

Label-Free Bias Identification in Vision Models via Gradient Probes on Concept Decompositions

This paper introduces a post-hoc, label-free method for identifying spurious correlations in frozen vision classifiers without requiring bias annotations, group labels, or retraining. The approach applies non-negative matrix factorization to intermediate activations to extract interpretable concept vectors, then ranks them using a gradient-based bias estimator derived from misclassified examples. On Colored MNIST, Waterbirds, and CelebA benchmarks, the method recovers known spurious cues and improves worst-group accuracy by up to 17.9 percentage points on Waterbirds by suppressing top-ranked concepts at inference time. Notably, the method surfaces decision-relevant directions that do not always coincide with annotated attributes, offering both an auditing tool and a debiasing handle for deployed models.