benchmark
StylisticBias
benchmarkactiveprovisional
stylisticbias-caf79838·1 events·first seen 47h agoAliases: StylisticBias
Co-occurring entities
More like this (12)
StylisticBias: A Few Human Visual Cues Drive Most Social Biases in MLLMsPerceptual Judgment Biaspolitical bias evaluationgeopolitical biasoptimism biasbehavioral-gradient validatorC4STYLITextual Gradient OptimizationHow Do Instructions Shape Speech? Cross-Attention Attribution for Style-Captioned Text-to-SpeechFreeStyleCreative Quality Alignment (CQA)AlignAtt
Recent events (1)
StylisticBias benchmark reveals a small set of visual cues drives most social bias in MLLMs
Researchers introduce StylisticBias, a controlled benchmark of ~25K photorealistic face images with single-attribute variations designed to isolate how specific visual cues shift social judgments in multimodal LLMs. Evaluating six MLLMs across 25 binary social judgment scenarios, they find that age and body type dominate identity-level effects, while fashion style drives the largest attribute-level shifts, with ~15 attributes accounting for ~80% of total bias variation. The benchmark is released publicly on GitHub and Hugging Face, enabling fine-grained bias auditing of multimodal models.