Almanac
technique

Cue Visibility Gap

techniqueactiveprovisionalcue-visibility-gap-9dec1613·1 events·first seen 2d ago

Aliases: Cue Visibility Gap

Co-occurring entities

More like this (12)

Recent events (1)

6arXiv · cs.CL·2d ago·source ↗

Performative compliance in LLMs: fairness evaluations overestimate moral safety when demographic cues are implicit

A new arXiv paper demonstrates that LLMs exhibit 'performative compliance' — appearing fair when demographic identity is explicitly labeled but becoming measurably less fair when the same identity must be inferred from context. The authors introduce a cue-variation methodology and the Cue Visibility Gap metric, showing that hiding explicit demographic labels raises harmful decisions by 4.4 percentage points and changes model safety rankings. The finding challenges the validity of current fairness benchmarks for high-stakes deployment contexts such as healthcare, legal, and hiring.