Almanac
technique

Human Label Variation (HLV)

techniqueactiveprovisionalhuman-label-variation-hlv--89d782ce·1 events·first seen 20d ago

Aliases: Human Label Variation (HLV)

Co-occurring entities

More like this (12)

Recent events (1)

5arXiv · cs.CL·20d ago·source ↗

Cross-Annotator Preference Optimization (CAPO) for Learning Annotator-Specific Explanation Behavior

This paper investigates whether LLMs can learn and reproduce individual annotator-specific reasoning patterns, not just label choices, using two sentence-pair tasks (NLI and paraphrase judgment) with four annotators each. The authors find that annotator-specific patterns are weak at the single-annotation level but detectable after aggregation, and propose CAPO—a preference optimization method that contrasts a target annotator's response against other valid but less target-specific annotations. CAPO outperforms prompting and supervised fine-tuning baselines in capturing annotator-specific label-explanation behavior. The work suggests a path toward scalable annotation pipelines grounded in annotator histories rather than labels alone.