Entity · technique

Cross-Annotator Preference Optimization (CAPO)

techniqueactivecross-annotator-preference-optimization-capo--f4757030·1 events·first seen May 28, 2026

Aliases: Cross-Annotator Preference Optimization (CAPO)

Co-occurring entities

Human Label Variation (HLV)Natural Language Inference Paraphrase Judgment supervised fine-tuning

More like this (12)

Inter-Annotator Agreement annotator disagreement Task Decomposition for Efficient Annotation human-LLM collaborative annotation Adaptive Clip Policy Optimization Constraint-Aware Counterfactual Editing for Aspect-Based Sentiment Analysis Automatic Post-Editing (APE)APPO: Agentic Procedural Policy Optimization cross-attention adapter plannotator Observe-and-Act Adaptive Context Selection Identity Preference Optimization

Recent events (1)

5arXiv · cs.CL·May 28, 2026·source ↗

Cross-Annotator Preference Optimization (CAPO) for Learning Annotator-Specific Explanation Behavior

This paper investigates whether LLMs can learn and reproduce individual annotator-specific reasoning patterns, not just label choices, using two sentence-pair tasks (NLI and paraphrase judgment) with four annotators each. The authors find that annotator-specific patterns are weak at the single-annotation level but detectable after aggregation, and propose CAPO—a preference optimization method that contrasts a target annotator's response against other valid but less target-specific annotations. CAPO outperforms prompting and supervised fine-tuning baselines in capturing annotator-specific label-explanation behavior. The work suggests a path toward scalable annotation pipelines grounded in annotator histories rather than labels alone.

Evaluation and Benchmarking Alignment and RLHF Cross-Annotator Preference Optimization (CAPO)Human Label Variation (HLV)Natural Language Inference +2 more