Almanac
model

PARL

modelactiveprovisionalparl-372f1daf·1 events·first seen 16d ago

Aliases: PARL

Co-occurring entities

More like this (12)

Recent events (1)

5arXiv · cs.CL·16d ago·source ↗

PARL: Preference-Aware Rubric Learning for Personalized LLM Evaluation

This paper introduces PARL (Preference-Aware Rubric Learning), a framework that reframes personalized LLM evaluation as a learning problem rather than static judgment. PARL induces preference-aware evaluation rubrics from raw user interaction histories and uses a discriminative reinforcement learning objective to contrast user-authored responses against model outputs, capturing user-specific decision boundaries. Experiments on personalized text generation tasks show PARL produces high-fidelity rubrics that generalize across users and tasks, outperforming existing LLM-as-a-judge and automatic metric approaches.