Almanac
product

TuneJury

productactiveprovisionaltunejury-a1e274d3·1 events·first seen 33h ago

Aliases: TuneJury

Co-occurring entities

More like this (12)

Recent events (1)

4arXiv · cs.AI·33h ago·source ↗

TuneJury: Open pairwise reward model for text-to-music preference alignment

Researchers introduce TuneJury, an open-source instance-level pairwise reward model for text-to-music generation that predicts preference scores from text prompts and audio clips. The model is trained on publicly available human-preference labels spanning arena votes, crowdsourced comparisons, and expert ratings. A post-hoc anchor calibration method enables efficient adaptation to new generators without full retraining. The reward model drives gains across best-of-N selection, latent optimization, and expert-iteration post-training.