Entity · technique

DITTO

techniqueactiveditto-fcadc5c0·1 events·first seen Jun 16, 2026

Aliases: DITTO

Co-occurring entities

Bradley-Terry TuneJury

More like this (12)

DiT DOCCI DONDO DIVE DiT-XL DeiT DiT-Reward TunerDiT DINO Q-DIBA DART DAgger

Recent events (1)

4arXiv · cs.AI·Jun 16, 2026·source ↗

TuneJury: Open pairwise reward model for text-to-music preference alignment

Researchers introduce TuneJury, an open-source instance-level pairwise reward model for text-to-music generation that predicts preference scores from text prompts and audio clips. The model is trained on publicly available human-preference labels spanning arena votes, crowdsourced comparisons, and expert ratings. A post-hoc anchor calibration method enables efficient adaptation to new generators without full retraining. The reward model drives gains across best-of-N selection, latent optimization, and expert-iteration post-training.

Alignment and RLHF Multimodal Progress DITTO Bradley-Terry TuneJury