Almanac
technique

Pair Opt-dist

techniqueactiveprovisionalpair-opt-dist-a5062e7f·1 events·first seen 22d ago

Aliases: Pair Opt-dist

Co-occurring entities

More like this (12)

Recent events (1)

5arXiv · cs.LG·22d ago·source ↗

Active Query Synthesis for Preference Learning via Mutual Information Maximization

This paper introduces Info-Synth, an active query synthesis framework for preference learning that generates optimal pairwise queries by maximizing a mutual information objective in continuous space, bypassing the computational cost of pool-based evaluation. A confidence-aware response model is proposed to handle ambiguous comparisons between nearly identical or highly dissimilar items. Two finite-pool extensions (Pair M-dist and Pair Opt-dist) are also introduced. The framework is validated on synthetic preference tasks, text summarization datasets, and robotic controller tuning.