technique
In-Context Reward Adaptation
techniqueactiveprovisional
in-context-reward-adaptation-44407f1e·1 events·first seen 19d agoAliases: In-Context Reward Adaptation
Co-occurring entities
More like this (12)
Gradient-Guided Reward OptimizationObserve-and-Act Adaptive Context SelectionReward Learning from ComparisonsIn-Context Multiple Instance Learningin-context learningUsing Reward Uncertainty to Induce Diverse Behaviour in Reinforcement LearningHybrid Reward Advantage SplittingLoRA (Low-Rank Adaptation)Multi-Agent Reinforcement Learning from Delayed Marketplace Feedback for Objective-Weight Adaptation in Three-Sided DispatchReinforcement Learning Elicits Contextual Learning of Unseen Language TranslationScaling Laws for Reward Model Overoptimizationreward model
Recent events (1)
In-Context Reward Adaptation for Robust Preference Modeling
This paper proposes In-Context Reward Adaptation (ICRA), a transformer-based framework that infers reward structures from small sets of preference demonstrations at inference time, without retraining. The key finding is that standard transformers exhibit asymptotic bias toward ground-truth rewards, but incorporating human response time as an auxiliary signal resolves this limitation and enables generalization to unseen preference domains. The approach addresses a core limitation of static RLHF reward models, which fail to handle heterogeneous or shifting human value distributions.