Entity · technique

In-Context Reward Adaptation

techniqueactivein-context-reward-adaptation-44407f1e·1 events·first seen May 29, 2026

Aliases: In-Context Reward Adaptation

Co-occurring entities

Transformers Reinforcement Learning from Human Feedback human response time in-context learning

More like this (12)

Gradient-Guided Reward Optimization Multi-Task Bayesian In-Context Learning Observe-and-Act Adaptive Context Selection Reward Learning from Comparisons Improving LLM-Generated Process Model Quality Through Reinforcement Learning: The Role of Reward Function Design REAR: Test-time Preference Realignment through Reward Decomposition In-Context Multiple Instance Learning in-context learning REAlignment Reward Using Reward Uncertainty to Induce Diverse Behaviour in Reinforcement Learning Hybrid Reward Advantage Splitting LoRA (Low-Rank Adaptation)

Recent events (1)

6arXiv · cs.LG·May 29, 2026·source ↗

In-Context Reward Adaptation for Robust Preference Modeling

This paper proposes In-Context Reward Adaptation (ICRA), a transformer-based framework that infers reward structures from small sets of preference demonstrations at inference time, without retraining. The key finding is that standard transformers exhibit asymptotic bias toward ground-truth rewards, but incorporating human response time as an auxiliary signal resolves this limitation and enables generalization to unseen preference domains. The approach addresses a core limitation of static RLHF reward models, which fail to handle heterogeneous or shifting human value distributions.

Evaluation and Benchmarking Alignment and RLHF Transformers In-Context Reward Adaptation Reinforcement Learning from Human Feedback +2 more