CharacterEval
charactereval-22e19fc2·1 events·first seen 2d agoAliases: CharacterEval
Co-occurring entities
More like this (12)
Recent events (1)
Psy-CoT and RAPO: Psychology-grounded reasoning and role-aware RL for character-faithful role-playing agents
Researchers propose Psy-CoT, a chain-of-thought framework that decomposes role-playing reasoning into three psychology-grounded steps (Interaction Perception, Psychological Empathy, Logical Construction) to improve out-of-distribution generalization beyond surface mimicry. They also introduce Role-Aware Policy Optimization (RAPO), a reinforcement learning method that uses profile–token mutual information to weight gradients asymmetrically, addressing reward hacking where generic phrases receive the same signal as role-specific ones. Experiments on CoSER, CharacterBench, and CharacterEval show Psy-CoT outperforms existing role-playing CoT methods and RAPO consistently beats GRPO across model scales. The work addresses a known failure mode of SFT-based role-playing agents and proposes a targeted RL fix for reward model exploitation.