Exploring Extrinsic and Intrinsic Properties for Effective Reasoning with Code Interpreter
exploring-extrinsic-and-intrinsic-properties-for-effective-reasoning-with-code-interpreter-c776e222·1 events·first seen 32h agoAliases: Exploring Extrinsic and Intrinsic Properties for Effective Reasoning with Code Interpreter
More like this (12)
Recent events (1)
Systematic study of extrinsic and intrinsic properties for effective code interpreter reasoning in LLMs
Researchers investigate what behavioral properties make LLMs effective at reasoning with a Code Interpreter (CI), identifying two axes: extrinsic 'crucial tokens' and intrinsic 'cognitive behaviors' such as verification, backtracking, and backward chaining. Stronger CI reasoning models consistently exhibit higher prevalence of these properties. The paper shows that appending code-specific crucial tokens at inference time improves performance on mathematical, ordering, and optimization tasks, while augmenting training with cognitive behaviors improves SFT and RL performance in two of three evaluated models. The work also finds these behaviors reduce overthinking in incorrect responses and improve token efficiency.