Almanac
paper

Exploring Extrinsic and Intrinsic Properties for Effective Reasoning with Code Interpreter

paperactiveprovisionalexploring-extrinsic-and-intrinsic-properties-for-effective-reasoning-with-code-interpreter-c776e222·1 events·first seen 32h ago

Aliases: Exploring Extrinsic and Intrinsic Properties for Effective Reasoning with Code Interpreter

More like this (12)

Recent events (1)

5arXiv · cs.CL·32h ago·source ↗

Systematic study of extrinsic and intrinsic properties for effective code interpreter reasoning in LLMs

Researchers investigate what behavioral properties make LLMs effective at reasoning with a Code Interpreter (CI), identifying two axes: extrinsic 'crucial tokens' and intrinsic 'cognitive behaviors' such as verification, backtracking, and backward chaining. Stronger CI reasoning models consistently exhibit higher prevalence of these properties. The paper shows that appending code-specific crucial tokens at inference time improves performance on mathematical, ordering, and optimization tasks, while augmenting training with cognitive behaviors improves SFT and RL performance in two of three evaluated models. The work also finds these behaviors reduce overthinking in incorrect responses and improve token efficiency.