Entity · benchmark

multi-hop reasoning

benchmarkactivemulti-hop-reasoning-f30191a3·1 events·first seen Jun 1, 2026

Aliases: multi-hop reasoning

Co-occurring entities

Transformers Rotary Position Embedding (RoPE)symbolic attention heads GPT-J discrepancy metric positional attention heads

More like this (12)

Multi-hop Question Answering Multi-hop Graph Retrieval hybrid reasoning Modality-Informed Reciprocal Reasoning Optimization Message Passing Enables Efficient Reasoning Multilingual Reasoning Cascades Need More Context latent reasoning spatio-temporal dynamic reasoning Native Active Perception as Reasoning for Omni-Modal Understanding 2WikiMultiHopQA Reasoning as Pattern Matching: Shared Mechanisms in Human and LLM Everyday Reasoning Reasoning Enhancement

Recent events (1)

6arXiv · cs.LG·Jun 1, 2026·source ↗

Positional vs. Symbolic Attention Heads: Learning Dynamics, RoPE Geometry, and Length Generalization

Researchers train a decoder-only Transformer (GPT-J) on two structurally equivalent multi-hop reasoning tasks to study how attention heads specialize into positional or symbolic roles during learning. They find that successful task learning correlates with the emergence of 'pure' heads—exclusively positional or symbolic—and provide theoretical constructions showing how single-layer RoPE-based attention realizes these functions geometrically. A novel 'discrepancy' metric formalizes the robustness difference between the two head types, with symbolic mechanisms shown to extrapolate more reliably to longer sequences than positional ones. The findings have implications for understanding length generalization failures in RoPE-based models.

Long Context Evolution Evaluation and Benchmarking Transformers multi-hop reasoning Rotary Position Embedding (RoPE)+5 more