Many Circuits, One Mechanism: Input Variation and Evaluation Granularity in Circuit Discovery
many-circuits-one-mechanism-input-variation-and-evaluation-granularity-in-circuit-discovery-f21170d1·1 events·first seen 12d agoAliases: Many Circuits, One Mechanism: Input Variation and Evaluation Granularity in Circuit Discovery
Co-occurring entities
More like this (12)
Recent events (1)
Phantom specialization in circuit discovery: structural differences don't imply distinct mechanisms
A new arXiv preprint challenges a core assumption in mechanistic interpretability: that structurally different circuits discovered for the same task imply distinct computational mechanisms. Using Literal Sequence Copying across token-frequency bands in five Pythia models (70M–1.4B), the authors extract 75 circuits and show that structurally distinct circuits implement the same computation, with band-specific edges transferring broadly and a shared core recovering ≥99% of circuit performance. The paper introduces the term 'phantom specialization' for this pattern and argues that standard source-level evaluation inflates apparent faithfulness, while edge-level evaluation and cross-condition transfer tests are needed to detect the many-to-one mapping from structure to function.