Entity · paper

Expert-Aware Causal Tracing of Factual Recall in Sparse MoE Language Models

paperactiveexpert-aware-causal-tracing-of-factual-recall-in-sparse-moe-language-models-255ffff8·1 events·first seen Jun 3, 2026

Aliases: Expert-Aware Causal Tracing of Factual Recall in Sparse MoE Language Models

Co-occurring entities

Qwen3-30B-A3B-Base Mixtral-8x7B-v0.1 CounterFact

More like this (12)

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It Knowledgeless Language Models: Suppressing Parametric Recall for Evidence-Grounded Language Modeling Tying the Loop -- Tied Expert Layers in Mixture-of-Experts Language Models Same Evidence, Different Target: Decoding How Diagnostic Evidence Bears on Causal Questions from Language-Model States Understanding the Impact of Linguistic Realization Choices on LLM Stance with Causal Tracing Evaluation Awareness Is Not One Capability: Evidence from Open Language Models Towards Mechanistically Understanding Why Memorized Knowledge Fails to Generalize in Large Language Model Finetuning Recalling Too Well: Sycophancy Evaluation and Mitigation in Memory-Augmented Models Reasoning Language Models Cognitive Episodes in LLM Reasoning Traces Enable Interpretable Human Item Difficulty Prediction Exposure is Optional: Learning Unlike Coordination in Language Models From Observation to Intervention: A Causal Audit of Expert Importance in Mixture-of-Experts Models

Recent events (1)

5arXiv · cs.CL·Jun 3, 2026·source ↗

Expert-aware causal tracing of factual recall in sparse MoE language models

A new arXiv preprint extends causal tracing methodology to sparse mixture-of-experts (MoE) language models, asking which routed experts mediate factual recall rather than just which layers or feed-forward modules. Using CounterFact facts, the authors apply noise-corruption and clean-patch interventions to Qwen3-30B-A3B-Base and Mixtral-8x7B-v0.1, finding that expert-level localization is possible in the former (a single expert at layer 44) but requires multi-expert coalition recovery in the latter. The results indicate that factual localization in MoE models is model- and protocol-dependent rather than universal.

Evaluation and Benchmarking AI Safety Research Qwen3-30B-A3B-Base Mixtral-8x7B-v0.1 Expert-Aware Causal Tracing of Factual Recall in Sparse MoE Language Models +1 more