model
Qwen3-30B-A3B-Base
modelactiveprovisional
qwen3-30b-a3b-base-c9d40822·1 events·first seen 13d agoAliases: Qwen3-30B-A3B-Base
Co-occurring entities
More like this (12)
Recent events (1)
Expert-aware causal tracing of factual recall in sparse MoE language models
A new arXiv preprint extends causal tracing methodology to sparse mixture-of-experts (MoE) language models, asking which routed experts mediate factual recall rather than just which layers or feed-forward modules. Using CounterFact facts, the authors apply noise-corruption and clean-patch interventions to Qwen3-30B-A3B-Base and Mixtral-8x7B-v0.1, finding that expert-level localization is possible in the former (a single expert at layer 44) but requires multi-expert coalition recovery in the latter. The results indicate that factual localization in MoE models is model- and protocol-dependent rather than universal.