Entity · model

Qwen3-30B-A3B-Base

modelactiveqwen3-30b-a3b-base-c9d40822·1 events·first seen Jun 3, 2026

Aliases: Qwen3-30B-A3B-Base

Co-occurring entities

Mixtral-8x7B-v0.1 Expert-Aware Causal Tracing of Factual Recall in Sparse MoE Language Models CounterFact

More like this (12)

Qwen3.5-35B-A3B-Base Qwen3-30B Qwen3-8B-Base Qwen3-30B-A3B Qwen3.6-35B-A3B Qwen3.5-35B-A3B Qwen3-30B-A3B-Instruct Qwen3-14B-Base Qwen3-4B-Base Qwen3.5-2B-Base Qwen3.6-27B Qwen3.5-122B-A10B

Recent events (1)

5arXiv · cs.CL·Jun 3, 2026·source ↗

Expert-aware causal tracing of factual recall in sparse MoE language models

A new arXiv preprint extends causal tracing methodology to sparse mixture-of-experts (MoE) language models, asking which routed experts mediate factual recall rather than just which layers or feed-forward modules. Using CounterFact facts, the authors apply noise-corruption and clean-patch interventions to Qwen3-30B-A3B-Base and Mixtral-8x7B-v0.1, finding that expert-level localization is possible in the former (a single expert at layer 44) but requires multi-expert coalition recovery in the latter. The results indicate that factual localization in MoE models is model- and protocol-dependent rather than universal.

Evaluation and Benchmarking AI Safety Research Qwen3-30B-A3B-Base Mixtral-8x7B-v0.1 Expert-Aware Causal Tracing of Factual Recall in Sparse MoE Language Models +1 more