Entity · dataset

CounterFact

datasetactivecounterfact-7e610b9a·2 events·first seen Jun 3, 2026

Aliases: CounterFact

Co-occurring entities

BGE Llama3-8B-Instruct Qwen3-4B MQuAKE When to Write and When to Suppress: Route-Specialized Dual Adapters for Memory-Assisted Knowledge Editing zsRE Qwen3-30B-A3B-Base Mixtral-8x7B-v0.1 Expert-Aware Causal Tracing of Factual Recall in Sparse MoE Language Models

More like this (12)

Counterfactual Editing counterfactual reasoning PolitiFact CommunityFact SciFact CORE (Contrastive Reflection)Counterfactual Report Coordinates CompVis ReContext ConTextual FACTR 2 Kontrast

Recent events (2)

4arXiv · cs.LG·Jun 15, 2026·source ↗

Dual-adapter routing system improves knowledge editing precision in LLMs

A new arXiv paper introduces a route-specialized dual-adapter architecture for knowledge editing in LLMs, separating the concerns of writing edits (edit adapter) and suppressing them when irrelevant (locality adapter). A relevance router gates which adapter is applied, addressing the locality problem in memory-assisted editing. Evaluated on CounterFact, zsRE, and MQuAKE benchmarks using Llama-3.1-8B-Instruct and Qwen3-8B, the method achieves best-in-class probability-preference accuracy across all three datasets. Ablations show the gain comes from the architectural separation rather than increased parameter capacity.

Evaluation and Benchmarking Alignment and RLHF BGE Llama3-8B-Instruct Qwen3-4B +4 more

5arXiv · cs.CL·Jun 3, 2026·source ↗

Expert-aware causal tracing of factual recall in sparse MoE language models

A new arXiv preprint extends causal tracing methodology to sparse mixture-of-experts (MoE) language models, asking which routed experts mediate factual recall rather than just which layers or feed-forward modules. Using CounterFact facts, the authors apply noise-corruption and clean-patch interventions to Qwen3-30B-A3B-Base and Mixtral-8x7B-v0.1, finding that expert-level localization is possible in the former (a single expert at layer 44) but requires multi-expert coalition recovery in the latter. The results indicate that factual localization in MoE models is model- and protocol-dependent rather than universal.

Evaluation and Benchmarking AI Safety Research Qwen3-30B-A3B-Base Mixtral-8x7B-v0.1 Expert-Aware Causal Tracing of Factual Recall in Sparse MoE Language Models +1 more