technique
Alternating Token-Weighted Unlearning
techniqueactiveprovisional
alternating-token-weighted-unlearning-a17257f8·1 events·first seen 12d agoAliases: Alternating Token-Weighted Unlearning
Co-occurring entities
More like this (12)
Backdoor Unlearning Generalization: A Path Toward the Removal of Unknown Triggers in LLMsinference-time behavioural unlearningHierarchical Advantage Weighting for Online RL Fine-Tuning of VLAs from Sparse Episode OutcomesEntropy-Regularized Reinforcement LearningOperator LearningLanguage Modeling LossSparse Subspace-to-Expert Sharing for Task-Agnostic Continual LearningCLP: Collocation-Length Prediction for Zero-Loss Adaptive Multi-Token InferenceKL-regularized RLClass-Incremental LearningMulti-Task Learningmasked-token modeling
Recent events (1)
ATWU: Token-level importance learning improves LLM unlearning via retain-conflict criterion
This paper introduces Alternating Token-Weighted Unlearning (ATWU), a framework that learns which tokens in a forget sample are most relevant to unlearning by characterizing their conflict with the retain objective. Rather than relying on auxiliary models or heuristics, ATWU jointly learns token forget-specificity and model parameters using a lightweight linear scorer over hidden states. Evaluated on TOFU and RWKU benchmarks, ATWU achieves state-of-the-art forget-retain trade-offs and produces token-level scores that align with ground-truth forget-specific spans.