technique

MAST

techniqueactiveprovisionalmast-4eda12c4·1 events·first seen 2d ago

Aliases: MAST

Co-occurring entities

Qwen3-1.7B-Base MATH Qwen2.5-Math-PRM GSM8K

More like this (12)

MATS MAMS MIST StreamMA MMAE MOSS MAI MAML MIT MA²P MCP MaFI

Recent events (1)

5arXiv · cs.AI·2d ago·source ↗

MAST: Mechanism-guided selective unlearning for RLVR-trained reasoning models

Researchers introduce MAST (Mechanism-Aligned Selective Targeting), a method for selectively unlearning capabilities induced by reinforcement learning from verifiable rewards (RLVR) in language models while minimizing collateral damage to retained knowledge. The approach ranks attention-projection tensors by off-principal energy and gradient coupling to identify a targeted subset for update, rather than applying full-parameter gradient ascent. Evaluated on Qwen2.5-Math-1.5B and Qwen3-1.7B-Base, MAST achieves statistically significant forgetting on target MATH problems while preserving GSM8K performance, whereas full-parameter unlearning collapses retained capabilities. The method generalizes across seeds and unlearning objectives (NPO/SimNPO).

AI Safety Research Alignment and RLHF Qwen3-1.7B-Base MATH MAST +2 more