Almanac
product

KVEraser

productactiveprovisionalkveraser-3b1d5b1e·1 events·first seen 30h ago

Aliases: KVEraser

More like this (12)

Recent events (1)

5arXiv · cs.CL·30h ago·source ↗

KVEraser: Learned KV cache editing for efficient localized context erasing in LLMs

KVEraser is a learned method for efficiently erasing specific spans from an LLM's KV cache without full recomputation of subsequent tokens. The approach replaces only the KV states of the erased interval with learned steering states, using a two-stage training pipeline of generic pre-training followed by task-specific fine-tuning. On contexts from 1K–32K tokens, KVEraser nearly matches full recomputation quality while incurring only 24% latency overhead versus a 17.6x increase for exact recomputation, with demonstrated generalization to long-document QA with harmful factual distractors.