benchmark

LACUNA

benchmarkactiveprovisionallacuna-03fe131d·1 events·first seen 18h ago

Aliases: LACUNA

Co-occurring entities

OLMo LACUNA Allen Institute for AI

More like this (12)

LACUNA LLUMI LAVE Luna LIMA LAMBADA LLaDA LUCID LOCUS LASH LAMBDA LMCache

Recent events (1)

6arXiv · cs.LG·18h ago·source ↗

LACUNA testbed introduces ground-truth parameter-level evaluation for LLM unlearning

Researchers introduce LACUNA, the first unlearning testbed with ground-truth parameter-level localization, designed to evaluate whether LLM unlearning methods truly erase knowledge from model weights or merely suppress it at the output level. The testbed injects PII of synthetic individuals into predefined parameters of 1B and 7B OLMo-based models via masked continual pretraining, enabling direct measurement of localization precision. Benchmarking current SOTA unlearning methods reveals they are highly imprecise and vulnerable to resurfacing attacks despite strong output-level performance, while successful localization enables even simple gradient-based methods to achieve robust erasure. The work addresses a critical gap in unlearning evaluation methodology relevant to privacy compliance and AI safety.

Evaluation and Benchmarking AI Safety Research OLMo LACUNA LACUNA +1 more