Almanac
technique

CHAIR

techniqueactiveprovisionalchair-b98b7eca·1 events·first seen 6d ago

Aliases: CHAIR

Co-occurring entities

More like this (12)

Recent events (1)

4arXiv · cs.CL·6d ago·source ↗

CHAIR: Supervised hallucination detection via internal logit analysis across LLM layers

A new arXiv preprint introduces CHAIR (Classifier of Hallucination As ImproveR), a supervised framework that detects hallucinations by extracting statistical features (max, min, mean, std, slope) from token logits across all layers of an LLM. Evaluated on TruthfulQA and MMLU, CHAIR shows improved detection accuracy especially in zero-shot settings. The authors argue the approach also points toward richer internal representations for designing adaptive decoding strategies that reduce hallucinations.