Entity · paper

AMEL

paperactiveamel-0241e494·1 events·first seen May 22, 2026

Aliases: AMEL

Co-occurring entities

Claude Opus 4.6 Google Claude Haiku 4.5 LLM-as-a-Judge accumulated message effect OpenAI GPT-5.5 Anthropic

More like this (12)

AMIA AMALIA ADEME AMRS ASAM ELMOD MERL AMP AIME ALX MMAE AMARIS

Recent events (1)

7arXiv · cs.CL·May 22, 2026·source ↗

AMEL: Accumulated Message Effects Bias LLM Judgments in Multi-Turn Evaluation Pipelines

This paper introduces AMEL (Accumulated Message Effect on LLM Judgments), documenting that prior conversation history with predominantly positive or negative evaluations systematically biases subsequent LLM judgments toward the prevailing polarity. Across 75,898 API calls to 11 models from 4 providers, the effect is statistically robust (d = -0.17, p < 10^-46), concentrates on high-uncertainty items, and shows a negativity asymmetry where negative histories induce 1.62x more bias than positive ones. Critically, the bias does not grow with context length, scaling reduces but does not eliminate it, and the simplest mitigation is using a fresh context per evaluation item.

Evaluation and Benchmarking AI Safety Research Claude Opus 4.6 Google Claude Haiku 4.5 +7 more