Entity · benchmark

eating disorder safety evaluation

benchmarkactiveeating-disorder-safety-evaluation-66513209·1 events·first seen Jun 2, 2026

Aliases: eating disorder safety evaluation

Co-occurring entities

clinical ED experts large language models

More like this (12)

MedDDC-Eval clinical ED experts G-Eval PsychoSafe STAGE-Eval Behavioral-SafetyBench AIriskEval-edu Demo L-Eval joint safety evaluation proactive assistance evaluation ARC Evals Cranfield evaluation paradigm

Recent events (1)

5arXiv · cs.CL·Jun 2, 2026·source ↗

Systematic Evaluation of LLM Safety Failures on Eating Disorder Queries with Clinician Feedback

This paper investigates how LLMs respond to queries from users with eating disorders, finding that specific linguistic cues in prompts increase the likelihood of unsafe model responses. Working with clinical ED experts, the authors systematically vary risk levels in user prompts to measure the extent to which LLMs uncritically adapt to potentially dangerous inputs. The study highlights a gap between perceived model safety and actual harm facilitation in sensitive health contexts.

Evaluation and Benchmarking AI Safety Research clinical ED experts large language models eating disorder safety evaluation