Entity · technique

Plausibility Evaluation

techniqueactiveplausibility-evaluation-92633037·1 events·first seen Jun 1, 2026

Aliases: Plausibility Evaluation

Co-occurring entities

Token-level Rationales Faithfulness Evaluation hate speech detection Soft Label Supervision

More like this (12)

From Plausible to Actionable: A Position on LLM Self-Explanations Faithfulness Evaluation Cranfield evaluation paradigm Can LLMs Judge Better Than They Generate? Evaluating Task Asymmetry, Mechanistic Interpretability and Transferability for In-Context QA Does Bielik Know What It Doesn't Know? Activation Dispersion Separates Entity Familiarity from Factual Reliability Across Model Scale Grounding LLM Reasoning under Incomplete Graph Evidence CheckRLM: Effective Knowledge-Thought Coherence Checking in Retrieval-Augmented Reasoning PoPE (Popperian Placebo-controlled Evaluation)ProvenanceGuard: Source-Aware Factuality Verification for MCP-Based LLM Agents Pragmatic Reasoning Evidence-Grounded F1 false-premise detection

Recent events (1)

5arXiv · cs.CL·Jun 1, 2026·source ↗

Disagreeing Rationales: Rethinking Classification and Explainability Evaluation in Hate Speech Detection

This paper investigates human disagreement in token-level rationale annotations for hate speech detection, a dimension less studied than label disagreement. The authors unify diverse models, training strategies, loss functions, and evaluation metrics under a single protocol, systematically comparing hard and soft label/rationale representation spaces. Results show that both hard and soft metrics favor softer representations, suggesting that soft supervision better captures human reasoning variation in subjective NLP tasks. The work calls for rethinking evaluation frameworks for classification and explainability in subjective NLP.

Evaluation and Benchmarking Alignment and RLHF Token-level Rationales Faithfulness Evaluation Plausibility Evaluation +2 more