Entity · benchmark

hate-based rhetoric

benchmarkactivehate-based-rhetoric-33d5e552·1 events·first seen May 26, 2026

Aliases: hate-based rhetoric

Co-occurring entities

concept spec digital empathy multi-agent systematizer AI-Assisted Systematization for Evaluating GenAI Systems zero-shot systematizer

More like this (12)

hate speech detection From Self to Other: Evaluating Demographic Perspective-Taking in LLM Hate Speech Annotation UC Berkeley Measuring Hate Speech Corpus Global Project Against Hate and Extremism adversarial pragmatics Beyond Benchmarks: Exposing the Hidden Crisis in Bangla Hate Speech Detection How Temperature Shapes Ideological Discourse in Retrieval-Augmented Generation?A Resource for Enthymeme Detection in Controversial Political Discourse geopolitical bias Rhetor Innocuous-Seeming Data, Latent Ideology: Ideological Generalisation in Finetuned LLMs adversarial examples

Recent events (1)

6arXiv · cs.CL·May 26, 2026·source ↗

AI-Assisted Systematization for Evaluating GenAI Systems

This paper addresses a foundational gap in GenAI evaluation: the underspecification of broad, contested concepts like 'reasoning,' 'fairness,' or 'creativity.' The authors introduce a structured artifact called a 'concept spec' and a validation worksheet, then build two AI-assisted systematizers—a zero-shot approach and a multi-agent approach—to convert vague evaluation targets into measurable, structured accounts. They apply these tools to hate-based rhetoric and digital empathy, assessing the resulting specs on content validity and information recoverability. The work positions AI assistance as a scalable aid for the cognitively demanding process of evaluation design.

Evaluation and Benchmarking AI Safety Research hate-based rhetoric concept spec digital empathy +4 more