benchmark
hate-based rhetoric
benchmarkactiveprovisional
hate-based-rhetoric-33d5e552·1 events·first seen 22d agoAliases: hate-based rhetoric
Co-occurring entities
More like this (12)
hate speech detectionFrom Self to Other: Evaluating Demographic Perspective-Taking in LLM Hate Speech AnnotationUC Berkeley Measuring Hate Speech CorpusGlobal Project Against Hate and ExtremismA Resource for Enthymeme Detection in Controversial Political Discoursegeopolitical biasadversarial examplesSkillHarmdisagreement-focused samplingadversarial trainingpolitical bias evaluationnegative transfer
Recent events (1)
AI-Assisted Systematization for Evaluating GenAI Systems
This paper addresses a foundational gap in GenAI evaluation: the underspecification of broad, contested concepts like 'reasoning,' 'fairness,' or 'creativity.' The authors introduce a structured artifact called a 'concept spec' and a validation worksheet, then build two AI-assisted systematizers—a zero-shot approach and a multi-agent approach—to convert vague evaluation targets into measurable, structured accounts. They apply these tools to hate-based rhetoric and digital empathy, assessing the resulting specs on content validity and information recoverability. The work positions AI assistance as a scalable aid for the cognitively demanding process of evaluation design.