Almanac
other

speaker-attribute classification

otheractiveprovisionalspeaker-attribute-classification-2fada4c2·1 events·first seen 22d ago

Aliases: speaker-attribute classification

Co-occurring entities

More like this (12)

Recent events (1)

4arXiv · cs.CL·22d ago·source ↗

WhoSaidIt: Human-LLM Collaborative Annotation for Multilingual Speaker-Attribute Classification

This paper proposes a human-LLM collaborative re-annotation framework for stabilizing noisy multilingual speaker-attribute labels under resource constraints. LLMs surface recurring annotation rationales through iterative expert interaction, combined with disagreement-focused sampling for targeted re-annotation. The resulting WhoSaidIt dataset covers nine speaker-attribute labels across multiple languages. Benchmarking of recent LLMs reveals substantial cross-lingual annotation divergence and highlights both capabilities and limitations of LLMs in this classification task.