菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-25
📄 Abstract - WhoSaidIt: Human-LLM Collaborative Annotation for Text-Based Multilingual Speaker-Attribute Classification

Annotating speaker attributes from text is inherently ambiguous, particularly in multilingual settings where demographic and social cues are implicit and culturally variable. We propose a human-large language model (LLM) collaborative re-annotation framework for stabilizing multilingual speaker-attribute labels under practical resource constraints. Starting from a noisy corpus, we use LLMs to surface recurring annotation rationales through iterative interaction with experts, and apply disagreement-focused sampling for targeted re-annotation. Using this framework, we construct WhoSaidIt, a multilingual dataset covering nine speaker-attribute labels. We quantify divergence between original and revised annotations, benchmark recent LLMs, and analyze the effect of explicit rationales on model behavior. Our results reveal substantial cross-lingual differences in annotation decisions and demonstrate both the strengths and limitations of LLMs in speaker-attribute classification.

顶级标签: llm natural language processing data
详细标签: annotation multilingual speaker attributes human-llm collaboration benchmark 或 搜索:

谁说的:基于人机协作的多语言文本说话人属性分类标注方法 / WhoSaidIt: Human-LLM Collaborative Annotation for Text-Based Multilingual Speaker-Attribute Classification


1️⃣ 一句话总结

本文提出一种人类与大语言模型协作的标注框架,通过迭代对话让模型提炼专家标注理由、并针对分歧样本重点修正,从而在资源有限的情况下更稳定地为多语言文本中的说话人属性(如性别、社会身份等)打标签,并构建了涵盖9种属性的多语言数据集WhoSaidIt,验证了该方法能有效捕捉跨语言标注差异,同时揭示了大模型在此类任务中的能力与局限。

源自 arXiv: 2605.26070