TruthStance:一个关于Truth Social平台对话的标注数据集 / TruthStance: An Annotated Dataset of Conversations on Truth Social
1️⃣ 一句话总结
这篇论文创建并发布了一个名为TruthStance的大规模标注数据集,专门用于研究主流平台之外、在Truth Social上的对话结构、论点挖掘和立场检测,并评估了大型语言模型在这些任务上的表现。
Argument mining and stance detection are central to understanding how opinions are formed and contested in online discourse. However, most publicly available resources focus on mainstream platforms such as Twitter and Reddit, leaving conversational structure on alt-tech platforms comparatively under-studied. We introduce TruthStance, a large-scale dataset of Truth Social conversation threads spanning 2023-2025, consisting of 24,378 posts and 523,360 comments with reply-tree structure preserved. We provide a human-annotated benchmark of 1,500 instances across argument mining and claim-based stance detection, including inter-annotator agreement, and use it to evaluate large language model (LLM) prompting strategies. Using the best-performing configuration, we release additional LLM-generated labels for 24,352 posts (argument presence) and 107,873 comments (stance to parent), enabling analysis of stance and argumentation patterns across depth, topics, and users. All code and data are released publicly.
TruthStance:一个关于Truth Social平台对话的标注数据集 / TruthStance: An Annotated Dataset of Conversations on Truth Social
这篇论文创建并发布了一个名为TruthStance的大规模标注数据集,专门用于研究主流平台之外、在Truth Social上的对话结构、论点挖掘和立场检测,并评估了大型语言模型在这些任务上的表现。
源自 arXiv: 2602.14406