多视角性作为叙事相似性预测的资源 / Multiperspectivity as a Resource for Narrative Similarity Prediction
1️⃣ 一句话总结
这篇论文提出,与其将解读叙事时的多种合理视角视为挑战,不如利用由不同角色大语言模型组成的“评审团”来整合这些多视角判断,从而更有效地预测叙事相似性,并揭示了当前单一标准答案评估框架的局限性。
Predicting narrative similarity can be understood as an inherently interpretive task: different, equally valid readings of the same text can produce divergent interpretations and thus different similarity judgments, posing a fundamental challenge for semantic evaluation benchmarks that encode a single ground truth. Rather than treating this multiperspectivity as a challenge to overcome, we propose to incorporate it in the decision making process of predictive systems. To explore this strategy, we created an ensemble of 31 LLM personas. These range from practitioners following interpretive frameworks to more intuitive, lay-style characters. Our experiments were conducted on the SemEval-2026 Task 4 dataset, where the system achieved an accuracy score of 0.705. Accuracy improves with ensemble size, consistent with Condorcet Jury Theorem-like dynamics under weakened independence. Practitioner personas perform worse individually but produce less correlated errors, yielding larger ensemble gains under majority voting. Our error analysis reveals a consistent negative association between gender-focused interpretive vocabulary and accuracy across all persona categories, suggesting either attention to dimensions not relevant for the benchmark or valid interpretations absent from the ground truth. This finding underscores the need for evaluation frameworks that account for interpretive plurality.
多视角性作为叙事相似性预测的资源 / Multiperspectivity as a Resource for Narrative Similarity Prediction
这篇论文提出,与其将解读叙事时的多种合理视角视为挑战,不如利用由不同角色大语言模型组成的“评审团”来整合这些多视角判断,从而更有效地预测叙事相似性,并揭示了当前单一标准答案评估框架的局限性。
源自 arXiv: 2603.22103