菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-19
📄 Abstract - Are Rationales Necessary and Sufficient? Tuning LLMs for Explainable Misinformation Detection

The rapid spread of misinformation on social media platforms has become a formidable challenge. To mitigate its proliferation, Misinformation Detection (MD) has emerged as a critical research topic. Traditional MD approaches based on small models typically perform binary classification through a black-box process. Recently, the rise of Large Language Models (LLMs) has enabled explainable MD, where models generate rationales that explain their decisions, thereby enhancing transparency. Existing explainable MD methods primarily focus on crafting sophisticated prompts to elicit rationales from off-the-shelf LLMs. In this work, we propose a pipeline to fine-tune a dedicated LLM specifically for explainable MD. Our pipeline begins by collecting large-scale fact-checked articles, and then uses multiple strong LLMs to produce veracity predictions and rationales. To ensure high-quality training data, we leverage a filtering strategy that selects only the correct instances for fine-tuning. While this pipeline is intuitive and prevalent, our experiments reveal that naive filtering based solely on label correctness is insufficient in practice and suffers from two critical limitations: (1) Coarse-grained labels cause insufficient rationales: Rationales filtered solely based on binary labels are insufficient to adequately support their decisions; (2) Over-verification behavior causes unnecessary rationales: Stronger LLMs tend to exhibit over-verification behavior, producing excessively verbose and unnecessary rationales. To address these issues, we introduce LONSREX, a novel data synthesis pipeline to Locate Necessary and Sufficient Rationales for Explainable MD. Specifically, we propose a metric that quantifies the contribution of each verification step to the final prediction, thereby evaluating its necessity and sufficiency. Experimental results demonstrate the effectiveness of LONSREX.

顶级标签: llm machine learning misinformation detection
详细标签: explainability fine-tuning rationale extraction data filtering verification behavior 或 搜索:

解释性虚假信息检测中的理由必要性与充分性:大语言模型微调研究 / Are Rationales Necessary and Sufficient? Tuning LLMs for Explainable Misinformation Detection


1️⃣ 一句话总结

本文针对大语言模型进行可解释虚假信息检测时,发现仅基于标签正确性筛选训练数据会导致理由不充分或冗余的问题,提出了一种名为LONSREX的新方法,通过量化每个验证步骤对最终预测的贡献,自动筛选出既必要又充分的解释理由,从而显著提升模型的可解释性和检测效果。

源自 arXiv: 2605.19285