菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-13
📄 Abstract - Uncertainty-Aware Web-Conditioned Scientific Fact-Checking

Scientific fact-checking is vital for assessing claims in specialized domains such as biomedicine and materials science, yet existing systems often hallucinate or apply inconsistent reasoning, especially when verifying technical, compositional claims against an evidence snippet under source and cost/latency constraints. We present a pipeline centered on atomic predicate-argument decomposition and calibrated, uncertainty-gated corroboration: atomic facts are aligned to local snippets via embeddings, verified by a compact evidence-grounded checker, and only facts with uncertain support trigger domain-restricted web search over authoritative sources. The system supports both binary and tri-valued classification where it predicts labels from Supported, Refuted, NEI for three-way tasks. We evaluate under two regimes, Context-Only (no web) and Context+Web (uncertainty-gated web corroboration); when retrieved evidence conflicts with the provided context, we abstain with NEI rather than overriding the context. On multiple benchmarks, our framework surpasses the strongest benchmarks. In our experiments, web corroboration was invoked for only a minority of atomic facts on average, indicating that external evidence is consulted selectively under calibrated uncertainty rather than routinely. Overall, coupling atomic granularity with calibrated, uncertainty-gated corroboration yields more interpretable and context-conditioned verification, making the approach well-suited to high-stakes, single-document settings that demand traceable rationales, predictable cost/latency, and conservative.

顶级标签: natural language processing llm model evaluation
详细标签: fact-checking uncertainty quantification evidence retrieval scientific claims interpretability 或 搜索:

基于不确定性感知与网络条件约束的科学事实核查 / Uncertainty-Aware Web-Conditioned Scientific Fact-Checking


1️⃣ 一句话总结

这篇论文提出了一种新的科学事实核查方法,它将复杂的科学论断拆解成原子事实进行逐一验证,并引入一个基于不确定度评估的智能开关,仅在必要时才去联网搜索权威证据,从而在保证核查准确性和可解释性的同时,有效控制了计算成本和响应延迟。

源自 arXiv: 2604.11036