菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-27
📄 Abstract - Faithful Autoformalization via Roundtrip Verification and Repair

When an LLM formalizes natural language, how do we know the output is faithful? We propose a roundtrip verification approach which does not require ground-truth annotations: formalize a statement, translate the result back to natural language, re-formalize, and use a formal tool to check logical equivalence. When the two formalizations agree, this provides evidence of a faithful formalization. When they disagree, a diagnosis step identifies which translation stage failed, and a targeted repair operator attempts to correct that stage. We evaluate our approach on 150 traffic rules using Claude Opus 4.6 and GPT-5.2. Diagnosis-guided repair raises formal equivalence from 45--61% to 83--85% for both models, outperforming a random-repair baseline. An independent NLI analysis confirms that formal equivalence is correlated with less semantic drift.

顶级标签: llm natural language processing
详细标签: autoformalization faithfulness verification repair 或 搜索:

通过往返验证与修复实现忠实的形式化自动转换 / Faithful Autoformalization via Roundtrip Verification and Repair


1️⃣ 一句话总结

本文提出了一种无需人工标注的往返验证方法,通过将自然语言形式化后再翻译回来并重新形式化,利用形式工具检查两次结果是否逻辑等价,从而自动检测和修复AI在形式化过程中的错误,实验表明该方法能将形式化准确率从约45-61%提升至83-85%。

源自 arXiv: 2604.25031