通过往返验证与修复实现忠实的形式化自动转换 / Faithful Autoformalization via Roundtrip Verification and Repair
1️⃣ 一句话总结
本文提出了一种无需人工标注的往返验证方法,通过将自然语言形式化后再翻译回来并重新形式化,利用形式工具检查两次结果是否逻辑等价,从而自动检测和修复AI在形式化过程中的错误,实验表明该方法能将形式化准确率从约45-61%提升至83-85%。
When an LLM formalizes natural language, how do we know the output is faithful? We propose a roundtrip verification approach which does not require ground-truth annotations: formalize a statement, translate the result back to natural language, re-formalize, and use a formal tool to check logical equivalence. When the two formalizations agree, this provides evidence of a faithful formalization. When they disagree, a diagnosis step identifies which translation stage failed, and a targeted repair operator attempts to correct that stage. We evaluate our approach on 150 traffic rules using Claude Opus 4.6 and GPT-5.2. Diagnosis-guided repair raises formal equivalence from 45--61% to 83--85% for both models, outperforming a random-repair baseline. An independent NLI analysis confirms that formal equivalence is correlated with less semantic drift.
通过往返验证与修复实现忠实的形式化自动转换 / Faithful Autoformalization via Roundtrip Verification and Repair
本文提出了一种无需人工标注的往返验证方法,通过将自然语言形式化后再翻译回来并重新形式化,利用形式工具检查两次结果是否逻辑等价,从而自动检测和修复AI在形式化过程中的错误,实验表明该方法能将形式化准确率从约45-61%提升至83-85%。
源自 arXiv: 2604.25031