← 返回列表

arXiv 提交日期: 2026-04-27

📄 Abstract - Faithful Autoformalization via Roundtrip Verification and Repair

When an LLM formalizes natural language, how do we know the output is faithful? We propose a roundtrip verification approach which does not require ground-truth annotations: formalize a statement, translate the result back to natural language, re-formalize, and use a formal tool to check logical equivalence. When the two formalizations agree, this provides evidence of a faithful formalization. When they disagree, a diagnosis step identifies which translation stage failed, and a targeted repair operator attempts to correct that stage. We evaluate our approach on 150 traffic rules using Claude Opus 4.6 and GPT-5.2. Diagnosis-guided repair raises formal equivalence from 45--61% to 83--85% for both models, outperforming a random-repair baseline. An independent NLI analysis confirms that formal equivalence is correlated with less semantic drift.

顶级标签: llm natural language processing

通过往返验证与修复实现忠实的形式化自动转换 / Faithful Autoformalization via Roundtrip Verification and Repair

1️⃣ 一句话总结

本文提出了一种无需人工标注的往返验证方法，通过将自然语言形式化后再翻译回来并重新形式化，利用形式工具检查两次结果是否逻辑等价，从而自动检测和修复AI在形式化过程中的错误，实验表明该方法能将形式化准确率从约45-61%提升至83-85%。

👋 没兴趣 ☆ 感兴趣 📌 待读

打开原文 PDF

源自 arXiv: 2604.25031

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要