辩论能高效利用你的时间 / Debate is efficient with your time
1️⃣ 一句话总结
这篇论文通过引入‘辩论查询复杂度’这一新概念,证明了在AI安全辩论机制中,人类裁判只需检查极少量的信息(通常与问题复杂度的对数成正比)就能可靠地判断复杂问题的答案,从而极大地提高了监督效率,并将这一效率问题与计算复杂性理论中的核心问题联系起来。
AI safety via debate uses two competing models to help a human judge verify complex computational tasks. Previous work has established what problems debate can solve in principle, but has not analysed the practical cost of human oversight: how many queries must the judge make to the debate transcript? We introduce Debate Query Complexity}(DQC), the minimum number of bits a verifier must inspect to correctly decide a debate. Surprisingly, we find that PSPACE/poly (the class of problems which debate can efficiently decide) is precisely the class of functions decidable with O(log n) queries. This characterisation shows that debate is remarkably query-efficient: even for highly complex problems, logarithmic oversight suffices. We also establish that functions depending on all their input bits require Omega(log n) queries, and that any function computable by a circuit of size s satisfies DQC(f) <= log(s) + 3. Interestingly, this last result implies that proving DQC lower bounds of log(n) + 6 for languages in P would yield new circuit lower bounds, connecting debate query complexity to central questions in circuit complexity.
辩论能高效利用你的时间 / Debate is efficient with your time
这篇论文通过引入‘辩论查询复杂度’这一新概念,证明了在AI安全辩论机制中,人类裁判只需检查极少量的信息(通常与问题复杂度的对数成正比)就能可靠地判断复杂问题的答案,从而极大地提高了监督效率,并将这一效率问题与计算复杂性理论中的核心问题联系起来。
源自 arXiv: 2602.08630