← 返回列表

arXiv 提交日期: 2026-03-28

📄 Abstract - Multiple-Prediction-Powered Inference

Statistical estimation often involves tradeoffs between expensive, high-quality measurements and a variety of lower-quality proxies. We introduce Multiple-Prediction-Powered Inference (MultiPPI): a general framework for constructing statistically efficient estimates by optimally allocating resources across these diverse data sources. This work provides theoretical guarantees about the minimax optimality, finite-sample performance, and asymptotic normality of the MultiPPI estimator. Through experiments across three diverse large language model (LLM) evaluation scenarios, we show that MultiPPI consistently achieves lower estimation error than existing baselines. This advantage stems from its budget-adaptive allocation strategy, which strategically combines subsets of models by learning their complex cost and correlation structures.

顶级标签: model evaluation machine learning theory

多重预测驱动的统计推断 / Multiple-Prediction-Powered Inference

1️⃣ 一句话总结

这篇论文提出了一个名为MultiPPI的新框架，它能够智能地组合昂贵但准确的数据和廉价但粗糙的预测模型，在给定预算下实现更精确的统计估计，并在大语言模型评估等实际场景中证明了其优越性。

👋 没兴趣 ☆ 感兴趣 📌 待读

打开原文 PDF

源自 arXiv: 2603.27414

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要