菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-07
📄 Abstract - Data Distribution Valuation Using Generalized Bayesian Inference

We investigate the data distribution valuation problem, which aims to quantify the values of data distributions from their samples. This is a recently proposed problem that is related to but different from classical data valuation and can be applied to various applications. For this problem, we develop a novel framework called Generalized Bayes Valuation that utilizes generalized Bayesian inference with a loss constructed from transferability measures. This framework allows us to solve, in a unified way, seemingly unrelated practical problems, such as annotator evaluation and data augmentation. Using the Bayesian principles, we further improve and enhance the applicability of our framework by extending it to the continuous data stream setting. Our experiment results confirm the effectiveness and efficiency of our framework in different real-world scenarios.

顶级标签: machine learning theory model evaluation
详细标签: data valuation bayesian inference distribution shift transferability data streams 或 搜索:

使用广义贝叶斯推断进行数据分布价值评估 / Data Distribution Valuation Using Generalized Bayesian Inference


1️⃣ 一句话总结

这篇论文提出了一个名为‘广义贝叶斯估值’的新框架,它利用广义贝叶斯推断来量化不同数据分布样本的价值,并能统一解决诸如评估数据标注者质量和优化数据增强等实际问题。

源自 arXiv: 2604.05993