菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-25
📄 Abstract - Global Sequential Testing for Multi-Stream Auditing

Across many risk-sensitive areas, it is critical to continuously audit the performance of machine learning systems and detect any unusual behavior quickly. This can be modeled as a sequential hypothesis testing problem with $k$ incoming streams of data and a global null hypothesis that asserts that the system is working as expected across all $k$ streams. The standard global test employs a Bonferroni correction and has an expected stopping time bound of $O\left(\ln\frac{k}{\alpha}\right)$ when $k$ is large and the significance level of the test, $\alpha$, is small. In this work, we construct new sequential tests by using ideas of merging test martingales with different trade-offs in expected stopping times under different, sparse or dense alternative hypotheses. We further derive a new, balanced test that achieves an improved expected stopping time bound that matches Bonferroni's in the sparse setting but that naturally results in $O\left(\frac{1}{k}\ln\frac{1}{\alpha}\right)$ under a dense alternative. We empirically demonstrate the effectiveness of our proposed tests on synthetic and real-world data.

顶级标签: machine learning model evaluation systems
详细标签: sequential hypothesis testing multi-stream auditing test martingales statistical testing anomaly detection 或 搜索:

多流审计的全局序贯检验 / Global Sequential Testing for Multi-Stream Auditing


1️⃣ 一句话总结

这篇论文提出了一种新的序贯检验方法,用于快速监测多个数据流中的异常,相比传统方法,它在不同异常模式下都能更快地发现问题。

源自 arXiv: 2602.21479