从点估计到分布:用于早产预测的多实例学习中的高斯混合模型池化 / From Point Estimates to Distributions: GMM Pooling for MIL in Preterm Birth Prediction
1️⃣ 一句话总结
本文提出了一种基于高斯混合模型的池化方法,将每个患者的多张超声图像特征分布建模为固定长度表示,从而更准确地预测早产风险,相比传统方法显著提升了性能。
Preterm birth (PTB) prediction can enable targeted surveillance and timely intervention, yet most ultrasound-based models use a single selected transvaginal ultrasound (TVUS) frame per patient despite routine exams acquiring multiple cervical images. We formulate PTB prediction as a multiple instance learning (MIL) problem, representing each patient as a variable-sized bag of TVUS images with a single outcome label. To move beyond standard MIL aggregators that collapse a bag into a point estimate, we propose a Gaussian Mixture Model (GMM) pooling, which summarizes all images in a bag into a fixed-length representation by modeling their feature distribution. This design captures intra-patient variability. We evaluate the method on a private clinical cohort and on a public lymph node metastasis benchmark. For PTB prediction, GMM pooling improves over the instance-based model PR-AUC from 0.44 to 0.56. On the lymph node benchmark, it achieves state-of-the-art performance with 0.91 F1-score and 0.89 ROC-AUC for classification and 0.18 MAE for regression. The code is publicly available at this https URL.
从点估计到分布:用于早产预测的多实例学习中的高斯混合模型池化 / From Point Estimates to Distributions: GMM Pooling for MIL in Preterm Birth Prediction
本文提出了一种基于高斯混合模型的池化方法,将每个患者的多张超声图像特征分布建模为固定长度表示,从而更准确地预测早产风险,相比传统方法显著提升了性能。
源自 arXiv: 2606.23005