菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-29
📄 Abstract - Residual-Guided Expert Specialization for Incomplete Multimodal Learning

As real-world prediction systems often face missing modalities at inference, incomplete multimodal learning (IML) remains a practical challenge. While prior methods aim to learn representations robust to missing inputs, representations from incomplete modalities inevitably deviate from their full-modality counterparts due to missing evidence. To explicitly leverage these deviations, we propose MARS (Missingness-Aware Residual-guided Specialization), a mixture-of-experts framework that guides expert specialization based on how representations are reshaped by missingness. By contrasting task representations derived from incomplete inputs with their complete counterparts during training, we derive a privileged residual signal that captures this representational gap. The residual signal guides a residual router to assign samples to experts specialized for the corresponding deviation patterns. In parallel, a feature router learns to imitate this routing behavior using only incomplete inputs, enabling deployment without access to full modalities. To mitigate this train-test router gap, we develop a discrepancy-aware noise regularization that adaptively perturbs the residual router's decisions when the feature router deviates, enhancing expert robustness under imperfect imitation. Experiments on multimodal classification (CASIA-SURF, CREMA-D, UPMC Food-101) and segmentation (MCubeS) under missing scenarios show that MARS consistently surpasses baselines while remaining efficient and extensible to diverse backbones and tasks.

顶级标签: multi-modal machine learning
详细标签: incomplete multimodal learning mixture of experts residual signal missing modality robustness 或 搜索:

基于残差引导的专家特化:面向不完整多模态学习 / Residual-Guided Expert Specialization for Incomplete Multimodal Learning


1️⃣ 一句话总结

本文提出了一种名为MARS的混合专家框架,通过对比完整与不完整多模态数据的表示差异(即残差信号),将不完整输入引导到专门处理该类缺失模式的专家模块中,从而在推理时即使部分模态缺失也能保持较高的预测性能。

源自 arXiv: 2606.30355