菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-19
📄 Abstract - Anti-causal domain generalization: Leveraging unlabeled data

The problem of domain generalization concerns learning predictive models that are robust to distribution shifts when deployed in new, previously unseen environments. Existing methods typically require labeled data from multiple training environments, limiting their applicability when labeled data are scarce. In this work, we study domain generalization in an anti-causal setting, where the outcome causes the observed covariates. Under this structure, environment perturbations that affect the covariates do not propagate to the outcome, which motivates regularizing the model's sensitivity to these perturbations. Crucially, estimating these perturbation directions does not require labels, enabling us to leverage unlabeled data from multiple environments. We propose two methods that penalize the model's sensitivity to variations in the mean and covariance of the covariates across environments, respectively, and prove that these methods have worst-case optimality guarantees under certain classes of environments. Finally, we demonstrate the empirical performance of our approach on a controlled physical system and a physiological signal dataset.

顶级标签: machine learning theory model training
详细标签: domain generalization anti-causal learning distribution shift unlabeled data robustness 或 搜索:

反因果域泛化:利用未标记数据 / Anti-causal domain generalization: Leveraging unlabeled data


1️⃣ 一句话总结

这篇论文提出了一种新的域泛化方法,它利用反因果关系的特性,通过分析未标记数据在不同环境中的变化来训练模型,从而在数据标签稀缺的情况下,也能提升模型在新环境中的预测鲁棒性。

源自 arXiv: 2602.17187