基于粒度球引导的稳定潜在域发现用于领域通用人群计数 / Granular Ball Guided Stable Latent Domain Discovery for Domain-General Crowd Counting
1️⃣ 一句话总结
这篇论文提出了一种新方法,通过将样本分组为‘粒度球’并对其中心进行聚类,来更稳定地识别训练数据中的隐藏子领域,并结合双分支学习框架来分别处理语义信息和外观风格,从而显著提升了人群计数模型在面对未知场景时的泛化能力。
Single-source domain generalization for crowd counting remains highly challenging because a single labeled source domain often contains heterogeneous latent domains, while test data may exhibit severe distribution shifts. A fundamental difficulty lies in stable latent domain discovery: directly performing flat clustering on evolving sample-level latent features is easily affected by feature noise, outliers, and representation drift, leading to unreliable pseudo-domain assignments and weakened domain-structured learning. To address this issue, we propose a granular ball guided stable latent domain discovery framework for domain-general crowd counting. Specifically, the proposed method first organizes samples into compact local granular balls and then clusters granular ball centers as representatives to obtain pseudo-domains, transforming direct sample-level clustering into a hierarchical representative-based clustering process. This design yields more stable and semantically consistent pseudo-domain assignments. Built upon the discovered latent domains, we further develop a two-branch learning framework that enhances transferable semantic representations via semantic codebook re-encoding while modeling domain-specific appearance variations through a style branch, thereby reducing semantic--style entanglement and improving generalization under domain shifts. Extensive experiments on ShanghaiTech A/B, UCF\_QNRF, and NWPU-Crowd under a strict no-adaptation protocol demonstrate that the proposed method consistently outperforms strong baselines, especially under large domain gaps.
基于粒度球引导的稳定潜在域发现用于领域通用人群计数 / Granular Ball Guided Stable Latent Domain Discovery for Domain-General Crowd Counting
这篇论文提出了一种新方法,通过将样本分组为‘粒度球’并对其中心进行聚类,来更稳定地识别训练数据中的隐藏子领域,并结合双分支学习框架来分别处理语义信息和外观风格,从而显著提升了人群计数模型在面对未知场景时的泛化能力。
源自 arXiv: 2603.24106