面向单域泛化的对抗性域提示调优与生成 / Adversarial Domain Prompt Tuning and Generation for Single Domain Generalization
1️⃣ 一句话总结
本文提出一种渐进式对抗提示调优框架,利用预训练的文本到图像生成模型自动学习抽象提示,从而生成多样化的域外训练数据,使仅基于单一训练域的模型能更好地泛化到未知领域。
Single domain generalization (SDG) aims to learn a robust model, which could perform well on many unseen domains while there is only one single domain available for training. One of the promising directions for achieving single-domain generalization is to generate out-of-domain (OOD) training data through data augmentation or image generation. Given the rapid advancements in AI-generated content (AIGC), this paper is the first to propose leveraging powerful pre-trained text-to-image (T2I) foundation models to create the training data. However, manually designing textual prompts to generate images for all possible domains is often impractical, and some domain characteristics may be too abstract to describe with words. To address these challenges, we propose a novel Progressive Adversarial Prompt Tuning (PAPT) framework for pre-trained diffusion models. Instead of relying on static textual domains, our approach learns two sets of abstract prompts as conditions for the diffusion model: one that captures domain-invariant category information and another that models domain-specific styles. This adversarial learning mechanism enables the T2I model to generate images in various domain styles while preserving key categorical features. Extensive experiments demonstrate the effectiveness of the proposed method, achieving superior performances to state-of-the-art single-domain generalization approaches.
面向单域泛化的对抗性域提示调优与生成 / Adversarial Domain Prompt Tuning and Generation for Single Domain Generalization
本文提出一种渐进式对抗提示调优框架,利用预训练的文本到图像生成模型自动学习抽象提示,从而生成多样化的域外训练数据,使仅基于单一训练域的模型能更好地泛化到未知领域。
源自 arXiv: 2606.21736