Diffusion-Based Data Augmentation for Image Recognition: A Systematic Analysis and Evaluation

📄 Abstract - Diffusion-Based Data Augmentation for Image Recognition: A Systematic Analysis and Evaluation

Diffusion-based data augmentation (DiffDA) has emerged as a promising approach to improving classification performance under data scarcity. However, existing works vary significantly in task configurations, model choices, and experimental pipelines, making it difficult to fairly compare methods or assess their effectiveness across different scenarios. Moreover, there remains a lack of systematic understanding of the full DiffDA workflow. In this work, we introduce UniDiffDA, a unified analytical framework that decomposes DiffDA methods into three core components: model fine-tuning, sample generation, and sample utilization. This perspective enables us to identify key differences among existing methods and clarify the overall design space. Building on this framework, we develop a comprehensive and fair evaluation protocol, benchmarking representative DiffDA methods across diverse low-data classification tasks. Extensive experiments reveal the relative strengths and limitations of different DiffDA strategies and offer practical insights into method design and deployment. All methods are re-implemented within a unified codebase, with full release of code and configurations to ensure reproducibility and to facilitate future research.

基于扩散模型的图像识别数据增强：系统性分析与评估 / Diffusion-Based Data Augmentation for Image Recognition: A Systematic Analysis and Evaluation

1️⃣ 一句话总结

这篇论文提出了一个名为UniDiffDA的统一分析框架，将基于扩散模型的数据增强方法分解为三个核心环节，并在此基础上建立了一套公平的评估体系，通过大量实验揭示了不同策略的优劣，为在数据稀缺情况下有效利用扩散模型生成数据来提升图像分类性能提供了实用指南。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要