菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-29
📄 Abstract - SynSur: An end-to-end generative pipeline for synthetic industrial surface defect generation and detection

The bottleneck in learning-based industrial defect detection is often limited not by model capacity, but by the scarcity of labeled defect data: defects are rare, annotations are expensive, and collecting balanced training sets is slow. We present an end-to-end pipeline for synthetic defect generation and annotation, combining Vision-Language-Model-based prompts, LoRA-adapted diffusion, mask-guided inpainting, and sample filtering with automatic label derivation, and demonstrates the potential of real data with realistic synthetic samples to overcome data scarcity. The evaluation is conducted on, a challenging dataset of pitting defects on ball screw drives, and then on a subset of the Mobile phone screen surface defect segmentation dataset (MSD) dataset to test cross-domain transfer. Beyond downstream detector performance, we analyze key stages of the pipeline, including prompt construction, LoRA selection, and sample filtering with DreamSim and CLIPScore, to understand which synthetic samples are both realistic and useful. Experiments with YOLOv26, YOLOX, and LW-DETR show that synthetic-only training does not replace real data. When combined with real data, synthetic defects can preserve performance and yield modest gains in selected BSData training regimes. The MSD transfer study shows that the overall pipeline structure carries over to a second industrial inspection domain, while also highlighting the importance of domain-specific adaptation and annotation-quality control. Overall, the paper provides an end-to-end assessment of diffusion-based industrial defect synthesis and shows that its strongest value lies in strengthening scarce real datasets rather than substituting for them.

顶级标签: computer vision machine learning model training
详细标签: defect detection synthetic data generation diffusion models industrial inspection data augmentation 或 搜索:

SynSur:面向工业表面缺陷生成与检测的端到端合成管线 / SynSur: An end-to-end generative pipeline for synthetic industrial surface defect generation and detection


1️⃣ 一句话总结

该论文提出了一种端到端的合成缺陷生成与标注管线,利用视觉语言模型、扩散模型和图像修复技术自动生成逼真的工业表面缺陷样本,实验表明这些合成数据虽无法替代真实数据,但能在真实稀缺数据上补充训练,小幅提升检测性能,并具备跨工业领域的迁移潜力。

源自 arXiv: 2604.26633