菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-09
📄 Abstract - GroundingAnomaly: Spatially-Grounded Diffusion for Few-Shot Anomaly Synthesis

The performance of visual anomaly inspection in industrial quality control is often constrained by the scarcity of real anomalous samples. Consequently, anomaly synthesis techniques have been developed to enlarge training sets and enhance downstream inspection. However, existing methods either suffer from poor integration caused by inpainting or fail to provide accurate masks. To address these limitations, we propose GroundingAnomaly, a novel few-shot anomaly image generation framework. Our framework introduces a Spatial Conditioning Module that leverages per-pixel semantic maps to enable precise spatial control over the synthesized anomalies. Furthermore, a Gated Self-Attention Module is designed to inject conditioning tokens into a frozen U-Net via gated attention layers. This carefully preserves pretrained priors while ensuring stable few-shot adaptation. Extensive evaluations on the MVTec AD and VisA datasets demonstrate that GroundingAnomaly generates high-quality anomalies and achieves state-of-the-art performance across multiple downstream tasks, including anomaly detection, segmentation, and instance-level detection.

顶级标签: computer vision model training data
详细标签: anomaly synthesis diffusion models few-shot learning industrial inspection image generation 或 搜索:

GroundingAnomaly:基于空间定位扩散的少样本异常合成 / GroundingAnomaly: Spatially-Grounded Diffusion for Few-Shot Anomaly Synthesis


1️⃣ 一句话总结

这篇论文提出了一种名为GroundingAnomaly的新方法,它利用像素级语义图精准控制异常生成的位置,并通过门控注意力机制高效利用少量样本,从而合成高质量的异常图像,显著提升了工业质检中异常检测与分割的性能。

源自 arXiv: 2604.08301