回归点云:探索点-语言模型在零样本三维异常检测中的应用 / Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection
1️⃣ 一句话总结
这篇论文提出了一种名为BTP的新方法,它直接利用三维点云数据和预训练的点-语言模型,通过将点云特征与文本描述对齐,实现了无需特定类别训练数据就能准确检测和定位三维物体缺陷的目标。
Zero-shot (ZS) 3D anomaly detection is crucial for reliable industrial inspection, as it enables detecting and localizing defects without requiring any target-category training data. Existing approaches render 3D point clouds into 2D images and leverage pre-trained Vision-Language Models (VLMs) for anomaly detection. However, such strategies inevitably discard geometric details and exhibit limited sensitivity to local anomalies. In this paper, we revisit intrinsic 3D representations and explore the potential of pre-trained Point-Language Models (PLMs) for ZS 3D anomaly detection. We propose BTP (Back To Point), a novel framework that effectively aligns 3D point cloud and textual embeddings. Specifically, BTP aligns multi-granularity patch features with textual representations for localized anomaly detection, while incorporating geometric descriptors to enhance sensitivity to structural anomalies. Furthermore, we introduce a joint representation learning strategy that leverages auxiliary point cloud data to improve robustness and enrich anomaly semantics. Extensive experiments on Real3D-AD and Anomaly-ShapeNet demonstrate that BTP achieves superior performance in ZS 3D anomaly detection. Code will be available at \href{this https URL}{this https URL}.
回归点云:探索点-语言模型在零样本三维异常检测中的应用 / Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection
这篇论文提出了一种名为BTP的新方法,它直接利用三维点云数据和预训练的点-语言模型,通过将点云特征与文本描述对齐,实现了无需特定类别训练数据就能准确检测和定位三维物体缺陷的目标。
源自 arXiv: 2603.21511