BUSSARD:基于标准化流的双向通用场景特异性异常关系检测 / BUSSARD: Normalizing Flows for Bijective Universal Scene-Specific Anomalous Relationship Detection
1️⃣ 一句话总结
这篇论文提出了一种名为BUSSARD的新方法,它利用标准化流和语言模型来检测图像场景图中物体之间的异常关系,在保持高检测精度的同时,速度比现有最好方法快五倍,并且对词语变化(如同义词)具有更强的鲁棒性。
We propose Bijective Universal Scene-Specific Anomalous Relationship Detection (BUSSARD), a normalizing flow-based model for detecting anomalous relations in scene graphs, generated from images. Our work follows a multimodal approach, embedding object and relationship tokens from scene graphs with a language model to leverage semantic knowledge from the real world. A normalizing flow model is used to learn bijective transformations that map object-relation-object triplets from scene graphs to a simple base distribution (typically Gaussian), allowing anomaly detection through likelihood estimation. We evaluate our approach on the SARD dataset containing office and dining room scenes. Our method achieves around 10% better AUROC results compared to the current state-of-the-art model, while simultaneously being five times faster. Through ablation studies, we demonstrate superior robustness and universality, particularly regarding the use of synonyms, with our model maintaining stable performance while the baseline shows 17.5% deviation. This work demonstrates the strong potential of learning-based methods for relationship anomaly detection in scene graphs. Our code is available at this https URL .
BUSSARD:基于标准化流的双向通用场景特异性异常关系检测 / BUSSARD: Normalizing Flows for Bijective Universal Scene-Specific Anomalous Relationship Detection
这篇论文提出了一种名为BUSSARD的新方法,它利用标准化流和语言模型来检测图像场景图中物体之间的异常关系,在保持高检测精度的同时,速度比现有最好方法快五倍,并且对词语变化(如同义词)具有更强的鲁棒性。
源自 arXiv: 2603.16645