菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-03
📄 Abstract - From Local Matches to Global Masks: Novel Instance Detection in Open-World Scenes

Detecting and segmenting novel object instances in open-world environments is a fundamental problem in robotic perception. Given only a small set of template images, a robot must locate and segment a specific object instance in a cluttered, previously unseen scene. Existing proposal-based approaches are highly sensitive to proposal quality and often fail under occlusion and background clutter. We propose L2G-Det, a local-to-global instance detection framework that bypasses explicit object proposals by leveraging dense patch-level matching between templates and the query image. Locally matched patches generate candidate points, which are refined through a candidate selection module to suppress false positives. The filtered points are then used to prompt an augmented Segment Anything Model (SAM) with instance-specific object tokens, enabling reliable reconstruction of complete instance masks. Experiments demonstrate improved performance over proposal-based methods in challenging open-world settings.

顶级标签: computer vision robotics systems
详细标签: instance segmentation open-world detection patch matching segment anything model novel object detection 或 搜索:

从局部匹配到全局掩码:开放世界场景中的新实例检测 / From Local Matches to Global Masks: Novel Instance Detection in Open-World Scenes


1️⃣ 一句话总结

这篇论文提出了一种名为L2G-Det的新方法,它通过密集的局部图像块匹配来检测和分割开放世界中的新物体,无需依赖传统的物体候选框,从而在物体被遮挡或背景杂乱时表现更优。

源自 arXiv: 2603.03577