用于三维单目标跟踪的时序一致长时记忆模型 / Temporally Consistent Long-Term Memory for 3D Single Object Tracking
1️⃣ 一句话总结
这篇论文提出了一个名为ChronoTrack的新方法,通过引入一个紧凑且时序一致的长时记忆模块,有效解决了三维点云序列中目标跟踪因特征漂移和记忆开销大而难以长期稳定的问题,从而在多个标准测试集上取得了最佳性能并能实时运行。
3D Single Object Tracking (3D-SOT) aims to localize a target object across a sequence of LiDAR point clouds, given its 3D bounding box in the first frame. Recent methods have adopted a memory-based approach to utilize previously observed features of the target object, but remain limited to only a few recent frames. This work reveals that their temporal capacity is fundamentally constrained to short-term context due to severe temporal feature inconsistency and excessive memory overhead. To this end, we propose a robust long-term 3D-SOT framework, ChronoTrack, which preserves the temporal feature consistency while efficiently aggregating the diverse target features via long-term memory. Based on a compact set of learnable memory tokens, ChronoTrack leverages long-term information through two complementary objectives: a temporal consistency loss and a memory cycle consistency loss. The former enforces feature alignment across frames, alleviating temporal drift and improving the reliability of proposed long-term memory. In parallel, the latter encourages each token to encode diverse and discriminative target representations observed throughout the sequence via memory-point-memory cyclic walks. As a result, ChronoTrack achieves new state-of-the-art performance on multiple 3D-SOT benchmarks, demonstrating its effectiveness in long-term target modeling with compact memory while running at real-time speed of 42 FPS on a single RTX 4090 GPU. The code is available at this https URL
用于三维单目标跟踪的时序一致长时记忆模型 / Temporally Consistent Long-Term Memory for 3D Single Object Tracking
这篇论文提出了一个名为ChronoTrack的新方法,通过引入一个紧凑且时序一致的长时记忆模块,有效解决了三维点云序列中目标跟踪因特征漂移和记忆开销大而难以长期稳定的问题,从而在多个标准测试集上取得了最佳性能并能实时运行。
源自 arXiv: 2604.13789