Dual-Prompt CLIP with Hybrid Visual Encoders for Occluded Person Re-Identification

📄 Abstract - Dual-Prompt CLIP with Hybrid Visual Encoders for Occluded Person Re-Identification

Occluded person re-identification focuses on matching partially visible pedestrians across multiple camera views. However, occlusions disrupt body-region cues, thereby complicating cross-view matching. Most person ReID methods built on pretrained vision-language models only focus on enhancing prompt-based feature learning while ignoring the semantic information of occluders. Based on the success of CLIP-ReID, we propose a novel Dual Prompt Learning ReID (DPL-ReID) model for occluded person ReID. It incorporates a Dual Prompt Learning (Dual-PL) strategy, which can utilize textual cues to capture complete pedestrian semantics and keep robustness against occlusion, and a Real-World Occlusion Augmentation (RWOA) method that realistically simulates occlusion scenarios encountered in real word to enrich occluded samples. In addition, we also design a Weighted Gated Feature Fusion (WGFF) method, which in corporates LSNet to capture global information and act as a feature-gating mechanism. This mechanism can effectively guide the CLIP visual encoder toward generating more comprehensive feature representations. Extensive experiments on several benchmark occluded ReID datasets show that our proposed DPL-ReID achieves the state-of-the art performance. The occlusion instance library are available at this https URL.

基于双提示CLIP与混合视觉编码器的遮挡行人重识别方法 / Dual-Prompt CLIP with Hybrid Visual Encoders for Occluded Person Re-Identification

1️⃣ 一句话总结

本文针对行人被遮挡导致识别困难的问题，提出了一种结合双提示学习和视觉特征融合的新模型，通过模拟真实遮挡场景并利用文字提示来捕捉完整行人特征，从而在遮挡条件下显著提升行人匹配的准确率。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要