Learning to Label: A Reinforced Self-Evolving Framework for Semi-supervised Referring Expression Segmentation

📄 Abstract - Learning to Label: A Reinforced Self-Evolving Framework for Semi-supervised Referring Expression Segmentation

Semi-supervised referring expression segmentation (SS-RES) aims to achieve precise pixel-level language grounding under limited annotation, yet suffers from limited supervision and unreliable pseudo-labels when exploiting unlabeled image-text pairs. In this work, we propose Learning to Label, a reinforced self-evolving framework (L2L) that casts pseudo-label construction as a learnable decision-making process. To build foundational understanding, we leverage a multimodal large language model to extract semantic-spatial priors, which are instantiated as initial soft segmentation proposals and elevated, together with textual cues, into learnable guidance signals that condition a hierarchical segmentation network. To ensure stable learning, reinforced pseudo-label selection is formulated as an exploratory decision process that adaptively rewards high-utility pixel-level supervision based on multimodal priors and model predictions. This reinforced self-evolving loop enables joint optimization of the segmentation model and pseudo-labels, progressively enhancing label reliability under sparse supervision. Extensive experiments on RefCOCO, RefCOCO+, and RefCOCOg demonstrate improvements over existing methods, validating its effectiveness and generalization.

学会标注：一种用于半监督指代表达式分割的强化自进化框架 / Learning to Label: A Reinforced Self-Evolving Framework for Semi-supervised Referring Expression Segmentation

1️⃣ 一句话总结

本文提出了一种名为L2L的强化自进化框架，通过将伪标签构建转化为可学习的决策过程，结合多模态大模型提取的先验知识与强化学习的自适应奖励机制，逐步提升半监督指代表达式分割中标签的可靠性，从而在仅少量标注下实现精准的像素级语言定位。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要