Just Pass Twice: Efficient Token Classification with LLMs for Zero-Shot NER

📄 Abstract - Just Pass Twice: Efficient Token Classification with LLMs for Zero-Shot NER

Large language models encode extensive world knowledge valuable for zero-shot named entity recognition. However, their causal attention mechanism, where tokens attend only to preceding context, prevents effective token classification when disambiguation requires future context. Existing approaches use LLMs generatively, prompting them to list entities or produce structured outputs, but suffer from slow autoregressive decoding, hallucinated entities, and formatting errors. We propose Just Pass Twice (JPT), a simple yet effective method that enables causal LLMs to perform discriminative token classification with full bidirectional context. Our key insight is that concatenating the input to itself lets each token in the second pass attend to the complete sentence, requiring no architectural modifications. We combine these representations with definition-guided entity embeddings for flexible zero-shot generalization. Our approach achieves state-of-the-art results on zero-shot NER benchmarks, surpassing the previous best method by +7.9 F1 on average across CrossNER and MIT benchmarks, being over 20x faster than comparable generative methods.

只需两次前向传播：利用大语言模型进行高效零样本命名实体识别的标记分类方法 / Just Pass Twice: Efficient Token Classification with LLMs for Zero-Shot NER

1️⃣ 一句话总结

这篇论文提出了一种名为‘只需两次前向传播’的简单高效方法，通过将输入句子自我拼接，使原本只能单向关注上下文的大语言模型能够利用完整的双向信息进行零样本命名实体识别，从而在显著提升准确率的同时，速度比现有生成式方法快20倍以上。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要