菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-13
📄 Abstract - Exploiting Pre-trained Encoder-Decoder Transformers for Sequence-to-Sequence Constituent Parsing

To achieve deep natural language understanding, syntactic constituent parsing plays a crucial role and is widely required by many artificial intelligence systems for processing both text and speech. A recent approach involves using standard sequence-to-sequence models to handle constituent parsing as a machine translation problem, moving away from traditional task-specific parsers. These models are typically initialized with pre-trained encoder-only language models like BERT or RoBERTa. However, the use of pre-trained encoder-decoder language models for constituency parsing has not been thoroughly explored. To bridge this gap, we extend the sequence-to-sequence framework by investigating parsers built on pre-trained encoder-decoder architectures, including BART, mBART, and T5. We fine-tune them to generate linearized parse trees and extensively evaluate them on different linearization strategies across both continuous treebanks and more complex discontinuous benchmarks. Our results demonstrate that our approach outperforms all prior sequence-to-sequence models and performs competitively with leading task-specific constituent parsers on continuous constituent parsing.

顶级标签: natural language processing machine learning
详细标签: constituent parsing sequence-to-sequence encoder-decoder transformer pre-trained model syntax 或 搜索:

利用预训练的编码器-解码器Transformer进行序列到序列的句法成分分析 / Exploiting Pre-trained Encoder-Decoder Transformers for Sequence-to-Sequence Constituent Parsing


1️⃣ 一句话总结

本文研究了如何利用预训练的编码器-解码器模型(如BART、mBART和T5)来将句法成分分析任务转化为序列到序列的翻译问题,实验证明该方法在连续和不连续句库上均显著优于此前基于BERT等编码器模型的序列到序列方法,并达到了与专为成分分析设计的顶尖系统相当的性能。

源自 arXiv: 2605.13373