菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-27
📄 Abstract - Better heads do not guarantee better binarized constituency parsing

We revisit punctuation-aware tree binarization for constituency parsing and ask whether dependency-induced headedness improves binary parser supervision. Although learned heads substantially outperform rule-based heads in intrinsic head prediction, they do not yield consistent parsing gains after debinarization. In particular, punctuation-conditioned evaluation shows that learned headedness underperforms rule-based binarization in macro-average punctuation-sensitive $F_1$, despite a small overall gain on CTB. Similar instability appears under cross-treebank transfer. These results suggest that \ycc{linguistically grounded} headedness is not necessarily parser-optimal when used as a binarization control signal. The paper presents a negative result: better head prediction does not imply better punctuation-sensitive constituency parsing.

顶级标签: natural language processing theory model evaluation
详细标签: constituency parsing tree binarization headedness punctuation negative result 或 搜索:

更好的头部选择并不保证更好的二值化成分句法分析 / Better heads do not guarantee better binarized constituency parsing


1️⃣ 一句话总结

本文通过实验挑战了一个常见假设:在成分句法分析中,使用更准确的基于学习的句法头部(head)来指导树结构的二值化,并不比简单的基于规则的头部带来更一致的解析性能提升,甚至在标点符号敏感的评估指标上表现更差,表明语言学上合理的头部选择并非总是对解析器最优。

源自 arXiv: 2605.28131