更好的头部选择并不保证更好的二值化成分句法分析 / Better heads do not guarantee better binarized constituency parsing
1️⃣ 一句话总结
本文通过实验挑战了一个常见假设:在成分句法分析中,使用更准确的基于学习的句法头部(head)来指导树结构的二值化,并不比简单的基于规则的头部带来更一致的解析性能提升,甚至在标点符号敏感的评估指标上表现更差,表明语言学上合理的头部选择并非总是对解析器最优。
We revisit punctuation-aware tree binarization for constituency parsing and ask whether dependency-induced headedness improves binary parser supervision. Although learned heads substantially outperform rule-based heads in intrinsic head prediction, they do not yield consistent parsing gains after debinarization. In particular, punctuation-conditioned evaluation shows that learned headedness underperforms rule-based binarization in macro-average punctuation-sensitive $F_1$, despite a small overall gain on CTB. Similar instability appears under cross-treebank transfer. These results suggest that \ycc{linguistically grounded} headedness is not necessarily parser-optimal when used as a binarization control signal. The paper presents a negative result: better head prediction does not imply better punctuation-sensitive constituency parsing.
更好的头部选择并不保证更好的二值化成分句法分析 / Better heads do not guarantee better binarized constituency parsing
本文通过实验挑战了一个常见假设:在成分句法分析中,使用更准确的基于学习的句法头部(head)来指导树结构的二值化,并不比简单的基于规则的头部带来更一致的解析性能提升,甚至在标点符号敏感的评估指标上表现更差,表明语言学上合理的头部选择并非总是对解析器最优。
源自 arXiv: 2605.28131