菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-15
📄 Abstract - Generating Concept Lexicalizations via Dictionary-Based Cross-Lingual Sense Projection

We study the task of automatically expanding WordNet-style lexical resources to new languages through sense generation. We generate senses by associating target-language lemmas with existing lexical concepts via semantic projection. Given a sense-tagged English corpus and its translation, our method projects English synsets onto aligned target-language tokens and assigns the corresponding lemmas to those synsets. To generate these alignments and ensure their quality, we augment a pre-trained base aligner with a bilingual dictionary, which is also used to filter out incorrect sense projections. We evaluate the method on multiple languages, comparing it to prior methods, as well as dictionary-based and large language model baselines. Results show that the proposed project-and-filter strategy improves precision while remaining interpretable and requiring few external resources. We plan to make our code, documentation, and generated sense inventories accessible.

顶级标签: natural language processing llm data
详细标签: lexical resource expansion cross-lingual sense projection wordnet dictionary augmentation sense generation 或 搜索:

基于词典的跨语言词义投射生成概念词汇化 / Generating Concept Lexicalizations via Dictionary-Based Cross-Lingual Sense Projection


1️⃣ 一句话总结

这篇论文提出了一种利用双语词典辅助跨语言词义对齐的方法,能够自动、高效地将WordNet这类词汇资源扩展到新语言,在保证准确性和可解释性的同时,几乎不需要额外的资源。

源自 arXiv: 2604.14397