菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-11
📄 Abstract - DECO-MWE: building a linguistic resource of Korean multiword expressions for feature-based sentiment analysis

This paper aims to construct a linguistic resource of Korean Multiword Expressions for Feature-Based Sentiment Analysis (FBSA): DECO-MWE. Dealing with multiword expressions (MWEs) has been a critical issue in FBSA since many constructs reveal lexical idiosyncrasy. To construct linguistic resources of sentiment MWEs efficiently, we utilize the Local Grammar Graph (LGG) methodology: DECO-MWE is formalized as a Finite-State Transducer that represents lexical-syntactic restrictions on MWEs. In this study, we built a corpus of cosmetics review texts, which show particularly frequent occurrences of MWEs. Based on an empirical examination of the corpus, four types of MWEs have been distinguished. The DECO-MWE thus covers the following four categories: Standard Polarity MWEs (SMWEs), Domain-Dependent Polarity MWEs (DMWEs), Compound Named Entity MWEs (EMWEs) and Compound Feature MWEs (FMWEs). The retrieval performance of the DECO-MWE shows 0.806 f-measure in the test corpus. This study brings a twofold outcome: first, a sizeable general-purpose polarity MWE lexicon, which may be broadly used in FBSA; second, a finite-state methodology adopted in this study to treat domain-dependent MWEs such as idiosyncratic polarity expressions, named entity expressions or feature expressions, and which may be reused in describing linguistic properties of other corpus domains.

顶级标签: natural language processing data
详细标签: multiword expressions sentiment analysis korean finite-state transducer lexical-syntactic 或 搜索:

DECO-MWE:面向基于特征的情感分析的韩语多词表达语言资源构建 / DECO-MWE: building a linguistic resource of Korean multiword expressions for feature-based sentiment analysis


1️⃣ 一句话总结

本文构建了一个名为DECO-MWE的韩语多词表达语言资源,通过局部语法图方法自动识别四种情感相关多词表达类型,在化妆品评论测试中达到0.806的F值,为基于特征的情感分析提供了通用领域和领域依赖的情感表达工具。

源自 arXiv: 2605.10295