基于大语言模型的阿拉伯语金融情感分析:来自沙特市场的证据 / LLM-Based Financial Sentiment Analysis in Arabic: Evidence from Saudi Markets
1️⃣ 一句话总结
该论文开发了一套专为沙特阿拉伯市场设计的阿拉伯语金融情感分析系统,通过整合官方新闻和社交媒体数据,结合命名实体识别与公司词典,成功构建了大规模标注语料库,实现了对股票市场投资者情绪的有效捕捉和分析。
Investor sentiment shapes financial markets, yet modeling sentiment in Arabic financial contexts remains challenging due to linguistic complexity and limited resources. We present an Arabic NLP framework for large-scale financial sentiment analysis tailored to the Saudi market, integrating official financial news and social media to capture institutional and public investor sentiment. The framework constructs a large Arabic financial corpus through a multi-stage pipeline encompassing data collection, cleaning, deduplication, entity linking, and sentiment annotation. Transformer-based NER combined with a curated company lexicon links textual mentions to canonical company identifiers, with sentiment labels assigned using a five-class scheme. The resulting dataset of 84K samples supports company-level sentiment aggregation and analysis of sentiment dynamics relative to stock market behavior on the Saudi Exchange. Experimental results demonstrate reliable and scalable Arabic financial sentiment analysis.
基于大语言模型的阿拉伯语金融情感分析:来自沙特市场的证据 / LLM-Based Financial Sentiment Analysis in Arabic: Evidence from Saudi Markets
该论文开发了一套专为沙特阿拉伯市场设计的阿拉伯语金融情感分析系统,通过整合官方新闻和社交媒体数据,结合命名实体识别与公司词典,成功构建了大规模标注语料库,实现了对股票市场投资者情绪的有效捕捉和分析。
源自 arXiv: 2605.19714