基于大语言模型的社交媒体抑郁症风险评估 / Depression Risk Assessment in Social Media via Large Language Models
1️⃣ 一句话总结
本研究利用大语言模型(LLM)对Reddit帖子进行八类与抑郁相关的情绪识别,并计算一个加权严重性指数,从而评估用户的抑郁症风险,该方法在零样本场景下与专为任务微调的模型表现相当,并成功应用于大规模社交媒体数据中,证明了其作为低成本、可扩展的群体心理健康监测工具的可行性。
Depression is one of the most prevalent and debilitating mental health conditions worldwide, frequently underdiagnosed and undertreated. The proliferation of social media platforms provides a rich source of naturalistic linguistic signals for the automated monitoring of psychological well-being. In this work, we propose a system based on Large Language Models (LLMs) for depression risk assessment in Reddit posts, through multi-label classification of eight depression-associated emotions and the computation of a weighted severity index. The method is evaluated in a zero-shot setting on the annotated DepressionEmo dataset (~6,000 posts) and applied in-the-wild to 469,692 comments collected from four subreddits over the period 2024-2025. Our best model, gemma3:27b, achieves micro-F1 = 0.75 and macro-F1 = 0.70, results competitive with purpose-built fine-tuned models (BART: micro-F1 = 0.80, macro-F1 = 0.76). The in-the-wild analysis reveals consistent and temporally stable risk profiles across communities, with marked differences between r/depression and r/anxiety. Our findings demonstrate the feasibility of a cost-effective, scalable approach for large-scale psychological monitoring.
基于大语言模型的社交媒体抑郁症风险评估 / Depression Risk Assessment in Social Media via Large Language Models
本研究利用大语言模型(LLM)对Reddit帖子进行八类与抑郁相关的情绪识别,并计算一个加权严重性指数,从而评估用户的抑郁症风险,该方法在零样本场景下与专为任务微调的模型表现相当,并成功应用于大规模社交媒体数据中,证明了其作为低成本、可扩展的群体心理健康监测工具的可行性。
源自 arXiv: 2604.19887