社交媒体上的网络欺凌治理:从内容识别到干预的统一框架 / Cyberbullying Governance on Social Media: A Unified Framework from Content Identification to Intervention
1️⃣ 一句话总结
本文提出了一个覆盖网络欺凌从发现到干预全过程的统一治理框架,将以往零散的、事后检测的方法转变为主动、持续的监管策略,并探讨了多模态内容、算法公平性和生成式AI带来的新挑战。
The proliferation of social media platforms and online communities has inadvertently catalyzed the spread of cyberbullying, hate speech, and other forms of online toxicity, making the effective governance of such harm a critical societal and computational challenge. While significant strides have been made in automating content moderation, existing research predominantly treats cyberbullying governance as passive, isolated detection at the post level. This reductionist view overlooks the continuous behavioral dynamics of users, the structural diffusion of toxic events, and the critical need for proactive mitigation. To bridge these gaps, this paper proposes a unified full-lifecycle governance framework that shifts the paradigm of cyberbullying governance from isolated static detection toward integrated, continuous, and proactive moderation. Drawing on cyberbullying research and adjacent fields, we systematically synthesize the state-of-the-art literature across four interconnected stages: (1) Content Identification, (2) User and Behavior Modeling, (3) Diffusion Dynamics and Early Warning, and (4) Intervention and Governance. Furthermore, we review available datasets and evaluation practices, and discuss emerging challenges including multimodality, explainability, algorithmic fairness, and the dual-use risks of generative AI, providing a roadmap for future research toward a safer and more resilient digital ecosystem.
社交媒体上的网络欺凌治理:从内容识别到干预的统一框架 / Cyberbullying Governance on Social Media: A Unified Framework from Content Identification to Intervention
本文提出了一个覆盖网络欺凌从发现到干预全过程的统一治理框架,将以往零散的、事后检测的方法转变为主动、持续的监管策略,并探讨了多模态内容、算法公平性和生成式AI带来的新挑战。
源自 arXiv: 2605.27584