← 返回列表

arXiv 提交日期: 2026-02-19

📄 Abstract - PEACE 2.0: Grounded Explanations and Counter-Speech for Combating Hate Expressions

The increasing volume of hate speech on online platforms poses significant societal challenges. While the Natural Language Processing community has developed effective methods to automatically detect the presence of hate speech, responses to it, called counter-speech, are still an open challenge. We present PEACE 2.0, a novel tool that, besides analysing and explaining why a message is considered hateful or not, also generates a response to it. More specifically, PEACE 2.0 has three main new functionalities: leveraging a Retrieval-Augmented Generation (RAG) pipeline i) to ground HS explanations into evidence and facts, ii) to automatically generate evidence-grounded counter-speech, and iii) exploring the characteristics of counter-speech replies. By integrating these capabilities, PEACE 2.0 enables in-depth analysis and response generation for both explicit and implicit hateful messages.

顶级标签: natural language processing llm systems

PEACE 2.0：基于证据的解释与反制言论以对抗仇恨言论 / PEACE 2.0: Grounded Explanations and Counter-Speech for Combating Hate Expressions

1️⃣ 一句话总结

这篇论文介绍了一个名为PEACE 2.0的新工具，它不仅能分析并解释一条信息为何被判定为仇恨言论，还能自动生成基于事实证据的反制言论进行回应。

👋 没兴趣 ☆ 感兴趣 📌 待读

打开原文 PDF

源自 arXiv: 2602.17467

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要