菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-22
📄 Abstract - UnBias-Plus: Detect, Explain, and Rewrite Bias

Bias in natural language remains a persistent challenge in both human-written and AI-generated content, affecting domains such as journalism, education, and AI research. Most existing detection methods identify only the presence of bias, with limited support for granular detection, interpretable explanations, neutral rewriting, and openly available trained models. We present UnBias-Plus, an open-source toolkit unifying (1) segment-level multi-class bias classification, (2) biased span localization, (3) neutral text rewriting, and (4) reasoning for each decision. Available via Python, CLI, REST API, and web interfaces, UnBias-Plus supports accessible bias analysis. The toolkit, source code, models, datasets, and documentation are publicly available.

顶级标签: natural language processing llm
详细标签: bias detection text rewriting explainability open-source toolkit 或 搜索:

UnBias-Plus:检测、解释与改写偏见 / UnBias-Plus: Detect, Explain, and Rewrite Bias


1️⃣ 一句话总结

本文提出了一个名为UnBias-Plus的开源工具包,它不仅能判断文本中是否存在偏见,还能精确定位偏见的片段、解释原因并将其改写为中性表达,从而帮助用户更全面地理解和纠正语言偏见。

源自 arXiv: 2606.23412