菜单

🤖 系统
📄 Abstract - EthicsMH: A Pilot Benchmark for Ethical Reasoning in Mental Health AI

The deployment of large language models (LLMs) in mental health and other sensitive domains raises urgent questions about ethical reasoning, fairness, and responsible alignment. Yet, existing benchmarks for moral and clinical decision-making do not adequately capture the unique ethical dilemmas encountered in mental health practice, where confidentiality, autonomy, beneficence, and bias frequently intersect. To address this gap, we introduce Ethical Reasoning in Mental Health (EthicsMH), a pilot dataset of 125 scenarios designed to evaluate how AI systems navigate ethically charged situations in therapeutic and psychiatric contexts. Each scenario is enriched with structured fields, including multiple decision options, expert-aligned reasoning, expected model behavior, real-world impact, and multi-stakeholder viewpoints. This structure enables evaluation not only of decision accuracy but also of explanation quality and alignment with professional norms. Although modest in scale and developed with model-assisted generation, EthicsMH establishes a task framework that bridges AI ethics and mental health decision-making. By releasing this dataset, we aim to provide a seed resource that can be expanded through community and expert contributions, fostering the development of AI systems capable of responsibly handling some of society's most delicate decisions.

顶级标签: llm benchmark medical
详细标签: ethical reasoning mental health ai evaluation clinical decision-making responsible ai 或 搜索:

📄 论文总结

EthicsMH:心理健康AI伦理推理的试点基准 / EthicsMH: A Pilot Benchmark for Ethical Reasoning in Mental Health AI


1️⃣ 一句话总结

这篇论文提出了一个名为EthicsMH的试点数据集,专门用于评估人工智能在心理健康领域处理伦理困境的能力,旨在促进负责任AI的发展。


📄 打开原文 PDF