菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-23
📄 Abstract - Agentic Personas for Adaptive Scientific Explanations with Knowledge Graphs

AI explanation methods often assume a static user model, producing non-adaptive explanations regardless of expert goals, reasoning strategies, or decision contexts. Knowledge graph-based explanations, despite their capacity for grounded, path-based reasoning, inherit this limitation. In complex domains such as scientific discovery, this assumption fails to capture the diversity of cognitive strategies and epistemic stances among experts, preventing explanations that foster deeper understanding and informed decision-making. However, the scarcity of human experts limits the use of direct human feedback to produce adaptive explanations. We present a reinforcement learning approach for scientific explanation generation that incorporates agentic personas, structured representations of expert reasoning strategies, that guide the explanation agent towards specific epistemic preferences. In an evaluation of knowledge graph-based explanations for drug discovery, we tested two personas that capture distinct epistemic stances derived from expert feedback. Results show that persona-driven explanations match state-of-the-art predictive performance while persona preferences closely align with those of their corresponding experts. Adaptive explanations were consistently preferred over non-adaptive baselines (n = 22), and persona-based training reduces feedback requirements by two orders of magnitude. These findings demonstrate how agentic personas enable scalable adaptive explainability for AI systems in complex and high-stakes domains.

顶级标签: agents natural language processing model training
详细标签: explainable ai knowledge graphs reinforcement learning scientific discovery adaptive explanations 或 搜索:

基于知识图谱与智能体角色实现自适应科学解释 / Agentic Personas for Adaptive Scientific Explanations with Knowledge Graphs


1️⃣ 一句话总结

这篇论文提出了一种利用强化学习和模拟专家思维方式的智能体角色,来为知识图谱生成更贴合不同专家认知偏好的自适应科学解释方法,在药物发现等复杂领域中既能保持高预测性能,又大幅减少了对真人反馈的依赖。

源自 arXiv: 2603.21846