开源大语言模型在辅助日语病理报告撰写中的性能评估 / Performance Evaluation of Open-Source Large Language Models for Assisting Pathology Report Writing in Japanese
1️⃣ 一句话总结
这项研究评估了七种开源大语言模型在辅助撰写日语病理报告方面的表现,发现它们在需要推理的结构化报告生成和错字纠正任务中很有用,但在生成解释性文本时效果因人而异。
The performance of large language models (LLMs) for supporting pathology report writing in Japanese remains unexplored. We evaluated seven open-source LLMs from three perspectives: (A) generation and information extraction of pathology diagnosis text following predefined formats, (B) correction of typographical errors in Japanese pathology reports, and (C) subjective evaluation of model-generated explanatory text by pathologists and clinicians. Thinking models and medical-specialized models showed advantages in structured reporting tasks that required reasoning and in typo correction. In contrast, preferences for explanatory outputs varied substantially across raters. Although the utility of LLMs differed by task, our findings suggest that open-source LLMs can be useful for assisting Japanese pathology report writing in limited but clinically relevant scenarios.
开源大语言模型在辅助日语病理报告撰写中的性能评估 / Performance Evaluation of Open-Source Large Language Models for Assisting Pathology Report Writing in Japanese
这项研究评估了七种开源大语言模型在辅助撰写日语病理报告方面的表现,发现它们在需要推理的结构化报告生成和错字纠正任务中很有用,但在生成解释性文本时效果因人而异。
源自 arXiv: 2603.11597