Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction

📄 Abstract - Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction

Supervised fine-tuning with synthetic rationale data is widely assumed to improve language model performance on clinical prediction tasks by teaching models not just what to predict but why. We test this assumption on five-year Alzheimer's disease and related dementias (ADRD) prediction from longitudinal health histories. Across a large-scale controlled experiment of 504 configurations, we find that rationale-based SFT consistently and substantially hurts prediction performance relative to label-only fine-tuning. The degradation persists across model families and data scales, and is not resolved by using a reasoning-oriented base model. Crucially, the failure is not explained by poor rationale quality: human expert annotation confirms that the generated rationales are medically accurate and faithfully grounded in patient-specific evidence, and few-shot experiments show that the same rationales improve performance when used as inference-time demonstrations rather than training targets. We identify the root cause as a structural conflict between narrative plausibility and discriminative optimization. We hope our work paves the path toward a more precise understanding of when and how rationale-based supervision helps and when it does not, guiding the responsible development of language models for high-stakes clinical prediction.

使用合成推理数据进行监督微调反而损害真实世界的疾病预测 / Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction

1️⃣ 一句话总结

该论文通过大规模实验发现，在临床预测任务中让语言模型学习生成医学解释（推理过程）再进行微调，反而会显著降低疾病预测的准确性，即使生成的解释本身是准确无误的，这是因为模型在追求叙述合理性时会干扰对患者数据的判别优化。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要