减少来回沟通:结构化提示的对比研究 / Less Back-and-Forth: A Comparative Study of Structured Prompting
1️⃣ 一句话总结
这项研究通过对比三种提示方式(原始提示、清单式提示和追问式提示),发现给大语言模型提供简单的检查清单式提示,能显著提升回答质量并减少用户反复修改的次数,效果优于让模型主动追问的提示方法。
Large language models (LLMs) are widely used for open-ended tasks, but underspecified prompts can lead to low-quality answers and additional interaction. This paper studies whether structured prompt design improves response quality while reducing user effort. We compare three prompt conditions: a raw prompt, a checklist-improved prompt, and a clarifying-question prompt. We evaluate these conditions across four task types--summarization, planning, explanation, and coding--using three LLM systems: ChatGPT, Claude, and Grok. Each output is scored with a unified rubric covering task completion, correctness, compliance, and clarity. Checklist-improved prompts achieved the highest mean rubric score, 7.50 out of 8, compared with 5.67 for raw prompts and 6.67 for clarifying-question prompts. Checklist prompts also produced the best quality-effort tradeoff, using fewer average tokens than both raw and clarifying prompts. These results suggest that a simple prompt checklist can improve LLM responses while reducing unnecessary interaction.
减少来回沟通:结构化提示的对比研究 / Less Back-and-Forth: A Comparative Study of Structured Prompting
这项研究通过对比三种提示方式(原始提示、清单式提示和追问式提示),发现给大语言模型提供简单的检查清单式提示,能显著提升回答质量并减少用户反复修改的次数,效果优于让模型主动追问的提示方法。
源自 arXiv: 2605.20149