菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-19
📄 Abstract - Less Back-and-Forth: A Comparative Study of Structured Prompting

Large language models (LLMs) are widely used for open-ended tasks, but underspecified prompts can lead to low-quality answers and additional interaction. This paper studies whether structured prompt design improves response quality while reducing user effort. We compare three prompt conditions: a raw prompt, a checklist-improved prompt, and a clarifying-question prompt. We evaluate these conditions across four task types--summarization, planning, explanation, and coding--using three LLM systems: ChatGPT, Claude, and Grok. Each output is scored with a unified rubric covering task completion, correctness, compliance, and clarity. Checklist-improved prompts achieved the highest mean rubric score, 7.50 out of 8, compared with 5.67 for raw prompts and 6.67 for clarifying-question prompts. Checklist prompts also produced the best quality-effort tradeoff, using fewer average tokens than both raw and clarifying prompts. These results suggest that a simple prompt checklist can improve LLM responses while reducing unnecessary interaction.

顶级标签: llm natural language processing
详细标签: prompt engineering structured prompting prompt checklist response quality comparative study 或 搜索:

减少来回沟通:结构化提示的对比研究 / Less Back-and-Forth: A Comparative Study of Structured Prompting


1️⃣ 一句话总结

这项研究通过对比三种提示方式(原始提示、清单式提示和追问式提示),发现给大语言模型提供简单的检查清单式提示,能显著提升回答质量并减少用户反复修改的次数,效果优于让模型主动追问的提示方法。

源自 arXiv: 2605.20149