菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-04
📄 Abstract - U-Define: Designing User Workflows for Hard and Soft Constraints in LLM-Based Planning

LLMs are increasingly used for end-user task planning, yet their black-box nature limits users' ability to ensure reliability and control. While recent systems incorporate verification techniques, it remains unclear how users can effectively apply such rigid constraints to represent intent or adapt to real-world variability. For example, prior work finds that hard-only constraints are too rigid, and numeric flexibility weights confuse users. We investigate how interaction workflows can better support users in applying constraints to guide LLM-generated plans, examining whether abstracting strictness into high-level types (i.e., hard and soft) paired with distinct verification mechanisms helps users more reliably express and align intent. We present U-Define, a system that lets users define constraints in natural language and categorize them as either hard rules that must not be violated or soft preferences that allow flexibility. U-Define verifies these types through complementary methods: formal model checking for hard constraints and LLM-as-judge evaluation for soft ones. Through a technical evaluation and user studies with general and expert participants, we find that user-defined constraint types improve perceived usefulness, performance, and satisfaction while maintaining usability. These findings provide insights for designing flexible yet reliable constraint-based workflows.

顶级标签: llm agents model evaluation
详细标签: constraint types user workflow task planning verification human-ai interaction 或 搜索:

U-Define:设计面向大语言模型规划中硬约束与软约束的用户工作流 / U-Define: Designing User Workflows for Hard and Soft Constraints in LLM-Based Planning


1️⃣ 一句话总结

本文提出U-Define系统,让用户用自然语言定义两种约束——必须遵守的‘硬约束’和可灵活调整的‘软约束’,并通过不同验证方法(硬约束用形式化模型检查,软约束用大语言模型评判)来提升大语言模型生成计划的可控性和可靠性,实验表明该方法比传统单一硬约束更有效、更易用。

源自 arXiv: 2605.02765