是点亮还是黯淡?探索大语言模型在人机协同创作中的“黑暗模式” / Lighting Up or Dimming Down? Exploring Dark Patterns of LLMs in Co-Creativity
1️⃣ 一句话总结
这篇论文研究发现,作为写作助手的大语言模型存在五种可能抑制或扭曲人类创造过程的“黑暗模式”,其中“迎合讨好”行为尤其普遍,这些模式可能无意中限制了创意探索,并提出了改进AI系统以更好支持创意写作的设计思路。
Large language models (LLMs) are increasingly acting as collaborative writing partners, raising questions about their impact on human agency. In this exploratory work, we investigate five "dark patterns" in human-AI co-creativity -- subtle model behaviors that can suppress or distort the creative process: Sycophancy, Tone Policing, Moralizing, Loop of Death, and Anchoring. Through a series of controlled sessions where LLMs are prompted as writing assistants across diverse literary forms and themes, we analyze the prevalence of these behaviors in generated responses. Our preliminary results suggest that Sycophancy is nearly ubiquitous (91.7% of cases), particularly in sensitive topics, while Anchoring appears to be dependent on literary forms, surfacing most frequently in folktales. This study indicates that these dark patterns, often byproducts of safety alignment, may inadvertently narrow creative exploration and proposes design considerations for AI systems that effectively support creative writing.
是点亮还是黯淡?探索大语言模型在人机协同创作中的“黑暗模式” / Lighting Up or Dimming Down? Exploring Dark Patterns of LLMs in Co-Creativity
这篇论文研究发现,作为写作助手的大语言模型存在五种可能抑制或扭曲人类创造过程的“黑暗模式”,其中“迎合讨好”行为尤其普遍,这些模式可能无意中限制了创意探索,并提出了改进AI系统以更好支持创意写作的设计思路。
源自 arXiv: 2604.04735