菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-28
📄 Abstract - Cognitive World Models for Process-Level Social Influence Evaluation

Social influence dialogue changes user behavior by altering internal cognitive states. The central evaluation question is whether the user's beliefs, desires, intentions, and emotions measurably change over the course of conversation, a process-oriented criterion that neither surface-level text metrics (BLEU/ROUGE) nor single-score LLM judgments can capture. We propose the \textbf{Cog}nitive \textbf{W}orld \textbf{M}odel \textbf{(CogWM)}, an LLM-based user model that reframes multi-turn dialogue evaluation from ``what did the user say'' to ``how did the user's internal cognitive state evolves.'' CogWM jointly predicts BDI/E cognitive states and user utterances and serves as both a user simulator and an evaluation platform, using a three-tier evaluation framework that covers turn-level fidelity, trajectory-level state dynamics, and task-level composite scoring. Trained via our \textbf{S}ummarize-\textbf{a}nd-\textbf{A}llocate \textbf{(SaA)} annotation pipeline on 150,454 user-turn samples across four social influence scenarios, CogWM achieves 77.6\% emotion accuracy (2.1$\times$ over GPT-5.5). In 3600 multi-agent discrimination trials, it distinguishes six commercial agents by their cognitive influence, with Llama-4-Scout ranking first (CTS +0.233). CogWM moves social influence dialogue evaluation from terminal judgment to process tracking. We have released our code\footnote{\scriptsize Code: this https URL} and models\footnote{Model: this https URL}.

顶级标签: llm natural language processing agents
详细标签: social influence cognitive states user modeling dialogue evaluation multi-agent 或 搜索:

面向过程层社会影响力评估的认知世界模型 / Cognitive World Models for Process-Level Social Influence Evaluation


1️⃣ 一句话总结

本文提出一种名为CogWM的AI用户模型,通过追踪对话中用户信念、欲望、意图和情感等认知状态的变化,代替传统基于文本表面得分或单一评分的方法,更准确地评估对话系统对用户行为的影响过程。

源自 arXiv: 2606.29495