菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-30
📄 Abstract - Context as Prior: Bayesian-Inspired Intent Inference for Non-Speaking Agents with a Household Cat Testbed

Many agents in real-world environments cannot reliably communicate their goals through language, including household pets, pre-verbal infants, and other non-speaking embodied agents. In such settings, intent must be inferred from incomplete behavioral observations in context-rich environments. This creates a core ambiguity: observable behavior is often noisy or underspecified, while context provides strong prior information but can also induce brittle shortcut predictions if used naively. We present CatSignal, a Bayesian-inspired probabilistic framework for multimodal intent inference that models spatial context as a prior-like constraint and behavioral observations as evidence. Rather than treating context as an ordinary input feature, our method uses a context-gated Product-of-Experts formulation to compute posterior-like intent distributions from context, pose dynamics, and acoustic cues. We instantiate this formulation in a household cat setting as a focused proof-of-concept for intent inference in non-speaking agents. Under Leave-One-Video-Out evaluation on a multimodal domestic cat dataset, the proposed prior-guided fusion achieves the best overall accuracy of 77.72%, outperforming feature concatenation (71.83%) and stronger late-fusion baselines. More importantly, it substantially reduces context-driven shortcut failures in ambiguous cases. While simpler fusion strategies remain competitive in Macro-F1 and selective prediction, the proposed model provides the strongest overall accuracy and the best suppression of context-based shortcut collapse.

顶级标签: machine learning agents multi-modal
详细标签: intent inference bayesian methods non-speaking agents product-of-experts context prior 或 搜索:

上下文作为先验:针对非语言智能体基于贝叶斯启发的意图推断——以家猫测试平台为例 / Context as Prior: Bayesian-Inspired Intent Inference for Non-Speaking Agents with a Household Cat Testbed


1️⃣ 一句话总结

本文提出了一种受贝叶斯思想启发的概率模型CatSignal,通过将环境上下文视为先验知识、行为观测视为证据,来推断无法用语言交流的智能体(如家猫)的意图,实验证明该方法比传统融合方式更准确,并能有效减少因过度依赖上下文而导致的错误预测。

源自 arXiv: 2604.27445