菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-08
📄 Abstract - LaScA: Language-Conditioned Scalable Modelling of Affective Dynamics

Predicting affect in unconstrained environments remains a fundamental challenge in human-centered AI. While deep neural embeddings dominate contemporary approaches, they often lack interpretability and limit expert-driven refinement. We propose a novel framework that uses Language Models (LMs) as semantic context conditioners over handcrafted affect descriptors to model changes in Valence and Arousal. Our approach begins with interpretable facial geometry and acoustic features derived from structured domain knowledge. These features are transformed into symbolic natural-language descriptions encoding their affective implications. A pretrained LM processes these descriptions to generate semantic context embeddings that act as high-level priors over affective dynamics. Unlike end-to-end black-box pipelines, our framework preserves feature transparency while leveraging the contextual abstraction capabilities of LMs. We evaluate the proposed method on the Aff-Wild2 and SEWA datasets for affect change prediction. Experimental results show consistent improvements in accuracy for both Valence and Arousal compared to handcrafted-only and deep-embedding baselines. Our findings demonstrate that semantic conditioning enables interpretable affect modelling without sacrificing predictive performance, offering a transparent and computationally efficient alternative to fully end-to-end architectures

顶级标签: natural language processing multi-modal model evaluation
详细标签: affective computing language-conditioned modeling interpretable ai valence-arousal prediction semantic context 或 搜索:

LaScA:基于语言条件的大规模情感动态建模 / LaScA: Language-Conditioned Scalable Modelling of Affective Dynamics


1️⃣ 一句话总结

这篇论文提出了一种新方法,通过使用语言模型来解释面部几何和声音特征的情感含义,从而在保持模型可解释性的同时,更准确地预测人的情绪变化。

源自 arXiv: 2604.07193