基于颜色的情感表征在语音情感识别中的应用 / Color-based Emotion Representation for Speech Emotion Recognition
1️⃣ 一句话总结
这篇论文提出了一种用颜色属性(如色调、饱和度、明度)来连续、直观地表示语音情感的新方法,并通过构建回归模型和多任务学习,证明了该方法能有效提升语音情感识别的性能和可解释性。
Speech emotion recognition (SER) has traditionally relied on categorical or dimensional labels. However, this technique is limited in representing both the diversity and interpretability of emotions. To overcome this limitation, we focus on color attributes, such as hue, saturation, and value, to represent emotions as continuous and interpretable scores. We annotated an emotional speech corpus with color attributes via crowdsourcing and analyzed them. Moreover, we built regression models for color attributes in SER using machine learning and deep learning, and explored the multitask learning of color attribute regression and emotion classification. As a result, we demonstrated the relationship between color attributes and emotions in speech, and successfully developed color attribute regression models for SER. We also showed that multitask learning improved the performance of each task.
基于颜色的情感表征在语音情感识别中的应用 / Color-based Emotion Representation for Speech Emotion Recognition
这篇论文提出了一种用颜色属性(如色调、饱和度、明度)来连续、直观地表示语音情感的新方法,并通过构建回归模型和多任务学习,证明了该方法能有效提升语音情感识别的性能和可解释性。
源自 arXiv: 2602.16256