节点文献
基于复数帧段特征的语音情感识别方法
A Method of Speech Emotion Recognition Based on Complex Frame Segment Feature
【摘要】 提出了一种基于复数帧段特征的语音情感识别方法,采用相继的复数帧组成的特征参数矢量作为语音情感识别GMM的输入,能有效地在语音情感识别GMM中引入帧间相关动态信息,同时为了改善复数帧段输入GMM的输出概率密度函数性能,在GMM的前端增加语音帧段参数压缩的主分量分析神经网络(PCANN)。语音情感识别实验证实了引入帧间相关动态信息方法的有效性,新方法在识别率上较状态输出独立GMM方法有一定程度的提升。
【Abstract】 A method of speech emotion recognition is proposed based on complex frame segment feature. Through combining several successive frames as a segmental unit witch is treated as an input vector for Gaussian Mixture Model(GMM). The inter-frame correlation information is effectively introduced into the process of speech emotion recognition. Furthermore, principal components analysis neural nerwork(PCANN)is adopted before GMM for the purpose of frame parameter compression, to improve the performance of output probability density function. Corresponding experiments are performed and the results show that the recognition rate of the proposed method is improved to some extend comparing with the traditional status output independent GMM,thus the effectiveness of introducing dynamic inter-frame correlation information into the process of speech emotion recognition is validated.
【Key words】 speech emotion recognition; Gaussian mixture model; principal components analysis neural network; complex frame segment feature;
- 【文献出处】 电子器件 ,Chinese Journal of Electron Devices , 编辑部邮箱 ,2022年02期
- 【分类号】TN912.34
- 【下载频次】51