nav emailalert searchbtn searchbox tablepage yinyongbenwen piczone journalimg journalInfo journalinfonormal searchdiv searchzone qikanlogo popupnotification paper paperNew
2012, 04, v.44 68-72
一种改进的HMM-RBF情感语音识别方法
基金项目(Foundation): 国家自然科学基金资助项目,编号60905039
邮箱(Email):
DOI:
摘要:

针对隐马尔科夫模型和径向基神经网络识别语音情感的缺陷,提出了一种新的基于两者的混合模型识别方法.将神经预测器引入隐马尔科夫模型计算状态观察概率,使得隐马尔科夫模型能有效利用语音帧间信息,同时又利用状态累计概率输入径向基神经网络分类,避免特征向量时间规整的麻烦.选取高兴、愤怒、悲伤、惊奇、平静五类情感,以平静情感为参照求取特征向量进行实验,结果表明,相对于单一隐马尔科夫模型和常用混合模型方法,该混合模型识别性能有明显改善,并且加入噪声后的识别效果仍然较好.

Abstract:

According to the analysis of the emotional speech signal,a hybrid model for speech emotion recognition,consisting of a neural network with radial basis functions and hidden Markov models,was described.Because radial basis functions could describe correlation between the frames,this hybrid model utilized a neural net with radial basis functions to approximate posterior probabilities of hidden Markov models states.In addition,the discriminating recognition results were recognized by radial basis functions.In the experiments,five kinds of emotional speech such as happy,anger,sadness,surprise,and calm were recorded.The relative values of the first four kinds of emotional characteristic parameters and calm emotional characteristic parameters were regarded as the input characteristic vector.The experiments showed that the improved hybrid model with additional noise to the signal was better than hidden Markov models and the pre-existing hybrid model.

参考文献

[1]赵腊生,张强,魏小鹏.语音情感识别研究进展[J].计算机应用研究,2009,26(2):428-432.

[2]章国宝,宋清华,费树岷,等.语音情感识别研究[J].计算机技术与发展,2009,19(1):92-96.

[3]王朋,陈树中.基于混合模型HMM/RBF的数字语音识别[J].计算机工程,2002,28(12):136-138.

[4]何振亚,顾明亮,王太君,等.基于HMM与RBF的混合语音识别新方法[J].数据采集与处理,1999,14(2):153-156.

[5]罗毅.一种基于HMM和ANN的语音情感识别分类器[J].微计算机信息,2007,23(12):218-219.

[6]Polur P D,Miller G E.Investigation of an HMM/ANN hybrid structure in pattern recognition application using cepstral analysisof dysarthric(distorted)speech signals[J].Medical Engineering&Physics,2006,28(8):741-748.

[7]王海龙,孟令启,马金亮,等.基于RBF神经网络的热轧碳钢变形抗力预测[J].郑州大学学报:理学版,2007,39(3):131-135.

[8]Guo Xianhai.Study of emotion recognition based on electrocardiogram and RBF neural network[J].Procedia Engineering,2011,15:2408-2412.

[9]Moataz E A,Mohamed S K,Fakhri K.Survey on speech emotion recognition:features,classification schemes,and databases[J].Pattern Recognition,2011,44(3):572-587.

[10]Shahnaz C,Zhu Weiping,Ahmad M O,et al.Pitch estimation based on a harmonic sinusoidal autocorrelation model and atime-domain matching scheme[J].IEEE Transactions on Audio,Speech,and Language Processing,2012,20(1):322-335.

[11]Yeh J H,Pao T L,Lin C Y,et al.Segment-based emotion recognition from continuous Mandarin Chinese speech[J].Com-puters in Human Behavior,2011,27(5):1545-1552.

[12]Nwe T L,Foo S W,Silva L C,et al.Speech emotion recognition using hidden Markov models[J].Speech Communication,2003,41(4):603-623.

[13]Ho K,Leung A C,Sum J.Training RBF network to tolerate single node fault[J].Neurocomputing,2011,74(6):1046-1052.

基本信息:

中图分类号:TN912.34

引用信息:

[1]王杰,耿丽红,朱晓东.一种改进的HMM-RBF情感语音识别方法[J],2012,44(04):68-72.

基金信息:

国家自然科学基金资助项目,编号60905039

检 索 高级检索