节点文献

应用倒谱特征的带噪语音端点检测方法

Endpoint Detection of Noisy Speech by the Use of Cepstrum

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 韦晓东胡光锐任晓林

【Author】 WEI Xiao dong 1, HU Guang rui 2, REN Xiao lin 2 1. Shanghai Jiaotong Univ. Bell Labs Comm. and Network Joint Lab. Shanghai 200030, China; 2. Dept. of Electronic Eng. Shanghai Jiaotong Univ., Shanghai 200030

【机构】 上海交通大学与贝尔实验室通信和网络联合实验室上海交通大学电子工程系!上海200030

【摘要】 传统的语音端点检测方法以信号的短时能量、过零率等简单特征作为判决特征参数.这些方法在实际应用中,尤其当信号信噪比比较低时,无法满足系统的需要.文中利用语音信号的倒谱特征作为判决抽样信号帧是否为语音信号的依据,并提出了倒谱距离测量法和循环神经网络法.通过对宽带噪声-白噪声干扰情况和一种特殊噪声——汽车噪声情况的实验,发现倒谱特征参数的语音信号端点检测方法在噪声环境下具有传统的能量方法无法比拟的优越性,更适合于实际应用

【Abstract】 Most practical automatic speech recognition(ASR) systems must work with a small signal noise ratio(SNR), and the conventional speech detection methods based on some simple features such as energy cannot work well in the noisy environments. In this paper, cepstrum was used as the feature to detect the voice activity. Two algorithms for endpoint detection of noisy speech signal were proposed. The first one takes the cepstral distances as the decision thresholds instead of short time energy. The second approach takes advantages of recurrent neural networks. The experiments show that the high accurate rates can be obtained in the noisy cases.

【关键词】 端点检测倒谱距离神经网络
【Key words】 endpoint detectioncepstral distanceneural network
【基金】 贝尔实验室(中国)上海分部资助
  • 【文献出处】 上海交通大学学报 ,JOURNAL OF SHANGHAI JIAOTONG UNIVERSITY , 编辑部邮箱 ,2000年02期
  • 【分类号】TN912.34
  • 【被引频次】43
  • 【下载频次】357
节点文献中: 

本文链接的文献网络图示:

本文的引文网络