节点文献

最大后验估计和加权近邻回归结合的说话人自适应方法

Speaker adaptation with MAP estimation and weighted neighbor regression

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 何磊方棣棠吴文虎

【Author】 HE Lei, FANG Ditang, WU Wenhu (Center of Speech Technology, State Key Laboratory of Intelligent Technology and Systems, Department of Computer Science and Technolgy, Tsinghua University, Beijing 100084, China)

【机构】 清华大学计算机科学与技术系!智能技术与系统国家重点实验室语音技术中心北京100084

【摘要】 提出了一种最大后验 (m aximum a posteriori,MAP)估计和加权近邻回归 (weighted neighbors regression,WNR)相结合的说话人自适应方法。在 MAP自适应中 ,只有自适应数据对应的模型参数可以得到调整。针对这一缺点 ,提出一种基于变换的模型插值 /平滑方法 - WNR,利用模型近邻信息和 MAP自适应结果 ,建立距离加权的回归模型 ,对没有自适应数据的模型完成模型调整。实验证明 ,该方法可以有效地提高 MAP自适应的速度。在自适应数据为 10句时 ,音节误识率降低近 15 % ;而在自适应数据为 2 5 0句时 ,误识率降低 5 0 %以上。此外 ,证明了向量域平滑 (vectorfield sm oothing,VFS)是 WNR方法的一种退化的特例

【Abstract】 This paper describes a novel speaker adaptation framework that combines the maximum a posteriori (MAP) estimation and wighted neighbor regression (WNR) methods. A great deal of adaptation data is required in MAP adaptation because only the parameters of those models with adaptation data can be updated. To alleviate this disadvantage, a technique called WNR is presented in which the parameter relationships between the speaker independent models and the speaker adaptation models are trained by applying distance weighted regression to a set of neighbor model parameters with and without MAP adaptation. The Chinese syllable recognition error is reduced nearly 15 percent with 10 adaptation utterances and more than 50 percent with 250 utterances. In addition, vector field smoothing (VFS) can be proved to be a degenerate case of WNR.

  • 【文献出处】 清华大学学报(自然科学版) ,Journal of Tsinghua University(Science and Technology) , 编辑部邮箱 ,2001年01期
  • 【分类号】TN912.3
  • 【被引频次】3
  • 【下载频次】107
节点文献中: 

本文链接的文献网络图示:

本文的引文网络