节点文献
基于指数加权的基音平滑算法
Method of pitch smoothing based on exponential weighting
【摘要】 提出了一种用于语音合成的语音片断基音平滑技术。在基于波形拼接的语音合成中,一般使用TD-PSOLA算法进行基频和时长的修改,但是用传统的TD-PSOLA算法进行的基频修改是针对片断整体而言,所以仍然不能很好的解决语音合成中的拼接单元之间的基频不连续问题,特别是在片断接合处。由于基元片断提取自不同语境的语料,合成语音听起来明显感觉到音高的不自然。对传统的TD-PSOLA算法进行了改进,以基音周期为间隔对语音片断信号进行分帧,通过指数加权相应帧的方法来进行平滑处理,经听音测试,较好的解决了拼接片断间的不连续现象。
【Abstract】 A method of pitch smoothing for unit segments is presented. In speech synthsis based on waveform concatenative synthesis,time domain pitch synchronous overlap add (TD-PSOLA) algorithm is often used to modify pitch and duration. However, conventionalTD-PSOLA algorighm works on whole segment in pitch smoothing, it is still poor to resolve discontinuity between unit segment, parti-cularly at the joint. Because the unit is selected from different context corpus, so it is felt obviously that discontinuous pitch appears asif synthetic speech comes from different intonation. Smooth pitch is tried to smooth according to context by weighting exponential functionon each-frame splitted by pitch mark. The listenning test shows that this method well resolve discontinuity problem at the joint.
【Key words】 pitch smoothing; exponential weighting; TD-PSOLA; speech synthesis; F0 contours; pitch marking;
- 【文献出处】 计算机工程与设计 ,Computer Engineering and Design , 编辑部邮箱 ,2006年17期
- 【分类号】TN912.33
- 【被引频次】1
- 【下载频次】130