节点文献
平行语料库对齐技术的语言学思考
Parallel Corpus Alignment from the Perspective of Linguistics
【摘要】 双语对齐是平行语料库处理的基本环节之一。文章从语言学角度强调当前双语对齐研究能够为今后的工作起到指导作用。通过结合象征单位和翻译单位等概念分析认为,对齐结果应该符合语言学意义上的象征单位属性,对齐模型应该在合适的语言层次即翻译单位上建立双语对应关系,而且语料质量和数量上也应该加以控制。
【Abstract】 Bilingual alignment plays an essential part in parallel corpora processing.Reflections on the contemporary research into bilingual alignment from the perspective of linguistics can offer guidance for future study.Using such concepts as symbolic unit and translation unit,this paper concludes that the alignment results should acquire unit status,that the source text and target text should be alignment dynamically on appropriate scales,and that the quality and quantity of the data should also be scrutinized by the alignment model.
【关键词】 平行语料库;
双语对齐;
象征单位;
翻译单位;
【Key words】 parallel corpus; bilingual alignment; symbolic unit; translation unit;
【Key words】 parallel corpus; bilingual alignment; symbolic unit; translation unit;
【基金】 合肥工业大学科学研究发展基金项目(081705F)
- 【文献出处】 合肥工业大学学报(社会科学版) ,Journal of Hefei University of Technology(Social Sciences) , 编辑部邮箱 ,2009年03期
- 【分类号】H059
- 【被引频次】12
- 【下载频次】404