节点文献

面向依存文法分析的搭配抽取方法研究

A Method to Fetch Collocations Orienting DependencyGrammar

推荐 CAJ下载
PDF下载
不支持迅雷等下载工具，请取消加速工具后下载。

【Author】 Che Wanxiang Liu Ting Qin Bing Li ShengInformation Retrieval Group, Harbin Institute of Technology 15001

【摘要】本文通过对经分词和词性标注的大规模语料库(1.8GB)的统计,计算出语料库中出现的词对个数、距离及方差,并应用t检验的改进方法,得到了词对之间的“搭配强度系数”值R,以此来衡量它们之间这种搭配关系的强弱.这一系数直接面向依存文法分析,以此得到一个句子中各个词的搭配关系强弱序列表,以后将要从此表中得到依存文法树.目前我们可以在智能搜索引擎等多种场合找到此种方法的应用.更多还原

【Abstract】 In this paper, we statistic a very large corpus (1.8GB) and work out the word-pairs’ number, distance’s mean and variance. And then we use a modified t test method to fetch a "Collocations Coefficient" R in order to weigh how strong of their relationship. This coefficient orients to the analysis of dependency grammar straight. In this way, we have a word-pairs list in a sentence sorted by R’s order. Later, we can get a dependency grammar tree from this list. Now we can find several applications using this method such as intelligent searching engine, etc.更多还原

【关键词】搭配；搭配强度系数； t检验；依存文法；智能搜索引擎；
【Key words】 collocations Collocations Coefficient t test dependency grammarintelligent searching engine；

【会议录名称】自然语言理解与机器翻译——全国第六届计算语言学联合学术会议论文集

【会议名称】全国第六届计算语言学联合学术会议

【会议时间】2001-08
【会议地点】中国山西
【分类号】H085

【主办单位】山西大学计算机系

知网节下载

节点文献中：

本文链接的文献网络图示:

本文的引文网络

节点文献