节点文献
基于双标注框架的实体关系联合抽取
Joint extraction of entities and relations based on double-labeled architecture
【摘要】 实体关系抽取有流水线和联合抽取两种,联合抽取能更有效地抽取实体关系,流水线的适应能力更灵活。为解决实体关系抽取中的关系重叠问题,提出一种双标注实体关系抽取框架。使用联合解码的方式抽取自然文本中的主体实体,使用流水线方式抽取出客体实体。使用联合解码保证抽取精度的同时继承流水线的灵活性。所提模型在信息抽取数据集DUIE和远程监督数据集NYT上进行实验,其结果表明,该模型与基线模型相比具有竞争力。
【Abstract】 Relations extraction methods can be divided into two types including pipeline method and joint extraction, and the joint extraction model can extract the relation more effectively, and the adaptability of pipeline is more flexible. To solve the problem of relation overlap in relation extraction, the double-labeled relations extraction framework was proposed. The joint decoding was used to extract the subject entity in the natural text, and the object entity was extracted by pipeline. This technique ensured the extraction accuracy using the joint decoding method, and inherited the flexibility of the pipeline method. The proposed framework was experimented on the information extraction dataset DUIE and the remote supervision dataset NYT. The results show that this model can achieve competitive performance compared with the baseline model.
【Key words】 entity and relations extraction; sequence tagging; joint extraction; relation overlap; information extraction; attention mechanism; natural language processing(NLP);
- 【文献出处】 计算机工程与设计 ,Computer Engineering and Design , 编辑部邮箱 ,2024年06期
- 【分类号】TP391.1
- 【下载频次】69