èŠ‚ç‚¹æ–‡çŒ®

åŸºäºŽBERTæ¨¡åž‹çš„å¸æ³•æ–‡ä¹¦å®žä½“è¯†åˆ«æ–¹æ³•

Entity Recognition Method for Judicial Documents Based on BERT Model

æŽ¨è CAJä¸‹è½½
PDFä¸‹è½½
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ é™ˆå‰‘ï¼› ä½•æ¶›ï¼› é—»è‹±å‹ï¼› é©¬æž—æ¶›ï¼›

ã€Authorã€‘ CHEN Jian;HE Tao;WEN Ying-you;MA Lin-tao;School of Computer Science & Engineering/Neusoft Research Institute,Northeastern University;

ã€æœºæž„ã€‘ ä¸œåŒ—å¤§å¦è®¡ç®—æœºç§‘å¦ä¸Žå·¥ç¨‹å¦é™¢/ä¸œè½¯ç ”ç©¶é™¢ï¼›

ã€æ‘˜è¦ã€‘ é‡‡ç”¨æ‰‹å·¥åˆ†æžæ¡ˆä»¶å·å®—,å®¹æ˜“äº§ç”Ÿæ¡ˆä»¶å®žä½“é—æ¼çŽ°è±¡åŠæå–ç‰¹å¾æ•ˆçŽ‡ä½Žä¸‹é—®é¢˜.ä¸ºæ¤,ä½¿ç”¨åŸºäºŽåŒå‘è®ç»ƒTransformerçš„ç¼–ç å™¨è¡¨å¾é¢„è®ç»ƒæ¨¡åž‹.åœ¨æ‰‹å·¥æ ‡æ³¨çš„è¯æ–™åº“ä¸å¾®è°ƒæ¨¡åž‹å‚æ•°,å†ç”±é•¿çŸæ—¶è®°å¿†ç½‘ç»œä¸Žæ¡ä»¶éšæœºåœºå¯¹å‰ä¸€å±‚è¾“å‡ºçš„è¯ä¹‰ç¼–ç è¿›è¡Œè§£ç ,å®Œæˆå®žä½“æŠ½å–.è¯¥é¢„è®ç»ƒæ¨¡åž‹å…·æœ‰å·¨å¤§çš„å‚æ•°é‡ã€å¼ºå¤§çš„ç‰¹å¾æå–èƒ½åŠ›å’Œå®žä½“çš„å¤šç»´è¯ä¹‰è¡¨å¾ç‰ä¼˜åŠ¿,å¯æœ‰æ•ˆæå‡å®žä½“æŠ½å–æ•ˆæžœ.å®žéªŒç»“æžœè¡¨æ˜Ž,æœ¬æ–‡æå‡ºçš„æ¨¡åž‹èƒ½å®žçŽ°89%ä»¥ä¸Šçš„å®žä½“æå–å‡†ç¡®åº¦,æ˜¾è‘—ä¼˜äºŽä¼ ç»Ÿçš„å¾ªçŽ¯ç¥žç»ç½‘ç»œå’Œå·ç§¯ç¥žç»ç½‘ç»œæ¨¡åž‹.æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ Using manual analysis of case files,it is easy to cause the problem of case entity omission and lowefficiency of feature extraction.Therefore, the bidirectional encoder representation from transformers pre-training model based on the traditional long short-term memory networks and conditional random fields was used to fine tune the model parameters on the manually labeled corpus for entity recognition.And then the semantic coding output from the previous layer was decoded by the long short-term memory networks and conditional random fields to complete entity extraction.The pre-training model has the advantages of huge parameters,powerful feature extraction ability and multi-dimensional semantic representation of entities,which can effectively improve the effect of entity extraction.The experimental results showed that the proposed model can achieve more than 89% entity extraction accuracy,which is significantly better than the traditional recurrent neural network and convolutional neural network model.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ æ·±åº¦å¦ä¹ ï¼› é¢„è®ç»ƒæ¨¡åž‹ï¼› åŒå‘é•¿çŸæ—¶è®°å¿†ç½‘ç»œï¼› æ¡ä»¶éšæœºåœºï¼› å‘½åå®žä½“è¯†åˆ«ï¼›
ã€Key wordsã€‘ deep learningï¼› pre-training modelï¼› bidirectional long short-term memoryï¼› conditional random fieldï¼› named entity recognitionï¼›

ã€åŸºé‡‘ã€‘ å›½å®¶é‡ç‚¹ç ”å‘è®¡åˆ’é¡¹ç›®(2018YFC0830601);è¾½å®çœé‡ç‚¹ç ”å‘è®¡åˆ’é¡¹ç›®(2019JH2/10100027);ä¸å¤®é«˜æ ¡åŸºæœ¬ç§‘ç ”ä¸šåŠ¡è´¹ä¸“é¡¹èµ„é‡‘èµ„åŠ©é¡¹ç›®(N171802001);è¾½å®çœâ€œå…´è¾½è‹±æ‰è®¡åˆ’â€é¡¹ç›®(XLYC1802100)

ã€æ–‡çŒ®å‡ºå¤„ã€‘ ä¸œåŒ—å¤§å¦å¦æŠ¥(è‡ªç„¶ç§‘å¦ç‰ˆ) ,Journal of Northeastern University(Natural Science) , ç¼–è¾‘éƒ¨é‚®ç®± ,2020å¹´10æœŸ

ã€åˆ†ç±»å·ã€‘TP391.1
ã€è¢«å¼•é¢‘æ¬¡ã€‘22
ã€ä¸‹è½½é¢‘æ¬¡ã€‘553

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

åŸºäºŽBERTæ¨¡åž‹çš„å¸æ³•æ–‡ä¹¦å®žä½“è¯†åˆ«æ–¹æ³•

Entity Recognition Method for Judicial Documents Based on BERT Model

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

åŸºäºŽBERTæ¨¡åž‹çš„å¸æ³•æ–‡ä¹¦å®žä½“è¯†åˆ«æ–¹æ³•