èŠ‚ç‚¹æ–‡çŒ®

åŸºäºŽAndroidçš„ä¼—åŒ…æ–‡æœ¬æ ‡æ³¨ç³»ç»Ÿçš„è®¾è®¡ä¸Žå®žçŽ°

The Design and Implementation of Crowdsourcing Platform for Text Labeling System Based on Android

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ å”æ•ï¼›

ã€æ‘˜è¦ã€‘ æ–‡æœ¬ä¿¡æ¯æ˜¯æœ€åŸºæœ¬çš„ä¿¡æ¯å½¢å¼,åˆ©ç”¨è‡ªç„¶è¯è¨€å¤„ç†æŠ€æœ¯å¯ä»¥å¯¹æµ·é‡çš„æ–‡æœ¬æ•°æ®è¿›è¡Œåˆ†æžå¤„ç†ã€‚è€Œæ™ºèƒ½åŒ–è‡ªåŠ¨å¤„ç†ä¿¡æ¯çš„é¦–è¦æ¡ä»¶æ˜¯è¦æœ‰å·²ç»æ ‡æ³¨çš„æ•°æ®ä½œä¸ºè®ç»ƒé›†å¯¹æ•°æ®æ¨¡åž‹è¿›è¡Œè®ç»ƒã€‚å› æ¤,å¯¹æ–‡æœ¬æ•°æ®è¿›è¡Œæ ‡æ³¨å°±æˆä¸ºåœ¨å¯¹è‡ªç„¶è¯è¨€å¤„ç†ç®—æ³•è¿›è¡Œç ”ç©¶ä¹‹å‰éœ€è¦è§£å†³çš„ä¸€ä¸ªé—®é¢˜ã€‚ç”±äºŽæ–‡æœ¬å¤„ç†ç®—æ³•å¤šç§å¤šæ ·,éœ€è¦å¯¹æ–‡æœ¬è¿›è¡Œä¸åŒè§’åº¦çš„ç ”ç©¶,å°±éœ€è¦å®žçŽ°å¤šç§ç±»åž‹çš„æ–‡æœ¬æ ‡æ³¨ã€‚æœ¬æ–‡æ€»ç»“äº†å›½å†…å¤–æ•°æ®æ ‡æ³¨å¹³å°çš„å‘å±•çŽ°çŠ¶,é’ˆå¯¹ç›®å‰æ•°æ®æ ‡æ³¨å¹³å°æ ‡æ³¨ç±»åž‹ç¹å¤š,ä½†æ˜¯é²œæœ‰ä¸“ä¸šçš„æ–‡æœ¬æ ‡æ³¨å¹³å°çš„ç‰¹ç‚¹;ç»“åˆä¼—åŒ…å¹³å°ç”¨æˆ·é‡å¤§ã€æ•ˆçŽ‡é«˜ã€æˆæœ¬ä½Žçš„ç‰¹ç‚¹,æå‡ºæž„å»ºåŸºäºŽä¼—åŒ…çš„æ–‡æœ¬æ ‡æ³¨ç³»ç»Ÿçš„å¿…è¦æ€§å’Œå¯è¡Œæ€§,ä»Žè€Œæœ‰æ•ˆè§£å†³æ–‡æœ¬æ ‡æ³¨é—®é¢˜ã€‚æœ¬æ–‡è®¾è®¡å®žçŽ°äº†ä¸€ä¸ªåŸºäºŽä¼—åŒ…å¹³å°çš„æ–‡æœ¬æ ‡æ³¨ç³»ç»Ÿã€‚è¯¥ç³»ç»Ÿåˆ†ä¸ºä»»åŠ¡å‘å¸ƒã€ä»»åŠ¡æ‰§è¡Œå’Œä»»åŠ¡ç®¡ç†ä¸‰ä¸ªæ¨¡å—ã€‚åœ¨è¯¥ç³»ç»Ÿä¸,æ–‡æœ¬æ ‡æ³¨å·¥ä½œä»¥ä»»åŠ¡ä¸ºè½½ä½“,æ–‡æœ¬æ ‡æ³¨ä»»åŠ¡è¢«åˆ’åˆ†æˆä¸åŒçš„ç±»åž‹ã€‚åœ¨ä»»åŠ¡å‘å¸ƒæ¨¡å—ç”¨æˆ·é€‰æ‹©ä»»åŠ¡ç±»åž‹,ç„¶åŽæŠŠéœ€è¦æ ‡æ³¨çš„æ–‡æœ¬å†…å®¹ä»¥æ–‡ä»¶çš„å½¢å¼ä¸Šä¼ åˆ°è¯¥ç³»ç»Ÿã€‚åœ¨ä»»åŠ¡æ‰§è¡Œæ¨¡å—ç”¨æˆ·å¯ä»¥é€šè¿‡é€‰å–æ–‡ä»¶å†…å®¹ã€é€‰æ‹©æ ‡ç¾ã€è¿žçº¿å’Œæ‹–æ‹½æ–‡æœ¬ç‰ä¸åŒæ“ä½œæ–¹å¼,å¯¹æ–‡æœ¬æ•°æ®è¿›è¡Œä¸åŒç±»åž‹çš„æ ‡æ³¨ã€‚åœ¨ä»»åŠ¡ç®¡ç†æ¨¡å—ç”¨æˆ·å¯ä»¥æŸ¥çœ‹è‡ªå·±å‘å¸ƒæˆ–å‚ä¸Žçš„ä»»åŠ¡ã€‚è¯¥ç³»ç»ŸåŽå°ä½¿ç”¨Spring Bootæ¡†æž¶è¿›è¡Œæå»º,å‰ç«¯ä½¿ç”¨And roidç§»åŠ¨ç«¯é¡µé¢å±•ç¤ºæ•°æ®ã€‚è¯¥ç³»ç»Ÿè®¾è®¡å¹¶å®žçŽ°äº†å¯¹æ–‡æœ¬çš„å…ç§ç±»åž‹çš„æ ‡æ³¨,å®Œæˆäº†é¢„æœŸåŠŸèƒ½,åŽæœŸå¯ä»¥æ‰©å±•æ–°çš„æ–‡æœ¬æ ‡æ³¨ç±»åž‹ã€‚æœ¬æ–‡ä¸»è¦å¯¹ä¸‰ç§æ–‡æœ¬æ ‡æ³¨ç±»åž‹çš„è®¾è®¡ä¸Žå®žçŽ°è¿›è¡Œäº†æè¿°ã€‚è¯¥ç³»ç»Ÿè‡´åŠ›äºŽä¸ºè‡ªç„¶è¯è¨€å¤„ç†çš„æ‰€æœ‰ç®—æ³•æä¾›é«˜è´¨é‡ã€å¤šç§ç±»çš„å¯é æ ‡æ³¨æ•°æ®é›†;åˆ©ç”¨å¯é æ•°æ®æé«˜ç®—æ³•è®ç»ƒçš„å‡†ç¡®åº¦,ç¼©å‡è®ç»ƒç®—æ³•å‰æœŸå‡†å¤‡çš„æ—¶é—´,æŽ¨åŠ¨è‡ªç„¶è¯è¨€å¤„ç†æŠ€æœ¯çš„å‘å±•ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ Text information is the most basic form of information,and natural language processing technology can be used to analyze and process massive amounts of text data.The first condition for processing information intelligently and automatically is to own the text data that has already been labeled as thetraining set to train the data model.Therefore,labeling text data has become a problem to be solved before the study on natural language processing algorithms.Because there are many kinds of text processing algorithms,it is necessary to study the text at different angles,and it is necessary to implement multiple types of text labeling.This thesis has summarized the development status of data labeling platform at home and abroad,aiming at the characteristics of the current data labeling platform:there are many kinds of data labeling types,but there are few professional text labeling platforms;combined with the characteristics of crowdsourcing platform:users with large quantity,high efficiency and low cost,so the necessity and feasibility of constructing a crowdsourcing-based text labeling system is proposed to solve the data labeling problem effectivelyThis thesis has designed and implemented a text labeling system based on crowdsourcing platform.The system is divided into three modules:task publishing module task executing module and task management module.In this system,the text labeling work is task-based and the text labeling tasks are divided into different types.In the task publishing module,users can select a text labeling type,and then upload the text content that wants to be labeled to the system in the form of file.In the task executing module,users can choose different ways of operation such as selecting file content,selecting labels,connecting lines and draging text to implement different types of text labeling.In the task management module,users can view tasks that are published or participated in by himself.The systemâ€™s back end uses the Spring Boot framework to build,and the front end uses Android mobile pages to display data.The system has designed and implemented six types of text labeling to label text,and has completed the expected functions,the system can extend new text labeling types in late period.The system is dedicated to providing high-quality,multi-category and reliable labeling data sets for all algorithms of natural language processing;and improving the accuracy of algorithm training by using reliable data,reducing the preparation time required for training algorithms,and promoting the development of natural language processing technology.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ ä¼—åŒ…ï¼› æ–‡æœ¬æ ‡æ³¨ï¼› ä¿¡æ¯æŠ½å–ï¼› å…³ç³»æŠ½å–ï¼› Spring Bootï¼› Androidï¼›
ã€Key wordsã€‘ Crowdsourcingï¼› Text Annotationï¼› Information Extractionï¼› Relationship Extractionï¼› Spring Bootï¼› Androidï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ å—äº¬å¤§å¦

ã€åˆ†ç±»å·ã€‘TP391.1
ã€è¢«å¼•é¢‘æ¬¡ã€‘2
ã€ä¸‹è½½é¢‘æ¬¡ã€‘320

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®