èŠ‚ç‚¹æ–‡çŒ®

Webå†…å®¹æŒ–æŽ˜æŠ€æœ¯ç ”ç©¶

Research on Web Content Mining

æŽ¨è CAJä¸‹è½½
PDFä¸‹è½½
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ æ¶‚æ‰¿èƒœï¼› é²æ˜Žç¾½ï¼› é™†çŽ‰æ˜Œï¼›

ã€Authorã€‘ TU Chengî€‘shengî€‹î€ˆ1,2î€‰,LU Mingî€‘yuî€‹2,LU Yuî€‘changî€‹2 (1.Dept.of Computer Science,Chongqing Three Gorges College,Chongqing 404000,China; 2.Dept.of Computer Science & Technology,State Key Laboratory of Intelligent Technology & System,Tsinghua University,Beijing 100084,China)

ã€æœºæž„ã€‘ é‡åº†ä¸‰å³¡å¦é™¢è®¡ç®—æœºç§‘å¦ç³»ï¼› æ¸…åŽå¤§å¦è®¡ç®—æœºç§‘å¦ä¸ŽæŠ€æœ¯ç³»æ™ºèƒ½æŠ€æœ¯ä¸Žç³»ç»Ÿå›½å®¶é‡ç‚¹å®žéªŒå®¤ï¼› æ¸…åŽå¤§å¦è®¡ç®—æœºç§‘å¦ä¸ŽæŠ€æœ¯ç³»æ™ºèƒ½æŠ€æœ¯ä¸Žç³»ç»Ÿå›½å®¶é‡ç‚¹å®žéªŒå®¤ é‡åº†404000æ¸…åŽå¤§å¦è®¡ç®—æœºç§‘å¦ä¸ŽæŠ€æœ¯ç³»æ™ºèƒ½æŠ€æœ¯ä¸Žç³»ç»Ÿå›½å®¶é‡ç‚¹å®žéªŒå®¤åŒ—äº¬100084ï¼› åŒ—äº¬100084ï¼› åŒ—äº¬100084ï¼›

ã€æ‘˜è¦ã€‘ ç®€è¦ä»‹ç»äº†WebæŒ–æŽ˜çš„æ¦‚å¿µã€åˆ†ç±»ä»¥åŠå…¶åŠŸèƒ½,é˜è¿°äº†WebæŒ–æŽ˜ä¸Žä¼ ç»Ÿæ•°æ®æŒ–æŽ˜ä»¥åŠWebä¿¡æ¯æ£€ç´¢ä¹‹é—´çš„å…³ç³»ã€‚ç»™å‡ºäº†Webå†…å®¹æŒ–æŽ˜çš„ä¸åŒåˆ†ç±»æ–¹æ³•ã€æ–‡æœ¬ä»¥åŠå¤šåª’ä½“æ–‡æœ¬æ•°æ®æŒ–æŽ˜çš„å®šä¹‰ã€åˆ†ç±»ä¸Žåº”ç”¨ã€‚é‡ç‚¹åˆ†æžäº†Webæ–‡æœ¬æŒ–æŽ˜çš„æ–¹æ³•,åŒ…æ‹¬æ–‡æœ¬çš„ç‰¹å¾è¡¨ç¤ºä¸ŽæŠ½å–ã€æ–‡æœ¬çš„åˆ†ç±»ä¸Žèšç±»ç‰,è®¨è®ºäº†å¤šåª’ä½“æ–‡æœ¬åˆ†ç±»æŒ–æŽ˜æ–¹æ³•ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ This paper briefly introduces the conception of web mining,including the taxonomy and function,and discusses the relationship between information mining and retrieval on the web,and the difference between web mining and data mining.Then definition and classifications and applications of web text data mining are given,including a taxonomy of content mining.The method of text mining on web are discussed in detail,including text categorization and text clustering,etc.It discusses multimedia text data categorization and its alterationæ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ WebæŒ–æŽ˜ï¼› Webå†…å®¹æŒ–æŽ˜ï¼› æ–‡æœ¬çš„åˆ†ç±»ï¼› æ–‡æœ¬èšç±»ï¼› å¤šåª’ä½“æ–‡æœ¬æŒ–æŽ˜ï¼›
ã€Key wordsã€‘ Web Miningï¼› Web Content Miningï¼› Text Categorizationï¼› Text Clusteringï¼› Multimedia Text Miningï¼›

ã€åŸºé‡‘ã€‘ å›½å®¶è‡ªç„¶ç§‘å¦åŸºé‡‘é‡å¤§é¡¹ç›®(79990580);å›½å®¶"973"é‡ç‚¹åŸºç¡€ç ”ç©¶å‘å±•é¡¹ç›®(G1998030414)

ã€æ–‡çŒ®å‡ºå¤„ã€‘ è®¡ç®—æœºåº”ç”¨ç ”ç©¶ ,Application Research of Computers , ç¼–è¾‘éƒ¨é‚®ç®± ,2003å¹´11æœŸ

ã€åˆ†ç±»å·ã€‘TP393.09
ã€è¢«å¼•é¢‘æ¬¡ã€‘85
ã€ä¸‹è½½é¢‘æ¬¡ã€‘1040

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®