èŠ‚ç‚¹æ–‡çŒ®

æ”¹è¿›èšç±»çš„æ·±åº¦ç¥žç»ç½‘ç»œåŽ‹ç¼©å®žçŽ°æ–¹æ³•

Deep neural networks compression based on improved clustering

æŽ¨è CAJä¸‹è½½
PDFä¸‹è½½
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ åˆ˜æ¶µï¼› çŽ‹å®‡ï¼› é©¬ç°ï¼›

ã€Authorã€‘ LIU Han;WANG Yu;MA Yan;School of Automation and Information Engineering, Xiâ€™an University of Technology;

ã€é€šè®¯ä½œè€…ã€‘ åˆ˜æ¶µ;

ã€æœºæž„ã€‘ è¥¿å®‰ç†å·¥å¤§å¦è‡ªåŠ¨åŒ–ä¸Žä¿¡æ¯å·¥ç¨‹å¦é™¢ï¼›

ã€æ‘˜è¦ã€‘ æ·±åº¦ç¥žç»ç½‘ç»œé€šå¸¸æ˜¯è¿‡å‚æ•°åŒ–çš„,å¹¶ä¸”æ·±åº¦å¦ä¹ æ¨¡åž‹å˜åœ¨ä¸¥é‡å†—ä½™,è¿™å¯¼è‡´äº†è®¡ç®—å’Œå˜å‚¨çš„å·¨å¤§æµªè´¹.é’ˆå¯¹è¿™ä¸ªé—®é¢˜,æœ¬æ–‡æå‡ºäº†ä¸€ç§åŸºäºŽæ”¹è¿›èšç±»çš„æ–¹æ³•æ¥å¯¹æ·±åº¦ç¥žç»ç½‘ç»œè¿›è¡ŒåŽ‹ç¼©.é¦–å…ˆé€šè¿‡å‰ªæžç–ç•¥å¯¹æ£å¸¸è®ç»ƒåŽçš„ç½‘ç»œè¿›è¡Œä¿®å‰ª,ç„¶åŽé€šè¿‡K-Means++èšç±»å¾—åˆ°æ¯å±‚æƒé‡çš„èšç±»ä¸å¿ƒä»Žè€Œå®žçŽ°æƒå€¼å…±äº«,æœ€åŽè¿›è¡Œå„å±‚æƒé‡çš„é‡åŒ–.æœ¬æ–‡åœ¨LeNet,AlexNetå’ŒVGG-16ä¸Šåˆ†åˆ«è¿›è¡Œäº†å®žéªŒ,æå‡ºçš„æ–¹æ³•æœ€ç»ˆå°†æ·±åº¦ç¥žç»ç½‘ç»œæ•´ä½“åŽ‹ç¼©äº†30åˆ°40å€,å¹¶ä¸”æ²¡æœ‰ç²¾åº¦æŸå¤±.å®žéªŒç»“æžœè¡¨æ˜Žé€šè¿‡åŸºäºŽæ”¹è¿›èšç±»çš„åŽ‹ç¼©æ–¹æ³•,æ·±åº¦ç¥žç»ç½‘ç»œåœ¨ä¸æŸå¤±ç²¾åº¦çš„æ¡ä»¶ä¸‹å®žçŽ°äº†æœ‰æ•ˆåŽ‹ç¼©,è¿™ä½¿å¾—æ·±åº¦ç½‘ç»œåœ¨ç§»åŠ¨ç«¯çš„éƒ¨ç½²æˆä¸ºäº†å¯èƒ½.æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ Deep neural networks are typically over-parametrized and there is significant redundancy for deep learning models,which results in a waste of both computation and memory usage.In order to solve the problem,a new method based on improved clustering to compress the deep neural network is proposed.First of all,the network is pruned after the normal training.Then through the K-Means++ clustering the clustering center of each layer is gotten to achieve weight sharing.After the first two steps network weight quantization are also performed.The experiments on LeNet,AlexNet and VGG-16 are carried out,in which the deep neural network are compressed by 30 to 40 times without any loss of precision.The experimental results show that the deep neural network achieves effective compression without loss of accuracy through the method based on improved clustering,which makes the deployment of deep network on the mobile end possible.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ æ·±åº¦ç¥žç»ç½‘ç»œï¼› å‰ªæžï¼› K-Means++èšç±»ï¼› æ·±åº¦ç½‘ç»œåŽ‹ç¼©ï¼›
ã€Key wordsã€‘ deep neural networksï¼› pruningï¼› K-Means++ï¼› deep network compressionï¼›

ã€åŸºé‡‘ã€‘ å›½å®¶è‡ªç„¶ç§‘å¦åŸºé‡‘é‡ç‚¹é¡¹ç›®(61833013);é™•è¥¿çœé‡ç‚¹ç ”å‘è®¡åˆ’é‡ç‚¹é¡¹ç›®(2018ZDXM-GY-089);é™•è¥¿çœçŽ°ä»£è£…å¤‡ç»¿è‰²åˆ¶é€ ååŒåˆ›æ–°ä¸å¿ƒç ”ç©¶è®¡åˆ’(304-210891704);é™•è¥¿çœæ•™è‚²åŽ…ç§‘å¦ç ”ç©¶è®¡åˆ’(2017JS088);è¥¿å®‰ç†å·¥å¤§å¦ç‰¹è‰²ç ”ç©¶è®¡åˆ’(2016TS023)èµ„åŠ©~~

ã€æ–‡çŒ®å‡ºå¤„ã€‘ æŽ§åˆ¶ç†è®ºä¸Žåº”ç”¨ ,Control Theory & Applications , ç¼–è¾‘éƒ¨é‚®ç®± ,2019å¹´07æœŸ

ã€åˆ†ç±»å·ã€‘TP183
ã€è¢«å¼•é¢‘æ¬¡ã€‘4
ã€ä¸‹è½½é¢‘æ¬¡ã€‘170

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

æ”¹è¿›èšç±»çš„æ·±åº¦ç¥žç»ç½‘ç»œåŽ‹ç¼©å®žçŽ°æ–¹æ³•

Deep neural networks compression based on improved clustering

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æ”¹è¿›èšç±»çš„æ·±åº¦ç¥žç»ç½‘ç»œåŽ‹ç¼©å®žçŽ°æ–¹æ³•