èŠ‚ç‚¹æ–‡çŒ®

æ·±åº¦ä¸‰ç»´é‡å»º:æ–¹æ³•ã€æ•°æ®å’ŒæŒ‘æˆ˜ï¼ˆè‹±æ–‡ï¼‰

Deep 3D reconstruction:methods, data, and challenges

æŽ¨è CAJä¸‹è½½
PDFä¸‹è½½
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ åˆ˜å½©éœžï¼› å”å¾·æ…§ï¼› çŽ‹å°‘å¸†ï¼› çŽ‹å¿—å‹‡ï¼› æŽæ•¬åŽï¼› å°¹å®æ‰ï¼›

ã€Authorã€‘ Caixia LIU;Dehui KONG;Shaofan WANG;Zhiyong WANG;Jinghua LI;Baocai YIN;Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Institute of Artificial Intelligence,Faculty of Information Technology, Beijing University of Technology;Multimedia Laboratory, School of Computer Science, University of Sydney;

ã€é€šè®¯ä½œè€…ã€‘ çŽ‹å°‘å¸†;

ã€æœºæž„ã€‘ åŒ—äº¬å·¥ä¸šå¤§å¦ä¿¡æ¯å¦éƒ¨åŒ—äº¬äººå·¥æ™ºèƒ½ç ”ç©¶é™¢å¤šåª’ä½“ä¸Žæ™ºèƒ½è½¯ä»¶æŠ€æœ¯åŒ—äº¬å¸‚é‡ç‚¹å®žéªŒå®¤ï¼› æ‚‰å°¼å¤§å¦è®¡ç®—æœºç§‘å¦å¦é™¢å¤šåª’ä½“å®žéªŒå®¤ï¼›

ã€æ‘˜è¦ã€‘ ä¸‰ç»´å½¢çŠ¶é‡å»ºæ˜¯è®¡ç®—æœºè§†è§‰ã€è®¡ç®—æœºå›¾å½¢å¦ã€æ¨¡å¼è¯†åˆ«å’Œè™šæ‹ŸçŽ°å®žç‰é¢†åŸŸçš„é‡è¦ç ”ç©¶è¯¾é¢˜ã€‚çŽ°æœ‰ä¸‰ç»´é‡å»ºæ–¹æ³•é€šå¸¸å˜åœ¨ä¸¤ä¸ªç“¶é¢ˆ:(1)å®ƒä»¬æ¶‰åŠå¤šä¸ªäººå·¥è®¾è®¡é˜¶æ®µ,å¯¼è‡´ç´¯ç§¯è¯¯å·®,ä¸”éš¾ä»¥è‡ªåŠ¨å¦ä¹ ä¸‰ç»´å½¢çŠ¶çš„è¯ä¹‰ç‰¹å¾;(2)å®ƒä»¬ä¸¥é‡ä¾èµ–å›¾åƒå†…å®¹å’Œè´¨é‡,ä»¥åŠç²¾ç¡®æ ¡å‡†çš„æ‘„åƒæœºã€‚å› æ¤,è¿™äº›æ–¹æ³•çš„é‡å»ºç²¾åº¦éš¾ä»¥æé«˜ã€‚åŸºäºŽæ·±åº¦å¦ä¹ çš„ä¸‰ç»´é‡å»ºæ–¹æ³•é€šè¿‡åˆ©ç”¨æ·±åº¦ç½‘ç»œè‡ªåŠ¨å¦ä¹ ä½Žè´¨é‡å›¾åƒä¸çš„ä¸‰ç»´å½¢çŠ¶è¯ä¹‰ç‰¹å¾,å…‹æœäº†è¿™ä¸¤ä¸ªç“¶é¢ˆã€‚ç„¶è€Œ,è¿™äº›æ–¹æ³•å…·æœ‰å¤šç§ä½“ç³»æ¡†æž¶,ä½†æ˜¯è‡³ä»Šæœªæœ‰æ–‡çŒ®å¯¹å®ƒä»¬ä½œæ·±å…¥åˆ†æžå’Œæ¯”è¾ƒã€‚æœ¬æ–‡å¯¹åŸºäºŽæ·±åº¦å¦ä¹ çš„ä¸‰ç»´é‡å»ºæ–¹æ³•è¿›è¡Œå…¨é¢ç»¼è¿°ã€‚é¦–å…ˆ,åŸºäºŽä¸åŒæ·±åº¦å¦ä¹ æ¨¡åž‹æ¡†æž¶,å°†åŸºäºŽæ·±åº¦å¦ä¹ çš„ä¸‰ç»´é‡å»ºæ–¹æ³•åˆ†ä¸º4ç±»:é€’å½’ç¥žç»ç½‘ç»œã€æ·±è‡ªç¼–ç å™¨ã€ç”Ÿæˆå¯¹æŠ—ç½‘ç»œå’Œå·ç§¯ç¥žç»ç½‘ç»œ,å¹¶å¯¹ç›¸åº”æ–¹æ³•ä½œè¯¦ç»†åˆ†æžã€‚å…¶æ¬¡,è¯¦ç»†ä»‹ç»ä¸Šè¿°æ–¹æ³•å¸¸ç”¨çš„4ä¸ªä»£è¡¨æ€§æ•°æ®åº“ã€‚å†æ¬¡,å¯¹åŸºäºŽæ·±åº¦å¦ä¹ çš„ä¸‰ç»´é‡å»ºæ–¹æ³•è¿›è¡Œç»¼åˆæ¯”è¾ƒ,åŒ…æ‹¬ä¸åŒæ–¹æ³•åœ¨åŒä¸€æ•°æ®åº“ã€åŒä¸€æ–¹æ³•åœ¨ä¸åŒæ•°æ®åº“ä»¥åŠåŒä¸€æ–¹æ³•å¯¹äºŽä¸åŒè§†è§’ä¸ªæ•°è¾“å…¥çš„ç»“æžœæ¯”è¾ƒã€‚æœ€åŽ,è®¨è®ºäº†åŸºäºŽæ·±åº¦å¦ä¹ çš„ä¸‰ç»´é‡å»ºæ–¹æ³•çš„å‘å±•è¶‹åŠ¿ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ Three-dimensional(3D) reconstruction of shapes is an important research topic in the fields of computer vision, computer graphics, pattern recognition, and virtual reality. Existing 3D reconstruction methods usually suffer from two bottlenecks:(1) they involve multiple manually designed states which can lead to cumulative errors,but can hardly learn semantic features of 3D shapes automatically;(2) they depend heavily on the content and quality of images, as well as precisely calibrated cameras. As a result, it is difficult to improve the reconstruction accuracy of those methods. 3D reconstruction methods based on deep learning overcome both of these bottlenecks by automatically learning semantic features of 3D shapes from low-quality images using deep networks. However, while these methods have various architectures, in-depth analysis and comparisons of them are unavailable so far. We present a comprehensive survey of 3D reconstruction methods based on deep learning. First, based on different deep learning model architectures, we divide 3D reconstruction methods based on deep learning into four types, recurrent neural network, deep autoencoder, generative adversarial network, and convolutional neural network based methods,and analyze the corresponding methodologies carefully. Second, we investigate four representative databases that are commonly used by the above methods in detail. Third, we give a comprehensive comparison of 3D reconstruction methods based on deep learning, which consists of the results of different methods with respect to the same database,the results of each method with respect to different databases, and the robustness of each method with respect to the number of views. Finally, we discuss future development of 3D reconstruction methods based on deep learning.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ æ·±åº¦å¦ä¹ æ¨¡åž‹ï¼› ä¸‰ç»´é‡å»ºï¼› å¾ªçŽ¯ç¥žç»ç½‘ç»œï¼› æ·±åº¦è‡ªç¼–ç å™¨ï¼› ç”Ÿæˆå¯¹æŠ—ç½‘ç»œï¼› å·ç§¯ç¥žç»ç½‘ç»œï¼›
ã€Key wordsã€‘ Deep learning modelsï¼› Three-dimensional reconstructionï¼› Recurrent neural networkï¼› Deep autoencoderï¼› Generative adversarial networkï¼› Convolutional neural networkï¼›

ã€åŸºé‡‘ã€‘ Project supported by the National Natural Science Foundation of China (Nos. 61772049, 61632006, 61876012, U19B2039, and 61906011);the Beijing Natural Science Foundation of China(No. 4202003)

ã€æ–‡çŒ®å‡ºå¤„ã€‘ Frontiers of Information Technology & Electronic Engineering ,ä¿¡æ¯ä¸Žç”µåå·¥ç¨‹å‰æ²¿(è‹±æ–‡) , ç¼–è¾‘éƒ¨é‚®ç®± ,2021å¹´05æœŸ

ã€åˆ†ç±»å·ã€‘TP391.41
ã€è¢«å¼•é¢‘æ¬¡ã€‘1
ã€ä¸‹è½½é¢‘æ¬¡ã€‘213

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

æ·±åº¦ä¸‰ç»´é‡å»º:æ–¹æ³•ã€æ•°æ®å’ŒæŒ‘æˆ˜ï¼ˆè‹±æ–‡ï¼‰

Deep 3D reconstruction:methods, data, and challenges

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æ·±åº¦ä¸‰ç»´é‡å»º:æ–¹æ³•ã€æ•°æ®å’ŒæŒ‘æˆ˜ï¼ˆè‹±æ–‡ï¼‰