èŠ‚ç‚¹æ–‡çŒ®

å£°æºDOAä¼°è®¡ä¸çš„TDOA-DOAæ˜ å°„æ–¹æ³•ç ”ç©¶

Research on The Mapping of TDOA to DOA for Sound Source DOA Estimation

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ å¼ å³°ï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ å—äº¬èˆªç©ºèˆªå¤©å¤§å¦ ï¼Œ é€šä¿¡ä¸Žä¿¡æ¯ç³»ç»Ÿï¼Œ 2014ï¼Œ ç¡•å£«

ã€æ‘˜è¦ã€‘ å£°æºæ³¢è¾¾æ–¹å‘(Direction Of Arrival,DOA)ä¼°è®¡ä½œä¸ºéº¦å…‹é£Žé˜µåˆ—ä¿¡å·å¤„ç†ä¸çš„ä¸€é¡¹å…³é”®æŠ€æœ¯,åœ¨è§†é¢‘ä¼šè®®ç³»ç»Ÿã€æ•…éšœæ£€æµ‹ã€åŒ»ç–—è¯Šæ–ã€å†›äº‹ç‰è®¸å¤šé¢†åŸŸéƒ½æœ‰å¹¿æ³›åº”ç”¨ã€‚åŸºäºŽå¤šé€šé“åˆ°è¾¾æ—¶é—´å·®(Time Differences Of Arrival,TDOA)çš„æ–¹æ³•æ˜¯å£°æºDOAä¼°è®¡ä¸çš„ä¸€ç§é‡è¦æ–¹æ³•ã€‚ç„¶è€Œå½“å‰ç ”ç©¶å·¥ä½œä¸»è¦é›†ä¸åœ¨TDOAèŽ·å–,è€Œå¯¹TDOA-DOAæ˜ å°„æ–¹æ³•ç ”ç©¶è¾ƒå°‘ã€‚åŸºäºŽæœ€å°äºŒä¹˜æ”¯æŒå‘é‡å›žå½’æœº(Least Squares Support Vector Regression,LS-SVR)çš„TDOA-DOAæ˜ å°„æ–¹æ³•æœ‰è¾ƒå¥½çš„å£°æºDOAä¼°è®¡æ•ˆæžœ,ä½†å…¶ç ”ç©¶å¹¶ä¸å…¨é¢ã€‚æœ¬æ–‡é’ˆå¯¹åŸºäºŽLS-SVRçš„TDOA-DOAæ˜ å°„æ–¹æ³•,ä»ŽLS-SVRä¸çš„æ ¸å‡½æ•°é€‰å–ã€å¤šæ ¸LS-SVRæž„é€ ä»¥åŠç¨€ç–åŒ–åˆ†æžç‰æ–¹é¢è¿›è¡Œäº†æ·±å…¥ç ”ç©¶ã€‚æ¤å¤–,æœ¬æ–‡æå‡ºä¸€ç§åŸºäºŽç¨€ç–è¡¨ç¤ºç†è®ºçš„æ— éœ€è°ƒèŠ‚å‚æ•°çš„TDOA-DOAæ˜ å°„æ–¹æ³•ã€‚æœ¬æ–‡çš„ä¸»è¦å·¥ä½œæœ‰:1)ç”±äºŽä¸åŒæ ¸å‡½æ•°å…·æœ‰ä¸åŒçš„æ˜ å°„æ€§èƒ½,å› è€Œæœ¬æ–‡ç ”ç©¶äº†å¾„å‘åŸºæ ¸ã€å¤šé¡¹å¼æ ¸ä»¥åŠçº¿æ€§æ ¸è¿™ä¸‰ç§å¸¸è§æ ¸å‡½æ•°æž„é€ çš„LS-SVRåœ¨æ··å“å’Œå™ªå£°çŽ¯å¢ƒä¸çš„å£°æºDOAä¼°è®¡æ€§èƒ½,å¹¶ä¸Žæœ€å°äºŒä¹˜æ˜ å°„æ–¹å¼è¿›è¡Œäº†æ¯”è¾ƒã€‚ç ”ç©¶ç»“æžœè¡¨æ˜Ž,é‡‡ç”¨å¾„å‘åŸºæ ¸å‡½æ•°å…·æœ‰æ›´é«˜çš„ä¼°è®¡æ€§èƒ½ã€‚2)é’ˆå¯¹ä¼°è®¡æ—¶å»¶åœ¨æ··å“è¾ƒä¸ºä¸¥é‡çš„çŽ¯å¢ƒä¸å‡ºçŽ°ç¦»ç¾¤å€¼çš„é—®é¢˜,æœ¬æ–‡æ ¹æ®TDOA-DOAçš„æ˜ å°„ç‰¹ç‚¹,æå‡ºä¸€ç§åŸºäºŽä¸å€¼æ»¤æ³¢çš„TDOAå¤„ç†æ–¹æ³•ä»¥æ¶ˆé™¤ç¦»ç¾¤å€¼ã€‚ç ”ç©¶ç»“æžœè¡¨æ˜Ž,é‡‡ç”¨è¯¥æ–¹æ³•åŽ,åœ¨æ··å“è¾ƒä¸ºä¸¥é‡çš„çŽ¯å¢ƒä¸å£°æºDOAæ˜ å°„æ€§èƒ½å¾—åˆ°äº†æœ‰æ•ˆæå‡ã€‚3)ä¸ºäº†è¿›ä¸€æ¥æå‡å£°æºDOAä¼°è®¡æ€§èƒ½,æœ¬æ–‡ç»“åˆå¤šæ ¸å¦ä¹ ç†è®ºä»¥åŠK-meansèšç±»ç®—æ³•,æå‡ºäº†åŸºäºŽèšç±»æ–¹æ³•çš„å¤šæ ¸LS-SVRæ˜ å°„æ–¹æ³•ã€‚ä»¿çœŸç»“æžœè¡¨æ˜Ž,å¤šæ ¸LS-SVRçš„æ€§èƒ½è¦ä¼˜äºŽå•æ ¸LS-SVRä»¥åŠæœ€å°äºŒä¹˜æ³•;ä¸€èˆ¬æƒ…å†µä¸‹,æ ¸çš„ä¸ªæ•°è¶Šå¤š,å¤šæ ¸LS-SVRçš„æ€§èƒ½è¶Šå¥½,å¹¶ä¸”æ··å“æ—¶é—´è¶Šå¤§,å¤šæ ¸çš„æ€§èƒ½ä¼˜åŠ¿ä½“çŽ°å¾—è¶Šæ˜Žæ˜¾ã€‚4)é’ˆå¯¹LS-SVRæ˜ å°„æ–¹æ³•ä¸è®ç»ƒé›†å˜åœ¨å†—ä½™è¿™ä¸€é—®é¢˜,æœ¬æ–‡å°†åŸºäºŽæœ€å°æ”¯æŒæƒé‡çš„å‰ªæžç¨€ç–æ–¹æ³•è¿ç”¨åˆ°å£°æºDOAä¼°è®¡ä¸,åˆ†åˆ«å¯¹å•æ ¸å’Œå¤šæ ¸LS-SVRæ˜ å°„æ–¹æ³•è¿›è¡Œäº†ç¨€ç–åŒ–åˆ†æžã€‚ç ”ç©¶ç»“æžœè¡¨æ˜Ž,ä¸ŽåŸºæœ¬LS-SVRç›¸æ¯”,ç¨€ç–LS-SVRæ–¹æ³•ä¸ä»…èƒ½ä¿æŒè‰¯å¥½çš„DOAä¼°è®¡æ€§èƒ½,è€Œä¸”æœ‰æ•ˆå‡å°äº†æµ‹è¯•æ—¶çš„è¿ç®—é‡ã€‚5)æå‡ºäº†ä¸€ç§åŸºäºŽç¨€ç–è¡¨ç¤ºç†è®ºçš„æ— éœ€è°ƒèŠ‚å‚æ•°çš„TDOA-DOAæ˜ å°„æ–¹æ³•ã€‚åœ¨æ¤åŸºç¡€ä¸Š,ä¸ºè¿›ä¸€æ¥é™ä½Žè¿ç®—é‡,æœ¬æ–‡åº”ç”¨ä¸€ç§åŒæ¥ç½‘æ ¼æœç´¢æ–¹æ³•æ¥åŒ¹é…TDOAå‘é‡å’Œæ•°æ®å—å…¸ã€‚ç ”ç©¶ç»“æžœè¡¨æ˜Ž,ä¸Žä¼ ç»Ÿçš„æ— éœ€è°ƒèŠ‚å‚æ•°çš„æ˜ å°„æ–¹æ³•ç›¸æ¯”,è¯¥ç®—æ³•å˜åœ¨ä¸€å®šçš„æ€§èƒ½ä¼˜åŠ¿ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ As one of the key technologies in microphone array signal processing, the sound sources direction of arrival(DOA) estimation has been widely used in many fields, such as video conference system, fault detection, medical diagnosis, and military. The technique based on time differences of arrival(TDOAs) with multiple channels is an important method for the sound sources DOA estimation. While researchers focus on the acquisition of TDOAs, rather than the mapping of TDOAs to DOA. The mapping approach based on least squares support vector regression(LS-SVR) has shown its good performance, where its research still less of comprehensiveness. This paper focuses on the mapping of TDOAs to DOA based on LS-SVR, studies the choice of kernel functions, the construction of multi kernel LS-SVR and the sparsification analysis on support vectors. Moreover, we have proposed a tuning parameter-free mapping approach for TDOA-based sound source DOA estimation via sparse representation. The main jobs of this paper are:1) For the performance of different kernel functions are various, this paper focuses on the mapping construction of LS-SVR with radial basis kernel, polynomial kernel and linear kernel function, which influence the sound sources DOA estimation in reverberant and noise environment, and makes a comparison with least squares method. The research results show that the radial basis kernel has better estimation performance.2) Aiming at the problem that the outliers of TDOAs appear in reverberant environment, this paper proposes a TDOA processing approach based on median filtering, according to the characteristic of TDOAs to DOA mapping, to eliminate outliers. The research results shows that the sound source DOA mapping performance has been promoted effectively in reverberant environment, after using this approach.3) To further improve the sound source DOA mapping performance, this paper combines the theory of multi kernel learning and K-means clustering method, proposing a multi kernel LS-SVR mapping approach based on K-means clustering idea. The research results shows that the proposed mapping approach has better performance than single kernel LS-SVR and least squares method. In general, the more kernels the multi kernel LS-SVR owns, the better the performance it has, and the advantage of its performance shown more obviously following the increase of reverberant time.4) Aiming at the problem that the training set of LS-SVR mapping approach has some redundancy, this paper applies the sparse approximation based on pruning the minimum support values using LS-SVR to sound sources DOA estimation, and analyzes the sparsification of single kernel and multi kernel LS-SVR mapping approaches. The research results show that comparing with the basic LS-SVR approach, the sparse LS-SVR not only keeps good performance of sound sources DOA estimation, but also reduces the calculation amount of test effectively.5) This paper has proposed a tuning parameter-free mapping approach for TDOA-based sound sources DOA estimation via sparse representation. To further reduce the amount of calculation, this paper applies a two-step grid searching approach to match the TDOAs with data dictionary. The research results show that the proposed approach has some advantages over traditional tuning parameter-free mapping approach.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ éº¦å…‹é£Žé˜µåˆ—ï¼› å£°æºæ³¢è¾¾æ–¹å‘ä¼°è®¡ï¼› æ—¶å»¶ä¼°è®¡ï¼› æœ€å°äºŒä¹˜æ”¯æŒå‘é‡å›žå½’æœºï¼› å¤šæ ¸å¦ä¹ ï¼› ç¨€ç–è¡¨ç¤ºï¼›
ã€Key wordsã€‘ microphone arrayï¼› sound sources DOA estimationï¼› time delay estimationï¼› LS-SVRï¼› multi kernel learningï¼› sparse representationï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ å—äº¬èˆªç©ºèˆªå¤©å¤§å¦

ã€åˆ†ç±»å·ã€‘TN911.23
ã€è¢«å¼•é¢‘æ¬¡ã€‘1
ã€ä¸‹è½½é¢‘æ¬¡ã€‘188
æ”»è¯»æœŸæˆæžœ

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

å£°æºDOAä¼°è®¡ä¸­çš„TDOA-DOAæ˜ å°„æ–¹æ³•ç ”ç©¶

Research on The Mapping of TDOA to DOA for Sound Source DOA Estimation

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

å£°æºDOAä¼°è®¡ä¸çš„TDOA-DOAæ˜ å°„æ–¹æ³•ç ”ç©¶