電子耳蝸前端麥克風陣列語音增強技術的研究與進展_《生物醫學工程學雜志》

作者：

 陳又圣 , 陳偉芳 , 張璞 , 陳培培

深圳信息職業技術學院（廣東深圳 ?518000）;

關鍵詞：

電子耳蝸麥克風陣列語音增強波束形成

DOI：

10.7507/1001-5515.201805050

視頻：

導出 下載 收藏 掃碼 引用

摘要 全文 圖表 視頻 參考文獻 施引文獻 補充材料

麥克風陣列的方法在近年來被逐漸應用在電子耳蝸前端語音增強和提高言語識別率的研究里。該方法通過在空間不同的位置上放置若干麥克風，可以采集包含大量空間位置和方位信息的多通道信號，并形成增強目標信號和抑制干擾信號的特定波束指向模式。該方法更加適合用于電子耳蝸增強面對面交流的應用場景，其應用價值受到越來越多研究人員的關注。本文對麥克風陣列波束形成的原理進行闡述，并對目前文獻中基于麥克風陣列的語音增強技術進行分析，歸納和總結了其中的技術難點和發展趨勢。

引用本文： 陳又圣, 陳偉芳, 張璞, 陳培培. 電子耳蝸前端麥克風陣列語音增強技術的研究與進展. 生物醫學工程學雜志, 2019, 36(4): 696-704. doi: 10.7507/1001-5515.201805050 復制

圖1 麥克風陣列信號采集原理圖

Figure1. Schematic diagram of signal acquisition principle in microphone array

圖選項

下載全尺寸圖像

下載幻燈片

圖2 特定參數條件下的不同方位的系統幅頻響應曲線

Figure2. System amplitude based on specific parameters

圖選項

下載全尺寸圖像

下載幻燈片

圖3 不同延遲參數值的極性圖和系統零點

Figure3. Beam patterns and system nulls for different delay parameters

圖選項

下載全尺寸圖像

下載幻燈片

圖4 雙耳佩戴電子耳蝸和助聽器的示意圖

Figure4. A schematic diagram of binaural cochlear and hearing aids

圖選項

下載全尺寸圖像

下載幻燈片

圖5 單通道語音增強技術和麥克風陣列結合的去噪方法

Figure5. Noise suppression method based on the combination of single channel speech enhancement technology and microphone array technology

圖選項

下載全尺寸圖像

下載幻燈片

圖6 不同頻率條件下的雙麥克風極性圖

Figure6. Beam patterns of dual-microphone system based on different frequencies

圖選項

下載全尺寸圖像

下載幻燈片

圖7 語音信號和環境噪聲的頻譜對比

Figure7. Spectrum comparison of speech signal and environ mental noise

圖選項

下載全尺寸圖像

下載幻燈片

圖8 角度偏移 1～8° 的雙指向性麥克風極性圖對比

Figure8. Comparison of beam patterns for 1–8° angle offset in dual-microphone system

圖選項

下載全尺寸圖像

下載幻燈片

圖9 雙耳佩戴麥克風的雙麥克風極性圖的波束變化

Figure9. Changing of beams in beam patterns of dual-microphone system for situation of biauricular distance

圖選項

下載全尺寸圖像

下載幻燈片

1.	World Hearth Organization (WHO). Deafness and hearing loss[EB/OL]. (2018-03-15)[2019-02-20]. http://www.who.int/en/news-room/fact-sheets/detail/deafness-and-hearing-loss.
2.	銀力, 屠文河, 高姍仙, 等. 耳聾與助聽設備的選擇. 中國醫療器械信息, 2016(5): 23-29, 63.
3.	向琳. 兒童人工耳蝸植入后康復效果及影響因素研究. 長春: 吉林大學, 2017.
4.	National Institute on Deafness and Other Communication Disorders (NIDCD). Cochlear implants[EB/OL]. (2017-03-06)[2019-02-20]. https://www.nidcd.nih.gov/health/cochlear-implants.
5.	Lu C K, Wang S W. Peak-triggered sampling circuitry for a fine-structure-aware cochlear implant//IEEE Region 10 Conference (TENCON). Penang, Malaysia: IEEE, 2017: 31-34.
6.	Langner F, Saoji A A, Büchner A, et al. Adding simultaneous stimulating channels to reduce power consumption in cochlear implants. Hear Res, 2017, 345: 96-107.
7.	Padilla M, Stupak N, Landsberger D M. Pitch ranking with different virtual channel configurations in electrical hearing. Hear Res, 2017, 348: 54-62.
8.	Guan T, Yang M, Wei Z, et al. Simulation of the optical stimulation mechanism of cochlear nerves. Journal of Tsinghua University, 2017, 57(10): 1102-1105.
9.	Jiang Bin, Xia Nan, Wang Xing, et al. Auditory responses to short-wavelength infrared neural stimulation of the rat cochlear nucleus//39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’17). Seogwipo, South Korea: IEEE, 2017: 1942-1945.
10.	Wang Jingxuan, Lu Jianren, Tian Lan. Effect of fiberoptic collimation technique on 808 nm wavelength laser stimulation of cochlear neurons. Photomed Laser Surg, 2016, 34(6): 252-257.
11.	Anderson S R, Kan A, Thakkar T, et al. Pitch magnitude estimation can predict across-ear pitch comparisons in cochlear-implant users. J Acoust Soc Amer, 2017, 141(5): 3815.
12.	Van Eyndhoven S, Francart T, Bertrand A. EEG-informed attended speaker extraction from recorded speech mixtures with application in neuro-steered hearing prostheses. IEEE Trans Biomed Eng, 2017, 64(5): 1045-1056.
13.	Ma X J, Sudanthi W, Zhou Y, et al. Simulation for training cochlear implant electrode insertion//30th IEEE International Symposium on Computer-Based Medical Systems (IEEE CBMS 2017). Thessaloniki, Greece: IEEE, 2017: 1–6.
14.	Chen Yousheng, Chen Weifang. Research on fractional delay filter and mismatch feature based on least mean square rule for CI device//9th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC). Hangzhou, China: IEEE, 2017: 308-311.
15.	Arora S V, Vig R. Comparison of speech intelligibility parameter in cochlear implants by spatial filtering and coherence function methods. International Conference on Micro-Electronics and Telecommunication Engineering (ICMETE). Ghaziabad, India: IEEE, 2016: 573-577.
16.	Wimmer W, Kompis M, Stieger C, et al. Directional microphone contralateral routing of signals in cochlear implant users: a within-subjects comparison. Ear Hear, 2017, 38(3): 368-373.
17.	Mosnier I, Mathias N, Flament J, et al. Benefit of the UltraZoom beamforming technology in noise in cochlear implant users. Eur Arch Otorhinolaryngol, 2017, 274(9): 3335-3342.
18.	Gong Qin, Chen Yousheng. Parameter selection methods of delay and beamforming for cochlear implant speech enhancement. Acoust Phys, 2011, 57(4): 542-550.
19.	Li Xingxing, Wang Dangwei, Ma Xiaoyan, et al. Robust adaptive beamforming using iterative variable loaded sample matrix inverse. Electron Lett, 2018, 54(9): 546-548.
20.	Zohourian M, Enzner G, Martin R. Binaural speaker localization integrated into an adaptive beamformer for hearing aids. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2018, 26(3): 515-528.
21.	Xiao Jinjun, Luo Zhiquan, Merks I, et al. A robust adaptive binaural beamformer for hearing devices//2017 51st Asilomar Conference on Signals, Systems, and Computers. Pacific Grove, USA: IEEE, 2017: 1885-1889.
22.	Zeng Fangang. Challenges in improving cochlear implant performance and accessibility. IEEE Trans Biomed Eng, 2017, 64(8): 1662-1664.
23.	Lockwood M E, Jones D L, Bilger R C, et al. Performance of time- and frequency-domain binaural beamformers based on recorded signals from real rooms. J Acoust Soc Am, 2004, 115(1): 379-391.
24.	Ehlers E, Goupell M J, Zheng Yi, et al. Binaural sensitivity in children who use bilateral cochlear implants. J Acoust Soc Am, 2017, 141(6): 4264-4277.
25.	Lopez-Poveda E A, Eustaquio-Martín A, Stohl J S, et al. Intelligibility in speech maskers with a binaural cochlear implant sound coding strategy inspired by the contralateral medial olivocochlear reflex. Hear Res, 2017, 348: 134-137.
26.	Sheffield B M, Schuchman G, Bernstein J G. Pre- and postoperative binaural unmasking for bimodal cochlear implant listeners. Ear Hear, 2017, 38(5): 554-567.
27.	Goupell M J, Stakhovskaya O A, Bernstein J G. Contralateral interference caused by binaurally presented competing speech in adult bilateral cochlear-implant users. Ear Hear, 2018, 39(1): 110-123.
28.	羅鑫, 傅前杰, 王仁華. 聯合使用助聽器和增強電子耳蝸的使用者的中文語音識別. 北京生物醫學工程, 2005, 24(4): 250-253, 267.
29.	Kates J M, Weiss M R. A comparison of hearing-aid array-processing techniques. J Acoust Soc Am, 1996, 99(5): 3138-3148.
30.	Chen Yousheng, Gong Qin. Real-time spectrum estimation-based dual-channel speech-enhancement algorithm for cochlear implant. Biomed Eng Online, 2012, 11(74). DOI: 10.1186/1475-925X-11-74.
31.	Kim J, Lee H, Hoonteak L, et al. A micromachined microphone based on the electret membrane and field-effect-transistor mechano-electrical transduction. J Acoust Soc Am, 2017, 142(4): 2567.
32.	朱振嶺, 陳日林, 楊亦春. 差分傳聲器陣列低頻特性優化研究. 應用聲學, 2016, 35(6): 505-510.
33.	Duan X, Giddings R P, Mansoor S, et al. Performance tolerance of IMDD DFMA PONs to channel frequency response roll-off. IEEE Photonics Technology Letters, 2017, 29(19): 1655-1658.
34.	Chen Yousheng, Gong Qin. Broadband beamforming compensation algorithm in CI front-end acquisition. Biomed Eng Online, 2013, 12(18). DOI: 10.1186/1475-925X-12-18.

1. World Hearth Organization (WHO). Deafness and hearing loss[EB/OL]. (2018-03-15)[2019-02-20]. http://www.who.int/en/news-room/fact-sheets/detail/deafness-and-hearing-loss.
2. 銀力, 屠文河, 高姍仙, 等. 耳聾與助聽設備的選擇. 中國醫療器械信息, 2016(5): 23-29, 63.
3. 向琳. 兒童人工耳蝸植入后康復效果及影響因素研究. 長春: 吉林大學, 2017.
4. National Institute on Deafness and Other Communication Disorders (NIDCD). Cochlear implants[EB/OL]. (2017-03-06)[2019-02-20]. https://www.nidcd.nih.gov/health/cochlear-implants.
5. Lu C K, Wang S W. Peak-triggered sampling circuitry for a fine-structure-aware cochlear implant//IEEE Region 10 Conference (TENCON). Penang, Malaysia: IEEE, 2017: 31-34.
6. Langner F, Saoji A A, Büchner A, et al. Adding simultaneous stimulating channels to reduce power consumption in cochlear implants. Hear Res, 2017, 345: 96-107.
7. Padilla M, Stupak N, Landsberger D M. Pitch ranking with different virtual channel configurations in electrical hearing. Hear Res, 2017, 348: 54-62.
8. Guan T, Yang M, Wei Z, et al. Simulation of the optical stimulation mechanism of cochlear nerves. Journal of Tsinghua University, 2017, 57(10): 1102-1105.
9. Jiang Bin, Xia Nan, Wang Xing, et al. Auditory responses to short-wavelength infrared neural stimulation of the rat cochlear nucleus//39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’17). Seogwipo, South Korea: IEEE, 2017: 1942-1945.
10. Wang Jingxuan, Lu Jianren, Tian Lan. Effect of fiberoptic collimation technique on 808 nm wavelength laser stimulation of cochlear neurons. Photomed Laser Surg, 2016, 34(6): 252-257.
11. Anderson S R, Kan A, Thakkar T, et al. Pitch magnitude estimation can predict across-ear pitch comparisons in cochlear-implant users. J Acoust Soc Amer, 2017, 141(5): 3815.
12. Van Eyndhoven S, Francart T, Bertrand A. EEG-informed attended speaker extraction from recorded speech mixtures with application in neuro-steered hearing prostheses. IEEE Trans Biomed Eng, 2017, 64(5): 1045-1056.
13. Ma X J, Sudanthi W, Zhou Y, et al. Simulation for training cochlear implant electrode insertion//30th IEEE International Symposium on Computer-Based Medical Systems (IEEE CBMS 2017). Thessaloniki, Greece: IEEE, 2017: 1–6.
14. Chen Yousheng, Chen Weifang. Research on fractional delay filter and mismatch feature based on least mean square rule for CI device//9th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC). Hangzhou, China: IEEE, 2017: 308-311.
15. Arora S V, Vig R. Comparison of speech intelligibility parameter in cochlear implants by spatial filtering and coherence function methods. International Conference on Micro-Electronics and Telecommunication Engineering (ICMETE). Ghaziabad, India: IEEE, 2016: 573-577.
16. Wimmer W, Kompis M, Stieger C, et al. Directional microphone contralateral routing of signals in cochlear implant users: a within-subjects comparison. Ear Hear, 2017, 38(3): 368-373.
17. Mosnier I, Mathias N, Flament J, et al. Benefit of the UltraZoom beamforming technology in noise in cochlear implant users. Eur Arch Otorhinolaryngol, 2017, 274(9): 3335-3342.
18. Gong Qin, Chen Yousheng. Parameter selection methods of delay and beamforming for cochlear implant speech enhancement. Acoust Phys, 2011, 57(4): 542-550.
19. Li Xingxing, Wang Dangwei, Ma Xiaoyan, et al. Robust adaptive beamforming using iterative variable loaded sample matrix inverse. Electron Lett, 2018, 54(9): 546-548.
20. Zohourian M, Enzner G, Martin R. Binaural speaker localization integrated into an adaptive beamformer for hearing aids. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2018, 26(3): 515-528.
21. Xiao Jinjun, Luo Zhiquan, Merks I, et al. A robust adaptive binaural beamformer for hearing devices//2017 51st Asilomar Conference on Signals, Systems, and Computers. Pacific Grove, USA: IEEE, 2017: 1885-1889.
22. Zeng Fangang. Challenges in improving cochlear implant performance and accessibility. IEEE Trans Biomed Eng, 2017, 64(8): 1662-1664.
23. Lockwood M E, Jones D L, Bilger R C, et al. Performance of time- and frequency-domain binaural beamformers based on recorded signals from real rooms. J Acoust Soc Am, 2004, 115(1): 379-391.
24. Ehlers E, Goupell M J, Zheng Yi, et al. Binaural sensitivity in children who use bilateral cochlear implants. J Acoust Soc Am, 2017, 141(6): 4264-4277.
25. Lopez-Poveda E A, Eustaquio-Martín A, Stohl J S, et al. Intelligibility in speech maskers with a binaural cochlear implant sound coding strategy inspired by the contralateral medial olivocochlear reflex. Hear Res, 2017, 348: 134-137.
26. Sheffield B M, Schuchman G, Bernstein J G. Pre- and postoperative binaural unmasking for bimodal cochlear implant listeners. Ear Hear, 2017, 38(5): 554-567.
27. Goupell M J, Stakhovskaya O A, Bernstein J G. Contralateral interference caused by binaurally presented competing speech in adult bilateral cochlear-implant users. Ear Hear, 2018, 39(1): 110-123.
28. 羅鑫, 傅前杰, 王仁華. 聯合使用助聽器和增強電子耳蝸的使用者的中文語音識別. 北京生物醫學工程, 2005, 24(4): 250-253, 267.
29. Kates J M, Weiss M R. A comparison of hearing-aid array-processing techniques. J Acoust Soc Am, 1996, 99(5): 3138-3148.
30. Chen Yousheng, Gong Qin. Real-time spectrum estimation-based dual-channel speech-enhancement algorithm for cochlear implant. Biomed Eng Online, 2012, 11(74). DOI: 10.1186/1475-925X-11-74.
31. Kim J, Lee H, Hoonteak L, et al. A micromachined microphone based on the electret membrane and field-effect-transistor mechano-electrical transduction. J Acoust Soc Am, 2017, 142(4): 2567.
32. 朱振嶺, 陳日林, 楊亦春. 差分傳聲器陣列低頻特性優化研究. 應用聲學, 2016, 35(6): 505-510.
33. Duan X, Giddings R P, Mansoor S, et al. Performance tolerance of IMDD DFMA PONs to channel frequency response roll-off. IEEE Photonics Technology Letters, 2017, 29(19): 1655-1658.
34. Chen Yousheng, Gong Qin. Broadband beamforming compensation algorithm in CI front-end acquisition. Biomed Eng Online, 2013, 12(18). DOI: 10.1186/1475-925X-12-18.

《生物醫學工程學雜志》

電子耳蝸前端麥克風陣列語音增強技術的研究與進展

摘要 全文 圖表 視頻 參考文獻 施引文獻 補充材料

引言

1 麥克風陣列信號采集和波束形成原理

2 電子耳蝸麥克風陣列語音增強的方法

2.1 固定波束形成方法

2.2 自適應波束形成方法

2.3 雙耳電子耳蝸的方法

2.4 單通道語音增強技術和麥克風陣列結合方法

2.5 麥克風陣列語音增強方法的總結和言語識別率的關聯分析

3 麥克風陣列語音增強技術在電子耳蝸應用中存在的問題

3.1 低頻滾降失真

3.2 信號補償中的噪聲過度放大

3.3 電極數量限制及信號分辨率問題

3.4 麥克風間的增益失配和運動偏移失配問題

3.5 雙耳信號采集及波束變化問題

4 總結與展望

引言

1 麥克風陣列信號采集和波束形成原理

2 電子耳蝸麥克風陣列語音增強的方法

2.1 固定波束形成方法

2.2 自適應波束形成方法

2.3 雙耳電子耳蝸的方法

2.4 單通道語音增強技術和麥克風陣列結合方法

2.5 麥克風陣列語音增強方法的總結和言語識別率的關聯分析

3 麥克風陣列語音增強技術在電子耳蝸應用中存在的問題

3.1 低頻滾降失真

3.2 信號補償中的噪聲過度放大

3.3 電極數量限制及信號分辨率問題

3.4 麥克風間的增益失配和運動偏移失配問題

3.5 雙耳信號采集及波束變化問題

4 總結與展望

上一篇

Format

Content

《生物醫學工程學雜志》

電子耳蝸前端麥克風陣列語音增強技術的研究與進展

摘要 全文 圖表 視頻 參考文獻 施引文獻 補充材料

引言

1 麥克風陣列信號采集和波束形成原理

2 電子耳蝸麥克風陣列語音增強的方法

2.1 固定波束形成方法

2.2 自適應波束形成方法

2.3 雙耳電子耳蝸的方法

2.4 單通道語音增強技術和麥克風陣列結合方法

2.5 麥克風陣列語音增強方法的總結和言語識別率的關聯分析

3 麥克風陣列語音增強技術在電子耳蝸應用中存在的問題

3.1 低頻滾降失真

3.2 信號補償中的噪聲過度放大

3.3 電極數量限制及信號分辨率問題

3.4 麥克風間的增益失配和運動偏移失配問題

3.5 雙耳信號采集及波束變化問題

4 總結與展望

引言

1 麥克風陣列信號采集和波束形成原理

2 電子耳蝸麥克風陣列語音增強的方法

2.1 固定波束形成方法

2.2 自適應波束形成方法

2.3 雙耳電子耳蝸的方法

2.4 單通道語音增強技術和麥克風陣列結合方法

2.5 麥克風陣列語音增強方法的總結和言語識別率的關聯分析

3 麥克風陣列語音增強技術在電子耳蝸應用中存在的問題

3.1 低頻滾降失真

3.2 信號補償中的噪聲過度放大

3.3 電極數量限制及信號分辨率問題

3.4 麥克風間的增益失配和運動偏移失配問題

3.5 雙耳信號采集及波束變化問題

4 總結與展望

上一篇

Format

Content

摘要全文圖表視頻參考文獻施引文獻補充材料