基于多尺度殘差卷積神經網絡的視杯視盤聯合分割_《生物醫學工程學雜志》

作者：

袁鑫 ¹ ,  鄭秀娟 ¹ , 吉彬 ^1,2 , 李淼 ¹ , 李彬 ¹

1. 四川大學電氣工程學院自動化系（成都 610065）;
2. 中國移動（成都）產業研究院（成都 610041）;

關鍵詞：

深度學習全卷積神經網絡視盤分割視杯分割青光眼篩查

DOI：

10.7507/1001-5515.201909006

視頻：

導出 下載 收藏 掃碼 引用

摘要 全文 圖表 視頻 參考文獻 施引文獻 補充材料

青光眼是不可逆性失明的首要原因，早期癥狀不明顯，容易被忽視，因此青光眼早期篩查尤為重要。杯盤比是臨床上用于青光眼篩查的重要指標，所以精準分割視杯視盤是計算杯盤比的關鍵。本文提出了一種基于全卷積多尺度殘差神經網絡的視杯視盤分割方法。首先，對眼底圖像進行對比度增強，并引入極坐標變換。隨后，構造 W-Net 作為主體網絡，用帶殘差多尺度全卷積模塊來替代標準卷積單元，輸入端口加入圖像金字塔來構造多尺度輸入，側輸出層作為早期的分類器生成局部預測輸出。最后，提出了一種新的多標簽損失函數來指導網絡分割。實驗采用 REFUGE 數據集驗證，最終視杯、視盤分割的平均交并比分別為 0.904 0、0.955 3，重疊誤差分別為 0.178 0、0.066 5。結果表明，該方法不僅實現了視杯視盤聯合分割，而且有效提高了其分割精度。該方法將有助于大規模青光眼早期篩查的推廣。

引用本文： 袁鑫, 鄭秀娟, 吉彬, 李淼, 李彬. 基于多尺度殘差卷積神經網絡的視杯視盤聯合分割. 生物醫學工程學雜志, 2020, 37(5): 875-884. doi: 10.7507/1001-5515.201909006 復制

引言

青光眼是全世界第二大致盲眼病（僅次于白內障），也是造成不可逆性失明的首要原因^[1]。青光眼造成的視力損傷是不可逆的，且早期癥狀不易發現，因此對青光眼的早期篩查與診斷至關重要。目前，眼底圖像和三維光學相干層析成像（optical coherence tomography，OCT）常被用于輔助診斷青光眼。其中 OCT 圖像成本相對昂貴且普及率低，不適用于大規模的青光眼篩查，故大多數醫生常選擇使用成本較低和容易獲得的眼底圖像進行青光眼篩查與診斷。當下，評估眼底圖像中視神經乳頭的技術是主流的青光眼篩查技術^[2]，它采用一個二分類來判斷是否屬于青光眼疾病。然而眼底圖像常由經驗豐富的眼科醫生手動標注，費時費力且帶有很大的主觀性，因此手工標注眼底圖像不適用于青光眼的大規模篩查。

在眼底彩色圖像中，視盤呈現亮黃色且形狀近似橢圓，可分為兩個明顯的區域：中間明亮區（視杯）和外圍區（視神經網膜邊緣）。青光眼大規模篩查技術中，多種自動評估視神經乳頭的方法被相繼被提出，例如垂直杯盤比（vertical cup to disc ratio，CDR）^[3]、視杯（optic cup，OC）視盤（optic disc，OD）面積比以及視盤直徑^[4]等。而在臨床上，醫生主要采用杯盤比評估視神經頭。杯盤比是指垂直杯徑與垂直盤徑的比。通常情況下，杯盤比值越大，則患青光眼的概率越大。因此，準確地分割視杯視盤是篩查和診斷青光眼的關鍵。

在醫學圖像分割算法中，主要分為基于手工提取特征的傳統方法^[5-8]和利用卷積神經網絡來自動提取特征的深度學習框架^[9-11]。早期研究中的視杯視盤分割方法通常是利用手工提取視覺特征進行分割，其主要包括顏色、紋理、對比度閾值、邊緣檢測、分割模型和區域分割方法^[12-19]，但這些方法容易受到眼底圖像拍攝環境和圖像本身質量的影響，從而影響目標的分割效果。此外，從眾多像素中提取出特征訓練分類器不易實現，故有學者^[20]提出采用超像素策略來減少像素數，并采用超像素分類進行視杯視盤分割。但該方法需要手工構造特征來獲得分割結果，其實現過程繁瑣且可重復性差。深度學習在計算機視覺任務中克服了人工設計特征的局限性，并可自動學習高度的可區分性特征進行表示。在醫學圖像分割領域，早期的深度學習方法大多是基于圖像塊^[21-22]，其局限性是滑動窗會導致冗余計算和無法學習到圖像的全局特征。接著，端到端的全卷積神經網絡（fully convolutional neural network，FCN）被提出并在圖像分類和分割中得到廣泛應用^[23-24]。在全卷積神經網絡基礎上，Ronneberger 等^[25]提出在醫學圖像分割領域具有卓越性能的 U-Net 結構，并且該結構已經成為該領域的重要結構。通常，U-Net^[25]結構可以被認為是編、解碼器的結構。編碼器的目的是逐步減少特征映射的空間維數，來獲取更高層次的語義特征。而解碼器則是為了恢復目標中的空間維數和結構細節。因此，在編碼器中應盡可能捕獲更多的高級特征，而在解碼器中應盡可能地保留空間信息。之后，在 U-Net 基礎上又發展出許多改進網絡結構，包括 M-Net^[26-27]、U-Net++^[28-29]等，均在醫學圖像分割任務中取得了不錯的效果。

現有眼底圖像分析方法大多是對視杯視盤分別進行分割^[12-15.30]。文獻[12-13]將視杯視盤分割分為兩個獨立階段，即分別利用各自的特征進行分割。在分割視杯時，先定位到視盤，把視盤分割出來后再提取感興趣區域來分割視杯，造成了視盤分割信息的浪費。文獻[14]則提出將視杯視盤分割集成在同一分割框架內。但這些方法忽略了視杯和視盤之間存在的先驗信息，例如形狀約束和結構約束，即視杯視盤均接近橢圓形，視杯包含于視盤之中。同時，上述幾種方法都將視杯視盤看作非獨立標簽，每個像素只對應一個標簽（即視杯、視盤或背景），割裂了兩者的聯系。

針對上述問題，本文以 U-Net^[25]與 M-Net^[26]為基礎，提出了一個新的 W-Net 網絡結構用于視杯視盤聯合分割。相比于 U-Net^[25]或 M-Net^[26]，W-Net 主網絡結構是以 W 型卷積神經網絡為主體框架，加深了網絡深度，有利于網絡進行深層信息學習。此外，W-Net 引入了多尺度輸入和不同網絡深度的監督，其中多尺度輸入可構造圖像金字塔，使輸入圖像信息更加豐富；另外在網絡中加入不同的深度監督，可使側輸出層計算梯度能更容易地反向傳播到前面的卷積層中，可有效抑制梯度消失問題和有利于網絡的早期訓練。同時，本文還提出了一種殘差多尺度全卷積學習模塊來代替 U-Net 和 M-Net 中的標準卷積學習單元，加深了網絡寬度，從而能在有限的尺度范圍內捕獲更多尺度特征。最后，提出了一種新的多標簽損失函數來指導網絡進行視杯視盤分割。

1 方法

1.1 數據集

本文選用 REFUGE 數據集（https://refuge.grand-challenge.org/Home/）進行方法驗證。該數據庫是目前青光眼眼底照片精標數據庫中最全面的標注數據庫，主要包括青光眼與非青光眼兩種類型數據，其中青光眼和非青光眼圖像的比例分別為 10% 和 90%。每張眼底圖像分別包含診斷、圖像分割及定位三方面信息，由七位專家人工標記并融合，克服了之前許多青光眼公開數據集存在的只有診斷標簽信息，無視杯、視盤等關鍵結構的標注信息，且參與標注的專家較少等缺點。所有圖像均以后極（posterior pole）為中心，同時含有黃斑和視盤。在這個數據集中，由訓練集、驗證集和測試集三個組成。訓練集中有 400 張像素為 2 142 × 2 056 的眼底圖像，是使用 Zeiss Visucam 500 眼底相機拍攝的，而驗證集和測試集各由 400 張像素均為 1 634 × 1 634 的眼底圖像組成，是使用 Canon CR-2 眼底相機拍攝的。由于各個集合被不同的眼底照相機拍攝，故而圖像在顏色、紋理等光學性質上會呈現不同，如圖 1 所示。

圖1 不同眼底照相機拍攝的眼底圖像 Figure1. Fundus images taken by different fundus cameras

圖選項

網絡	視盤					視杯
網絡	OE	ACC	MA	MIoU	FWIoU	OE	ACC	MA	MIoU	FWIoU
U-Net^[25]	0.087 7	0.977 2	0.971 6	0.940 9	0.956 9	0.236 4	0.982 8	0.909 8	0.872 6	0.966 9
M-Net^[26]	0.081 9	0.981 1	0.974 5	0.942 8	0.960 2	0.195 9	0.985 4	0.938 4	0.897 0	0.973 2
U-Net-Mcon	0.078 1	0.979 3	0.971 3	0.947 1	0.960 7	0.217 5	0.984 0	0.926 8	0.882 7	0.969 7
M-Net-Mcon	0.077 2	0.979 8	0.975 2	0.948 1	0.961 0	0.192 2	0.985 8	0.938 7	0.898 1	0.973 5
W-Net	0.080 5	0.981 9	0.974 8	0.945 8	0.960 8	0.190 7	0.986 1	0.940 2	0.899 3	0.973 8
W-Net-Mcon	0.066 5	0.983 3	0.978 6	0.955 3	0.967 4	0.178 0	0.987 0	0.949 6	0.904 0	0.975 3
注：最佳結果用加粗字體標出

1.	Tham Y C, Li Xiang, Wong T Y, et al. Global prevalence of glaucoma and projections of glaucoma burden through 2040: a systematic review and meta-analysis. Ophthalmology, 2014, 121(11): 2081-2090.
2.	Garway-Heath D F, Hitchings R A. Quantitative evaluation of the optic nerve head in early glaucoma. Br J Ophthalmol, 1998, 82(4): 352-361.
3.	Jonas J B, Bergua A, Schmitz-Valckenberg P, et al. Ranking of optic disc variables for detection of glaucomatous optic nerve damage. Invest Ophthalmol Vis Sci, 2000, 41(7): 1764-1773.
4.	Michael D H O D. Optic disc size, an important consideration in the glaucoma evaluation. Clin Eye Vis Care, 1999, 11(2): 59-62.
5.	Wang Jinke, Cheng Yuanzhi, Guo Changyong, et al. Shape-intensity prior level set combining probabilistic atlas and probability map constrains for automatic liver segmentation from abdominal CT images. Int J Comput Assist Radiol Surg, 2016, 11(5): 817-826.
6.	Shi Changfa, Cheng Yuanzhi, Wang Jinke, et al. Low-rank and sparse decomposition based shape model and probabilistic atlas for automatic pathological organ segmentation. Med Image Anal, 2017, 38: 30-49.
7.	Zhang Jianpeng, Xia Yong, Xie Yutong, et al. Classification of medical images in the biomedical literature by jointly using deep and handcrafted visual features. IEEE J Biomed Health Inform, 2018, 22(5): 1521-1530.
8.	Shi Changfa, Cheng Yuanzhi, Liu Fei, et al. A hierarchical local region-based sparse shape composition for liver segmentation in CT scans. Pattern Recognit, 2016, 50: 88-106.
9.	de Oliveira L A Jr, Medeiros H R, Macêdo D, et al. SegNetRes-CRF: A deep convolutional encoder-decoder architecture for semantic image segmentation//2018 International Joint Conference on Neural Networks (IJCNN). Rio de Janeiro: IEEE, 2018, 39(12): 2481-2495.
10.	Guo Z, Li X, Huang H, et al. Deep learning-based image segmentation on multi-modal medical imaging. IEEE Trans Radiat Plasma Med Sci, 2019, 3(2): 162-169.
11.	Liu Shouqiang, Li Miao, Li Min, et al. Research of animals image semantic segmentation based on deep learning. Concurrency and Computation: Practice and Experience, 2020, 32(1): e4892.
12.	Joshi G D, Sivaswamy J, Krishnadas S R. Optic disk and cup segmentation from monocular color retinal images for glaucoma assessment. IEEE Trans Med Imaging, 2011, 30(6): 1192-1205.
13.	Dehghani A, Moghaddam H A, Mohammad-Shahram M. Optic disc localization in retinal images using histogram matching. EURASIP Journal on Image and Video Processing, 2012, 2012(1): 19.
14.	肖志濤, 邵一婷, 張芳, 等. 基于眼底結構特征的彩色眼底圖像視盤定位方法. 中國生物醫學工程學報, 2016, 35(3): 257-263.
15.	Almazroa A, Burman R, Raahemifar K, et al. Optic disc and optic cup segmentation methodologies for glaucoma image detection: a survey. J Ophthalmol, 2015, 2015: 180972.
16.	鄭紹華, 陳健, 潘林, 等. 基于定向局部對比度的眼底圖像視盤檢測方法. 中國生物醫學工程學報, 2014, 33(3): 289-296.
17.	Zheng Yuanjie, Stambolian D, O'brien J, et al. Optic disc and cup segmentation from color fundus photograph using graph cut with priors. Med Image Comput Comput Assist Interv, 2013, 16(Pt 2): 75-82.
18.	Aquino A, Gegúndez-Arias M E, Marín D. Detecting the optic disc boundary in digital fundus images using morphological, edge detection, and feature extraction techniques. IEEE Trans Med Imaging, 2010, 29(11): 1860-1869.
19.	劉振宇, 汪淼. 改進區域生長算法在視杯圖像分割中的應用. 遼寧大學學報: 自然科學版, 2017, 44(2): 105-113.
20.	Cheng Jun, Liu Jiang, Xu Yanwu, et al. Superpixel classification based optic disc and optic cup segmentation for glaucoma screening. IEEE Trans Med Imaging, 2013, 32(6): 1019-1032.
21.	Sironi A, Turetken E, Lepetit V, et al. Multiscale centerline detection. IEEE Trans Pattern Anal Mach Intell, 2016, 38(7): 1327-1341.
22.	Kamnitsas K, Ledig C, Newcombe V F J, et al. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation. Med Image Anal, 2017, 36: 61-78.
23.	Shen Wei, Zhou Mu, Yang Feng, et al. Learning from experts: Developing transferable deep features for patient-level lung cancer prediction//Ourselin S, Joskowicz L, Sabuncu M, et al. Medical image computing and computer-assisted intervention: Lecture notes in computer science. ChamI: Springer, 2016, 9901: 124-131.
24.	Song Jiangdian, Yang Caiyun, Fan Li, et al. Lung lesion extraction using a toboggan based growing automatic segmentation approach. IEEE Trans Med Imaging, 2016, 35(1): 337-353.
25.	Ronneberger O, Fischer P, Brox T, et al. U-net: Convolutional networks for biomedical image segmentation//Navab N, Hornegger J, Wells W, et al. Medical image computing and computer-assisted intervention: Lecture notes in computer science. Cham: Springer, 2015: 234-241.
26.	Fu Huazhu, Cheng Jun, Xu Yanwu, et al. Joint optic disc and cup segmentation based on multi-label deep network and polar transformation. IEEE Trans Med Imaging, 2018, 37(7): 1597-1605.
27.	Mehta R, Sivaswamy J. M-net: A Convolutional Neural Network for deep brain structure segmentation//2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017). Melbourne: IEEE, 2017: 437-440.
28.	Zhou Z, Rahman Siddiquee M M, Tajbakhsh N, et al. UNet++: A nested U-Net architecture for medical image segmentation//Stoyanov D, Taylor Z, Carneiro G, et al. Deep learning in medical image analysis and multimodal learning for clinical decision support. DLMIA 2018, ML-CDS 2018. Lecture notes in computer science. Cham: Springer, 2018, 11045: 3-11.
29.	Yu F, Wang Dequan, Shelhamer E, et al. Deep layer aggregation//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 2403-2412.
30.	Sevastopolsky A. Optic disc and cup segmentation methods for glaucoma detection with modification of U-Net convolutional neural network. Pattern Recogn Image Anal, 2017, 27(3): 618-624.
31.	Aganj I, Harisinghani M G, Weissleder R, et al. Unsupervised medical image segmentation based on the local center of mass. Sci Rep, 2018, 8(1): 13012.
32.	Zuiderveld K. Contrast limited adaptive histogram equalization. San Diego: Academic Press Professional, Inc, 1994.
33.	Crum W R, Camara O, Hill D L. Generalized overlap measures for evaluation and validation in medical image analysis. IEEE Trans Med Imaging, 2006, 25(11): 1451-1461.
34.	Lin T Y, Goyal P, Girshick R, et al. Focal loss for dense object detection//Proceedings of the IEEE International Conference on Computer Vision. Venice: IEEE, 2017: 2980-2988..
35.	Huang G, Liu Z, Van Der Maaten L, et al. Densely connected convolutional networks//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Honolulu: IEEE, 2017: 4700-4708.
36.	He Kaiming, Zhang Xiangyu, Ren Shaoqing, et al. Deep residual learning for image recognition//2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas: IEEE, 2016: 770-778.
37.	Szegedy C, Ioffe S, Vanhoucke V, et al. Inception-v4, inception-resnet and the impact of residual connections on learning//Thirty-First AAAI Conference on Artificial Intelligence. San Francisco: AAAI, 2017: 1-12.
38.	Gu Zaiwang, Cheng Jun, Fu Huazhu, et al. CE-Net: context encoder network for 2D medical image segmentation. IEEE Trans Med Imaging, 2019, 38(10): 2281-2292.

《生物醫學工程學雜志》

基于多尺度殘差卷積神經網絡的視杯視盤聯合分割

摘要 全文 圖表 視頻 參考文獻 施引文獻 補充材料

引言

1 方法

1.1 數據集

1.2 圖像預處理

1.2.1 極坐標變換

1.2.2 數據增強

1.2.3 歸一化處理

1.3 全卷積多尺度神經網絡模型的建立

1.3.1 主網絡結構

1.3.2 多尺度輸入層

1.3.3 側輸出層

1.3.4 多標簽損失函數

1.3.5 殘差多尺度卷積模塊

1.4 評價指標

2 結果

2.1 實驗環境

2.2 實驗結果

3 討論

4 結論

引言

1 方法

1.1 數據集

1.2 圖像預處理

1.2.1 極坐標變換

1.2.2 數據增強

1.2.3 歸一化處理

1.3 全卷積多尺度神經網絡模型的建立

1.3.1 主網絡結構

1.3.2 多尺度輸入層

1.3.3 側輸出層

1.3.4 多標簽損失函數

1.3.5 殘差多尺度卷積模塊

1.4 評價指標

2 結果

2.1 實驗環境

2.2 實驗結果

3 討論

4 結論

上一篇

下一篇

Format

Content

摘要全文圖表視頻參考文獻施引文獻補充材料