基于線性化注意力和雙重注意力的視杯盤分割模型_《生物醫學工程學雜志》

作者：

藍子俊 ¹ , 謝珺 ¹ ,  郭燕 ² , 張喆 ³ , 孫彬 ¹

1. 太原理工大學電子信息與光學工程學院（山西晉中 030600）;
2. 北京中醫藥大學第三附屬醫院（北京 100029）;
3. 太原理工大學電氣與動力工程學院（太原 030024）;

關鍵詞：

青光眼視杯盤分割線性化注意力雙重注意力知識蒸餾

DOI：

10.7507/1001-5515.202208061

視頻：

導出 下載 收藏 掃碼 引用

摘要 全文 圖表 視頻 參考文獻 施引文獻 補充材料

青光眼是致盲性眼病之一，視杯盤比是篩查青光眼的主要依據，因此準確分割視杯盤具有重要意義。本文提出一種基于線性化注意力和雙重注意力的視杯盤分割模型。首先，根據視盤特性定位裁剪感興趣區域。其次，引入線性化注意力的殘差網絡-34（ResNet-34）作為特征提取網絡。最后，通過線性化注意力的輸出特征生成通道和空間雙重注意力權重，用于校準解碼器輸出特征獲取視杯盤分割圖像。實驗結果表明，所提模型在視神經頭分割的視網膜圖像（DRISHTI-GS）數據集中，視盤、視杯交并比分別為0.962 3、0.856 4；用于視神經評估的開放式視網膜圖像-V3（RIM-ONE-V3）數據集中，視盤、視杯交并比分別為0.956 3、0.784 4。所提模型優于對比算法，在青光眼的早期篩查中具有一定的醫學價值。此外，本文利用知識蒸餾技術生成兩種規模更小的模型，有利于將模型應用于嵌入式設備。

引用本文： 藍子俊, 謝珺, 郭燕, 張喆, 孫彬. 基于線性化注意力和雙重注意力的視杯盤分割模型. 生物醫學工程學雜志, 2023, 40(5): 920-927. doi: 10.7507/1001-5515.202208061 復制

0 引言

青光眼是導致視力下降甚至失明的嚴重眼病之一。截止2020年，全球青光眼患者高達7 600萬，其中我國青光眼患者數約2 100萬，是全球青光眼患者最多的國家^[1]。青光眼篩查的重要依據是視杯盤比（cut to disc radio，CDR），眼底視杯盤區域如圖1所示，視杯盤比越大，患青光眼概率越高。近年來，隨著人工智能技術的發展，其在醫學影像領域的應用愈加廣泛，有效輔助專業醫師對眼病的診斷。因此，利用計算機技術快速準確地進行視盤（optic disc，OD）與視杯（optic cup，OC）分割，已成為青光眼輔助篩查的重要手段。

圖1 眼底圖像視杯與視盤區域 Figure1. Fundus image showing optic cup and optic disc areas

圖選項

MSA	MLA	DA	DRISHTI-GS				RIM-ONE-V3
			視盤		視杯		視盤		視杯
			Dice	IOU	Dice	IOU	Dice	IOU	Dice	IOU
—	—	—	0.976 5	0.954 3	0.915 1	0.845 7	0.975 9	0.953 1	0.869 1	0.777 3
—	√	—	0.978 2	0.957 5	0.918 5	0.851 6	0.976 2	0.953 6	0.872 3	0.784 4
√	—	√	0.971 3	0.944 7	0.907 6	0.833 7	0.976 2	0.953 7	0.873 2	0.782 7
—	√	√	0.980 8	0.962 3	0.921 5	0.856 4	0.977 6	0.956 3	0.872 9	0.784 4
√	√	√	0.971 3	0.944 7	0.905 8	0.831 5	0.964 6	0.932 1	0.804 3	0.6888

MSA	MLA	DA	參數量/個	計算速度/(幀·s^?1)
—	—	—	2.166 × 10⁷	111.71
—	√	—	2.298 × 10⁷	114.73
√	—	√	2.402 × 10⁷	97.39
—	√	√	2.300 × 10⁷	107.79
√	√	√	2.405 × 10⁷	89.85

方法	DRISHTI-GS				RIM-ONE-V3
	視盤		視杯		視盤		視杯
	Dice	IOU	Dice	IOU	Dice	IOU	Dice	IOU
U-Net	0.965 0	0.933 0	0.866 2	0.733 3	0.951 1	0.907 5	0.745 7	0.627 6
CENet	0.977 8	0.956 6	0.907 5	0.834 8	0.977 2	0.955 5	0.861 8	0.767 7
DeeplabV3	0.974 8	0.951 0	0.906 4	0.832 4	0.975 7	0.952 7	0.862 5	0.769 0
CGNet	0.964 7	0.932 3	0.858 4	0.766 6	0.964 2	0.931 3	0.775 7	0.666 5
RAU-Net	0.977 8	0.956 7	0.915 2	0.846 9	0.976 1	0.953 4	0.867 2	0.774 4
本文算法	0.980 8	0.962 3	0.921 5	0.856 4	0.977 6	0.956 3	0.872 9	0.784 4

特征提取網絡	知識蒸餾	參數量/個	計算速度/(幀·s^?1)	RIM-ONE-V3
				視盤		視杯
				Dice	IOU	Dice	IOU
ResNet-18	—	1.289 × 10⁷	135.58	0.973 4	0.948 4	0.866 2	0.777 8
ResNet-18	√	1.289 × 10⁷	135.58	0.974 8	0.950 9	0.871 9	0.781 6
MobileNetV2	—	2.16 × 10⁶	91.11	0.971 9	0.945 5	0.862 5	0.768 5
MobileNetV2	√	2.16 × 10⁶	91.11	0.973 9	0.949 3	0.866 4	0.774 2

1.	梁遠波, 江俊宏. 我國青光眼防治問題與展望. 浙江醫學, 2020, 42(22): 2377-2382.
2.	Aquino A, Gegúndez-Arias M E, Marín D. Detecting the optic disc boundary in digital fundus images using morphological, edge detection, and feature extraction techniques. IEEE Transactions on Medical Imaging, 2010, 29(11): 1860-1869.
3.	Cheng J, Liu J, Xu Y, et al. Superpixel classification based optic disc and optic cup segmentation for glaucoma screening. IEEE Transactions on Medical Imaging, 2013, 32(6): 1019-1032.
4.	Yu T, Ma Y, Li W. Automatic localization and segmentation of optic disc in fundus image using morphology and level set//2015 9th International Symposium on Medical Information and Communication Technology (ISMICT). Kamakura: IEEE, 2015: 195-199.
5.	Issac A, Parthasarthi M, Dutta M K. An adaptive threshold based algorithm for optic disc and cup segmentation in fundus images//2015 2nd International Conference on Signal Processing and Integrated Networks (SPIN). Delhi: IEEE, 2015: 143-147.
6.	曹新容, 薛嵐燕, 林嘉雯, 等. 基于視覺顯著性和旋轉掃描的視盤分割新方法. 生物醫學工程學雜志, 2018, 35(2): 229-236.
7.	Ramani R G, Shanthamalar J J. Improved image processing techniques for optic disc segmentation in retinal fundus images. Biomedical Signal Processing and Control, 2020, 58: 101832.
8.	Sevastopolsky A. Optic disc and cup segmentation methods for glaucoma detection with modification of U-Net convolutional neural network. Pattern Recognition and Image Analysis, 2017, 27(3): 618-624.
9.	Ronneberger O, Fischer P, Brox T. U-Net: convolutional networks for biomedical image segmentation//International Conference on Medical Image Computing and Computer-Assisted Intervention. Munich: Springer, 2015: 234-241.
10.	Al-Bander B, Williams B M, Al-Nuaimy W, et al. Dense fully convolutional segmentation of the optic disc and cup in colour fundus for glaucoma diagnosis. Symmetry, 2018, 10(4): 87.
11.	Fu H, Cheng J, Xu Y, et al. Joint optic disc and cup segmentation based on multi-label deep network and polar transformation. IEEE Transactions on Medical Imaging, 2018, 37(7): 1597-1605.
12.	Yu S, Xiao D, Frost S, et al. Robust optic disc and cup segmentation with deep learning for glaucoma detection. Computerized Medical Imaging and Graphics, 2019, 74: 61-71.
13.	侯向丹, 趙一浩, 劉洪普, 等. 融合殘差注意力機制的U-Net視盤分割. 中國圖象圖形學報, 2020, 25(9): 1915-1929.
14.	呂鵬飛, 王瑩, 王思齊, 等. 視覺顯著性的眼底圖像視盤檢測. 中國圖象圖形學報, 2021, 26(9): 2293-2304.
15.	劉洪普, 趙一浩, 侯向丹, 等. 融合上下文和注意力的視盤視杯分割. 中國圖象圖形學報, 2021, 26(5): 1041-1057.
16.	劉熠翕, 江旻珊, 張學典. 融合金字塔切分注意力模塊的視杯視盤分割. 上海理工大學學報, 2022, 44(6): 532-539, 545.
17.	Sivaswamy J, Krishnadas S, Chakravarty A, et al. A comprehensive retinal image dataset for the assessment of glaucoma from the optic nerve head analysis. JSM Biomedical Imaging Data Papers, 2015, 2(1): 1004.
18.	Sivaswamy J, Krishnadas S R, Joshi G D, et al. DRISHTI-GS: retinal image dataset for optic nerve head (ONH) segmentation//2014 IEEE 11th International Symposium on Biomedical Imaging (ISBI). Beijing: IEEE, 2014: 53-56.
19.	Fumero F, Alayón S, Sanchez J L, et al. RIM-ONE: an open retinal image database for optic nerve evaluation//2011 24th International Symposium on Computer-Based Medical Systems (CBMS). Bristol: IEEE, 2011: 1–6.
20.	Gu Z, Cheng J, Fu H, et al. CE-NET: context encoder network for 2D medical image segmentation. IEEE Transactions on Medical Imaging, 2019, 38(10): 2281-2292.
21.	Chen L C, Papandreou G, Schroff F, et al. Rethinking atrous convolution for semantic image segmentation. arXiv preprint, 2017, arXiv: 1706.05587.
22.	Wu T, Tang S, Zhang R, et al. CGNEt: a light-weight context guided network for semantic segmentation. IEEE Transactions on Image Processing, 2020, 30: 1169-1179.
23.	Ni Z L, Bian G B, Zhou X H, et al. RAUNEt: residual attention U-Net for semantic segmentation of cataract surgical instruments//Neural Information Processing: 26th International Conference (ICONIP 2019). Sydney: Springer, 2019: 139-149.

《生物醫學工程學雜志》

基于線性化注意力和雙重注意力的視杯盤分割模型

摘要 全文 圖表 視頻 參考文獻 施引文獻 補充材料

0 引言

1 方法實現

1.1 視盤定位

1.2 線性化注意力

1.3 融合多頭線性化注意力和雙重注意力的視杯盤分割模型

1.4 基于多頭線性化注意力的雙重注意力

1.4.1 通道注意力

1.4.2 空間注意力

1.5 模型壓縮優化

2 實驗結果

2.1 實驗數據

2.2 實驗環境

2.3 實驗細節

2.4 評價指標

2.5 實驗結果與分析

2.5.1 消融實驗

2.5.2 對比實驗

2.5.3 模型大小和推理速度

2.5.4 輕量化設計

2.5.5 模型分割可視化結果

3 結論

0 引言

1 方法實現

1.1 視盤定位

1.2 線性化注意力

1.3 融合多頭線性化注意力和雙重注意力的視杯盤分割模型

1.4 基于多頭線性化注意力的雙重注意力

1.4.1 通道注意力

1.4.2 空間注意力

1.5 模型壓縮優化

2 實驗結果

2.1 實驗數據

2.2 實驗環境

2.3 實驗細節

2.4 評價指標

2.5 實驗結果與分析

2.5.1 消融實驗

2.5.2 對比實驗

2.5.3 模型大小和推理速度

2.5.4 輕量化設計

2.5.5 模型分割可視化結果

3 結論

上一篇

下一篇

Format

Content

摘要全文圖表視頻參考文獻施引文獻補充材料