圖像增強下基于生成對抗網絡和卷積神經網絡的CT與MRI融合方法_《生物醫學工程學雜志》

作者：

劉云鵬 ¹ ,  李瑾 ² , 王宇 ¹ , 蔡文立 ³ , 陳飛 ² , 劉文潔 ¹ , 毛顯昊 ¹ , 干開豐 ⁴ , 王仁芳 ² , 孫德超 ⁵ , 邱虹 ² , 劉邦權 ⁵

1. 寧波工程學院國交學院信科專業（浙江寧波 315000）;
2. 浙江萬里學院（浙江寧波 315000）;
3. 哈佛醫學院放射學圖像實驗室（美國馬薩諸塞州波士頓 02114）;
4. 寧波大學附屬李惠利醫院（浙江寧波 315000）;
5. 寧波財經學院數字技術與工程學院（浙江寧波 315000）;

關鍵詞：

圖像增強圖像融合生成對抗網絡深度學習醫學圖像

DOI：

10.7507/1001-5515.202209050

視頻：

導出 下載 收藏 掃碼 引用

摘要 全文 圖表 視頻 參考文獻 施引文獻 補充材料

針對多模態醫學圖像融合中的重要特征丟失、細節表現不突出和紋理不清晰等問題，提出一種圖像增強下使用生成對抗網絡（GAN）和卷積神經網絡（CNN）進行電子計算機斷層掃描（CT）圖像與磁共振成像（MRI）圖像融合的方法。生成器針對高頻特征圖像，雙鑒別器針對逆變換后的融合圖像；高頻特征圖像通過GAN模型進行特征融合，低頻特征圖像通過基于遷移學習的CNN預訓練模型進行特征融合。實驗結果表明，與當前先進融合算法相比，所提方法在主觀表現上紋理細節特征更加豐富，輪廓邊緣信息更加清晰突出；在客觀指標評估中，融合質量評價指標（Q^AB/F）、信息熵（IE）、空間頻率（SF）、結構相似性（SSIM）、互信息（MI）和融合視覺信息保真度（VIFF）等關鍵指標比其他最佳測試結果分別提高了2.0%、6.3%、7.0%、5.5%、9.0%和3.3%。融合后圖像可以有效地應用于醫學診斷，進一步提高診斷效率。

引用本文： 劉云鵬, 李瑾, 王宇, 蔡文立, 陳飛, 劉文潔, 毛顯昊, 干開豐, 王仁芳, 孫德超, 邱虹, 劉邦權. 圖像增強下基于生成對抗網絡和卷積神經網絡的CT與MRI融合方法. 生物醫學工程學雜志, 2023, 40(2): 208-216. doi: 10.7507/1001-5515.202209050 復制

0 引言

為了避免重復多次讀取和觀察相同部位下多種模態醫學影像，一種有效的解決方案是將多模態的醫學影像進行融合，同時最大限度地保留原來每個模態下的圖像特征。針對電子計算機斷層掃描（computed tomography，CT）與磁共振成像（magnetic resonance imaging，MRI）醫學圖像融合的研究，主要包括傳統空域方法、傳統變換域方法和完全深度學習的方法。傳統空域方法^[1-6]的核心思想是以圖像原有的像素、像素塊或指定區域為計算單元，通過權重的方法控制不同圖像融合的比例與強度。該方法簡單快速，但是對比度信息、亮度信息和特征細節容易丟失。傳統空域融合方法的缺陷和不足可以使用變換域的方法^[7-19]進行彌補，該方法對原始多模態圖像進行頻域或者其他相關類型分解，不同分解域上可以使用相同或不同的方法分別進行融合，比如不同分解域中可以使用相同或不同的深度學習方法，然后通過分解域的逆操作合成最終的融合圖像。

在深度學習應用領域，一方面可以直接使用卷積神經網絡（convolutional neural network，CNN）來獲取醫學圖像的特征用于融合，另一方面也可以使用生成對抗網絡（generative adversarial network，GAN）的思想來直接融合，由于GAN模型^[20-27]需要的訓練數據更少，表達能力更為強大，在醫學圖像融合領域受到了極大的關注。CNN和GAN的方法既可以直接用于原始數據圖像，也可以用于分解后的特征圖像。

如前所述，在變換域的融合方法中，不同分解域可以使用不同的方法，例如在其中一個分解域中使用傳統的空域或變換域算法，而在另一個分解域中使用深度學習的方法。但在目前已知的研究范圍內，還沒有在高頻和低頻域中使用兩種不同的深度學習方法將CT和MRI進行融合的論文和技術。本研究使用變分模態分解（variational mode decomposition，VMD）方法進行變換，可以獲取更加豐富和精準的高頻特征圖像與低頻特征圖像，并且創新性地提出基于GAN的生成器與鑒別器不在同一個變換域中的端到端訓練模型，生成器針對高頻特征圖像，而鑒別器針對逆變換后的融合圖像，對于細節和紋理豐富的高頻圖像使用訓練后GAN模型的生成器進行融合，而對于低頻圖像，由于特征相對較少，通過基于遷移學習的CNN預訓練模型進行融合。同時結合圖像銳化、濾波、亮度調節和對比度調節方法，通過一種多參數搜索的方法進行圖像增強，用于訓練圖像和融合后圖像處理兩個方面。

1 研究方法

1.1 系統概述

整個研究過程包括訓練和融合生成兩個基本過程，如圖1和圖2所示。在訓練過程中，首先對輸入的MRI和CT分別進行圖像增強，然后使用VMD方法對MRI和CT進行分解同時生成增強后的邊緣圖像，對于分解產生的低頻圖像使用CNN預訓練模型進行融合。對于分解產生的高頻圖像和邊緣圖像，則作為擴展U-Net網絡模型的輸入，然后對形成的特征圖連接后再輸入到GAN的生成器中進行訓練。這是因為，CT和MRI這樣的醫學圖像大多用于腫瘤等病變的診斷，所以在圖像中往往會有一些需要識別的小目標，這些區域是醫學圖像中最重要和關鍵的部分，而U-Net結構的網絡模型對于最大程度地獲取微小目標組織和結構的紋理特征與邊緣輪廓信息有著很好的效果。

圖1 訓練基本過程 Figure1. Basic training process

圖選項

方法	Q^AB/F	IE	SF	SSIM	MI	VIFF
GM	0.524 2	4.795 3	21.078 2	0.643 8	2.705 2	0.353 1
GC	0.597 1	5.085 6	22.316 1	0.651 2	2.763 9	0.391 0
PS	0.509 3	5.433 8	20.517 8	0.660 9	2.688 9	0.373 5
HID	0.580 2	5.544 1	21.661 9	0.718 3	2.805 3	0.410 6
FusionGAN	0.542 8	5.219 0	21.328 3	0.701 8	2.897 1	0.379 3
DDcGAN	0.677 7	5.766 7	24.056 6	0.749 2	3.400 5	0.452 8
本文方法	0.691 5	6.129 3	25.750 1	0.790 6	3.706 1	0.467 7
注：粗體值表示測試方法中的最優值

方案	Q^AB/F	IE	SF	SSIM	MI	VIFF
1	0.651 9	5.725 1	16.201 7	0.740 8	2.715 2	0.311 0
2	0.584 4	5.621 8	23.771 2	0.691 5	3.417 0	0.302 6
3	0.680 4	6.124 0	25.291 1	0.782 4	3.633 7	0.448 7
4	0.591 0	5.715 9	23.351 8	0.660 9	3.303 1	0.313 8
本文方法	0.691 5	6.129 3	25.750 1	0.790 6	3.706 1	0.467 7
注：粗體值表示測試方法中的最優值

1.	Liu Y, Chen X, Ward R K, et al. Medical image fusion via convolutional sparsity based morphological component analysis. IEEE Signal Process Lett, 2019, 26(3): 485-489.
2.	Yan H, Li Z. A multi-modal medical image fusion method in spatial domain// 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC). Chengdu: IEEE, 2019: 597-601.
3.	Li W, Jia L, Du J. Multi-modal sensor medical image fusion based on multiple salient features with guided image filter. IEEE Access, 2019, 7: 173019-173033.
4.	Yadav S P, Yadav S. Image fusion using hybrid methods in multimodality medical images. Med Biol Eng Comput, 2020, 58(4): 669-687.
5.	Zhang S, Li X, Zhu R, et al. Medical image fusion algorithm based on L0 gradient minimization for CT and MRI. Multimed Tools Appl, 2021, 80(14): 21135-21164.
6.	Irshad M T, Rehman H U. Gradient compass-based adaptive multimodal medical image fusion. IEEE Access, 2021, 9: 22662-22670.
7.	Zhu Z, Zheng M, Qi G, et al. A phase congruency and local Laplacian energy based multi-modality medical image fusion method in NSCT domain. IEEE Access, 2019, 7: 20811-20824.
8.	Gai D, Shen X, Cheng H, et al. Medical image fusion via PCNN based on edge preservation and improved sparse representation in NSST domain. IEEE Access, 2019, 7: 85413-85429.
9.	Singh S, Anand R S. Multimodal medical image sensor fusion model using sparse K-SVD dictionary learning in nonsubsampled shearlet domain. IEEE Trans Instrum Meas, 2019, 69(2): 593-607.
10.	Meng L, Guo X, Li H. MRI/CT fusion based on latent low rank representation and gradient transfer. Biomed Signal Process Control, 2019, 53: 101536.
11.	Ullah H, Ullah B, Wu L, et al. Multi-modality medical images fusion based on local-features fuzzy sets and novel sum-modified-Laplacian in non-subsampled shearlet transform domain. Biomed Signal Process Control, 2020, 57: 101724.
12.	Sayadi M, Ghassemian H, Naimi R, et al. A new composite multimodality image fusion method based on shearlet transform and retina inspired model// 2020 International Conference on Machine Vision and Image Processing (MVIP). Iran: IEEE, 2020: 1-5.
13.	Panigrahy C, Seal A, Mahato N K. MRI and SPECT image fusion using a weighted parameter adaptive dual channel PCNN. IEEE Signal Process Lett, 2020, 27: 690-694.
14.	Zhu R, Li X, Zhang X, et al. MRI and CT medical image fusion based on synchronized-anisotropic diffusion model. IEEE Access, 2020, 8: 91336-91350.
15.	Goyal B, Dogra A, Khoond R, et al. An efficient medical assistive diagnostic algorithm for visualisation of structural and tissue details in CT and MRI fusion. Cognit Comput, 2021, 13(6): 1471-1483.
16.	Zhang L, Zhang Y, Ma S, et al. CT and MRI image fusion algorithm based on hybrid ?0?1 layer decomposing and two-dimensional variation transform. Biomed Signal Process Control, 2021, 70: 103024.
17.	Zhu R, Li X, Zhang X, et al. HID: the hybrid image decomposition model for MRI and CT fusion. IEEE J Biomed Health Inform, 2021, 26(2): 727-739.
18.	Faragallah O S, Muhammed A N, Taha T S, et al. PCA based SVD fusion for MRI and CT medical images. J Intell Fuzzy Syst, 2021, 41(2): 4021-4033.
19.	Polinati S, Bavirisetti D P, Rajesh K N, et al. The fusion of MRI and CT medical images using variational mode decomposition. Appl Sci, 2021, 11(22): 10975.
20.	Nian Z, Jung C. CNN-based multi-focus image fusion with light field data// 2019 IEEE International Conference on Image Processing (ICIP). Taipei: IEEE, 2019: 1044-1048.
21.	Liang X, Hu P, Zhang L, et al. MCFNet: multi-layer concatenation fusion network for medical images fusion. IEEE Sens J, 2019, 19(16): 7107-7119.
22.	Maneesha P, Singh T, Nayar R, et al. Multi modal medical image fusion using convolution neural network// 2019 Third International Conference on Inventive Systems and Control (ICISC). Coimbatore: IEEE, 2019: 351-357.
23.	Zhang Y, Liu Y, Sun P, et al. IFCNN: a general image fusion framework based on convolutional neural network. Information Fusion, 2020, 54: 99-118.
24.	Wang K, Zheng M, Wei H, et al. Multi-modality medical image fusion using convolutional neural network and contrast pyramid. Sensors, 2020, 20(8): 2169.
25.	Ma J, Yu W, Liang P, et al. FusionGAN: a generative adversarial network for infrared and visible image fusion. Inform Fusion, 2019, 48: 11-26.
26.	Xu H, Liang P, Yu W, et al. Learning a generative model for fusing infrared and visible images via conditional generative adversarial network with dual discriminators// Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI). Macao: ACM, 2019: 3954-3960.
27.	Ma J, Xu H, Jiang J, et al. DDcGAN: a dual-discriminator conditional generative adversarial network for multi-resolution image fusion. IEEE Trans Image Process, 2020, 29: 4980-4995.
28.	Isola P, Zhu J Y, Zhou T H, et al. Image-to-image translation with conditional adversarial networks// Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 5967-5976.
29.	Shi W Z, Caballero J, Huszár F, et al. Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network// Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 1874-1883.
30.	Oktay O, Schlemper J, Folgoc L L, et al. Attention U-Net: Learning where to look for the pancreas. arXiv, 2018: 1804.03999.

《生物醫學工程學雜志》

圖像增強下基于生成對抗網絡和卷積神經網絡的CT與MRI融合方法

摘要 全文 圖表 視頻 參考文獻 施引文獻 補充材料

0 引言

1 研究方法

1.1 系統概述

1.2 圖像融合

1.2.1 高頻圖像融合

1.2.2 低頻圖像融合

2 實驗與分析

2.1 實驗環境與數據

2.2 定量分析與討論

2.2.1 與當前先進方法整體對比

2.2.2 消融實驗

2.3 典型圖片展示

3 結論

0 引言

1 研究方法

1.1 系統概述

1.2 圖像融合

1.2.1 高頻圖像融合

1.2.2 低頻圖像融合

2 實驗與分析

2.1 實驗環境與數據

2.2 定量分析與討論

2.2.1 與當前先進方法整體對比

2.2.2 消融實驗

2.3 典型圖片展示

3 結論

上一篇

下一篇

Format

Content

摘要全文圖表視頻參考文獻施引文獻補充材料