基于深度學習的微創手術工具檢測與跟蹤研究綜述_《生物醫學工程學雜志》

作者：

劉玉瑩 ,  趙子健

山東大學控制科學與工程學院（濟南 250061）;

關鍵詞：

深度學習卷積神經網絡完全監督的弱監督的手術工具檢測與跟蹤

DOI：

10.7507/1001-5515.201904061

視頻：

導出 下載 收藏 掃碼 引用

摘要 全文 圖表 視頻 參考文獻 施引文獻 補充材料

基于深度學習的微創手術工具檢測與跟蹤技術在微創外科手術中的應用是目前的一個研究熱點。本文首先對微創手術工具檢測與跟蹤的相關技術內容進行系統闡述，主要介紹了基于深度學習算法的優勢。然后，本文概述了基于完全監督的深度神經網絡手術工具檢測與跟蹤算法以及新興的基于弱監督的深度神經網絡手術工具檢測與跟蹤的算法，重點歸納了基于深度卷積神經網絡及遞歸神經網絡的幾種典型算法框架及其流程圖，以便相關領域的科研工作者更系統地了解目前研究進展，同時可為微創外科手術醫生選擇導航技術時提供參考。最后，本文為基于深度學習的微創手術工具檢測與跟蹤技術的進一步研究提供了一個大致的方向。

引用本文： 劉玉瑩, 趙子健. 基于深度學習的微創手術工具檢測與跟蹤研究綜述. 生物醫學工程學雜志, 2019, 36(5): 870-878. doi: 10.7507/1001-5515.201904061 復制

引言

微創外科手術（minimally invasive surgeries, MIS）與傳統外科手術相比具有許多優勢，例如：創傷小、疼痛少、術后恢復時間短等，現已成為通用的外科手術選擇^[1]。但 MIS 采用間接觀察和操縱的方式，使得深度感知復雜化、手術視野和操作空間狹窄，降低了醫生的手眼協調，因而可能會在實際操作中對器官或組織造成損傷，所以需要以其他手段獲取額外信息來監測在體內移動的手術工具，而微創手術工具檢測與跟蹤算法就可以為 MIS 中的操作導航提供這樣的重要信息。微創手術工具的檢測與跟蹤算法可以確定微創手術工具的位置和空間姿態，為執行手術的臨床醫生或機器人提供精確實時的導航，使手術過程更順利安全。

微創手術工具檢測與跟蹤，有基于硬件和基于視覺的兩種解決方式，但由于基于視覺的解決方式，既簡單又不需要額外的設備，現已成為目前主要的研究方向。通過系統軟件，基于視覺的解決方式可以直接對圖像中的手術工具進行某些特征提取，即可檢測與跟蹤微創手術工具在圖像中的位置，從而為微創手術工具操縱提供導航。

值得一提的是，現在微創手術工具檢測與跟蹤技術日趨發展成熟，既有從傳統的自行設計特征、分類器，發展到現在的基于深度學習的端到端（end to end）算法；也有基于深度學習的從先檢測后跟蹤，發展到檢測與跟蹤融為一體的算法。本文主要介紹了基于完全監督深度學習神經網絡和基于弱監督深度學習神經網絡的微創手術工具檢測與跟蹤算法，以便相關領域的科研工作者更系統地了解目前研究進展，同時為微創外科手術醫生選擇合適的導航技術提供參考。最后，本文為基于深度學習的微創手術工具檢測與跟蹤技術的進一步研究提供了一個大致的方向。

1 深度學習理論

目前微創手術工具檢測與跟蹤算法可以被劃分為生成式（generative model）模型和判別式（discriminative model）模型兩大類別^[2]。生成式模型主要有基于特征匹配的方法、基于貝葉斯跟蹤的方法以及運動檢測的方法等^[3-4]。判別式模型則采用模式分類方法，例如：支持向量機（support vector machine，SVM）、隨機森林、大部分基于深度學習的方法等^[4-5]。判別式模型因為能夠明顯地區分背景和前景的信息，逐漸在微創手術工具檢測與跟蹤領域占據主流地位。

上文所提到的生成式模型中所有的方法和判別式模型中的 SVM 及隨機森林方法通常有較多的局限，比如：需要手工設計特征，不精確且耗費人力；難以構建高級的語義信息；無法應用于復雜場景等。但是基于深度學習的模型以人工神經網絡為架構對數據進行表征學習，具有學習能力強、特征表達能力高效、語義特征更高級的特點，所以從 2013 年以來研究者們普遍關注怎樣解決基于深度學習的微創手術工具檢測與跟蹤問題^[6]。

解決基于深度學習的微創手術工具檢測與跟蹤問題與解決基于深度學習的車輛檢測與跟蹤問題不同，前者存在目標更容易丟失、數據集更少、場景更加復雜等難題。同時微創手術工具的數據集也會因為遮擋、鏡面反射、手術場景中出血或煙霧導致信息丟失，進而影響微創手術工具檢測與跟蹤的測試效果^[2]。多年來研究者們直面難題，搭建全新的深度神經網絡，或者將用于解決各種醫學圖像分割或識別問題中的深度神經網絡進一步拓展應用在基于深度學習的微創手術工具檢測與跟蹤任務中^[7]。研究者們的努力使得基于深度學習的微創手術工具檢測與跟蹤技術不斷發展，至今已有多種基于深度學習的微創手術工具檢測與跟蹤算法被應用于 MIS 中。

現在基于深度學習的微創手術工具檢測與跟蹤算法中有兩種主流深度神經網絡模型：完全監督深度神經網絡模型和弱監督深度神經網絡模型。下面介紹近年來基于深度學習的微創手術工具檢測與跟蹤算法中測試效果較好的基于完全監督深度神經網絡和基于弱監督深度神經網絡的微創手術工具檢測與跟蹤算法。

2 基于完全監督深度神經網絡的微創手術工具檢測與跟蹤算法

2.1 以卷積神經網絡為骨干的經典算法

2.1.1 卷積神經網絡結合線段檢測

神經網絡的層數并不是越深越好，過深的神經網絡會帶來一系列的問題，比如圖像細節信息丟失、更新困難、定位更加復雜等。適當深度的神經網絡層數結合經典算法取得的效果或許更佳，例如 Chen 等^[8]提出了一種卷積神經網絡（convolutional neural network，CNN）結合傳統線段檢測（line segment detection，LSD）的微創手術工具檢測與跟蹤算法。該研究使用了 CNN 檢測和時空上下文（spatio-temporal context，STC）學習跟蹤算法，在視頻幀之間進行微創手術工具的檢測與跟蹤。在檢測之前，先用 LSD 來檢測微創手術工具外觀的線段并標記它們，然后使用有標記的圖像作為數據集來訓練 CNN，這些線段的位置有助于快速準確地檢測微創手術工具的尖端。最后使用 STC 學習算法逐幀跟蹤微創手術工具，STC 關鍵點在于有效利用快速傅里葉變換（fast fourier transform，FFT）和逆快速傅里葉變換（inverse fast fourier transform，IFFT）。實驗結果表明，在微創手術工具檢測與跟蹤任務中，Chen 等^[8]提出的方法具有先進的二維跟蹤性能，準確率為 93.2%，處理速度達到了 25 幀/s，故而該方法適合用于對實時性要求不是很高，但是對準確率要求較高的 MIS 中。如圖1 所示是其微創手術工具檢測與跟蹤算法流程圖。

圖1 CNN 結合 LSD 的微創手術工具檢測與跟蹤算法 Figure1. CNN combined with LSD for minimally invasive surgical tool detection and tracking algorithm

圖選項

1.	Zhao Zijian, Voros S, Weng Ying, et al. Tracking-by-detection of surgical instruments in minimally invasive surgery via the convolutional neural network deep learning-based method. Computer Assisted Surgery, 2017, 22(1): 26-35.
2.	Bouget D, Allan M, Stoyanov D, et al. Vision-based and marker-less surgical tool detection and tracking: a review of the literature. Med Image Anal, 2017, 35: 633-654.
3.	Du Xiaofei, Allan M, Dore A, et al. Combined 2D and 3D tracking of surgical instruments for minimally invasive and robotic-assisted surgery. Int J Comput Assist Radiol Surg, 2016, 11(6): 1109-1119.
4.	Twinanda A P, Shehata S M, Mutter D, et al. EndoNet: a deep architecture for recognition tasks on laparoscopic videos. IEEE Trans Med Imaging, 2017, 36(1): 86-97.
5.	Rieke N, Tan D J, Tombari F, et al. Real-time online adaption for robust instrument tracking and pose estimation. Medical Image Computing and Computer Assisted Intervention (MICCAI 2016), Springer, 2016: 422-430.
6.	Sahu M, Moerman D, Mewes P, et al. Instrument state recognition and tracking for effective control of robotized laparoscopic systems. International Journal of Mechanical Engineering and Robotics Research, 2016, 5(1): 33-38.
7.	García-Peraza-Herrera L C, Li Wenqi, Fidon L, et al. ToolNet: holistically-nested real-time segmentation of robotic surgical tools//IEEE/RSJ International Conference on Intelligent Robots and Systems, IEEE, 2017: 5717-5722.
8.	Chen Zhaorui, Zhao Zijian, Cheng Xiaolin. Surgical instruments tracking based on deep learning with lines detection and spatio-temporal context//Chinese Automation Congress (CAC 2017). IEEE, 2018. DOI: 10.1109/CAC.2017.8243236.
9.	Sahu M, Mukhopadhyay A, Szengel A, et al. Addressing multi-label imbalance problem of surgical tool detection using CNN. Int J Comput Assist Radiol Surg, 2017, 12(6): 1013-1020.
10.	García-Peraza-Herrera L C, Li Wenqi, Gruijthuijsen C, et al. Real-time segmentation of non-rigid surgical tools based on deep learning and tracking// Computer-Assisted and Robotic Endoscopy (CARE 2016). Springer, 2016: 84-95.
11.	Zhao Z, Voros S, Chen Z, et al. Surgical tool tracking based on two CNNs: from coarse to fine. The Journal of Engineering, 2019(14): 467-472.
12.	Zhao Zijian, Chen Zhaorui, Voros S, et al. Real-time tracking of surgical instruments based on spatio-temporal context and deep learning. Computer Assisted Surgery, 2019: 1-10.
13.	Sarikaya D, Corso J J, Guru K A. Detection and localization of robotic tools in robot-assisted surgery videos using deep neural networks for region proposal and detection. IEEE Trans Med Imaging, 2017, 36(7): 1542-1549.
14.	Jin A, Yeung S, Jopling J, et al. Tool detection and operative skill assessment in surgical videos using region-based convolutional neural networks//31st Conference on Neural Information Processing Systems (NIPS 2017), 2018. arXiv: 1802.08774v1.
15.	Choi B, Jo K, Choi S, et al. Surgical-tools detection based on convolutional neural network in laparoscopic robot-assisted surgery//39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2017. DOI: 10.1109/embc.2017.8037183.
16.	Redmon J, Divvala S K, Girshick R B, et al. You only look once: unified, real-time object detection// Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, 2016. arXiv: 1506.02640v5.
17.	Kurmann T, Neila P M, Du Xiaofei, et al. Simultaneous recognition and pose estimation of instruments in minimally invasive surgery//Medical Image Computing and Computer-Assisted Intervention (MICCAI 2017), Springer, 2017: 505-513.
18.	Gao Cong, Unberath M, Taylor R H, et al. Localizing dexterous surgical tools in X-ray for image-based navigation. arXiv: Computer Vision and Pattern Recognition, 2019. arXiv: 1901.06672v2.
19.	Du X, Kurmann T, Chang P L, et al. Articulated Multi-Instrument 2D pose estimation using fully convolutional networks. IEEE Trans Med Imaging, 2018, 37(5): 1276-1287.
20.	Laina I, Rieke N, Rupprecht C, et al. Concurrent segmentation and localization for tracking of surgical instruments//Medical Image Computing and Computer-Assisted Intervention (MICCAI 2017), Springer, 2017: 664-672.
21.	He Kaiming, Zhang Xiangyu, Ren Shaoqing, et al. Deep residual learning for image recognition// 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, 2016. arXiv: 1512.03385v1.
22.	Mishra K, Sathish R, Sheet D. Learning latent temporal connectionism of deep residual visual abstractions for identifying surgical tools in laparoscopy procedures//2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2017: 2233-2240.
23.	Hajj H A, Lamard M, Conze P, et al. Monitoring tool usage in surgery videos using boosted convolutional and recurrent neural networks. Med Image Anal, 2018, 47: 203-218.
24.	Hwang S, Kim H E. Self-transfer learning for weakly supervised lesion localization// Medical Image Computing and Computer-Assisted Intervention (MICCAI 2016). 2016: 239-246.
25.	Jia Zhipeng, Huang Xingyi, Chang E I, et al. Constrained deep weak supervision for histopathology image segmentation. IEEE Trans Med Imaging, 2017, 36(11): 2376-2388.
26.	Singh K K, Lee Y J. Hide-and-seek: forcing a network to be meticulous for weakly-supervised object and action localization// IEEE International Conference on Computer Vision (ICCV), 2017: 3524-3533.
27.	Kim D, Cho D, Yoo D, et al. Two-phase learning for weakly supervised object localization//IEEE Computer Society and the Computer Vision Foundation (CVF), 2017: 3554-3563. arXiv: 1708.02108v3.
28.	Vardazaryan A, Mutter D, Marescaux J, et al. Weakly-supervised learning for tool localization in laparoscopic videos//LABELS 2018, CVII 2018, STENT 2018: Intravascular Imaging and Computer Assisted Stenting and Large-Scale Annotation of Biomedical Data and Expert Label Synthesis, 2018: 169-179. arXiv: 1806.05573v2.
29.	Nwoye C I, Mutter D, Marescaux J A. Weakly supervised convolutional LSTM approach for tool tracking in laparoscopic videos. Int J Comput Assist Radiol Surg, 2019, 14(6): 1059-1067.

《生物醫學工程學雜志》

基于深度學習的微創手術工具檢測與跟蹤研究綜述

摘要 全文 圖表 視頻 參考文獻 施引文獻 補充材料

引言

1 深度學習理論

2 基于完全監督深度神經網絡的微創手術工具檢測與跟蹤算法

2.1 以卷積神經網絡為骨干的經典算法

2.1.1 卷積神經網絡結合線段檢測

2.1.2 卷積神經網絡結合支持向量機以及隱馬爾科夫模型

2.1.3 全卷積網絡結合光流跟蹤

2.2 基于從粗略到精細級聯卷積神經網絡的算法

2.3 基于區域提案的兩階段卷積神經網絡算法

2.4 基于回歸的一階段卷積神經網絡算法

2.5 基于先收縮后擴展路徑的卷積神經網絡算法

2.6 基于卷積神經網絡結合遞歸神經網絡的算法

3 基于弱監督的深度神經網絡微創手術工具檢測與跟蹤算法

4 總結及展望

引言

1 深度學習理論

2 基于完全監督深度神經網絡的微創手術工具檢測與跟蹤算法

2.1 以卷積神經網絡為骨干的經典算法

2.1.1 卷積神經網絡結合線段檢測

2.1.2 卷積神經網絡結合支持向量機以及隱馬爾科夫模型

2.1.3 全卷積網絡結合光流跟蹤

2.2 基于從粗略到精細級聯卷積神經網絡的算法

2.3 基于區域提案的兩階段卷積神經網絡算法

2.4 基于回歸的一階段卷積神經網絡算法

2.5 基于先收縮后擴展路徑的卷積神經網絡算法

2.6 基于卷積神經網絡結合遞歸神經網絡的算法

3 基于弱監督的深度神經網絡微創手術工具檢測與跟蹤算法

4 總結及展望

上一篇

下一篇

Format

Content

摘要全文圖表視頻參考文獻施引文獻補充材料