Start Over

Monitoring tool usage in surgery videos using boosted convolutional and recurrent neural networks.

Authors :: Al Hajj, Hassan
Lamard, Mathieu
Conze, Pierre-Henri
Cochener, Béatrice
Quellec, Gwenolé
Source :: Medical Image Analysis. Jul2018, Vol. 47, p203-218. 16p.
Publication Year :: 2018
Abstract: This paper investigates the automatic monitoring of tool usage during a surgery, with potential applications in report generation, surgical training and real-time decision support. Two surgeries are considered: cataract surgery, the most common surgical procedure, and cholecystectomy, one of the most common digestive surgeries. Tool usage is monitored in videos recorded either through a microscope (cataract surgery) or an endoscope (cholecystectomy). Following state-of-the-art video analysis solutions, each frame of the video is analyzed by convolutional neural networks (CNNs) whose outputs are fed to recurrent neural networks (RNNs) in order to take temporal relationships between events into account. Novelty lies in the way those CNNs and RNNs are trained. Computational complexity prevents the end-to-end training of “CNN+RNN” systems. Therefore, CNNs are usually trained first, independently from the RNNs. This approach is clearly suboptimal for surgical tool analysis: many tools are very similar to one another, but they can generally be differentiated based on past events. CNNs should be trained to extract the most useful visual features in combination with the temporal context. A novel boosting strategy is proposed to achieve this goal: the CNN and RNN parts of the system are simultaneously enriched by progressively adding weak classifiers (either CNNs or RNNs) trained to improve the overall classification accuracy. Experiments were performed in a dataset of 50 cataract surgery videos, where the usage of 21 surgical tools was manually annotated, and a dataset of 80 cholecystectomy videos, where the usage of 7 tools was manually annotated. Very good classification performance are achieved in both datasets: tool usage could be labeled with an average area under the ROC curve of A z = 0.9961 and A z = 0.9939 , respectively, in offline mode (using past, present and future information), and A z = 0.9957 and A z = 0.9936 , respectively, in online mode (using past and present information only). [ABSTRACT FROM AUTHOR]

Subjects :: *VIDEO recording in medicine
*RECURRENT neural networks
*DECISION support systems
*CHOLECYSTECTOMY
DIGESTIVE organ surgery

Details

Language :: English
ISSN :: 13618415
Volume :: 47
Database :: Academic Search Index
Journal :: Medical Image Analysis
Publication Type :: Academic Journal
Accession number :: 129908645
Full Text :: https://doi.org/10.1016/j.media.2018.05.001

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Monitoring tool usage in surgery videos using boosted convolutional and recurrent neural networks.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Monitoring tool usage in surgery videos using boosted convolutional and recurrent neural networks.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources