Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published June 2023 | Supplemental Material + Published
Journal Article Open

A vision transformer for decoding surgeon activity from surgical videos

Abstract

The intraoperative activity of a surgeon has substantial impact on postoperative outcomes. However, for most surgical procedures, the details of intraoperative surgical actions, which can vary widely, are not well understood. Here we report a machine learning system leveraging a vision transformer and supervised contrastive learning for the decoding of elements of intraoperative surgical activity from videos commonly collected during robotic surgeries. The system accurately identified surgical steps, actions performed by the surgeon, the quality of these actions and the relative contribution of individual video frames to the decoding of the actions. Through extensive testing on data from three different hospitals located in two different continents, we show that the system generalizes across videos, surgeons, hospitals and surgical procedures, and that it can provide information on surgical gestures and skills from unannotated videos. Decoding intraoperative activity via accurate machine learning systems could be used to provide surgeons with feedback on their operating skills, and may allow for the identification of optimal surgical behaviour and for the study of relationships between intraoperative factors and postoperative outcomes.

Additional Information

© 2023. The Author(s). This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. We are grateful to T. Chu for the annotation of videos with gestures. We also thank J. Laca and J. Nguyen for early feedback on the presentation of the manuscript. A.J.H. discloses support for the research described in this study from the National Cancer Institute under award no. R01CA251579-01A1 and a multi-year Intuitive Surgical Clinical Research Grant. Contributions. D.K. and A.J.H. contributed to the conception of the study. D.K. contributed to the study design, developed the deep learning models and wrote the manuscript. R.M. and T.H. provided annotations for the video samples. D.A.D. provided extensive feedback on the manuscript. B.J.M. provided data for the study. C.W. collected data from SAH and provided feedback on the manuscript. A.J.H. and A.A. provided supervision and contributed to edits of the manuscript. Data availability. Data supporting the results in this study involve surgeon and patient data. As such, while the data from SAH and HMH are not publicly available, de-identified data from USC can be made available upon reasonable request from the authors. Code availability. Code is made available at https://github.com/danikiyasseh/SAIS. Competing interests. D.K. is a paid employee of Vicarious Surgical and a consultant of Flatiron Health. C.W. is a paid consultant of Intuitive Surgical. A.A. is an employee of Nvidia. A.J.H. is a consultant of Intuitive Surgical. The other authors declare no competing interests.

Attached Files

Published - 41551_2023_Article_1010.pdf

Supplemental Material - 41551_2023_1010_MOESM1_ESM.pdf

Files

41551_2023_Article_1010.pdf
Files (3.0 MB)
Name Size Download all
md5:b2f272aacbad49add88ef8a8a2f22748
2.4 MB Preview Download
md5:ae411eb3e0658c3ba94496f7e5afd416
646.0 kB Preview Download

Additional details

Created:
August 22, 2023
Modified:
October 18, 2023