Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published March 30, 2023 | Published + Supplemental Material
Journal Article Open

A multi-institutional study using artificial intelligence to provide reliable and fair feedback to surgeons

Abstract

Background. Surgeons who receive reliable feedback on their performance quickly master the skills necessary for surgery. Such performance-based feedback can be provided by a recently-developed artificial intelligence (AI) system that assesses a surgeon's skills based on a surgical video while simultaneously highlighting aspects of the video most pertinent to the assessment. However, it remains an open question whether these highlights, or explanations, are equally reliable for all surgeons. Methods. Here, we systematically quantify the reliability of AI-based explanations on surgical videos from three hospitals across two continents by comparing them to explanations generated by humans experts. To improve the reliability of AI-based explanations, we propose the strategy of training with explanations –TWIX –which uses human explanations as supervision to explicitly teach an AI system to highlight important video frames. Results. We show that while AI-based explanations often align with human explanations, they are not equally reliable for different sub-cohorts of surgeons (e.g., novices vs. experts), a phenomenon we refer to as an explanation bias. We also show that TWIX enhances the reliability of AI-based explanations, mitigates the explanation bias, and improves the performance of AI systems across hospitals. These findings extend to a training environment where medical students can be provided with feedback today. Conclusions. Our study informs the impending implementation of AI-augmented surgical training and surgeon credentialing programs, and contributes to the safe and fair democratization of surgery.

Additional Information

© The Author(s) 2023. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. Research reported in this publication was supported by the National Cancer Institute under Award No. R01CA251579-01A1. Contributions. D.K. contributed to the conception of the study and the study design, developed the deep learning models, and wrote the manuscript. J.L. collected the data from the training environment. D.K., J.L., T.H., and M.O. provided annotations for the video samples. D.A.D. provided feedback on the manuscript. C.W. collected data from St. Antonius-Hospital and B.J.M. collected data from Houston Methodist Hospital, and provided feedback on the manuscript. A.J.H., and A.A. provided supervision and contributed to edits of the manuscript. Data availability. As the data contain protected health information, the videos of live surgical procedures and the patients' corresponding demographic information from the University of Southern California, St. Antonius Hospital, and Houston Methodist Hospital are not publicly available. However, since the data from the training environment do not involve patients, those videos and annotations are available on Zenodo (https://zenodo.org/record/7221656#.Y-ZIfi_MI2y) upon reasonable request from the authors. Source data for Fig. 1 is in Supplementary Data 1. Source data for Fig. 3 is in Supplementary Data 2. Source data for Fig. 4 is in Supplementary Data 3 and 4. Source data for Fig. 5 is in Supplementary Data 5. Code availability. While SAIS, the underlying AI system, can be accessed at https://github.com/danikiyasseh/SAIS, the code for the existing study can be found at https://github.com/danikiyasseh/TWIX. Competing interests. The authors declare the following competing interests: D.K. is a paid consultant of Flatiron Health and an employee of Vicarious Surgical. C.W. is a paid consultant of Intuitive Surgical. A.A. is an employee of Nvidia. A.J.H is a consultant of Intuitive Surgical. The remaining authors declare no competing interests.

Attached Files

Published - 43856_2023_Article_263.pdf

Supplemental Material - 43856_2023_263_MOESM1_ESM.csv

Supplemental Material - 43856_2023_263_MOESM2_ESM.csv

Supplemental Material - 43856_2023_263_MOESM3_ESM.xlsx

Supplemental Material - 43856_2023_263_MOESM4_ESM.xlsx

Supplemental Material - 43856_2023_263_MOESM5_ESM.xlsx

Supplemental Material - 43856_2023_263_MOESM6_ESM.pdf

Supplemental Material - 43856_2023_263_MOESM7_ESM.pdf

Files

43856_2023_263_MOESM1_ESM.csv
Files (2.3 MB)
Name Size Download all
md5:d4dacf40b75e5cbde5a430838ce12370
7.4 kB Preview Download
md5:aef73249aee386ee2ecfabca8555e4a6
81.4 kB Preview Download
md5:5d30200852941cc28695c753b3a79cd3
93.0 kB Preview Download
md5:66be8082385ddf75bc8a6d823e33a961
1.8 MB Preview Download
md5:9ab3c0a3517ae99d22ae24cd09ee2b73
158.2 kB Download
md5:b4d19ba4c3c9d5d2bd198333889c1b07
4.2 kB Preview Download
md5:f9bbb59e3ff9b0056832bcec779ebbfd
50.1 kB Download
md5:8512d2b696082b1150825cfafd4c8432
48.1 kB Download

Additional details

Created:
August 20, 2023
Modified:
October 20, 2023