daVinciNet: Joint Prediction of Motion and Surgical State in Robot-Assisted Surgery

Creators: Qin, Yidan^{1, 2}; Feyzabadi, Seyedshams¹; Allan, Max¹; Burdick, Joel W.²; Azizian, Mahdi¹

1. Intuitive Surgical (United States)
2. California Institute of Technology

Style

An error occurred while generating the citation.

Abstract

This paper presents a technique to concurrently and jointly predict the future trajectories of surgical instruments and the future state(s) of surgical subtasks in robot-assisted surgeries (RAS) using multiple input sources. Such predictions are a necessary first step towards shared control and supervised autonomy of surgical subtasks. Minute-long surgical subtasks, such as suturing or ultrasound scanning, often have distinguishable tool kinematics and visual features, and can be described as a series of fine-grained states with transition schematics. We propose daVinciNet - an end-to-end dual-task model for robot motion and surgical state predictions. daVinciNet performs concurrent end-effector trajectory and surgical state predictions using features extracted from multiple data streams, including robot kinematics, endoscopic vision, and system events. We evaluate our proposed model on an extended Robotic Intra-Operative Ultrasound (RIOUS+) imaging dataset collected on a da Vinci® Xi surgical system and the JHU-ISI Gesture and Skill Assessment Working Set (JIGSAWS). Our model achieves up to 93.85% short-term (0.5s) and 82.11% long-term (2s) state prediction accuracy, as well as 1.07mm short-term and 5.62mm long-term trajectory prediction error.

Additional Information

Attached Files

Accepted Version - 2009.11937.pdf

Files

2009.11937.pdf

Files (7.1 MB)

Name	Size	Download all
2009.11937.pdf md5:23655d1c7f9c77fe26e0cfb428be159e	7.1 MB	Preview Download

Additional details

	All versions	This version
Views	14	14
Downloads	23	23
Data volume	164.4 MB	164.4 MB