학술논문

Bimodal Camera Pose Prediction for Endoscopy
Document Type
Periodical
Source
IEEE Transactions on Medical Robotics and Bionics IEEE Trans. Med. Robot. Bionics Medical Robotics and Bionics, IEEE Transactions on. 5(4):978-989 Nov, 2023
Subject
Bioengineering
Robotics and Control Systems
Computing and Processing
Cameras
Colonoscopy
Colon
Three-dimensional displays
Image reconstruction
Synthetic data
Medical robotics
Pose estimation
Simultaneous localization and mapping
Endoscopes
3D reconstruction
camera pose estimation
endoscopy
SLAM
surgical AI
Language
ISSN
2576-3202
Abstract
Deducing the 3D structure of endoscopic scenes from images is exceedingly challenging. In addition to deformation and view-dependent lighting, tubular structures like the colon present problems stemming from their self-occluding and repetitive anatomical structure. In this paper, we propose SimCol, a synthetic dataset for camera pose estimation in colonoscopy, and a novel method that explicitly learns a bimodal distribution to predict the endoscope pose. Our dataset replicates real colonoscope motion and highlights the drawbacks of existing methods. We publish 18k RGB images from simulated colonoscopy with corresponding depth and camera poses and make our data generation environment in Unity publicly available. We evaluate different camera pose prediction methods and demonstrate that, when trained on our data, they generalize to real colonoscopy sequences, and our bimodal approach outperforms prior unimodal work. Our project and dataset can be found here: https://www.github.com/anitarau/simcol.