학술논문

Interactive Deep Learning-Based Retinal OCT Layer Segmentation Refinement by Regressing Translation Maps
Document Type
Periodical
Source
IEEE Access Access, IEEE. 12:47009-47023 2024
Subject
Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineered Materials, Dielectrics and Plasmas
Engineering Profession
Fields, Waves and Electromagnetics
General Topics for Engineers
Geoscience
Nuclear Engineering
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Retina
Image segmentation
Deep learning
Task analysis
Predictive models
Tuning
Training
Interactive systems
Annotations
Human in the loop
Optical coherence tomography
interactive annotation
deep learning
human-in-the-loop
optical coherence tomography
retina
Language
ISSN
2169-3536
Abstract
Retinal layer segmentation in optical coherence tomography (OCT) is essential for the diagnosis and follow-up of several diseases. Despite the success of deep learning approaches for this task, their clinical applicability is limited, since they neither account for pathologies other than those present in the training set nor for the specialists’ subjectivity. Thus, we propose an interactive layer segmentation approach that allows to obtain an initial segmentation and, more importantly, to interactively correct those segmentations. Our deep learning-based approach predicts the translation required to correct layer boundary segmentations by regressing pixel-wise translation maps that account for the user input. The method is designed to allow for segmentation correction by interactions with point-clicks or line-scribbles. Additionally, the system outputs a coordinate-wise confidence, allowing to automatically identify regions of possible segmentation failure that may require user attention. We extensively validate our approach on multiple private and public datasets with different pathomorphological complexities, achieving state-of-the-art performance, while allowing for a simple and efficient user interaction.