학술논문

CMC-Net: A New Transformer-Based Architecture for Polyp Segmentation
Document Type
Conference
Source
2023 RIVF International Conference on Computing and Communication Technologies (RIVF) Computing and Communication Technologies (RIVF), 2023 RIVF International Conference on. :13-18 Dec, 2023
Subject
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Fields, Waves and Electromagnetics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Pyramid Vision Transformer
Polyp Segmentation
Mixer
Attention module
Language
ISSN
2473-0130
Abstract
Polyps are pathological and usually appear in the gastrointestinal, most of which are not dangerous to health, but some are at risk of developing into cancer, especially polyps in the colon. Therefore, accurate polyp segmentation from colonoscopy images is necessary to help doctors come up with a treatment plan. Based on the effectiveness of convolutional neural networks (CNNs) and encoder-decoder architecture, U-Net has become the baseline model for most medical image segmentation tasks. However, due to limitations of long-range dependency of CNNs and an information imbalance of skip connection between different-level features from the encoder to the decoder, the Unet architecture gives low-accuracy seg-mentation results for complex datasets like polyps. To improve the above limitations, we introduce a new transformer-based architecture termed CMC-Net. A backbone of the encoder of our proposed model is PVT-v2 which produces a global receptive field instead of local receptive fields in traditional CNN backbones. We design the ConvMLP-Mixer block to exploit context information from the encoder and connect it to the feature map from the decoding process. An attention map is proposed parallel with the decoder to combine all levels of context information on full scale from the decoder. Our experiments with the K vasir-SEG set and the CVC-ClinicDB set to show that CMC-Net gives better-improved accuracy than other state-of-the-art models. The code is available at: https://github.com/thanhthu152/CMC-Net.