학술논문

A CNN-Transformer-based Approach for Medical Image Segmentation
Document Type
Conference
Source
2023 International Conference on System Science and Engineering (ICSSE) System Science and Engineering (ICSSE), 2023 International Conference on. :22-27 Jul, 2023
Subject
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Measurement
Image segmentation
Neural networks
Data science
Transformers
Feature extraction
Decoding
Medical image segmentation
Vision Transformer
Fully convolutional neural networks
Unet
PASPP
Convmixer
local and global context
Language
ISSN
2325-0925
Abstract
Advances in deep convolutional neural networks (CNNs) have shown excellent performances on image processing applications including segmentation for medical images. Nevertheless, CNN-based approaches like the Fully Convolutional Neural Networks (FCNs), Unet and variants for image segmentation often meet difficulties when expressing long-range dependency because of the locality properties of convolutional operations. In an alternative, the network models based on transformers have global context of the image and features, thus better expressing long-range dependency. Though having advantages, the transformer-based approach often lacks local information context, thus limiting certain applications like medical images. In the current study, we propose a new model that can inherit advantages of both global and local contexts of the two above approaches by using CNN and Transformer branches, and introduced the Convmixer and Progressive Atrous Spatial Pyramidal Pooling modules in the bottlenecks of each branches. The proposed model has been validated on various medical image databases including the Data science bowls 2018, and GlaS datasets. High evaluation scores including Dice score and Intersection Over Union metric have shown performance of the proposed segmentation model while compared with recent neural network models.