학술논문

MSSINet: Real-Time Segmentation Based on Multi-Scale Strip Integration
Document Type
Periodical
Source
IEEE Journal of Radio Frequency Identification IEEE J. Radio Freq. Identif. Radio Frequency Identification, IEEE Journal of. 8:241-251 2024
Subject
Fields, Waves and Electromagnetics
Communication, Networking and Broadcast Technologies
Signal Processing and Analysis
Strips
Feature extraction
Semantic segmentation
Semantics
Real-time systems
Computer architecture
Training
Real-time semantic segmentation
dual-resolution network
strip pooling
deep supervision
Language
ISSN
2469-7281
2469-729X
Abstract
Semantic segmentation plays a fundamental role in computer vision, underpinning applications such as autonomous driving and scene analysis. Although dual-branch networks have marked advancements in accuracy and processing speed, they falter in the context extraction phase within the low-resolution branch. Traditionally, square pooling is used at this juncture, leading to the oversight of stripe-shaped contextual information. In response, we introduce a novel architecture based on a deep aggregation pyramid, engineered for both real-time processing and precise segmentation. Central to our approach is a pioneering contextual information extractor designed to expand the effective receptive fields and fuse multi-scale context from low-resolution feature maps. Additionally, we have developed a feature fusion module to enhance the integration and differentiation of high-level semantic information across branches. To further refine the fidelity of segmentation, we implement dual deep supervisions within the high-resolution branchs intermediate layer, concentrating on boundary delineation and global features to enrich spatial detail capture. Our comprehensive experimental analysis, conducted on the Cityscapes and CamVid datasets, affirms MSSINets superior performance, showcasing its competitiveness against existing leading methodologies across a variety of scenarios.