학술논문

SmokeSeger: A Transformer-CNN Coupled Model for Urban Scene Smoke Segmentation
Document Type
Periodical
Source
IEEE Transactions on Industrial Informatics IEEE Trans. Ind. Inf. Industrial Informatics, IEEE Transactions on. 20(2):1385-1396 Feb, 2024
Subject
Power, Energy and Industry Applications
Signal Processing and Analysis
Computing and Processing
Communication, Networking and Broadcast Technologies
Semantics
Transformers
Semantic segmentation
Convolutional neural networks
Task analysis
Decoding
Visualization
Convolutional neural network
dual-branch encoder
smoke semantic segmentation
transformer
urban smoke scene
Language
ISSN
1551-3203
1941-0050
Abstract
Smoke is an informative indicator of early fire and gas leakage. Segmenting the smoke from images can provide detailed information about the smoke volume, dispersion direction, and source location, which has significant implications considering the proliferation of video surveillance systems in cities. Focusing on smoke segmentation in the urban scene, we designed a dual-branch segmentation model, named SmokeSeger, which couples a transformer branch and a convolutional neural network (CNN) branch to enhance the representation of both global and local features. To address the lack of real-scene smoke datasets, we built an urban scene smoke segmentation dataset containing 3217 images of fire smoke and exhaust emissions with accurate annotations. Experiments validate that the SmokeSeger outperforms other mainstream segmentation methods on the proposed dataset. Visualization of attention maps reveals that the model could effectively capture the semantic relationship between the smoke and the corresponding source, which benefits the discrimination between smoke and smoke-like objects. More details available at https://github.com/VisAcademic/SmokeSeger.