학술논문

Remote Sensing Image Scene Classification Method Combined Attention Mechanism and Multiscale Feature
Document Type
Conference
Source
2021 6th International Conference on Image, Vision and Computing (ICIVC) Image, Vision and Computing (ICIVC), 2021 6th International Conference on. :186-190 Jul, 2021
Subject
Computing and Processing
Computer vision
Image analysis
Fuses
Semantics
Benchmark testing
Feature extraction
Information retrieval
scene classification
attention mechanism
multiscale
CNN
remote sensing
Language
Abstract
Remote sensing image scene classification is a hot topic in computer vision field. However, in remote sensing image, the background information is complex and the scale of objects varies greatly. To address these limitations, in this paper, we propose a method combined attention mechanism and multi-scale features fusion. First, ResNet50 is employed as the feature extraction backbone network to extract features, and multiple attention networks are used to extract attention maps from channels and spaces simultaneously. Second, a sampling branch is utilized to further amplify important features. In addition, a context information extraction network is designed to pool the features into three-scale and fuse them with the convolutional layer to provide richer feature information. In order to avoid network overfitting and improve the generalization ability of the model. Extensive experiments were evaluated on AID and NWPU-RESISC45 dataset. Our method outperforms the baseline by 2.71% on AID and 2.75% on NWPU benchmark respectively. The experimental results demonstrate that comparing with the state-of-the-art methods and baseline, the accuracy of our method is significantly improved.