학술논문

CrackHAM: A Novel Automatic Crack Detection Network Based on U-Net for Asphalt Pavement
Document Type
Periodical
Author
Source
IEEE Access Access, IEEE. 12:12655-12666 2024
Subject
Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineered Materials, Dielectrics and Plasmas
Engineering Profession
Fields, Waves and Electromagnetics
General Topics for Engineers
Geoscience
Nuclear Engineering
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Feature extraction
Deep learning
Computer architecture
Task analysis
Semantic segmentation
Convolutional neural networks
Convolution
Pavement crack detection
deep learning
fusion module
attention mechanism
atrous convolution
Language
ISSN
2169-3536
Abstract
The maintenance of pavements takes considerable time and poses a significant task, especially when it comes to detecting cracks at the pixel level. Due to the complexity of pavement conditions, such as road markings, shadows, and oil stains, deep learning techniques are still a challenge in automating crack detection. This paper presents a novel methodology termed as CrackHAM, which is an encoder-decoder network founded on the U-Net architecture. The primary objectives of CrackHAM are twofold: to achieve accurate and robust pavement crack detection while reducing the parameters of the network. Our study introduces two significant improvements to the existing neural network architecture, namely the phased multi-fusion module and the dual attention mechanisms. These improvements improve the process of defect extraction, resulting in an improved level of performance. Furthermore, a novel module named HASPP is devised to augment the network’s capacity to acquire more comprehensive receptive fields. In order to lower the number of network parameters, a technique is employed whereby only use half of the number of input channels and output channels in the VGG16 are utilized as U-Net encoder modules. The empirical findings demonstrate that in the Deepcrack, Crack500, and FIND public datasets, CrackHAM achieves superior segmentation performance compared to the FCN, Deeplabv3, Swin-Unet, and U-Net models while utilizing only one-third of the computational resources.