학술논문

Mask R-CNN End-to-End Text Detection and Recognition
Document Type
Conference
Source
2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA) Machine Learning And Applications (ICMLA), 2019 18th IEEE International Conference On. :1787-1793 Dec, 2019
Subject
Computing and Processing
Engineering Profession
Robotics and Control Systems
Signal Processing and Analysis
Text recognition
Proposals
Image recognition
Feature extraction
Computational modeling
Neural networks
Image segmentation
Convolutional Neural Networks, Scene Text Recognition, Cascade Region Proposal Network, Mask Text Recognition
Language
Abstract
Text detection and recognition have witnessed drastic improvements in the field of computer vision. This end-toend model comprising of the detection and recognition models scales to provide higher accuracy. The most important phase in this end-to-end approach is the detection phase, as it plays an important role to identify the text. To address this issue, different approaches have been proposed. However, most of the methods produce lower efficiency to detect and recognize real world text. In this paper, we propose a new approach to investigate the challenges that the existing models possess and improve the efficiency of the detection and in turn increases the accuracy of text recognition. The proposed method outperforms the state-ofthe-art approaches due to the use of deblurring and sharpening to reduce noise in the pre-processing stage, followed by the cascade region proposal network model to improve the detection of real world text using non max suppression. Experimentations on real word datasets highlight the effectiveness of our method.