학술논문

Text Detection Based on MSER and CNN Features
Document Type
Conference
Source
2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) ICDAR Document Analysis and Recognition (ICDAR), 2017 14th IAPR International Conference on. 01:949-954 Nov, 2017
Subject
Computing and Processing
Image edge detection
Image color analysis
Merging
Fractals
Feature extraction
Filtering
Support vector machines
Scene text detection
MSERs
HOG
CNN
SVM
Language
ISSN
2379-2140
Abstract
Text detection in natural scenes holds great importance in the field of research and still remains a challenge and an important task because of size, various fonts, line orientation, different illumination conditions, weak characters and complex backgrounds in image. The contribution of our proposed method is to filtering out complex backgrounds by combining three strategies. These are enhancing the edge candidate detection in HSV space color using the fractal dimension (FD) to transform the image intensities, then using MSER candidate detection to get different masks applied in HSV space color as well as gray color. After that, we opt for the Stroke Width Transform (SWT) and heuristic filtering. Such strategies are followed so as to maximize the capacity of zones text pixels candidates and distinguish between text boxes and the rest of the image. The components selected non text are filtered by classifying the characters candidates using Support Vector Machines (SVM) exploring Convolutional Neural Networks (CNN) features and Histogram of Oriented Gradients (HOG) vector features. We use the technique of word grouping who the boundary box localization select different words in the image where false positives text blocks are eliminated by geometrical properties. The evaluation of the proposed method demonstrate the effectiveness of our method for complex foreground through the experimental results tested on three benchmarks ICDAR2013, ICDAR2015 and MSRA-TD500.