학술논문

Correcting broken characters in the recognition of historical printed documents
Document Type
Conference
Source
Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries. :364-366
Subject
Language
English
Abstract
This paper presents a new technique for dealing with broken characters, one of the major challenges in the optical character recognition (OCR) of degraded historical printed documents. A technique based on graph combinatorics is used to rejoin the appropriate connected components. It has been applied to real data with successful results.

Online Access