Structure Extraction from Decorated Characters Using Multiscale Images
Shin'ichiro Omachi, Masaki Inoue, and Hirotomo Aso
IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.23, no.3, pp.315-322, March 2001

Abstract
Decorated characters are widely used in various documents. Practical optical character reader is required to deal with not only common fonts but also complex designed fonts. However, since appearances of decorated characters are complicated, most general character recognition systems cannot give good performances on decorated characters. In this paper, an algorithm that can extract character's essential structure from a decorated character is proposed. This algorithm is applied in preprocessing of character recognition. The proposed algorithm consists of three procedures: global structure extraction, interpolation of structure and smoothing. By using multi-scale images, topographical features such as ridges and ravines are detected for structure extraction. Ridges are used for extracting global structure, and ravines are used for interpolation. Experimental results show character structures are clearly extracted from very complex decorated characters.
Keywords
character recognition, OCR, decorated character, structure extraction
Full paper
PDF
Gzipped Postscript