ABSTRACT
No abstract available.
Recommendations
Word-Wise Thai and Roman Script Identification
In some Thai documents, a single text line of a printed document page may contain words of both Thai and Roman scripts. For the Optical Character Recognition (OCR) of such a document page it is better to identify, at first, Thai and Roman script ...
Benchmark databases of handwritten Bangla-Roman and Devanagari-Roman mixed-script document images
Handwritten document image dataset is one of the basic necessities to conduct research on developing Optical Character Recognition (OCR) systems. In a multilingual country like India, handwritten documents often contain more than one script, leading to ...
Italic or Roman: Word Style Recognition without A Priori Knowledge for Old Printed Documents
ICDAR '09: Proceedings of the 2009 10th International Conference on Document Analysis and RecognitionThis paper presents an Italic/Roman word type recognition system without a priori knowledge on the characters' font. This method aims at analyzing old documents in which character segmentation is not trivial. Therefore our approach segments the document ...
Comments