Script Identification of Multi-Script Documents A Survey
In recent years, with the widespread of Internet and digitized processing of multi-script doc-uments worldwide, script identication techniques have become more important in the pattern recognition eld. Script identication concerns methods for identifying different scripts in multi-lingual, multi-script documents. This paper presents a comprehensive overview on research activities in the eld and focuses on the most valuable results obtained so far. The most vital processes in script identication are addressed in detail: identication and discriminating methods, features extraction (local and global), and classication. Different kinds of approaches have been developed and promising results have been achieved. This paper reports SoA performance results. This paper reports methods concerning handwritten, printed, and hybrid document processing. More research is necessary to meet the performance levels essential for everyday applications.
Handwriting recognition, optical character recognition (OCR), character recognition, multi-script documents, script identication.