G. S. Lehal - Journal Author

Identification of Printed Punjabi Words and English Numerals Using Gabor Features

Year: 2011 Volume: 5 Issue: 1 120 - 123 Pages

Abstract: Script identification is one of the challenging steps in the development of optical character recognition system for bilingual or multilingual documents. In this paper an attempt is made for identification of English numerals at word level from Punjabi documents by using Gabor features. The support vector machine (SVM) classifier with five fold cross validation is used to classify the word images. The results obtained are quite encouraging. Average accuracy with RBF kernel, Polynomial and Linear Kernel functions comes out to be greater than 99%.

A Study of Touching Characters in Degraded Gurmukhi Text

Year: 2007 Volume: 1 Issue: 4 1147 - 1150 Pages

Abstract: Character segmentation is an important preprocessing step for text recognition. In degraded documents, existence of touching characters decreases recognition rate drastically, for any optical character recognition (OCR) system. In this paper a study of touching Gurmukhi characters is carried out and these characters have been divided into various categories after a careful analysis.Structural properties of the Gurmukhi characters are used for defining the categories. New algorithms have been proposed to segment the touching characters in middle zone. These algorithms have shown a reasonable improvement in segmenting the touching characters in degraded Gurmukhi script. The algorithms proposed in this paper are applicable only to machine printed text.

Segmentation Problems and Solutions in Printed Degraded Gurmukhi Script

Year: 2008 Volume: 2 Issue: 9 2973 - 2982 Pages

Abstract: Character segmentation is an important preprocessing step for text recognition. In degraded documents, existence of touching characters decreases recognition rate drastically, for any optical character recognition (OCR) system. In this paper we have proposed a complete solution for segmenting touching characters in all the three zones of printed Gurmukhi script. A study of touching Gurmukhi characters is carried out and these characters have been divided into various categories after a careful analysis. Structural properties of the Gurmukhi characters are used for defining the categories. New algorithms have been proposed to segment the touching characters in middle zone, upper zone and lower zone. These algorithms have shown a reasonable improvement in segmenting the touching characters in degraded printed Gurmukhi script. The algorithms proposed in this paper are applicable only to machine printed text. We have also discussed a new and useful technique to segment the horizontally overlapping lines.

Top Journal

SUGGEST A JOURNAL

Identification of Printed Punjabi Words and English Numerals Using Gabor Features

A Study of Touching Characters in Degraded Gurmukhi Text

Segmentation Problems and Solutions in Printed Degraded Gurmukhi Script