A Persian OCR System using Morphological Operators

Optical Character Recognition (OCR) is a very old and of great interest in pattern recognition field. In this paper we introduce a very powerful approach to recognize Persian text. We have used morphological operators, especially Hit/Miss operator to descript each sub-word and by using a template matching approach we have tried to classify generated description. We used just one font in two different sizes to verify our approach. We achieved a very good rate, up to 99.9%.





References:
[1] Badr Al-Badr and Saberi A.Mahmoud, "Survey and bibliograghy of Arabic optical text recognition",Elsevier Signal Processing,1995, pp.49-77.
[2] seera, J.,Image Analysis and Mathematical Morphology", Acrdemic
Press, NewYork, 1982
[3] J.W. Smith and Z. Merali, "Optical character recognition", The British
Library, Wetherby, West Yorkshire LS23 7BQ, UK, 1985.
[4] E.M. Welch, "Can you read this? OCR software", MacUser, Vol. 9, No.
8, November 1993, pp. 169-178.
[5] R.Azmi and A.Kabir "A new segmentation technique for omnifont
Farsi text",Elsevier Pattern Recognition Letters, 2001, pp. 97-104.
[6] B. Parhami and M. Tarighi, "Automatic recognition of printed Farsi
text", Pattern Recognition, Vol. 14, No. 1,1981, pp. 1-6.
[7] H. Almuallim and S. Yamaguchi, "A method of recognition of Arabic
cursive handwriting," IEEE Trans. Patt. Anal. Machine Intell., vol.
PAMI-9, no. 5, Sept. 1987.
[8] T. El-Sheikh and R. Guindi, "Computer recognition of Arabic scripts,"
Patt. Recogn., vol. 21, no. 4, pp. 293-302, 1988.
[9] M. El-Wakil and A. Shoukry, "On-line recognition of handwritten
isolated Arabic characters," Patt. Recogn., vol. 22, no. 2, pp. 97-105,
1989.
[10] B. Timsari, Character recognition in typed Persian words: a
morphological approach, M.S. thesis, Isfahan Univ. of Tech., Iran,
1992.
[11] B. K. Jang and R. T. Chin, "Analysis of thinning algorithms using
mathematical morphology,"IEEE Trans. Patt. Anal. Machine Intell., vol.
PAMI-12, no. 6, pp. 541-551, June 1990.
[12] A. Amin and G. Masini, "Machine recognition of multifont printed
Arabic texts," in Proc. 8th Int. Conf. Patt. recogn., pp. 392-395, Paris,
1986.