Study of Features for Hand-printed Recognition

The feature extraction method(s) used to recognize hand-printed characters play an important role in ICR applications. In order to achieve high recognition rate for a recognition system, the choice of a feature that suits for the given script is certainly an important task. Even if a new feature required to be designed for a given script, it is essential to know the recognition ability of the existing features for that script. Devanagari script is being used in various Indian languages besides Hindi the mother tongue of majority of Indians. This research examines a variety of feature extraction approaches, which have been used in various ICR/OCR applications, in context to Devanagari hand-printed script. The study is conducted theoretically and experimentally on more that 10 feature extraction methods. The various feature extraction methods have been evaluated on Devanagari hand-printed database comprising more than 25000 characters belonging to 43 alphabets. The recognition ability of the features have been evaluated using three classifiers i.e. k-NN, MLP and SVM.

Authors:



References:
[1] V. K. Govindan and A. P. Shivaprasad, "Character Recognition - A
Review", Pattern Recognition, Vol. 23, No. 7 , 1990.
[2] R. Plamondon and S. N. Srihari, "On-line and Off-line Handwriting
Recognition: a Comprehensive Survey", IEEE Transactions on Pattern
Analysis and Machine Intelligence, Vol. 22, No.1,pp. 63-84, 2000.
[3] A. L. Koerich, R. Sabourin and C.Y. Suen, "Large Vocabulary Off-line
Handwriting Recognition: a Survey", Pattern Analysis Applications,
Vol. 6, pp. 97-121, 2003.
[4] N. Arica and T. Yarman-Vural, "An Overview of Character Recognition
Focused on Off-line Handwriting", IEEE Transactions on Systems, Man,
and Cybernetics-Part C: Applications and Reviews, Vol. 31, No. 2,
2001.
[5] Satish Kumar, "The Headline Removal Algorithm and its Effect on
Recognition of Devanagari Handwritten Characters", International
Journal of Systemics, Cybernetics and Informatics, April 2009.
[6] U. Pal, B.B. Chaudhuri, "Indian script character recognition: a survey",
Pattern Recognition, 37, pp. 1887-1899, 2004.
[7] R. Kirsch, "Computer Determination of the Constituent Structure of
Biomedical Images," Computers and Biomedical Research, Vol 4, pp.
315-328, 1971.
[8] J. Birk, R. Kelley, N. Chen and L. Wilson, "Image Feature Extraction
using Diameter-Limited Gradient Direction Histograms", IEEE
Transactions on Pattern Analysis and Machine Intelligence, Vol. 1, pp.
228-235, 1979.
[9] W. K. Pratt, "Digital Image Processing", Third Edition, Wiley, New
York , 2001.
[10] M. Bosker, "Omnidocument Technologies", Proceedings of the IEEE,
Vol. 80, No. 7 , 1992.
[11] J. Cao, M. Ahmadi and M. Shridhar, Recognition of Handwritten
Numerals with Multiple Feature and Multistage Classifier, Pattern
Recognition, Vol. 28, No. 2, pp. 153-160, 1995.
[12] K. M. Kim, J.J. Park, Y.G. Song, I. C. Kim and C. Y. Suen,
"Recognition of Handwritten Numerals Using a Combined Classifier
with Hybrid Features", SSPR & SPR, LNCS 3138, pp. 992-1000, 2004.
[13] N. Arica and F. T. Yarman-Vural, "Optical Character Recognition for
Cursive Handwriting", IEEE Transactions on Pattern Analysis and
Machine Intelligence, Vol. 24, No. 6, 2002.
[14] M. H. Glauberman, "Character Recognition for Business Machines",
Electronics, pp. 132-136, 1956.
[15] O. D. Trier, A. K. Jain and T. Taxt, "Feature Extraction Method for
Character Recognition - a Survey", Pattern Recognition, Vol. 29, No. 4,
pp. 641-662, 1996.
[16] M. Shridhar and A. Badreldin, "Recognition of Isolated and Connected
Handwritten Numerals", Proceedings of the IEEE International
Conference on Systems, Man and Cybernetics, pp. 142-146, 1984.
[17] L. Heutte, T. Paquet, J. Moreau, Y. Lecourtier and C. Olivier, "A
Sturctural / Statistical Feature Based Vector for Handwritten Character
Recognition", Pattern Recognition Letter, Vol. 19, pp. 629-641, 1998.
[18] C.- L. Liu, K. Nakashima, H. Sako and H. Fujisawa, "Handwritten Digit
Recognition: Benchmarking of State-of-the-Art", Pattern Recognition,
No. 36, pp. 2271-2285 , 2003.
[19] L. Koerich, "Large Vocabulary Off-line Handwritten Word
Recognition", Ph. D. Thesis, ├ëcole de Technologie Supérieure,
Montreal-Canada , 2004.
[20] R. M. Bozinovic and S. N. Srihari, Offline Cursive Script Word
Recognition, IEEE Transactions on Pattern Analysis and Machine
Intelligence, Vol. 11, No. 1, pp. 68-83 , 1989.
[21] A. D. S. Britto Jr., R. Sabourin, E. Lethelier, F. Bortolozzi and C. Y.
Suen, "Improvement in Handwritten Numeral String Recognition by
Slant Normalization and Contextual Information", Proceedings of the
Seventh International Workshop on Frontiers of Handwriting
Recognition, Amsterdam-Netherlands, pp. 323-332 , 2000.
[22] E. Kavallieratou, N. Fakotakis and G. Kokkinakis, "Slant Estimation
Algorithm for OCR Systems", Pattern Recognition, Vol. 34, pp. 2515-
2522, 2001.
[23] D. Guillevic and C. Y. Suen, "Cursive Script Recognition: A Sentence
Level Recognition Scheme", Proceedings of International Workshop on
Frontiers of Handwriting Recognition, pp. 216-223 , 1994.
[24] G. Nicchiotti and C. Scagliola, "Generalized Projections: A Tool for
Cursive Handwriting Normalization", Proceedings of the Fifth
International Conference on Document Analysis and Recognition,
Bangalore, India, pp. 729-732 , 1999.
[25] S. Madhvanath, G. Kim and V. Govindaraju, "Chaincode Contour
Processing for Handwritten Word Recognition", IEEE Transactions on
Pattern Analysis and Machine Intelligence, Vol. 21, No. 9, pp. 928-932
, 1999.
[26] F. Kimura, Y. Miyake, and M. Shridhar, "Handwritten ZIP Code
Recognition using Lexicon Free Word Recognition Algorithm",
International Conference on Document Analysis and Recognition,
Montreal, Que., Canada, pp. 906-910 , 1995.
[27] Y. Wen, Y. Lu and P. Shi, "Handwritten Bangla Numeral Recognition
System and its Application to Postal Automation", Pattern Recognition,
Vol. 40, pp. 99-107, 2007.
[28] S. Knerr, L. Personnaz and G. Dreyfus, "Handwritten Digit Recognition
by Neural Networks with Single-Layer Training", IEEE Transactions on
Neural Networks, Vol. 3, No. 6, pp. 962-968, 1992.
[29] S.-B. Cho, "Neural-Network Classifiers for Recognizing Totally
Unconstrained Handwritten Numerals", IEEE Transactions on Neural
Networks, Vol. 4, No. 1, pp. 43-53, 1997.
[30] L. S. Davis, "Survey of Edge Detection Techniques", Computer
Graphics and Image Processing, Vol. 4, pp. 248-270, 1975.
[31] G. Srikantan, S. W. Lam and S.N. Srihari, "Gradient Based Contour
Encoding for Character Recognition", Pattern Recognition, Vol. 29, No.
7, pp. 1147-1160, 1996.
[32] C.- L. Liu, K. Nakashima, H. Sako and H. Fujisawa, "Handwritten Digit
Recognition: Benchmarking of State-of-the-Art", Pattern Recognition,
No. 36, pp. 2271-2285 , 2003.
[33] H. Liu and X. Ding, "Handwritten Character Recognition using Gradient
Feature and Quadratic Classifier with Multiple Discrimination
Schemes", Proceedings of the Eighth International Conference on
Document Analysis and Recognition, pp. 19-25, 2005.
[34] H. Fujisawa and C.-L. Liu, "Directional Pattern Matching for Character
Recognition Revisited", Proceedings of the Seventh International
Conference on Document Analysis and Recognition, pp. 794-798, 2003.
[35] A. Kawamura, et al, "On-line Recognition of Freely Handwritten
Japanese Characters using Directional Feature Densities", Proceedings
of the Eleventh International Conference on Pattern Recognition, Vol. II,
pp. 183-186, 1992.
[36] G. Borgefors, "Distance Transformations in Digital Images", Computer
Vision, Graphics and Image Processing, Vol. 34, pp. 344-371, 1986.
[37] A. Negi, C. Bhagvati and B. Krishna, "An OCR System for Telugu",
Proceedings of the Sixth International Conference on Document
Processing, pp. 1110-1114, 2001.
[38] S. J. Smith, M. O. Bourgoin, K. Sims and H.L. Voorhees, "Handwritten
Character Classification using Nearest Neighbor in Large Database",
IEEE Transactions on Pattern Analysis and Machine Intelligence,
Vol.16, No. 9, 915-919, 1994.
[39] Zs. M. Kovics and R. Guerrieri, "Massively-Parallel Handwritten
Character Recognition Based on the Distance Transform", Pattern
Recognition, Vol. 28, No. 3, pp. 293-301, 1995.
[40] II-S. Oh, C.Y. Suen, "Distance Features for Neural Network-based
Recognition of Handwritten Characters", International Journal on
Document Analysis and Recognition, Vol.1, pp. 73-88, 1998.
[41] H. Freeman, "Computer Processing of Line Drawings", Computing
Surveys, Vol. 6, pp. 57-97, 1974.
[42] R. C. Gonzalez and R. E. Woods, "Digital Image Processing", 2nd Ed.,
Pearson Education , 2002.
[43] F. Kimura and M. Shridhar, Handwritten Numeral Recognition Based on
Multiple Algorithms, Pattern Recognition, Vol. 24, No. 10, pp. 969-
983,1991.
[44] A. Rosenfeld and J. L. Pfaltz, "Distance Functions on Digital Pictures",
Pattern Recognition, Vol. 1, No. 1, pp. 33-61, 1968.
[45] C.-L. Liu, K. Nakashima, H. Sako and H. Fujisawa, "Handwritten Digit
Recognition: Investigation of Normalization and Feature Extraction
Techniques", Pattern Recognition, Vol. 37, pp. 265-279, 2004.
[46] N. Arica and F. T. Yarman-Vural, "Optical Character Recognition for
Cursive Handwriting", IEEE Transactions on Pattern Analysis and
Machine Intelligence, Vol. 24, No. 6, 2002.
[47] Y. Le Cun, O. Mattan, B. Boser, J.S. Denker et al, "Handwritten Zip
Code Recognition with Multilayer Networks", Proceedings of
International Conference on Pattern Recognition, Atlantic City, USA,
Vol. 2, pp. 35-40, 1990.
[48] P. S. Deshpande, L. Malik and S. Arora, Journal of Computers, Vol. 3,
No. 5, pp. 11-17, May 2008.
[49] S. K. Parui and B. Shaw, "Off-line Devanagari Handwritten Word
Recognition: An HMM based approach", Proc. PReMI-2007(Springer),
LNCS-4815, pp. 528-535, Dec. 2007.
[50] B. Shaw, S. K. Parui and M. Shridhar, "Off-line Handwritten
Devanagari Word Recognition: A Segmentation Based Approach",
IEEE ,2008.
[51] Satish Kumar, "Performance Comparison of Features on Devanagari
Hand-printed Dataset", International Journal of Recent Trends in
Engineering, Vol. 1, No. 2, pp. 33-37, May 2009.
[52] Satish Kumar, "Neighborhood Pixels Weights-A New Feature
Extractor", International Journal of Computer Theory and Engineering,
Vol. 1, No. 6, pp. 69-77, Feb 2010.
[53] Satish Kumar, " A Study of Discrimination Ability of Features on Handprinted
Characters in Context to Noisy and Slanted Patterns" , Punjab
Institute of Management and Technology(PIMT) Journal of Research,
Vol. 2, No. 1, pp. 66-72, March 2009.
[54] Satish Kumar, "Devanagari Hand-printed Character Recognition using
Multiple Features and Multi-stage Classifier", International Journal of
Computer Information Systems and Industrial Management Applications
(IJCISIM), Vol. 2, pp.039-055, 2010.