Word Base Line Detection in Handwritten Text Recognition Systems

An approach is offered for more precise definition of base lines- borders in handwritten cursive text and general problems of handwritten text segmentation have also been analyzed. An offered method tries to solve problems arose in handwritten recognition with specific slant or in other words, where the letters of the words are not on the same vertical line. As an informative features, some recognition systems use ascending and descending parts of the letters, found after the word-s baseline detection. In such recognition systems, problems in baseline detection, impacts the quality of the recognition and decreases the rate of the recognition. Despite other methods, here borders are found by small pieces containing segmentation elements and defined as a set of linear functions. In this method, separate borders for top and bottom border lines are found. At the end of the paper, as a result, azerbaijani cursive handwritten texts written in Latin alphabet by different authors has been analyzed.





References:
[1] C.Faure and E.Lecolinet. OCR: Handwriting. In R.A.Cole et al, editor,
"Survey of the State of the Artin Human Language Technology", Center
for Spoken Language Understanding, Oregon Graduate Institute, pp 86-
89, 1995.
[2] W. Senior. "Off-line handwritten recognition: A review and experiments".
Technical Report CUED/F-INFENG/TR105, Cambridge University Engineering
Department, Dec. 1992.
[3] C.Higgins and P.Bramall. "A non-line cursive script recognition system".
Handwriting and Drawing Research - Basic and Applied Issues, IOS
Press, pp. 285-298, 1996.
[4] R. K. Powalka, N. Sherkat, L. J. Evett and R. J. Whitrow. "Dynamic cursive
script recognition: A hybrid approach". In Advances in Handwriting
and Drawing: A multidisciplinary approach, 1994.
[5] S. Wesolkowski. "Cursive script recognition: A survey". Handwriting and
Drawing Research -Basic and Advanced Issues, IOS Press, pp. 267-284,
1996.
[6] R.G. Casey and E. Lecolinet. "Strategies in character segmentation: A
survey". Proc. of the 3rd International Conference on Document Analysis
and Recognition, Montreal, Canada, pp. 1028-1033, 1995.
[7] K.R. Ayda-zad . C.Z. H s nov. Latin lifbali lyazmalari tanima sisteml
rind sozl rin seqmentl nm si ucun usul. AMEA XEBERLERI. F.-t.
v r. elml ri seriyasi. XXVI cild. 3. 2006.
[8] K.R.Aida-zade, J.Z. Hasanov. "Handwritten recognition system for azerbaijani
latin text". Proc. of PCI 2008 International conference, pp. xx-yy,
2008.
[9] S. Srihari and R.Bozinovic. "Multi-level perceptron approach to reading
cursive script". Artificial Intelligence, vol. 33, pp. 217-255, 1987.
[10] A.W. Senior. "Off-line cursive handwritting recognition using recurrent
neural networks", Trinity Hall, Cambridge, England, 1994.
[11] B. Yanikoglu and P. A. Sandon. "Segmentation of off-line cursive
handwriting using linear programming". Pattern Recognition, Vol. 31,
No. 12, pp. 1825-1833, 1998.