Application of Genetic Algorithms to Feature Subset Selection in a Farsi OCR

Dealing with hundreds of features in character recognition systems is not unusual. This large number of features leads to the increase of computational workload of recognition process. There have been many methods which try to remove unnecessary or redundant features and reduce feature dimensionality. Besides because of the characteristics of Farsi scripts, it-s not possible to apply other languages algorithms to Farsi directly. In this paper some methods for feature subset selection using genetic algorithms are applied on a Farsi optical character recognition (OCR) system. Experimental results show that application of genetic algorithms (GA) to feature subset selection in a Farsi OCR results in lower computational complexity and enhanced recognition rate.

Authors:



References:
[1] Oliveira, L. S., Benahmed, N., Sabourin, R., Bortolozzi, F., Suen, C. Y.,
"Feature Subset Selection Using Genetic Algorithms for Handwritten
Digit Recognition" Proc. XIV Brazilian Symposium on Computer
Graphics and Image Processing (SIBGRAPI-01), P.362, 2001.
[2] Yang, J., Honavar, V., "Feature Subset Selection Using a Genetic
Algorithm," Proc. IEEE Intelligent Systems, vol. 13, no. 2, pp. 44-
49, 1998.
[3] Sarfraz, M., Nawaz, S., N., Al-Khuraidly A., "Offline Arabic Text
Recognition System" Proc. 2003 International Conference on Geometric
Modeling and Graphics (GMAG'03), 2003.
[4] Deb, K., "Genetic Algorithm in Search and Optimization: the Technique
and Applications" Proc. International Workshop on Soft Computing and
Intelligent Systems, pp. 58-87, Calcutta, India, 1998.
[5] Kudo M, Sklansky J. , "Comparison of Algorithms that Select Features
for Pattern Classifiers" Pattern Recognition, Vol.33, pp.25-41, 2000.
[6] Kim, G., Kim, S., "Feature Selection Using Genetic Algorithms for
Handwritten Character Recognition" Proc. Seventh International
Workshop on Frontiers in Handwritten Recognition, Amsterdam, 2000.
[7] Sural, S., Das, P. K., "A Genetic Algorithm for Feature Selection in a
Neuro-Fuzzy OCR System" Proc. Sixth International Conference on
Document Analysis and Recognition (ICDAR-01), P.0987, 2001.
[8] Morita, M., Sabourin, R., Bortolozzi, F., Suen, C. Y., "Unsupervised
Feature Selection Using Multi-Objective Genetic Algorithms for
Handwritten Word Recognition " Proc. Seventh International
Conference on Document Analysis and Recognition (ICDAR-03), Vol.2,
P.666, 2003.
[9] Shi, D., Shu, W., Liu, H., "Feature Selection for Handwritten Chinese
Character Recognition Based on Genetic Algorithms" Proc. IEEE Int.
Conference on Systems, Man, and Cybernetics, vol. 5, pp. 4201-6, 1998.
[10] Ebrahimi, A., Kabir, E., "A Two Step Method for the Recognition of
Printed Subwords", Iranian Journal of Electrical and Computer
Engineering, Vol.2, No.2, pp.57-62, 2005 (in Farsi).