Binarization of Text Region based on Fuzzy Clustering and Histogram Distribution in Signboards

In this paper, we present a novel approach to accurately detect text regions including shop name in signboard images with complex background for mobile system applications. The proposed method is based on the combination of text detection using edge profile and region segmentation using fuzzy c-means method. In the first step, we perform an elaborate canny edge operator to extract all possible object edges. Then, edge profile analysis with vertical and horizontal direction is performed on these edge pixels to detect potential text region existing shop name in a signboard. The edge profile and geometrical characteristics of each object contour are carefully examined to construct candidate text regions and classify the main text region from background. Finally, the fuzzy c-means algorithm is performed to segment and detected binarize text region. Experimental results show that our proposed method is robust in text detection with respect to different character size and color and can provide reliable text binarization result.




References:
[1] G. Nagy, "Twenty years of document image analysis" IEEE Transactions
on Pattern Analysis and Machine Intelligence, vol. 22, no. 1, pp. 38-62,
2000.
[2] H. Peng, F. Long, Z. Chi, "Document image recognition based on
template matching of component block projections," IEEE Transactions
on Pattern Analysis and Machine Intelligence, vol. 25, no. 9, pp.
1188-1192, 2003.
[3] I.S. Oh, C.Y. Suen, "Distance features for neural network-based
recognition of handwritten characters," International Journal on
Document Analysis and Recognition, vol. 1, pp. 73-88, 1998.
[4] H. Li, D. Doermann, O. Kia, "Automatic text detection and tracking in
digital videos," IEEE Transactions on Image Processing, vol. 9, no. 1, pp.
147-156, 2000.
[5] J. Xi, X. Hua, L. Wenyin, H.J. Zhang, "A video text detection and
recognition system," International Conference on Multimedia and Expo,
pp. 873-876, 2001
[6] A.K. Jain, B. Yu, "Automatic text location in images and video frames,"
Pattern Recognition, vol. 31, no. 12, pp. 2055-2076, 1998.
[7] N. Ezaki, K. Kiyota, B.T. Minh, M. Bulacu, L. Schomaker, "Improved
text-detection methods for a camera-based text reading system for blind
persons," International Conference on Document Analysis and
Recognition, pp. 257-261, 2005.
[8] X. Chen, J. Yang, J. Zhang, A. Waibel, "Automatic detection and
recognition of signs from natural scenes," IEEE Transactions on Image
Processing, vol. 13, no. 1, pp. 87-99, 2004.
[9] K. Jung, K.I. Kim, A.K. Jain, "Text information extraction in images and
video: a survey," Pattern Recognition, vol. 37, pp. 977-997, 2004.
[10] J. Gao, Y. Yang, Y. Zhang, A. Waibel, "Text detection and translation from
natural scenes," Tech. Report CMU-CS-01-139, 2001.
[11] Q. Ye, J. Jiao, J. Huang, H. Yu, "Text detection and restoration in natural
scene images," Journal of Visual Communication and Image
Representation, vol. 18, pp. 504-513, 2007.
[12] G. Obinata, A. Dutta, Vision Systems: Segmentation and Pattern
Recognition, I-Tech, pp. 307-332, 2007.
[13] J. Gllavata, R. Ewerth, B. Freisleben, "A robust algorithm for text
detection in images," International Symposium on Image and Signal
Processing, vol. 2, pp. 611-616, 2003.
[14] W. Wu, X. Chen, J. Yang, "Detection of text on road signs from video,"
IEEE transaction Intelligent Transportation Systems, vol. 6, no. 4, pp.
378-390, 2005.
[15] D. Chen, H. Bourlard, J.P. Thiran, "Text identification in complex
background using SVM," IEEE Computer Society Conference on
Computer Vision and Pattern Recognition, vol. 2, pp. 621-626, 2001.
[16] Y. Zhong, K. Karu, A.K. Jain, "Locating text in complex color images,"
pattern Recognition, vol. 28, no. 10, pp. 1523-1536, 1995.
[17] A.K. Jain, B. Yu, "Automatic text location in image and video frames,"
International Conference on Pattern Recognition, vol. 2, pp. 1497-1499,
1998.
[18] R. Lienhart, A.Wernicke, "Localizing and segmenting text in images and
videos," IEEE Transactions on Circuits System, vol. 12, no. 4, pp.
256-268, 2002.
[19] Y. Zhong, H. Zhang, A.K. Jain, "Automatic caption localization in
compressed video," IEEE Transactions on Pattern Analysis and Machine
Intelligence, vol. 22, no. 4, pp. 385-392, April 2000.
[20] X. Tang, X. Gao, J. Liu, H. Zhang, "A spatial-temporal approach for video
caption detection and recognition," IEEE Transactions on Neural
Network, vol. 13, no. 4, pp. 961-971, 2002.
[21] K. In Kim, K. Jung, J. Hyung, "Texture-based approach for text detection
in images using support vector machines and continuously adaptive mean
shift algorithm," IEEE Transactions on Pattern Analysis and Machine
Intelligence, vol. 25, no. 12, pp. 1631-1639, 2003.
[22] M. Fujii, W.J.R. Hoefer, "Filed-singularity correction in 2-D time-domain
Haar-wavelet Modeling of waveguide components," IEEE Transactions
on Microwave Theory and Techniques, vol. 49, no. 4, pp. 685-691, 2001.
[23] Q. Ye, W. Gao, W. Wang, W. Zeng, "A robust text detection algorithm in
images and video frames," International Conference on Information,
Communications and Signal Processing, vol. 2, pp. 802-806, 2003.
[24] Y. Zheng, H. Li, D. Doermann, "Machine printed text and handwriting
identification in noisy document images," IEEE Transactions on Pattern
Analysis and Machine Intelligence, vol. 26, no. 3, pp. 337-353, 2004.
[25] S. Belongie, J. Malik, J. Puzicha, "Matching shapes," International
Conference on Computer Vision, vol. 1, pp. 454-461, 2001.
[26] M. Sonka, V. Hlavac, R. Boyle, Image Processing, Analysis and Machine
Vision, Thomson Asia Pte Led.
[27] R.L. Cannon, J.V. Dave, J.C. Bezdek, "Efficient implementation of the
fuzzy c-means clustering algorithm," IEEE Transactions on Pattern
Analysis and Machine Intelligence, vol. 8. No. 2, pp. 248-255, 1986.
[28] Y. Liu, S. Goto, T. Ikenaga, "A contour-based robust algorithm for text
detection in color images," IEICE Transaction Information and System,
vol. E89-D, no. 3, pp. 1221-1230, 2006.