Binarization of Text Region based on Fuzzy Clustering and Histogram Distribution in Signboards
In this paper, we present a novel approach to accurately
detect text regions including shop name in signboard images with
complex background for mobile system applications. The proposed
method is based on the combination of text detection using edge
profile and region segmentation using fuzzy c-means method. In the
first step, we perform an elaborate canny edge operator to extract all
possible object edges. Then, edge profile analysis with vertical and
horizontal direction is performed on these edge pixels to detect
potential text region existing shop name in a signboard. The edge
profile and geometrical characteristics of each object contour are
carefully examined to construct candidate text regions and classify the
main text region from background. Finally, the fuzzy c-means
algorithm is performed to segment and detected binarize text region.
Experimental results show that our proposed method is robust in text
detection with respect to different character size and color and can
provide reliable text binarization result.
[1] G. Nagy, "Twenty years of document image analysis" IEEE Transactions
on Pattern Analysis and Machine Intelligence, vol. 22, no. 1, pp. 38-62,
2000.
[2] H. Peng, F. Long, Z. Chi, "Document image recognition based on
template matching of component block projections," IEEE Transactions
on Pattern Analysis and Machine Intelligence, vol. 25, no. 9, pp.
1188-1192, 2003.
[3] I.S. Oh, C.Y. Suen, "Distance features for neural network-based
recognition of handwritten characters," International Journal on
Document Analysis and Recognition, vol. 1, pp. 73-88, 1998.
[4] H. Li, D. Doermann, O. Kia, "Automatic text detection and tracking in
digital videos," IEEE Transactions on Image Processing, vol. 9, no. 1, pp.
147-156, 2000.
[5] J. Xi, X. Hua, L. Wenyin, H.J. Zhang, "A video text detection and
recognition system," International Conference on Multimedia and Expo,
pp. 873-876, 2001
[6] A.K. Jain, B. Yu, "Automatic text location in images and video frames,"
Pattern Recognition, vol. 31, no. 12, pp. 2055-2076, 1998.
[7] N. Ezaki, K. Kiyota, B.T. Minh, M. Bulacu, L. Schomaker, "Improved
text-detection methods for a camera-based text reading system for blind
persons," International Conference on Document Analysis and
Recognition, pp. 257-261, 2005.
[8] X. Chen, J. Yang, J. Zhang, A. Waibel, "Automatic detection and
recognition of signs from natural scenes," IEEE Transactions on Image
Processing, vol. 13, no. 1, pp. 87-99, 2004.
[9] K. Jung, K.I. Kim, A.K. Jain, "Text information extraction in images and
video: a survey," Pattern Recognition, vol. 37, pp. 977-997, 2004.
[10] J. Gao, Y. Yang, Y. Zhang, A. Waibel, "Text detection and translation from
natural scenes," Tech. Report CMU-CS-01-139, 2001.
[11] Q. Ye, J. Jiao, J. Huang, H. Yu, "Text detection and restoration in natural
scene images," Journal of Visual Communication and Image
Representation, vol. 18, pp. 504-513, 2007.
[12] G. Obinata, A. Dutta, Vision Systems: Segmentation and Pattern
Recognition, I-Tech, pp. 307-332, 2007.
[13] J. Gllavata, R. Ewerth, B. Freisleben, "A robust algorithm for text
detection in images," International Symposium on Image and Signal
Processing, vol. 2, pp. 611-616, 2003.
[14] W. Wu, X. Chen, J. Yang, "Detection of text on road signs from video,"
IEEE transaction Intelligent Transportation Systems, vol. 6, no. 4, pp.
378-390, 2005.
[15] D. Chen, H. Bourlard, J.P. Thiran, "Text identification in complex
background using SVM," IEEE Computer Society Conference on
Computer Vision and Pattern Recognition, vol. 2, pp. 621-626, 2001.
[16] Y. Zhong, K. Karu, A.K. Jain, "Locating text in complex color images,"
pattern Recognition, vol. 28, no. 10, pp. 1523-1536, 1995.
[17] A.K. Jain, B. Yu, "Automatic text location in image and video frames,"
International Conference on Pattern Recognition, vol. 2, pp. 1497-1499,
1998.
[18] R. Lienhart, A.Wernicke, "Localizing and segmenting text in images and
videos," IEEE Transactions on Circuits System, vol. 12, no. 4, pp.
256-268, 2002.
[19] Y. Zhong, H. Zhang, A.K. Jain, "Automatic caption localization in
compressed video," IEEE Transactions on Pattern Analysis and Machine
Intelligence, vol. 22, no. 4, pp. 385-392, April 2000.
[20] X. Tang, X. Gao, J. Liu, H. Zhang, "A spatial-temporal approach for video
caption detection and recognition," IEEE Transactions on Neural
Network, vol. 13, no. 4, pp. 961-971, 2002.
[21] K. In Kim, K. Jung, J. Hyung, "Texture-based approach for text detection
in images using support vector machines and continuously adaptive mean
shift algorithm," IEEE Transactions on Pattern Analysis and Machine
Intelligence, vol. 25, no. 12, pp. 1631-1639, 2003.
[22] M. Fujii, W.J.R. Hoefer, "Filed-singularity correction in 2-D time-domain
Haar-wavelet Modeling of waveguide components," IEEE Transactions
on Microwave Theory and Techniques, vol. 49, no. 4, pp. 685-691, 2001.
[23] Q. Ye, W. Gao, W. Wang, W. Zeng, "A robust text detection algorithm in
images and video frames," International Conference on Information,
Communications and Signal Processing, vol. 2, pp. 802-806, 2003.
[24] Y. Zheng, H. Li, D. Doermann, "Machine printed text and handwriting
identification in noisy document images," IEEE Transactions on Pattern
Analysis and Machine Intelligence, vol. 26, no. 3, pp. 337-353, 2004.
[25] S. Belongie, J. Malik, J. Puzicha, "Matching shapes," International
Conference on Computer Vision, vol. 1, pp. 454-461, 2001.
[26] M. Sonka, V. Hlavac, R. Boyle, Image Processing, Analysis and Machine
Vision, Thomson Asia Pte Led.
[27] R.L. Cannon, J.V. Dave, J.C. Bezdek, "Efficient implementation of the
fuzzy c-means clustering algorithm," IEEE Transactions on Pattern
Analysis and Machine Intelligence, vol. 8. No. 2, pp. 248-255, 1986.
[28] Y. Liu, S. Goto, T. Ikenaga, "A contour-based robust algorithm for text
detection in color images," IEICE Transaction Information and System,
vol. E89-D, no. 3, pp. 1221-1230, 2006.
[1] G. Nagy, "Twenty years of document image analysis" IEEE Transactions
on Pattern Analysis and Machine Intelligence, vol. 22, no. 1, pp. 38-62,
2000.
[2] H. Peng, F. Long, Z. Chi, "Document image recognition based on
template matching of component block projections," IEEE Transactions
on Pattern Analysis and Machine Intelligence, vol. 25, no. 9, pp.
1188-1192, 2003.
[3] I.S. Oh, C.Y. Suen, "Distance features for neural network-based
recognition of handwritten characters," International Journal on
Document Analysis and Recognition, vol. 1, pp. 73-88, 1998.
[4] H. Li, D. Doermann, O. Kia, "Automatic text detection and tracking in
digital videos," IEEE Transactions on Image Processing, vol. 9, no. 1, pp.
147-156, 2000.
[5] J. Xi, X. Hua, L. Wenyin, H.J. Zhang, "A video text detection and
recognition system," International Conference on Multimedia and Expo,
pp. 873-876, 2001
[6] A.K. Jain, B. Yu, "Automatic text location in images and video frames,"
Pattern Recognition, vol. 31, no. 12, pp. 2055-2076, 1998.
[7] N. Ezaki, K. Kiyota, B.T. Minh, M. Bulacu, L. Schomaker, "Improved
text-detection methods for a camera-based text reading system for blind
persons," International Conference on Document Analysis and
Recognition, pp. 257-261, 2005.
[8] X. Chen, J. Yang, J. Zhang, A. Waibel, "Automatic detection and
recognition of signs from natural scenes," IEEE Transactions on Image
Processing, vol. 13, no. 1, pp. 87-99, 2004.
[9] K. Jung, K.I. Kim, A.K. Jain, "Text information extraction in images and
video: a survey," Pattern Recognition, vol. 37, pp. 977-997, 2004.
[10] J. Gao, Y. Yang, Y. Zhang, A. Waibel, "Text detection and translation from
natural scenes," Tech. Report CMU-CS-01-139, 2001.
[11] Q. Ye, J. Jiao, J. Huang, H. Yu, "Text detection and restoration in natural
scene images," Journal of Visual Communication and Image
Representation, vol. 18, pp. 504-513, 2007.
[12] G. Obinata, A. Dutta, Vision Systems: Segmentation and Pattern
Recognition, I-Tech, pp. 307-332, 2007.
[13] J. Gllavata, R. Ewerth, B. Freisleben, "A robust algorithm for text
detection in images," International Symposium on Image and Signal
Processing, vol. 2, pp. 611-616, 2003.
[14] W. Wu, X. Chen, J. Yang, "Detection of text on road signs from video,"
IEEE transaction Intelligent Transportation Systems, vol. 6, no. 4, pp.
378-390, 2005.
[15] D. Chen, H. Bourlard, J.P. Thiran, "Text identification in complex
background using SVM," IEEE Computer Society Conference on
Computer Vision and Pattern Recognition, vol. 2, pp. 621-626, 2001.
[16] Y. Zhong, K. Karu, A.K. Jain, "Locating text in complex color images,"
pattern Recognition, vol. 28, no. 10, pp. 1523-1536, 1995.
[17] A.K. Jain, B. Yu, "Automatic text location in image and video frames,"
International Conference on Pattern Recognition, vol. 2, pp. 1497-1499,
1998.
[18] R. Lienhart, A.Wernicke, "Localizing and segmenting text in images and
videos," IEEE Transactions on Circuits System, vol. 12, no. 4, pp.
256-268, 2002.
[19] Y. Zhong, H. Zhang, A.K. Jain, "Automatic caption localization in
compressed video," IEEE Transactions on Pattern Analysis and Machine
Intelligence, vol. 22, no. 4, pp. 385-392, April 2000.
[20] X. Tang, X. Gao, J. Liu, H. Zhang, "A spatial-temporal approach for video
caption detection and recognition," IEEE Transactions on Neural
Network, vol. 13, no. 4, pp. 961-971, 2002.
[21] K. In Kim, K. Jung, J. Hyung, "Texture-based approach for text detection
in images using support vector machines and continuously adaptive mean
shift algorithm," IEEE Transactions on Pattern Analysis and Machine
Intelligence, vol. 25, no. 12, pp. 1631-1639, 2003.
[22] M. Fujii, W.J.R. Hoefer, "Filed-singularity correction in 2-D time-domain
Haar-wavelet Modeling of waveguide components," IEEE Transactions
on Microwave Theory and Techniques, vol. 49, no. 4, pp. 685-691, 2001.
[23] Q. Ye, W. Gao, W. Wang, W. Zeng, "A robust text detection algorithm in
images and video frames," International Conference on Information,
Communications and Signal Processing, vol. 2, pp. 802-806, 2003.
[24] Y. Zheng, H. Li, D. Doermann, "Machine printed text and handwriting
identification in noisy document images," IEEE Transactions on Pattern
Analysis and Machine Intelligence, vol. 26, no. 3, pp. 337-353, 2004.
[25] S. Belongie, J. Malik, J. Puzicha, "Matching shapes," International
Conference on Computer Vision, vol. 1, pp. 454-461, 2001.
[26] M. Sonka, V. Hlavac, R. Boyle, Image Processing, Analysis and Machine
Vision, Thomson Asia Pte Led.
[27] R.L. Cannon, J.V. Dave, J.C. Bezdek, "Efficient implementation of the
fuzzy c-means clustering algorithm," IEEE Transactions on Pattern
Analysis and Machine Intelligence, vol. 8. No. 2, pp. 248-255, 1986.
[28] Y. Liu, S. Goto, T. Ikenaga, "A contour-based robust algorithm for text
detection in color images," IEICE Transaction Information and System,
vol. E89-D, no. 3, pp. 1221-1230, 2006.
@article{"International Journal of Information, Control and Computer Sciences:60985", author = "Jonghyun Park and Toan Nguyen Dinh and Gueesang Lee", title = "Binarization of Text Region based on Fuzzy Clustering and Histogram Distribution in Signboards", abstract = "In this paper, we present a novel approach to accurately
detect text regions including shop name in signboard images with
complex background for mobile system applications. The proposed
method is based on the combination of text detection using edge
profile and region segmentation using fuzzy c-means method. In the
first step, we perform an elaborate canny edge operator to extract all
possible object edges. Then, edge profile analysis with vertical and
horizontal direction is performed on these edge pixels to detect
potential text region existing shop name in a signboard. The edge
profile and geometrical characteristics of each object contour are
carefully examined to construct candidate text regions and classify the
main text region from background. Finally, the fuzzy c-means
algorithm is performed to segment and detected binarize text region.
Experimental results show that our proposed method is robust in text
detection with respect to different character size and color and can
provide reliable text binarization result.", keywords = "Text detection, edge profile, signboard image, fuzzy
clustering.", volume = "2", number = "7", pages = "2510-6", }