A Comparison of Fuzzy Clustering Algorithms to Cluster Web Messages

Our objective in this paper is to propose an approach
capable of clustering web messages. The clustering is carried out by
assigning, with a certain probability, texts written by the same web
user to the same cluster based on Stylometric features and using
fuzzy clustering algorithms. Focus in the present work is on
comparing the most popular algorithms in fuzzy clustering theory
namely, Fuzzy C-means, Possibilistic C-means and Fuzzy
Possibilistic C-Means.





References:
<p>[1] J. Ai, J. Laffey &laquo; Web Mining as a Tool for Understanding Online
Learning &raquo;, MERLOT Journal of Online Learning and Teaching, Vol. 3,
No. 2, June 2007.
[2] S. Arayaa, M. Silvab, R. Weberc &laquo; A methodology for web usage
mining and its application to target group identification &raquo; Fuzzy Sets and
Systems 148 (2004) 139&ndash;152.
[3] J. M. Carbo, J. Minguillon , E. Mort , &ldquo;User navigational behavior in elearning
virtual environments&rdquo;. IEEE/WIC/ACM International
Conference on Web Intelligence, 2005
[4] M. Chau, J. Wu , &ldquo;Mining communities and their relationships in blogs:
a study of online hate group&rdquo;. Int. J. Human-Computer Studies, pp.57-
70, 2007
[5] H. Chen, W. Chung, J. Qin, E. Reid, M. Sageman, G. Weimann,
Uncovering the Dark Web: A Case Study of Jihad on the Web. Journal
of the American Society for Information Science and Technology,
Vol.(59), Issue 8, pp: 1347&ndash;1359, 2008
[6] C. Correa, P. Barreiro, M. P. Diago, J. Tard_aguila C. Valero &ldquo;A
Comparison of Fuzzy Clustering Algorithms Applied to Feature
Extraction on Vineyard&rdquo;
[7] K. K. Chen , P. H. Chou, P. H. Li, M. J. Wu, &ldquo;Integrating web mining
and neural network for personalized e-commerce automatic service&rdquo;,
Expert System with applications, Vol.(37): 2898-2910, 2010
[8] O. De Vel, &ldquo;Mining e-mail authorship&rdquo;. In: Proc. of the Workshop on
text mining in ACM international conference on knowledge discovery
and data mining (KDD).
[9] S. El Manar El Bouanani, I. Kassou &ldquo;Vers une m&eacute;thodologie de
mod&eacute;lisation d&rsquo;une signature unique des profils Web : Module de
d&eacute;tection des auteurs des forums web&rdquo;, JADT 2012
[10] I. Farkhund, B. C. M. Fung, H. Binsalleeh, &ldquo;Mining writeprints from
anonymous e-mails for forensic investigation&rdquo;. digital investigation,
Vol.(7): 56-64, 2010
[11] Iqbal F, et al. (2010). Mining writeprints from anonymous e-mails for
forensic investigation. Digit. Investig, doi:10.1016/j.diin.2010.03.003.
[12] J. Li, H. Chen, R. Zheng &laquo; From fingerprint to writeprint&rdquo;.
Communications of the ACM - Supporting exploratory search. Vol.(49),
Issue 4, pp: 76-82, 2006
[13] K. L. Lo, M. H. Sohod, Z. Zakaria &ldquo;Determination of Consumers&rsquo; Load
Profiles based on Two-stage Fuzzy C-Means&rdquo;, Proceedings of the 5th
WSEAS Int. Conf. on Power Systems and Electromagnetic Compatibility,
Corfu, Greece, August 23-25, 2005 (pp 212-217)
[14] H. Mohtasseb, A. Ahmed, &ldquo;Mining Online Diaries for Blogger
Identification&rdquo;. Proceedings of the World Congress on Engineering
(WCE). London, U.K.
[15] A. Orebaugh, J. Allnutt, &ldquo;Classification of Instant Messaging
Communications for Forensics Analysis&rdquo;. The International Journal of
Forensic Computer Science, Vol.(1): 22-28.
[16] D. Xu, H. Wang, Su K. &ldquo;Intelligent Student Profiling with Fuzzy
Models&rdquo;. Proceedings of the 35th Hawaii International Conference on
System Sciences, 2002
[17] Y. C. Yang.&rdquo;Web user behavioral profiling for user identification&rdquo;.
Decision Support Systems, Vol.(49): 261&ndash;271.
[18] I. C. Yeh, C. H. Lien, T. M. Ting, C. H. Liu, &ldquo; Applications of web
mining for marketing of online bookstore&rdquo;. Expert System with
applications, Vol.(36) :11249-11256, 2009
[19] X. Zhang, J. Edwards, J. Harding , &ldquo; Personalised online sales using web
usage data mining&rdquo;. Computers in Industry, 2007, Vol.(58): 772&ndash;782.
[20] R. Zheng, J. Li, H. Chen, Z. Huang, &ldquo;A framework for authorship
Identification of Online Messages: writing-Style features and
classification Techniques&rdquo;. Journal of The American Society For
Information Science And Technology, 2006, pp: 378-393.</p>