Finding Authoritative Researchers on Academic Web Sites

In this paper, we present a methodology for finding authoritative researchers by analyzing academic Web sites. We show a case study in which we concentrate on a set of Czech computer science departments- Web sites. We analyze the relations between them via hyperlinks and find the most important ones using several common ranking algorithms. We then examine the contents of the research papers present on these sites and determine the most authoritative Czech authors.




References:
[1] S. Brin and L. Page, "The Anatomy of a Large-Scale Hypertextual Web
Search Engine," in Proc. 7th World Wide Web Conference, Brisbane,
Australia, 1998, pp. 107-117.
[2] S. Chakrabarti, Mining the Web: Analysis of Hypertext and Semi
Structured Data. San Francisco, CA: Morgan Kaufmann Publishers,
2003, pp. 209-218.
[3] S. Chakrabarti, B. E. Dom, D. Gibson, R. Kumar, P. Raghavan,
S. Rajagopalan, and A. Tomkins, "Spectral Filtering for Resource
Discovery," in Proc. ACM SIGIR Workshop on Hypertext Information
Retrieval on the Web, Melbourne, Australia, pp. 13-21, 1998.
[4] M. Diligenti, M. Gori, and M. Maggini, "A Unified Probabilistic
Framework for Web Page Scoring Systems," IEEE Trans. Knowledge
and Data Engineering, vol. 16, no. 1, 2004, pp. 4-16.
[5] C. Ding, X. He, P. Husbands, H. Zha, and H. Simon, "PageRank, HITS
and a Unified Framework for Link Analysis," in Proc. 25th ACM SIGIR
Conf. Research and Development in Information Retrieval, Tampere,
Finland, 2002, pp. 353-354.
[6] C. Ding, X. He, P. Husbands, H. Zha, and H. Simon, "PageRank, HITS
and a Unified Framework for Link Analysis," Lawrence Berkeley
National Laboratory, University of California, Berkeley, CA, Technical
Report 49372, Nov. 2001.
[7] D. Gibson, J. Kleinberg, and P. Raghavan, "Inferring Web Communities
from Link Topology," in Proc. 9th ACM Conference on Hypertext and
Hypermedia, Pittsburgh, PA, 1998, pp. 225-234.
[8] H. Han, H. Zha, and C. L. Giles, "Name Disambiguation in Author
Citations Using a K-way Spectral Clustering Method," in Proc. 5th
ACM/IEEE-CS Int. Conf. Digital Libraries, Denver, CO, 2005, pp.
334-343.
[9] J. Kleinberg, "Authoritative Sources in a Hyperlinked Environment,"
Journal of the ACM, vol. 46, no. 5, 1999, pp. 604-632.
[10] A. K. McCallum, K. Nigam, J. Rennie, and K. Seymore, "Automating
the Construction of Internet Portals with Machine Learning,"
Information Retrieval Journal, vol. 3, no. 2, 2000, pp. 127-163.
[11] L. Page, S. Brin, R. Motwani, and T. Winograd, "The PageRank Citation
Ranking: Bringing Order to the Web," Computer Science Department,
Stanford University, CA, Technical Report 1999-66, Nov. 1999.
[12] K. Seymore, A. McCallum, and R. Rosenfeld, "Learning Hidden
Markov Model Structure for Information Extraction," in Proc. AAAI-99
Workshop Machine Learning for Information Extraction, Orlando, FL,
1999, pp. 37-42.
[13] M. Thelwall, "Extracting Macroscopic Information from Web Links,"
Journal of the American Society for Information Science and
Technology, vol. 52, no. 13, 2001, pp.1157-1168.
[14] M. Thelwall, "The Relationship between the WIFs or Inlinks of
Computer Science Departments in UK and Their RAE Ratings or
Research Productivities in 2001," Scientometrics, vol. 57, no. 2, 2003,
pp. 239-255.