Lexical Database for Multiple Languages: Multilingual Word Semantic Network

Data mining and knowledge engineering have become a tough task due to the availability of large amount of data in the web nowadays. Validity and reliability of data also become a main debate in knowledge acquisition. Besides, acquiring knowledge from different languages has become another concern. There are many language translators and corpora developed but the function of these translators and corpora are usually limited to certain languages and domains. Furthermore, search results from engines with traditional 'keyword' approach are no longer satisfying. More intelligent knowledge engineering agents are needed. To address to these problems, a system known as Multilingual Word Semantic Network is proposed. This system adapted semantic network to organize words according to concepts and relations. The system also uses open source as the development philosophy to enable the native language speakers and experts to contribute their knowledge to the system. The contributed words are then defined and linked using lexical and semantic relations. Thus, related words and derivatives can be identified and linked. From the outcome of the system implementation, it contributes to the development of semantic web and knowledge engineering.





References:
[1] M. Steyvers and JB Tenenbaum. 2005. The Large-ScaleStructure of
Semantic Networks: Statistical Analyses and a Model of Semantic
Growth. Cognitive Science, 29: 41-78.
[2] Huysman, M. and Wulf, V. (2006) IT to support knowledge sharing in
communities, towards a social capital analysis. Journal of Information
Technology, 21, 40-51.
[3] C. Havasi, R. Speer, and J. Alonso. ConceptNet 3: a flexible,
multilingual semantic network for common sense knowledge. In Recent
Advances in Natural Language Processing, Borovets, Bulgaria,
September 2007.
[4] Darren Cook. 2008. MLSN: A multi-lingual semantic network. In 14th
Annual Meeting of the Association.
[5] WordNet, "http://wordnet.princeton.edu/", last accessed in June 2011.
[6] Miller, G. A. 1995. WORDNET: A Lexical Database for English.
Communications of ACM(11): 39-41.
[7] B. Hamp and H. Feldweg. Germanet - a lexical-semantic net for
German. In Proceedings of ACL workshop Automatic Information
Extraction and Building of Lexical Semantic Re-sources for NLP
Applications, Madrid., 1997.
[8] R. Navigli, S.P. Ponzetto, BabelNet: Building a very large multilingual
semantic network, in: Proceedings of the 48th Annual Meeting of the
Associationfor Computational Linguistics, Uppsala, Sweden, 11-16
July 2010, pp. 216-225.
[9] Hippel, E. v., How Open Source Software Works: "Free" User-to-User
Assistance., Research Policy, 32, 923-943., 2002.
[10] Sowa, J. F. (Eds.). (1992). Principles of Semantic Networks. San Mateo,
CA: Morgan Kaufmann Publishers.
[11] D. Hindle and M. Rooth. Structural ambiguity and lexical relations.
Computational Linguistics,19(1):103-120, 1993.
[12] Open Source, "http://en.wikipedia.org/wiki/Open_source", last accessed
in June 2011.