Latent Semantic Inference for Agriculture FAQ Retrieval

FAQ system can make user find answer to the problem that puzzles them. But now the research on Chinese FAQ system is still on the theoretical stage. This paper presents an approach to semantic inference for FAQ mining. To enhance the efficiency, a small pool of the candidate question-answering pairs retrieved from the system for the follow-up work according to the concept of the agriculture domain extracted from user input .Input queries or questions are converted into four parts, the question word segment (QWS), the verb segment (VS), the concept of agricultural areas segment (CS), the auxiliary segment (AS). A semantic matching method is presented to estimate the similarity between the semantic segments of the query and the questions in the pool of the candidate. A thesaurus constructed from the HowNet, a Chinese knowledge base, is adopted for word similarity measure in the matcher. The questions are classified into eleven intension categories using predefined question stemming keywords. For FAQ mining, given a query, the question part and answer part in an FAQ question-answer pair is matched with the input query, respectively. Finally, the probabilities estimated from these two parts are integrated and used to choose the most likely answer for the input query. These approaches are experimented on an agriculture FAQ system. Experimental results indicate that the proposed approach outperformed the FAQ-Finder system in agriculture FAQ retrieval.





References:
[1] S. Oyama,T. Kokubo, and T. Ishida, Domain-Specific Web Search with Keyword Spices .EEE Trans. Knowledge and Data Eng.vol.16, no. 1, pp. 17-27, Jan. 2004.
[2] C.O.Kwok , O. Etzioni , and D.S. Weld, Scaling Question Answering to the Web, ACM Trans. Information Systems, vol. 19,no. 3, pp. 242-262, 2001.
[3] R.D. Burke, K.J. Hammond, V.A. Kulyukin, S.L. Lytinen, N.Tomuro, and S. Schoenber, Question Answering from Frequently-Asked Question Files Experiences with the FAQ Finder System, Technical Report TR-97-05, Univ.of Chicago, pp. 1-38,1997.
[4] C.H. Wu, J.F. Yeh, and M.J. Chen, Domain-Specific FAQ Retrieval
Using Independent Aspects, ACM Trans. Asian Language Information
Processing, vol. 4, no. 1, 2005.
[5] D. Camacho, "Using Hierarchical Knowledge Structure to Implement
Dynamic FAQ System, Proc. Fifth Int-l Conf. Practical Aspects of
Knowledge Management (PAKM -04), 2004.
[6] V. Jijkoun, J. Mur, and M. de Rijke, Information Extraction for Question
Answering: Improving Recall through Syntactic Patterns,Proc. Int-l Conf.
Computational Linguistics, 2004.
[7] R. Soricut and E. Brill, Automatic Question Answering: Beyond the
Factoid," Proc. Human Language Technology Conf., 2004.
[8] E. Sneiders, Automated Question Answering Using Question Templates
that Cover the Conceptual Model of the Database,Natural Language
Processing and Information Systems, Proc.Int-l Workshop Applications
of Natural Language to Information Systems, pp. 235-239, 2002.
[9] Chung-Hsien Wu, Senior Member, IEEE, Jui-Feng Yeh, and Yu-Sheng
Lai, Semantic Segment Extraction and Matching for Internet FAQ
Retrieval, IEEE Transactions on Knowledge and Data Engineering, Vol.
18, No. 7, July 2006
[10] Zhou Qiang, Huang Changning´╝îAn Improved Approach for Chinese
Parsing Based on Local Preference Information´╝îJournal of Software´╝î
Vol.10´╝îNo.1´╝îpp1-6´╝î1999
[11] Qun Liu, Sujian LI .Word Similarity Computing Based on How-net.
Computational Linguistics and Chinese Language
Processing,China(Taiwan), 2002(7): 59 ´¢× 76.
[12] R. Baeza-Yates and B. Ribeiro-Neto, Modern Information Retrieval.
Addison-Wesley, 1999.