Speech Recognition Using Scaly Neural Networks

This research work is aimed at speech recognition using scaly neural networks. A small vocabulary of 11 words were established first, these words are “word, file, open, print, exit, edit, cut, copy, paste, doc1, doc2". These chosen words involved with executing some computer functions such as opening a file, print certain text document, cutting, copying, pasting, editing and exit. It introduced to the computer then subjected to feature extraction process using LPC (linear prediction coefficients). These features are used as input to an artificial neural network in speaker dependent mode. Half of the words are used for training the artificial neural network and the other half are used for testing the system; those are used for information retrieval. The system components are consist of three parts, speech processing and feature extraction, training and testing by using neural networks and information retrieval. The retrieve process proved to be 79.5-88% successful, which is quite acceptable, considering the variation to surrounding, state of the person, and the microphone type.




References:
[1] Accurate Automation Corporation, "What are Artificial Neural
Networks?", web site: http://www.accurateautomation.
com/products/nnets.htm.
[2] C.H.Chen "Signal Processing Handbook" 1985.
[3] Christopher M.Fraser -California University, Hayward, 2000, "Neural
Networks:Literature Review" web site,
http://www.telecom.csuhayward.edu/~stat/Neural/CFProjNN.htm.
[4] Consortium for Virtual Operations Research, "Artificial Neural
Networks ", 1997 web site http://cvor.pe.wvu.edu/neural.htm
[5] Da Ruan "Intelligent Hybrid Systems, Fuzzy Logic, Neural Networks,
Genetic Algorithms" 1997 by Kluwer Academic Publishers.
[6] Dr. Leslie Smith, "An Introduction To Neural Networks", Centre for
Cognition and Computational Neuroscience, Department of Computing
and Mathematics, University of Stirling, Website,
http://www.cs.stir.ac.uk/~Iss/Nnitro/InvSlides.html
[7] Frank Fallside / William A. Woods "Computer Speech Processing",
Printic-hall, 1985.
[8] Gurney, K., "Neural Nets", 12.6.1996 web site
http://www.shef.ac.uk/psychology/gurney/notes/contents.html
[9] Ingrid F.Russell "Neural Networks", Department Of Computer Science,
University of Hartford, West Hartford CT 06117.web
site,http://uhavax.hartfold.edu./disk$userdata/faculty/compsci.../neuralnetworks-
tutorial.htm.
[10] J.C.Simon "Spoken Language Generation and Understanding", 1980.
[11] Jean Hennebert, Martin Hasler and Herve Dedieu "Neural Networks in
Speech Recognition" Department of Electrical Engineering, Swiss
Federal Institute of Technology, 1015 Lausanna, Switzerland.
[12] Kevin Gurney "An Introduction to Neural Networks" Ucl, Press Limited
Taylor & Francis Group London, 1997.
[13] L.R.Rabinar /R.W.Schafer "Digital Processing of Speech Signals" 1978
Prentic-hall.
[14] Lawrence R. Rabiner & Stephene E.Levinson "Isolated and Connected
Word Recognition Theory and Selected Applications" IEEE Transaction
on communications , May 1981, vol. com-29, no.5 .
[15] Literature Review- "Speech Recognition for Noisy Environments" Web
Site, http://www.dcs.shef.ac.uk/~jeremy/litrev.htm.
[16] Luna, "Neural Networks for Speech Recognition", Web Site,
http://luna.moonstar.com/~morticia/thesis/chapter2.html.
[17] Ron Cole / Victor Zue "Spoken Language Input" 1998, web site
http://cslu.cse.ogi.edu/HLTsurvey/ch1node2.html
[18] Sarle, Warren S., "Neural Net FAQ", web site
ftp://ftp.sas.com/pub/neural/FAQ.html
[19] Satu Virtanen / Kosti Rytknِen "Neural Network", Helsinki University
of Technology, web site http://www.askcom/tlark.neural networks.html
[20] Siganos, Dimitrios & Stergiou, Christos, "Neural Networks", web site,
http://www.dse.doc.ic.ac.uk/~nd/surprise_96/journal/vol4/cs11/report.ht
ml
[21] Tony Robins "Speech Vision Robotics Group", web site http://svrwww.
eng.cam.ac.uk/~ajr .
[22] Valluru B.Rao and Hayagriva V.Rao "C++ Neural Networks and Fuzzy
Logic" Managment Information Source, 1993.