Voice Disorders Identification Using Hybrid Approach: Wavelet Analysis and Multilayer Neural Networks

This paper presents a new strategy of identification and classification of pathological voices using the hybrid method based on wavelet transform and neural networks. After speech acquisition from a patient, the speech signal is analysed in order to extract the acoustic parameters such as the pitch, the formants, Jitter, and shimmer. Obtained results will be compared to those normal and standard values thanks to a programmable database. Sounds are collected from normal people and patients, and then classified into two different categories. Speech data base is consists of several pathological and normal voices collected from the national hospital “Rabta-Tunis". Speech processing algorithm is conducted in a supervised mode for discrimination of normal and pathology voices and then for classification between neural and vocal pathologies (Parkinson, Alzheimer, laryngeal, dyslexia...). Several simulation results will be presented in function of the disease and will be compared with the clinical diagnosis in order to have an objective evaluation of the developed tool.




References:
[1] V. Parsa and D. G. Jamieson, "Interactions between speech coders and
disordered speech," Speech Communication, vol. 40, no. 7, pp. 365-385,
2003.
[2] S. B. Davis, "Acoustic characteristics of normal and pathological
voices," Speech and Language: Advances inBasic Research and
Practice, vol. 1, pp. 271-335, 1979.
[3] F. Plant, H Kessler, B Cheetham, J Earis, "Speech Monitoring of
Infective Laryngitis", Proceedings of ICSLP96, Philadelphia, pp. 749 -
752 , 1996
[4] M.N. Viera, F.R. McInnes, M.A. Jack "Robust F0 and Jitter estimation
in the Pathological voices ", Proceedings of ICSLP96, Philadelphia,
pp.745 -748, 1996.
[5] J.Nayak, P.S.Bhat "Classification and analysis of speech abnormalities",
ITBM-RBM 26 (2005) 319-327.
[6] S. Mallat, "A Theory for multiresolution signal decomposition: Wavelet
representation", IEEE Trans. Pattern Analysis and Machine Intelligence.
Vol. 11. No. 7 pp674-693 July 1989.
[7] B. Boyanov, S.Hadjitodorov: "Acoustic analysis of pathological voices:
a voice analysis system for screening of laryngeal diseases", Proc. IEEE
Engineering in Medical and Biology, (1997), vol. 16, no. 4, 74-82.
[8] J.J. Jiang,Yu Zhang: "Nonlinear dynamic analysis of speech from
pathological subjects", Proc IEEE Electronics Letters, March (2002),
vol.38, no.6.
[9] P.Yu, M.Ouaknine, J.Revis, and A.Giovanni, "Objective Voice Analysis
for Dysphonic Patients: A Multiparametric Protocol Including Acoustic
and Aerodynamic Measurements", Journal of Voice Vol. 15, No. 4, pp.
529-542, 2001
[10] J.Wang, Jo.Cheolwoo, "Performance of Gaussian Mixture Model as a
classifier for Pathological Voice", proceeding of the ASST in Auckland
2006, pp 165-169.
[11] J.Kortelainen, K.Noponen, « Neural networks », Intelligent Systems
2005
[12] S.Lotfi, C.Adnène, "A Speech Processing Interface for Analysis of
Pathological Voices", in proceeding of ICTTA conference, Damascus
2006.
[13] S.Lotfi, B.Haythem, C.Adnène, " Interface d-analyse vocale a
l-identification de certaines pathologies d-origine neurologique et
vocale", in proceeding of JTM conference, Tunis 2007.
[14] A.chérif, « Pitch detection and formant extraction of Arabic speech
processing » Journal of applied acoustics, January 2001
[15] A.M. Gaouda, M. Salama, A. Chikhani, and M. Sultan, "Application of
wavelet analysis for monitoring dynamic performance in industrial
plants," North American Power Symposium, Oct. 1997, Laramie,
Wyoming.
[16] C.Fredouille, G.Pouchoulin, J-F. Bonastre, M.Azzarello, A.Giovanni,
A.Ghio "Application of automatic speaker recognition techniques to
pathological voice assessment" in proceeding of international conference
on acoustic speech and signal processing (ICASSP 2005)
[17] C. M. Bishop, "Neural Networks for Pattern Recognition". Oxford:
Clarendon Press, 1996.