Automotive 3-Microphone Noise Canceller in a Frequently Moving Noise Source Environment

A combined three-microphone voice activity detector (VAD) and noise-canceling system is studied to enhance speech recognition in an automobile environment. A previous experiment clearly shows the ability of the composite system to cancel a single noise source outside of a defined zone. This paper investigates the performance of the composite system when there are frequently moving noise sources (noise sources are coming from different locations but are not always presented at the same time) e.g. there is other passenger speech or speech from a radio when a desired speech is presented. To work in a frequently moving noise sources environment, whilst a three-microphone voice activity detector (VAD) detects voice from a “VAD valid zone", the 3-microphone noise canceller uses a “noise canceller valid zone" defined in freespace around the users head. Therefore, a desired voice should be in the intersection of the noise canceller valid zone and VAD valid zone. Thus all noise is suppressed outside this intersection of area. Experiments are shown for a real environment e.g. all results were recorded in a car by omni-directional electret condenser microphones.


Authors:



References:
[1] M. Shozakai, S. Nakamura, and K. Shikano, "Robust speech recognition
in car environments," presented at Acoustics, Speech, and Signal
Processing, 1998. ICASSP '98. Proceedings of the 1998 IEEE
International Conference on, 1998.
[2] L. Griffiths and C. Jim, "An alternative approach to linearly constrained
adaptive beamforming," Antennas and Propagation, IEEE Transactions
on [legacy, pre - 1988], vol. 30, pp. 27-34, 1982.
[3] B. Widrow and F. Luo, "Microphone arrays for hearing aids: An
overview," Speech Communication, vol. 39, pp. 27-34, 2003.
[4] T. Nishiura, R. Gruhn, and S. Nakamura, "Collaborative steering of
microphone array and video camera toward multi-lingual tele-conference
through speech-to-speech translation," presented at Automatic Speech
Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on,
2001.
[5] S. Stergiopoulos and A. C. Dhanantwari, "Implementation of adaptive
processing in integrated active-passive sonars with multi-dimensional
arrays," presented at Advances in Digital Filtering and Signal
Processing, 1998 IEEE Symposium on, 1998.
[6] G. W. Elko, "Microphone array systems for hands-free
telecommunication," Speech Communication, vol. 22, pp. 229-240,
1996.
[7] B. Widrow, J. R. Glover, Jr., J. M. McCool, J. Kaunitz, C. S. Williams,
R. H. Hearn, J. R. Zeidler, J. Eugene Dong, and R. C. Goodlin,
"Adaptive noise cancelling: Principles and applications," Proceedings of
the IEEE, vol. 63, pp. 1692-1716, 1975.
[8] B. Widrow and M. E. Hoff, "Adaptive switching circuits," IRE Wescon
Convention Record, pp. 94-104, 1960.
[9] M. M. Goulding and J. S. Bird, "Speech enhancement for mobile
telephony," Vehicular Technology, IEEE Transactions on, vol. 39, pp.
316-326, 1990.
[10] W. Armbruester, R. Czarnach, and P. Vary, "Adaptive Noise
Cancellation with Reference Input - Possible Applications and
Theoretical Limits," in Signal Processing III: Theories and Applications,
I. T. Young, Ed.: Elsevier, 1986, pp. 391-394.
[11] H. Agaiby and T. J. Moir, "A robust word boundary detection algorithm
with application to speech recognition," presented at Digital Signal
Processing Proceedings, 1997. DSP 97., 1997 13th International
Conference on, 1997.
[12] Z. Qi and T. J. Moir, "An Automotive three-microphone Voice Activity
Detector and noise canceller," presented at 2005 International
Conference on Intelligent Sensors, Sensor Networks and Information,
Melbourne, 2005.
[13] G. Carter, C. Knapp, and A. Nuttall, "Estimation of the magnitudesquared
coherence function via overlapped fast Fourier transform
processing," Audio and Electroacoustics, IEEE Transactions on, vol. 21,
pp. 337-344, 1973.
[14] S. Haykin, Adaptive Filter Theory, 4 ed: Prentice Hall, 2002.
[15] G. Barrault, M. H. Costa, J. C. M. Bermudez, and A. Lenzi, "A new
analytical model for the NLMS algorithm," presented at Acoustics,
Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE
International Conference on, 2005.
[16] C. Rulph, DSP applications using C and the TMS320C6x DSK: J.
Wiley, 2002.
[17] W. Herbordt, Sound Capture for Human/machine Interfaces - Practical
Aspects of Microphone Array Signal Processing: Springer-Verlag, 2005.
[18] W. Herbordt, T. Horiuchi, M. Fujimoto, T. Jitsuhiro, and S. Nakamura,
"Hands-Free Speech Recognition and Communication on PDAS Using
Microphone Array Technology," presented at Automatic Speech
Recognition and Understanding, 2005 IEEE Workshop on, 2005.
[19] O. Hoshuyama and A. Sugiyama, "Robust Adaptive Beamforming," in
Microphone Arrays: Signal Processing Techniques and Applications
(Digital Signal Processing), M. Brandstein and Ward, Eds.: Springer-
Verlag, 2001.
[20] R. B. Wallace and R. A. Goubran, "Improved tracking adaptive noise
canceler for nonstationary environments," Signal Processing, IEEE
Transactions on (see also Acoustics, Speech, and Signal Processing,
IEEE Transactions on), vol. 40, pp. 700-703, 1992.
[21] R. B. Wallace and R. A. Goubran, "Noise cancellation using parallel
adaptive filters," Circuits and Systems II: Analog and Digital Signal
Processing, IEEE Transactions on (see also Circuits and Systems II:
Express Briefs, IEEE Transactions on), vol. 39, pp. 239-243, 1992.
[22] D. Van Compernolle, "Switching adaptive filters for enhancing noisy
and reverberant speech from microphone array recordings," presented at
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990
International Conference on, 1990.