Enhancement of a 3D Sound Using Psychoacoustics

Generally, in order to create 3D sound using binaural systems, we use head related transfer functions (HRTF) including the information of sounds which is arrived to our ears. But it can decline some three-dimensional effects in the area of a cone of confusion between front and back directions, because of the characteristics of HRTF. In this paper, we propose a new method to use psychoacoustics theory that reduces the confusion of sound image localization. In the method, HRTF spectrum characteristic is enhanced by using the energy ratio of the bark band. Informal listening tests show that the proposed method improves the front-back sound localization characteristics much better than the conventional methods




References:
[1] Chong-Jin Tan and Woon-Seng Gan, "User-defined spectral manipulation
of HRTF for improved localization in 3D sound systems," IEEE electronic
letters, vol. 34, pp. 2387-2389, Dec. 1998.
[2] H. Hachbibogle, B. Gunel and A. M. Kondoz, "Head-related transfer
function filter interpolation by root displacement," in IEEE Workshop on
Applications of Signal Processing to Audio and Acoustics, USA, Oct.
2005, pp. 134-137.
[3] K. Koo and H. Cha, "Improvement of front/back Sound Localization
Characteristics using Psychoacoustics of Head Related Transfer
Function", Journal of The Korean Society of Broadcast Engineers, vol.
11, no. 4, 2006, pp. 448-457.
[4] D. N. Zotkin and R. Duraiswami, "Rendering localized spatial audio in a
virtual auditory space," IEEE transaction on multimedia, vol. 6, no. 4, pp.
553-564, Aug. 2004.
[5] N. Gupta, A. Barreto and C. Ordonez, "Spectral modification of
head-related transfer functions for improved virtual sound spatialization,"
in IEEE International Conference on Acoustics, Speech, and Signal
Processing, USA, 2002, pp. 1953-1956.
[6] M. Park, S. Choi, S. Kim and K. Bae, "Improvement of front-back sound
localization characteristics in headphone-based 3D sound generation," in
International Conference on Advanced Communication Technology,
Korea, 2005, pp. 273-276.
[7] C. P. Brown and O. Duda, "A structural model for binaural sound
synthesis," IEEE Transaction on speech and audio processing, vol. 5, no. 5,
1998, pp. 476-488
[8] F. Baumgarte, "Improved audio coding using a psychoacoustic model
based on a cochlear filter bank," IEEE transactions on speech and audio
processing, vol. 10, no. 7, 2002, pp. 495-503.