A High Quality Speech Coder at 600 bps

This paper presents a vocoder to obtain high quality synthetic speech at 600 bps. To reduce the bit rate, the algorithm is based on a sinusoidally excited linear prediction model which extracts few coding parameters, and three consecutive frames are grouped into a superframe and jointly vector quantization is used to obtain high coding efficiency. The inter-frame redundancy is exploited with distinct quantization schemes for different unvoiced/voiced frame combinations in the superframe. Experimental results show that the quality of the proposed coder is better than that of 2.4kbps LPC10e and achieves approximately the same as that of 2.4kbps MELP and with high robustness.





References:
[1] Ovens, M.J..Ponting, Turner.M.E, "Ultra low bit rate voice coding," IEE
Seminar, Vol.4, pp 911 - 920, 2000
[2] Gwenael guilmin, Francois Capman, and et.al, "New NATO STANAG
narrow band voice coder at 600 bit/s", IEEE International Conference on
Acoustics, Speech, and Signal Processing, Vol.3, pp.689-692, 2006
[3] T.Wang, K.Koishida, V.Cuperman, and et.al, "A 1200/2400 bps coding
suite based on MELP," Proc of IEEE Workshop on Speech Coding,
Vol.1, pp. 122-126, 2002
[4] O.Gottesman, A.Gersho, "Enhanced Waveform Interpolative Coding at
Low Bit-rate", IEEE Trans.Speech Audio Processing, vol.9, No.8,
pp.242-250, 2001
[5] Minoru Kohata, "A New 1.2kbit/s speech coding method based on a
sinusoidal harmonic vocoder," Systems and Computers in Japan, vol.31,
No.14, pp.64-73, 2000
[6] Jian Cong, Suo Cong, "New speech encoding algorithm for ultra low bit
rate at 600/300," IEEE International Conference on Acoustics, Speech,
and Signal Processing, Vol.2, pp.709-712, 2006
[7] Ehsan Jahangiri, Shahrokh Ghaemmaghami, "Scalable speech coding at
rates below 900 bps", IEEE International Conference on Multimedia &
Expo, Vol.1, pp.85-88, 2008
[8] A.D.Subramaniam, B.D.Rao, "PDF Optimized Parametric Vector
Quantization of Speech Line Spectral Frequencies," IEEE Trans. Speech
Audio Processing, Vol. 11, No. 2, pp. 130-142, Mar. 2003.
[9] L.M. Supplee, R.P.Cohn, J.S.Collura, A.V.McCree, "MELP: The new
federal standard at 2400 bits/s," IEEE International Conference on
Acoustics, Speech, and Signal Processing, Vol.4, pp.1591-1954, 1997
[10] Thomas E.Tremain, "The Government Standard Linear Predictive
Coding Algorithm: LPC-10," Speech Technology, No.2, pp.40-49, 1982