著者
Masayuki Nishiguchi
出版者
ACOUSTICAL SOCIETY OF JAPAN
雑誌
Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日
vol.27, no.6, pp.375-383, 2006 (Released:2006-11-01)
参考文献数
19

A coding algorithm for speech called harmonic vector excitation coding (HVXC) has been developed that encodes speech at very low bit rates (2.0–4.0 kbit/s). It breaks speech signals down into two types of segments: voiced segments, for which a parametric representation of harmonic spectral magnitudes of LPC residual signals is used; and unvoiced segments, for which the CELP coding algorithm is used. This combination provides near toll-quality speech at 4.0 kbit/s, and communication-quality speech at 2.0 kbit/s, thus outperforming FS1016 4.8-kbit/s CELP. This paper discusses the encoder and decoder algorithms for HVXC, including fast harmonic synthesis, time scale modification, and pitch-change decoding. Due to its high coding efficiency and new functionality, HVXC has been adopted as the ISO/IEC International Standard for MPEG-4 audio.
著者
Masayuki Nishiguchi
出版者
一般社団法人 日本音響学会
雑誌
Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日
vol.27, no.1, pp.43-49, 2006 (Released:2006-01-01)
参考文献数
21
被引用文献数
1 1

Harmonic coding is a very powerful technique for the coding of speech at very low bit rates; and the efficient coding of spectral magnitudes sampled at harmonic frequencies is the key to obtaining good coded-speech quality. This paper presents a weighted vector quantization method for spectral vectors composed of a variable number of harmonic magnitudes. It is based on simple, efficient linear dimension conversion and employs a weighted distortion measure that exploits the human auditory sense. A codebook training algorithm using the weighting matrix is also presented. Finally, a low-complexity VQ codebook search technique based on pre-selection is described that reduces the computational complexity to less than 10% of that of an exhaustive search, without perceptible loss of quality. The proposed quantization scheme is used in Harmonic Vector eXcitation Coding (HVXC), which is a very low-bit-rate speech coding algorithm that combines harmonic and stochastic vector representations of LPC residual signals. Due to the high efficiency of this VQ scheme, HVXC provides good communication-quality speech at bit rates as low as 2–4 kbit/s, and was adopted as the ISO/IEC International Standard for MPEG-4 Audio.