JP2558876B2

JP2558876B2 - Speech coding device

Info

Publication number: JP2558876B2
Application number: JP1141708A
Authority: JP
Inventors: 茂細井
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1989-06-02
Filing date: 1989-06-02
Publication date: 1996-11-27
Anticipated expiration: 2011-11-27
Also published as: JPH036600A

Description

【発明の詳細な説明】産業上の利用分野本発明はディジタル通信、ボイスメール等に利用する
音声符号化装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice encoding device used for digital communication, voice mail and the like.

従来の技術従来の音声符号化装置は、第３図（ａ）に示すように
予測器１、量子化器２、予測器１および量子化器２から
の伝送データを多重化する多重化器３を有する符号器14
と、第３図（ｂ）に示すようにこの符号器14からの伝送
データを分解する分解器４、量粒子化器５、分離器４お
よび逆量子化器５からの伝送データを合成する合成器６
を有する複合器15とで構成されている。2. Description of the Related Art A conventional speech coding apparatus, as shown in FIG. 3 (a), includes a predictor 1, a quantizer 2, and a multiplexer 3 for multiplexing transmission data from the predictor 1 and the quantizer 2. Encoder 14 with
Then, as shown in FIG. 3 (b), a synthesis for synthesizing the transmission data from the decomposing unit 4, the quantity particle demultiplexing unit 5, the demultiplexing unit 5 and the dequantizing unit 5 for decomposing the transmission data from the encoder 14. Bowl 6
And a compound device 15 having

上記予測器１は第４図に示すように、アナログ／ディ
ジタル（A/D）変換された音声信号の近接サンプル値間
の相関を除去するために、短時間予測分析器（線形予測
器）８により短時間予測フィルタ係数を求め、短時間予
測フィルタ７により短時間予測誤差を求める。As shown in FIG. 4, the predictor 1 has a short-term predictive analyzer (linear predictor) 8 in order to remove a correlation between adjacent sample values of an analog / digital (A / D) converted voice signal. The short-time prediction filter coefficient is obtained by the following, and the short-time prediction error is obtained by the short-time prediction filter 7.

そして、短時間予測フィルタ７の出力から音源ピッチ
の周期的な相関を除去するために、長時間予測分析器
（ピッチ予測器）10により長時間予測フィルタ係数を求
め、長時間予測フィルタ９により予測誤差を求める。こ
の予測誤差は量子化器２に入力される。Then, in order to remove the periodic correlation of the sound source pitch from the output of the short-term prediction filter 7, the long-term prediction analyzer (pitch predictor) 10 obtains the long-term prediction filter coefficient, and the long-term prediction filter 9 predicts it. Find the error. This prediction error is input to the quantizer 2.

一方短時間予測フィルタ係数および長時間予測フィル
タ係数は量子化器11により量子化され、伝送データとし
て多重化器３に入力される。On the other hand, the short-term prediction filter coefficient and the long-term prediction filter coefficient are quantized by the quantizer 11 and input to the multiplexer 3 as transmission data.

上記量子器２は第５図に示すように、入力された予測
誤差をベクトル量子化するベクトル量子化器12と、複数
個の一定長の信号列（代表ベクトルという）を用意した
コードブック13とを有し、入力された予測誤差の信号列
と各代表ベクトルとの２乗距離を計算し、その値の最も
小さい代表ベクトルの番号を量子化値として伝送データ
する。この伝送データは、多重化部３により予測器１か
らの伝送データ（予測フィルタの係数）と多重化され、
復号器に送出される。As shown in FIG. 5, the quantizer 2 includes a vector quantizer 12 that vector-quantizes an input prediction error, and a codebook 13 that prepares a plurality of constant-length signal sequences (referred to as representative vectors). Then, the square distance between the input prediction error signal sequence and each representative vector is calculated, and the number of the representative vector having the smallest value is transmitted as the quantized value for transmission data. This transmission data is multiplexed with the transmission data (coefficient of the prediction filter) from the predictor 1 by the multiplexing unit 3,
It is sent to the decoder.

一方、復号器では、分離器４により、量子化器２の出
力（代表ベクトル番号）と、予測器１からの出力（フィ
ルタ係数）に分離される。量子化器２の出力は逆量子化
器５に、予測器１の出力は合成器６にそれぞれ入力され
る。逆量子化器５では入力された代表ベクトルの番号に
相当する信号列をコードブックを用いて復号し、予測誤
差とする。この予測誤差は合成器６において、予測器１
のフィルタと逆特性を持つフィルタに入力され、音声信
号が復号される。On the other hand, in the decoder, the separator 4 separates the output of the quantizer 2 (representative vector number) and the output of the predictor 1 (filter coefficient). The output of the quantizer 2 is input to the inverse quantizer 5, and the output of the predictor 1 is input to the combiner 6. The inverse quantizer 5 decodes the signal sequence corresponding to the input representative vector number using a codebook to obtain a prediction error. This prediction error is calculated by the combiner 6 in the predictor 1
The audio signal is input to the filter having the inverse characteristic of that of the above, and the audio signal is decoded.

発明が解決しようとする課題しかしながら、上記従来の音声符号化装置では、音声
の特性によらず音声信号のどの部分に対しても常に同じ
処理を行っているため、低ビットレートの音声符号化に
おいては、音声の特性に従った適切な符号化処理が行な
われず、復号後の音声品質が良好でなく、符号化効率を
向上できないという問題があった。DISCLOSURE OF THE INVENTION Problems to be Solved by the Invention However, in the above-described conventional speech coding apparatus, since the same processing is always performed on any part of the speech signal regardless of the characteristics of the speech, in speech coding at a low bit rate. Has a problem that an appropriate encoding process according to the characteristics of the voice is not performed, the voice quality after decoding is not good, and the encoding efficiency cannot be improved.

本発明はこのような従来の問題を解決するものであ
り、符号化効率を改善した音声符号化装置を提供するこ
とを目的とするものである。The present invention solves such a conventional problem, and an object of the present invention is to provide a speech coding apparatus with improved coding efficiency.

課題を解決するための手段本発明は上記目的を達成するために、入力された音声
信号を短時間予測器で短時間予測した後の短時間予測誤
差の特性に従って、分類器で定常部と過渡部に分類し、
この分類出力により、長時間予測器と定包絡化器のいず
れか一方を用いることにより、音声の特性に従った処理
を行い、符号化効率を改善するようにしたものである。Means for Solving the Problems In order to achieve the above object, the present invention provides a classifier with a stationary part and a transient part according to the characteristics of a short-term prediction error after short-term prediction of an input voice signal by the short-term predictor. Divided into parts,
With this classification output, either one of the long-term predictor and the constant enveloper is used to perform the processing according to the characteristics of the voice and improve the coding efficiency.

作用本発明は上記構成により、分類器の出力により、音声
信号を類似した波形が続き、ピッチ周期間の相関の強い
定常部と、信号の振幅変化が激しく、ピッチ周期間の相
関が弱い過渡部とに分類する。Operation According to the present invention, with the above configuration, the output of the classifier causes a waveform similar to that of a voice signal to continue, a stationary portion with a strong correlation between pitch periods, and a transient with a strong amplitude change of the signal and a weak correlation between pitch periods. Classify into divisions.

そして、定常部においては、ピッチ周期間の相関が強
いことから、長時間予測を用いてピッチ周期間の相関を
除去する。この処理からは長時間予測フィルタ係数と長
時間予測誤差信号が出力される。Then, in the stationary part, since the correlation between pitch periods is strong, the correlation between pitch periods is removed using long-term prediction. From this process, the long-term prediction filter coefficient and the long-term prediction error signal are output.

また、過渡部においては、信号の振幅変化が大きく、
特に音声信号の立上り、立下りでは、その区間の初めの
部分と終わりの部分では振幅が大きく異なっているの
で、振幅包絡を検出し、振幅包絡情報を分離して定包絡
化する。この処理からは、振幅包絡情報と定包絡化され
た端義予測誤差信号が出力される。Also, in the transient part, the amplitude change of the signal is large,
In particular, at the rising and falling edges of the audio signal, the amplitudes at the beginning and the end of the section are greatly different, so the amplitude envelope is detected, and the amplitude envelope information is separated to form a constant envelope. From this process, the amplitude envelopment information and the fixed-envelope simple sense prediction error signal are output.

この処理により、量子化器に入力される信号は、相関
が除去されるか、あるいは、定包絡化されているので、
ベクトル量子化等による量知化器を用いることによっ
て、低ビットレートにおいても効率良く符号化できると
いう効果を有する。By this processing, the signal input to the quantizer has the correlation removed or the constant envelope,
By using a quantizer based on vector quantization or the like, there is an effect that efficient coding can be performed even at a low bit rate.

実施例第１図は本発明の一実施例を示すものである。第１図
（ａ）において、21は音声信号の短時間予測を行う短時
間予測器、22は短時間予測器21から出力された短時間予
測誤差信号の特性に従って分類する分類器、23a、23bは
分類器22の出力により処理を切換える連動スイッチ、24
は、短時間予測器21からの短時間予測誤差信号により長
時間予測を行う長時間予測器、25は、短時間予測器21か
らの短時間誤差信号により信号の振幅包絡を検出し、定
包絡化を行う定包絡化器、26は長時間予測器24からの長
時間予測誤差信号又は定包絡化器25からの振幅包絡情報
を量子化する量子化器、27は多重化器であり、その多重
化器27は短時間予測器21からの出力信号ａ、分類器22か
ら出力信号ｂ、長時間予測器24からの出力信号ｃ、定包
絡化器25からの出力信号ｄ、量子化器26からの出力信号
ｅを多重化して、伝送データを出力する。そして、上記
の符号21〜26を付したものによって符号器34を構成して
いる。Embodiment FIG. 1 shows an embodiment of the present invention. In FIG. 1 (a), 21 is a short-term predictor that performs short-term prediction of a speech signal, 22 is a classifier that classifies according to the characteristics of the short-time prediction error signal output from the short-term predictor 21, 23a, 23b Is an interlocking switch that switches processing by the output of the classifier 22, 24
Is a long-term predictor that performs long-term prediction with the short-term prediction error signal from the short-term predictor 21, and 25 is a constant envelope that detects the amplitude envelope of the signal with the short-time error signal from the short-term predictor 21. A constant enveloper that performs the quantization, 26 is a quantizer that quantizes the long-term prediction error signal from the long-term predictor 24 or the amplitude envelope information from the constant envelope 25, and 27 is a multiplexer, The multiplexer 27 outputs the output signal a from the short-term predictor 21, the output signal b from the classifier 22, the output signal c from the long-term predictor 24, the output signal d from the constant enveloper 25, and the quantizer 26. The output signal e from is multiplexed and the transmission data is output. The encoder 34 is configured by the ones to which the above reference numerals 21 to 26 are attached.

一方、第１図（ｂ）において、28は符号器34からの伝
送データを信号ａ〜ｅに分離する分類器、29は信号ｅか
ら信号列を逆量子化する逆量子化器、30a、30bは信号ｂ
により処理を切換える連動スイッチ、31は信号ｃから長
時間合成を行う長時間合成器、32は信号ｄから振幅包絡
を再生する包絡再生器、33は信号ａから短時間合成を行
う合成器である。そして、上記の符号28〜33を付したも
のによって復号器35を構成している。On the other hand, in FIG. 1B, 28 is a classifier for separating the transmission data from the encoder 34 into signals a to e, 29 is a dequantizer for dequantizing a signal sequence from the signal e, and 30a, 30b. Is the signal b
A reference numeral 31 is a long-time synthesizer for performing a long-time synthesis from the signal c, 32 is an envelope regenerator for reproducing an amplitude envelope from the signal d, and 33 is a synthesizer for performing a short-time synthesis from the signal a. . And, the decoder 35 is constituted by the above-mentioned reference numerals 28 to 33.

第２図（ａ）に示すような音声信号が、A/D変換され
て一定の時間長毎（以下、フレームという）に符号器34
に入力すると、短時間予測器21では各フレーム毎に近接
サンプル間の相関が除去され、予測フィルタ係数を示す
信号ａと短時間予測誤差が求まる。The audio signal as shown in FIG. 2 (a) is A / D-converted, and the encoder 34 is arranged at constant time intervals (hereinafter referred to as frames).
When input to, the short-term predictor 21 removes the correlation between adjacent samples for each frame, and obtains the signal a indicating the prediction filter coefficient and the short-term prediction error.

分類器22では短時間予測器21の出力である予測誤差を
用いて、第２図（ｂ）に示すように、定常部（イ）と過
渡部（ロ）の２種に分類される。この分類器22で分類さ
れた結果に従って連動スイッイ23a、23bを切換え、長時
間予測器24と定包絡化器25のいずれかを選択する。The classifier 22 uses the prediction error output from the short-term predictor 21 to classify into two types, a stationary part (a) and a transient part (b), as shown in FIG. 2 (b). The interlocked switches 23a and 23b are switched according to the result of classification by the classifier 22, and either the long-term predictor 24 or the constant enveloper 25 is selected.

定常部（イ）においては、短時間予測器21の出力信号
は実線位置にセットされた連動スイッチ23aを介して長
時間予測器24に入力され、この長時間予測器24により、
ピッチ周期間の相関を除去する。長時間予測器24からは
長時間予測フィルタ係数ｃと長時間予測誤差信号が出力
される。長時間予測フィルタ係数ｃは多重化器27に入力
されて伝送データとなる。長時間予測誤差信号は実線位
置にセットされた連動スイッチ23bを介して量子化器26
に入力される。In the steady part (b), the output signal of the short-term predictor 21 is input to the long-term predictor 24 via the interlocking switch 23a set to the solid line position.
Remove the correlation between pitch periods. The long-term predictor 24 outputs a long-term prediction filter coefficient c and a long-term prediction error signal. The long-term prediction filter coefficient c is input to the multiplexer 27 and becomes transmission data. The long-term prediction error signal is sent to the quantizer 26 via the interlocking switch 23b set to the solid line position.
Is input to

一方、過渡部（ロ）においては、短時間予測器21の出
力信号は点線位置に切換えられた連動スイッチ23aを介
して定包絡化器25に入力され、この高次定包絡化器25に
より振幅包絡ｄを検出する。ここでは例えば、高次多項
式により振幅包絡を近似し、この多項式の係数を振幅包
絡情報ｄとすればよい。この振幅包絡情報ｄは多重化器
27に入力され、伝送データとなる。また、短期予測誤差
信号は定包絡化され、点線位置にセットされた連動スイ
ッチ23bを介して量子化器26に入力される。On the other hand, in the transient section (b), the output signal of the short-time predictor 21 is input to the constant enveloper 25 via the interlocking switch 23a switched to the dotted line position, and the high-order constant enveloper 25 amplitude Envelope d is detected. Here, for example, the amplitude envelope may be approximated by a high-order polynomial, and the coefficient of this polynomial may be used as the amplitude envelope information d. This amplitude envelope information d is a multiplexer
It is input to 27 and becomes transmission data. The short-term prediction error signal is constant-enveloped and input to the quantizer 26 via the interlocking switch 23b set at the dotted line position.

量子化器26では、長時間予測器24あるいは定包絡化器
25からの入力信号に対して、例えばベクトル量子化を用
いて量子化を行う。この入力信号は相関が除去されてい
るか、あるいは定包絡化されているので、ベクトル量子
化に適した信号となっており、効率の良い量子化を行う
ことができる。The quantizer 26 includes a long-term predictor 24 or a constant enveloper.
The input signal from 25 is quantized using, for example, vector quantization. This input signal has a correlation removed or a constant envelope, so that it is a signal suitable for vector quantization, and efficient quantization can be performed.

一方、復号器35においては、符号器34から伝送された
伝送データ（ａ〜ｅ）を分離器28により分離し、逆量子
化器29によって逆量子化された信号ｅに対して、長時間
合成器31あるいは包絡再生器32により短時間予測誤差信
号を符号化し、短時間合成器33により、音声信号を各フ
レーム毎に符号化する。このように上記実施例によれ
ば、分類器22の分類出力により、符号化の処理を変える
ことができるため、符号化効率を改善することができる
という利点を有する。On the other hand, in the decoder 35, the transmission data (a to e) transmitted from the encoder 34 is separated by the separator 28, and the signal e dequantized by the dequantizer 29 is combined for a long time. The short-term prediction error signal is encoded by the device 31 or the envelope regenerator 32, and the audio signal is encoded by each frame by the short-time synthesizer 33. As described above, according to the above-described embodiment, since the coding process can be changed by the classification output of the classifier 22, there is an advantage that the coding efficiency can be improved.

発明の効果以上説明したように、本発明によれば、音声の特性を
利用して定常部と過渡部に分類し、この分類出力により
符号化の処理を変えるように構成したので、音声の特性
に従った適切な符号化処理ができ、符号化効率を改善す
ることができるという効果を有する。EFFECTS OF THE INVENTION As described above, according to the present invention, the characteristics of the voice are classified into the stationary part and the transient part by utilizing the characteristics of the voice, and the encoding process is changed by the classification output. According to this, there is an effect that an appropriate encoding process can be performed and the encoding efficiency can be improved.

[Brief description of drawings]

第１図は本発明の一実施例における音声符号化装置の概
略ブロック図、第２図は音声信号の入力信号および出力
信号の説明図、第３図は従来の音声符号化装置の概略ブ
ロック図、第４図は従来の予測器の概略ブロック図、第
５図は従来の量子化器の概略ブロック図である。 21……短時間予測器、22……分類器、23a,23b……切換
えスイッチ、24……長時間予測器、25……定包絡化器、
26……量子化器、27……多重化器。FIG. 1 is a schematic block diagram of a speech coding apparatus according to an embodiment of the present invention, FIG. 2 is an explanatory diagram of input signals and output signals of speech signals, and FIG. 3 is a schematic block diagram of a conventional speech coding apparatus. FIG. 4 is a schematic block diagram of a conventional predictor, and FIG. 5 is a schematic block diagram of a conventional quantizer. 21 ... Short-time predictor, 22 ... Classifier, 23a, 23b ... Changeover switch, 24 ... Long-term predictor, 25 ... Constant enveloper,
26 ... Quantizer, 27 ... Multiplexer.

Claims

(57) [Claims]

1. A short-time predictor for performing a short-time prediction of a speech signal to remove a correlation between adjacent sample values, and a short-time prediction error signal output from the short-time predictor to a stationary part and a transient part. A classifier for classifying, a long-time predictor that removes the correlation between pitch periods, a constant envelop that detects an amplitude envelope and constant-envelopes, and a quantum output of the long-term predictor or the constant-envelope And a quantizer for converting the output of the short-term predictor to the long-term predictor or the constant envelope quantizer according to the output of the classifier, and the output is quantized by the quantizer Encoding device.