JPH01233498A

JPH01233498A - speech encoding device

Info

Publication number: JPH01233498A
Application number: JP63060139A
Authority: JP
Inventors: Koji Okazaki; 岡崎　晃二; Takashi Ota; 恭士大田; Fumio Amano; 文雄天野; Shigeyuki Umigami; 重之海上
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1988-03-14
Filing date: 1988-03-14
Publication date: 1989-09-19

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】〔概要〕音声の高能率符号化等に用いる音声符号化装置に関し。[Detailed description of the invention] 〔overview〕 Regarding a speech encoding device used for high-efficiency encoding of speech, etc.

ピッチ切出し方法および帯域制限方法を用いて伝送ビッ
トレートの低減を図りつつ、これらの方法の欠点である
符号化遅延の増大および再生音のこちりを抑制すること
を目的とし。The purpose of this invention is to reduce the transmission bit rate by using a pitch cutting method and a band limiting method, while suppressing the disadvantages of these methods, such as increased encoding delay and distortion of reproduced sound.

音声信号のピッチ周期を検出するピッチ検出手段と、ピ
ッチ検出手段で検出されたピッチ周期に基づき音声波形
を複数ピッチ分サンプルして該複数ピッチ分の波形から
１ピッチ分の波形を発生するピッチ波形発生手段と、ピ
ッチ波形発生手段で発生された１ピフチ波形の周波数帯
域を帯域制限する帯域制限手段と、帯域制限手段で帯域
制限された音声波形を符号化する符号化手段とを具備し
。Pitch detection means for detecting the pitch period of an audio signal; and a pitch waveform that samples the audio waveform for a plurality of pitches based on the pitch period detected by the pitch detection means and generates a waveform for one pitch from the waveform for the plurality of pitches. The apparatus includes a generating means, a band limiting means for band-limiting the frequency band of the 1-pitch waveform generated by the pitch waveform generating means, and an encoding means for encoding the audio waveform band-limited by the band limiting means.

ピッチ検出手段で検出されたピッチ周期の大きさに応じ
てピッチ波形発生手段での複数ピッチ分の波形の数およ
び帯域制限手段による制限帯域幅を変更するように構成
される。The number of waveforms for a plurality of pitches in the pitch waveform generating means and the limited bandwidth by the band limiting means are changed in accordance with the size of the pitch period detected by the pitch detecting means.

[Industrial application field]

本発明は音声の高能率符号化等に用いる音声符号化装置
に関する。The present invention relates to a speech encoding device used for high-efficiency encoding of speech.

音声符号化装置では、音声信号を低ビツトレートで符号
化しつつ、再生側で聴感上の違和感なく元の音声を再生
できることが必要とされている。An audio encoding device is required to be able to encode an audio signal at a low bit rate while reproducing the original audio without causing any audible discomfort on the playback side.

[Conventional technology]

高能率符号化の１手法として、音声信号からＮピッチ分
の波形をサンプルしてこれらＮピッチ分の音声波形から
１ピッチ分の音声波形を作成し。One method of high-efficiency encoding is to sample N-pitch waveforms from an audio signal and create a 1-pitch audio waveform from these N-pitch audio waveforms.

これを符号化して受信側に伝送し、受信側では受信信号
を復号後、Ｎ回繰り返すことによって元のＮピッチ分の
音声信号を再生するピッチ切出し方法が知られている。A pitch extraction method is known in which the received signal is encoded and transmitted to the receiving side, and the receiving side decodes the received signal and repeats it N times to reproduce the original N pitches worth of audio signal.

この方法によれば、伝送ビットレートは全ての音声波形
を伝音する場合に比べて１／Ｎに低減することができる
。According to this method, the transmission bit rate can be reduced to 1/N compared to the case where all audio waveforms are transmitted.

また高能率符号化の他の手法として、音声信号を帯域制
限することによってサンプリング周波数を低減し、それ
によって低ビツトレート化を図る方法も知られている。As another method of high-efficiency encoding, a method is also known in which the sampling frequency is reduced by band-limiting the audio signal, thereby lowering the bit rate.

すなわち音声信号の帯域を１／Ｍニ１ｌｆｆｌし、１／
Ｍのサンプリング周波数でダウンサンプリングすること
によって伝送ビットレートを帯域制限を行わない場合に
比べて１／Ｍに低減するものである。In other words, the band of the audio signal is reduced to 1/M by 1lffl, and 1/
By downsampling at a sampling frequency of M, the transmission bit rate is reduced to 1/M compared to the case where no band limitation is performed.

[Problem that the invention seeks to solve]

複数ピッチ波形から１ピツチ波形を生成する前者のピッ
チ切出し方法は、符号化遅延でか低音時において大きく
なり過ぎるという問題点がある。The former pitch extraction method, which generates a one-pitch waveform from a plurality of pitch waveforms, has a problem in that the pitch becomes too large at bass frequencies, probably due to encoding delay.

すなわち、送信側の符号化遅延では、ピッチ周期をＴ、
１ピツ千波形を切り出す元の複数ピッチ波形のサンプル
波形の数をＮとすると一般にτ＝２Ｎ−Ｔとなる。いまピッチ周期の最大値Ｔｍａｘを２Ｑｍｓｅ
ｃ。That is, in the encoding delay on the transmitting side, the pitch period is T,
If the number of sample waveforms of the multi-pitch waveform from which the 1-pitch waveform is cut out is N, then generally τ=2N-T. Now set the maximum pitch period Tmax to 2Qmse
c.

サンプル波形の数Ｎを６とすると、最大符号化遅延τｗ
ａｘは２４０　ｍ５ｅｃとなり、この大きさは通話のた
めには実用上支障がある。したがってサンプル波形の数
Ｎの大きさは最大ピッチ周期によって制限され、このた
め充分に低ビツトレート化を図れない。When the number N of sample waveforms is 6, the maximum encoding delay τw
The ax is 240 m5ec, and this size poses a practical problem for telephone calls. Therefore, the size of the number N of sample waveforms is limited by the maximum pitch period, and therefore a sufficiently low bit rate cannot be achieved.

音声信号の帯域制限を行う後者の方法は、帯域制限され
た音声信号を受信側で再生した場合、聴感上、音がこも
ってしまうという問題点がある。The latter method of band-limiting the audio signal has a problem in that when the band-limited audio signal is reproduced on the receiving side, the sound becomes muffled to the auditory sense.

したがって本発明の目的は、ピッチ切出し方法および帯
域制限方法を用いて伝送ビットレートの低減を図りつつ
、これらの方法の欠点である符号化遅延の増大および再
生音のこもりを抑制した音声符号化装置を提供すること
にある。Therefore, an object of the present invention is to provide an audio encoding device that reduces the transmission bit rate by using a pitch cutting method and a band limiting method, while suppressing the disadvantages of these methods, such as an increase in encoding delay and muffled reproduced sound. Our goal is to provide the following.

[Means for solving problems]

第１図は本発明に係る音声符号化装置の原理を説明する
ブロック図である。FIG. 1 is a block diagram illustrating the principle of a speech encoding device according to the present invention.

本発明に係る音声符号化装置は、音声信号のピッチ周期
Ｔを検出するピッチ検出手段ｌと、ピッチ検出手段１で
検出されたピッチ周期Ｔに基づき音声信号を複数ピッチ
分サンプルしてＮピッチ分の音声波形から１ピッチ分の
波形を発生するピッチ波形発生手段２と、ピッチ波形発
生手段２で発生された１ピツチ波形の周波数帯域を１／
Ｍに帯域制限する帯域制限手段３と、帯域制限手段３で
帯域制限された音声波形を符号化する符号化手段４とを
具備し、ピッチ検出手段１で検出されたピッチ周期Ｔの
大きさに応じてピッチ波形発生手段２でのピッチ波形の
サンプル数Ｎおよび帯域制限手段３による帯域制限の割
合Ｍを変更するように構成される。The speech encoding device according to the present invention includes a pitch detection means 1 for detecting the pitch period T of the speech signal, and a plurality of pitches of the speech signal sampled based on the pitch period T detected by the pitch detection means 1 and N pitches. The pitch waveform generating means 2 generates a waveform for one pitch from the audio waveform of , and the frequency band of the one pitch waveform generated by the pitch waveform generating means 2 is
It is equipped with a band limiting means 3 that limits the band to M, and an encoding means 4 that encodes the audio waveform whose band has been band limited by the band limiting means 3. The number of samples N of the pitch waveform in the pitch waveform generating means 2 and the rate M of band limiting by the band limiting means 3 are changed accordingly.

[Effect]

通常１人間の音声のピッチ周期は８０）１ｚ以上であり
、イントネーションによりたまにこれより低くなること
がある程度である。よって符号化遅延でか問題となるピ
ッチ周期Ｔの長い音声は大部分イントネーションが低い
場合に出現することになるが、このようなイントネーシ
ョンが低い音声に対しては送信側で周波数帯域の制限を
行っても、受信側の再生音声は聴感上こもった音には聞
こえず。Normally, the pitch period of one person's voice is 80)1z or more, and it may sometimes be lower than this depending on intonation. Therefore, speech with a long pitch period T, which causes problems due to encoding delay, will mostly appear when the intonation is low, but for such speech with low intonation, the frequency band should be limited on the transmitting side. However, the playback audio on the receiving side does not sound muffled to the auditory sense.

帯域制限による影響は実用上少ない。Bandwidth limitations have little effect in practical terms.

そこでこのような聴感上の特性を利用して符号化ビア）
レートを低減しつつ、符号化遅延を短くしかつ再生音の
こちりをなくした音声符号化を行う。すなわちピッチ周
期Ｔの長い音声に対してはピッチ波形発生手段１でピッ
チ波形サンプル数Ｎを減らして符号化遅延τが太き（な
ることを防ぎつつ、ピッチ波形サンプル数Ｎを減らした
ことによるビットレートの増大を、帯域制限手段３で音
声波形の帯域を１／Ｍに制限してビットレートを１／Ｍ
に低減することにより相殺する。このように帯域制限を
行ってもピッチ周期の長い音声であるから再生側では帯
域制限による影響は聴感上あまり判らない。Therefore, using these auditory characteristics, the coded via)
To perform audio encoding that reduces the rate, shortens the encoding delay, and eliminates the distortion of reproduced sound. In other words, for speech with a long pitch period T, the pitch waveform generating means 1 reduces the number of pitch waveform samples N to increase the encoding delay τ (bits due to the reduction in the number of pitch waveform samples N) To increase the rate, limit the audio waveform band to 1/M using the band limiting means 3 to reduce the bit rate to 1/M.
offset by reducing the Even if the band is limited in this manner, since the voice has a long pitch period, the effect of the band limit is not very perceptible on the playback side.

ピッチ周期Ｔの短い音声に対してはピッチ波形発生信号
２でピッチ波形サンプル数Ｎを増加してビットレートを
低減するとともに、帯域制限信号３での帯域制限の程度
を緩和して再生音声がこもった音となることを防ぐ。For audio with a short pitch period T, the pitch waveform generation signal 2 increases the number of pitch waveform samples N to reduce the bit rate, and the band restriction signal 3 relaxes the degree of band restriction to make the reproduced audio muffled. This prevents the sound from becoming too loud.

このように本発明では、ピッチ波形サンプル数Ｎと帯域
制限率１／Ｍとをピッチ周期Ｔに応じて制御しており、
Ｔが大きいときはピッチ波形サンプルＮを小さくして符
号化遅延τをより短＜シ。In this way, in the present invention, the number N of pitch waveform samples and the band limiting rate 1/M are controlled according to the pitch period T.
When T is large, the pitch waveform sample N is made small to shorten the encoding delay τ.

その代わりにＭを大きくすることで、符号化圧縮率１／
Ｌ＝１／ＮＭをほぼ一定に保つとともに。Instead, by increasing M, the encoding compression rate is 1/
While keeping L=1/NM almost constant.

再生音の品質を聴感上、帯域制限を行っていない時と同
等なものとしている。The quality of the reproduced sound is aurally equivalent to when no band limitation is applied.

例えば、ピッチ周期Ｔ＝Ｏ〜１２．５ｍ５ｅｃのときサ
ンプル数Ｎ−６，帯域制限率１／Ｍ＝１とし。For example, when the pitch period T=0 to 12.5 m5ec, the number of samples is N-6 and the band limit rate is 1/M=1.

一方、ピッチ周期Ｔ　＝　１２．５〜２０ｍ５ｅｃのと
きサンプル数Ｎ−３，帯域制限率１−／Ｍ＝１／２とな
るようにサンプル数Ｎおよび帯域制限率１／Ｍをピッチ
周期Ｔに応じて変えた場合、前者の場合には符号化遅延
の最大値ｒ　ｗａｘは２　Ｘ１２．５Ｘ　６　＝１５０
ｍｓｅｃ　＋後者の場合には最大符号化遅延τｌ１ａｘ
は２　Ｘ２０Ｘ　３　＝１２０　ｍ５ｅｃとなり、符号
化遅延は最大で１５０　ｍ５ｅｃ程度となり、実用上問
題とならない程度とすることができる。On the other hand, when the pitch period T = 12.5 to 20 m5ec, the number of samples N and the band restriction rate 1/M are set according to the pitch period T so that the number of samples N-3 and the band restriction rate 1-/M = 1/2. In the former case, the maximum value of encoding delay r wax is 2 x 12.5 x 6 = 150
msec + maximum encoding delay τl1ax in the latter case
is 2 x 20 x 3 = 120 m5ec, and the encoding delay is about 150 m5ec at maximum, which can be set to a level that does not pose a problem in practice.

〔Example〕

以下９図面を参照して本発明の詳細な説明する。 The present invention will be described in detail below with reference to nine drawings.

本発明に係る実施例の符号化部が第２図に示される。第
２図において、音声信号Ｓはピッチ抽出回路１０および
１／Ｎ切出し回路１１に入力される。An encoding unit according to an embodiment of the present invention is shown in FIG. In FIG. 2, the audio signal S is input to a pitch extraction circuit 10 and a 1/N extraction circuit 11.

ピッチ抽出回路１０は入力音声波形のピッチ周期を抽出
する回路であり、抽出されたピッチ周期Ｔは１／Ｎ切出
し回路１１および切替え回路１５に送られるとともに伝
送路を介して復号化部に伝送される。The pitch extraction circuit 10 is a circuit that extracts the pitch period of the input speech waveform, and the extracted pitch period T is sent to the 1/N extraction circuit 11 and the switching circuit 15, and is also transmitted to the decoding section via the transmission line. Ru.

１／Ｎ切出し回路１１はＮピッチ分の入力音声波形から
１ピツチ分の音声波形を作る回路であり。The 1/N extraction circuit 11 is a circuit that generates a 1-pitch audio waveform from an N-pitch input audio waveform.

ピッチ抽出回路１０で抽出されるピッチ周期Ｔが１５ｍ
５ｅｃ以上の場合にはＮ＝３すなわち３ピツチ分の音声
波形から１ピツチの波形を作成し、ピッチ周期Ｔ　＜　
１５ｍ５ｅｃのときにはＮ＝６すなわち６ピツチ分の音
声波形から１ピッチの音声波形を作成する。The pitch period T extracted by the pitch extraction circuit 10 is 15 m.
In the case of 5ec or more, create a 1-pitch waveform from N=3, that is, 3-pitch audio waveform, and pitch period T <
In the case of 15m5ec, N=6, that is, a 1-pitch audio waveform is created from 6 pitches' worth of audio waveforms.

１／Ｎ切出し回路１１で発生された１ピツチ波形は次に
帯域分割フィルタ１２に入力される。帯域分割フィルタ
１２はＯ〜４ｋＨｚの帯域幅の入力音声信号ＳをＯ〜２
　ｋ　Ｈｚの低域信号Ｓｔと２に〜４ｋＨｚの高域信号
ＳＨとに分割してそれぞれ符号器１３と１４に送出して
符号化を行っており、これらの低域信号ＳＬおよび高域
信号ＳＬ１は元の音声信号のサンプリング信号の１７２
にダウンサンプリングされる。The 1-pitch waveform generated by the 1/N extraction circuit 11 is then input to the band division filter 12. The band division filter 12 divides the input audio signal S with a bandwidth of O~4kHz into O~2
The low frequency signal St of kHz and the high frequency signal SH of 2 to 4 kHz are divided and sent to encoders 13 and 14 for encoding, and these low frequency signal SL and high frequency signal SL1 is the sampling signal of the original audio signal.
is downsampled to.

符号器１３からの低域信号ＳＬはそのまま伝送路に送出
され、符号器１４からの高域信号ＳＨは切替え回路１５
を介して伝送路に送出される。切替え回路１５はピッチ
抽出回路１０からピッチ周期Ｔ情報を受け＋　Ｔ＜１５
ｍ５ｅｃの時には閉じられていて符号器１４の高域信号
Ｓｏを伝送路へ送出し、一方、Ｔ≧１５ｍ５ｅｃの時に
は開かれて符号器１４の高域信号ＳＨの伝送路への送出
をしゃ断するように構成されている。The low frequency signal SL from the encoder 13 is sent as is to the transmission path, and the high frequency signal SH from the encoder 14 is sent to the switching circuit 15.
It is sent out to the transmission path via. The switching circuit 15 receives pitch period T information from the pitch extraction circuit 10 + T<15
When m5ec, it is closed and sends out the high frequency signal So of the encoder 14 to the transmission path, while when T≧15m5ec, it is opened and cuts off the sending of the high frequency signal SH of the encoder 14 to the transmission path. It is composed of

このようにこの実施例では符号化部における帯域制限方
式として帯域分割符号化方式、すなわち人力を高域成分
と低域成分とに分割し各帯域の信号を独立に符号化する
方式を利用しており、この特番帯域の信号はその帯域幅
に応じてダウンサンプリングされている。。In this way, this embodiment uses a band division coding method as a band limiting method in the encoding section, that is, a method in which human power is divided into high frequency components and low frequency components and the signals of each band are independently encoded. The signal of this special number band is downsampled according to its bandwidth. .

本発明に係る実施例の復号化部が第３図に示される。第
３図において、符号化部から伝送路を介して送られてき
た低域信号ＳＬは復号器２０に入力され、また高域信号
３Ｍは切替え器２４を介して復号器２１に入力される。A decoding section of an embodiment according to the present invention is shown in FIG. In FIG. 3, a low frequency signal SL sent from the encoding section via a transmission path is input to a decoder 20, and a high frequency signal 3M is input to a decoder 21 via a switch 24.

さらにピッチ周期Ｔ情報は切替え器２４およびＮ回繰返
し回路２３に入力される。Further, the pitch period T information is input to the switch 24 and the N-times repeat circuit 23.

切替え器２４はピッチ周期Ｔに応じて切り替えられる回
路であり、Ｔ＜１５ｍ５ｅｃのとき伝送路側に切り替え
られて伝送路からの高域信号ＳＲを復号器２１に入力さ
せ、Ｔ≧１５ｍ５ｅｃでは伝送路からの高域信号ＳＨの
復号器２Ｉへの入力をしゃ断するように構成されている
。The switch 24 is a circuit that is switched according to the pitch period T, and when T<15m5ec, it is switched to the transmission line side and inputs the high frequency signal SR from the transmission line to the decoder 21, and when T≧15m5ec, it is switched to the transmission line side and inputs the high frequency signal SR from the transmission line to the decoder 21. It is configured to cut off input of the high frequency signal SH to the decoder 2I.

復号器２０および２Ｉの各出力信号は帯域合成フィルタ
２２に入力されて合成され、その合成信号はＮ回繰返し
回路２３に入力される。Ｎ回繰返し回路２３は帯域合成
フィルタ２２からの復号音声波形をピッチ周期Ｔに基づ
きＮ回繰り返して再生音声を作成する回路である。The output signals of the decoders 20 and 2I are input to a band synthesis filter 22 and combined, and the combined signal is input to an N-times repeating circuit 23. The N-times repeating circuit 23 is a circuit that repeats the decoded audio waveform from the band synthesis filter 22 N times based on the pitch period T to create reproduced audio.

実施例システムの動作が以下に説明される。まず符号化
部において、入力音声信号Ｓがピッチ抽出回路１０およ
び１／Ｎ切出し回路１１に入力され。The operation of the example system is described below. First, in the encoding section, the input audio signal S is input to the pitch extraction circuit 10 and the 1/N extraction circuit 11.

ピッチ抽出回路１０で音声信号Ｓのピッチ周期Ｔが抽出
される。いまこの抽出されたピッチ周期ＴがＴ＜１５ｍ
５ｅｃであるとする。１／Ｎ切出し回路１１はこのピッ
チ周期Ｔに基づき、Ｔ＜１５ｍ５ｅｃであるので入力音
声信号を６ピツチ分サンプリングしてこの６ピツチ分の
波形から１ピツチの音声波形を生成して出力する。この
１／Ｎ切出し回路１１からの１ピツチ分の音声波形は帯
域分割フィルタ１２に入力されて低域信号ＳＬと高域信
号ＳＨとに分割され、ｌ／２にダウンサンプリングしつ
つ符号器１３と１４で符号化される。切替え器１５はピ
ッチ周期Ｔ＜１５ｍ５ｅｃであるから閉じられており、
したがって符号器１３および１４からの低域信号ＳＬお
よび高域信号Ｓｏは共に伝送路を介して復号化部に伝送
される。A pitch extraction circuit 10 extracts the pitch period T of the audio signal S. Now, this extracted pitch period T is T<15m
Suppose that it is 5ec. Based on this pitch period T, the 1/N extraction circuit 11 samples the input audio signal by 6 pitches and generates and outputs a 1-pitch audio waveform from the 6-pitch waveform since T<15m5ec. The 1 pitch audio waveform from the 1/N extraction circuit 11 is input to the band division filter 12 where it is divided into a low frequency signal SL and a high frequency signal SH. 14. The switch 15 is closed because the pitch period T<15m5ec.
Therefore, both the low frequency signal SL and the high frequency signal So from encoders 13 and 14 are transmitted to the decoding section via the transmission path.

一方、ピッチ抽出回路１０で抽出されたピッチ周期Ｔが
Ｔ≧１５ｍ５ｅｃである場合、１／Ｎ切出し回路１１は
音声信号Ｓを３ピツチ分サンプリングしてこの３ピツチ
分の音声波形から１ピツチ分の音声波形を発生する。こ
の音声波形は前述同様に帯域分割フィルタ１２で低域信
号ＳＬおよび高域信号ＳＨに分割されて符号器１３およ
び１４で符号化されるが。On the other hand, when the pitch period T extracted by the pitch extraction circuit 10 is T≧15m5ec, the 1/N extraction circuit 11 samples the audio signal S for 3 pitches and uses the audio waveform for 3 pitches to extract 1 pitch from the audio waveform for 3 pitches. Generates audio waveform. This audio waveform is divided into a low frequency signal SL and a high frequency signal SH by the band division filter 12 and encoded by encoders 13 and 14, as described above.

Ｔ≧１５ｍ５ｅｃでは切替え器１５が開かれているので
。Since the switch 15 is open when T≧15m5ec.

符号器１４からの高域信号Ｓｏは伝送路に送出されない
。The high frequency signal So from the encoder 14 is not sent out to the transmission path.

このようにピッチ周期ＴがＴ≧１５ｍ５ｅｃの時には１
／Ｎ切出し回路１１でのピッチ波形サンプル数ＮがＴ＜
１５ｍ５ｅｃの時の半分となるので、この１／Ｎ切出し
回路１１における符号化圧縮率は半分に減るが、音声信
号Ｓのうち帯域分割フィルタ１２で分割した低域信号Ｓ
ｔ、のみしか復号化部に送出しないのでビットレートを
半分にでき、したがって伝送路に送出される信号の符号
化圧縮率は結局。In this way, when the pitch period T is T≧15m5ec, 1
/N The number of pitch waveform samples N in the extraction circuit 11 is T<
Since it is half of that of 15m5ec, the encoding compression rate in this 1/N extraction circuit 11 is reduced by half, but the low frequency signal S divided by the band division filter 12 of the audio signal S is
Since only t is sent to the decoding unit, the bit rate can be halved, and therefore the encoding compression rate of the signal sent to the transmission path is ultimately .

ピッチ周期ＴがＴ＜１５ｍ５ｅｃの時と同じになる。The pitch period T is the same as when T<15 m5ec.

すなわち、ピッチ波形サンプル数をＮとし、１／Ｍに帯
域制限して１／Ｍにダウンサンプリングしたものとする
と、圧縮率１／Ｌ＝１／　（Ｎ−Ｍ）はピッチ周期Ｔに
かかわらず常に一定である。In other words, if the number of pitch waveform samples is N, and the band is limited to 1/M and downsampled to 1/M, then the compression ratio 1/L = 1/ (N - M) is always constant regardless of the pitch period T. constant.

復号化部ではＴ＜１５ｍ５ｅｃでは切替え器２４は伝送
路側に接続されており、したがって伝送路を介して伝送
されてきた低域信号Ｓｔ、および高域信号ＳＨが復号器
２０および２１に入力されて復号され。In the decoding section, when T<15m5ec, the switch 24 is connected to the transmission line side, and therefore the low frequency signal St and the high frequency signal SH transmitted via the transmission channel are input to the decoders 20 and 21. decrypted.

その後、帯域合成フィルタ２２で合成されてその合成信
号がＮ回繰返し回路２３に入力される。Ｎ回繰返し回路
２３はこの合成信号の波形を６回繰り返して再生信号を
発生する。Thereafter, the signals are synthesized by the band synthesis filter 22 and the synthesized signal is inputted to the repeating circuit 23 N times. The N-time repeat circuit 23 repeats the waveform of this composite signal six times to generate a reproduced signal.

Ｔ≧１５　ｍ　ｓｅｃでは伝送路からの低域信号Ｓｔの
みが復号器２０で復号されて帯域合成フィルタ２２を介
してＮ回繰返し回路２３に入力され、Ｎ回繰返し回路２
３では合成信号波形を３回繰り返して再生信号を発生す
る。When T≧15 m sec, only the low-frequency signal St from the transmission path is decoded by the decoder 20 and inputted to the N-times repeating circuit 23 via the band synthesis filter 22, and the N-times repeating circuit 2
3, the synthesized signal waveform is repeated three times to generate a reproduced signal.

本発明の実施にあたっては種々の変形形態が可能である
。上述の実施例では符号化部における帯域制限方法とし
て帯域分割符号化方式を用いたが。Various modifications are possible in implementing the invention. In the above-described embodiment, a band division encoding method was used as a band limiting method in the encoding section.

勿論これに限らず９例えば離散フーリエ変換（ＤＦＴ）
を用いることもできる。すなわち入力音声信号に対して
離散フーリエ変換を行って線スベクトルを抽出し、ピッ
チ周期Ｔに応じてこの線スペクトルのうちの高周波成分
を除去することにより帯域制服を行うことができる。こ
の場合も圧縮率がほぼ一定となるように線スペクトルの
高域成分を落としていくことになる。Of course, the invention is not limited to this.9 For example, discrete Fourier transform (DFT)
You can also use That is, by performing discrete Fourier transform on the input audio signal to extract a line spectrum, and removing high frequency components of this line spectrum according to the pitch period T, band uniformity can be performed. In this case as well, the high frequency components of the line spectrum are reduced so that the compression ratio remains approximately constant.

〔Effect of the invention〕

本発明によれば、音声信号の符号化に際して符号化遅延
が大きくなり過ぎたりあるいは再生側において再生音が
こもった音になったりすることを防止しつつ、低ビツト
レートによる音声符号化を行うことができる。According to the present invention, it is possible to perform audio encoding at a low bit rate while preventing the encoding delay from becoming too large or the reproduced sound becoming muffled on the playback side when encoding the audio signal. can.

[Brief explanation of the drawing]

第１図は本発明に係る原理説明図。第２図は本発明に係る実施例の符号化部のブロック図、
および第３図は本発明に係る実施例の復号化部のブロック図で
ある。図において。１−ピッチ検出手段２−ピッチ波形発生手段３・−帯域制限手段４・・−符号化手段ＩＯ・・・ピッチ抽出回路１１・−１／Ｎ切出し回路１２・−・帯域分割フィルタ１３、１４−一符号器１５、２４−・切替え器２０、２１・・−復号器２２−・−帯域合成フィルタ２３−Ｎ回繰返し回路４１−千ご　日月１＝イ禾う　Ｓ　　理　序う第１図、４又ヂｒ日月の裏施侍１］のＴｆ号イご邪第２図Ａ脅１月の×夛馴列のイ夏号イこ侶ｐ第３図FIG. 1 is a diagram explaining the principle of the present invention. FIG. 2 is a block diagram of an encoding unit according to an embodiment of the present invention;
and FIG. 3 is a block diagram of a decoding section of an embodiment according to the present invention. In fig. 1 - Pitch detection means 2 - Pitch waveform generation means 3 - Band limiting means 4 - Encoding means IO - Pitch extraction circuit 11 - 1/N extraction circuit 12 - Band division filters 13, 14 - - Encoder 15, 24 - Switcher 20, 21 - Decoder 22 - Band synthesis filter 23 - N-times repeat circuit 41 4-way dir Sun Moon no Ura Sai Samurai 1] Tf number Igoya Figure 2

Claims

[Claims] 1. Pitch detection means for detecting the pitch period of an audio signal (
1), and pitch waveform generating means (2) which samples the audio signal for a plurality of pitches based on the pitch cycle detected by the pitch detection means (1) and generates a waveform for one pitch from the audio waveform for the plurality of pitches. ), band limiting means (3) for band limiting the frequency band of the one pitch waveform generated by the pitch waveform generating means (2), and encoding the audio waveform band limited by the band limiting means (3). the number of waveform samples in the pitch waveform generating means (2) and the band limiting means according to the size of the pitch period detected by the pitch detecting means (1); (3) A speech encoding device configured to change the limited bandwidth according to (3).