CN101662288B

CN101662288B - Method, device and system for encoding and decoding audios

Info

Publication number: CN101662288B
Application number: CN2008101191706A
Authority: CN
Inventors: 张德明; 李海婷; 张立斌; 霍克·克鲁格; 本特·凯瑟; 皮特·瓦里
Original assignee: Huawei Technologies Co Ltd
Current assignee: Guangdong Gaohang Intellectual Property Operation Co ltd; Wei Fang
Priority date: 2008-08-28
Filing date: 2008-08-28
Publication date: 2012-07-04
Anticipated expiration: 2028-08-28
Also published as: CN101662288A; WO2010022661A1

Abstract

An audio encoding and decoding method, device, and system, the method specifically includes: extracting time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters, and harmonic interval parameters used to characterize an audio signal; parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters are encoded and then transmitted to the decoder. The embodiment of the present invention adopts a set of parameters including time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters, and harmonic interval parameters, which reduces the number of parameters required for encoding and reduces the time required for encoding using parameters. The required number of bits can achieve the purpose of encoding the signal with fewer bits.

Description

Audio encoding and decoding method and device, system

技术领域 technical field

本发明涉及音频编码、解码技术领域，尤其涉及参数音频编码、解码的方法及装置、系统。The present invention relates to the technical field of audio coding and decoding, in particular to a method, device and system for parametric audio coding and decoding.

背景技术 Background technique

音频信号通常指人耳可以听到的频率在20Hz到20KHz的声波，数字音频信号是指经过模数转换后的音频信号。从模拟到数字的转换包含了以指定的采样率进行数字采样，以及以指定的分辨率对时域离散信号进行标量量化的过程。An audio signal generally refers to a sound wave with a frequency of 20 Hz to 20 KHz that can be heard by the human ear, and a digital audio signal refers to an audio signal after analog-to-digital conversion. The conversion from analog to digital involves digital sampling at a specified sampling rate and scalar quantization of time-domain discrete signals at a specified resolution.

音频编码，通常是指消除音频信号中的统计冗余和感知不敏感的编码方法RIRAC(消除音频信号中的统计冗余和感知不敏感，Redundancy and Irrelevancy Removal AudioCoding)，例如变换域编码。音频编码可以用一个较低的码率来表征信号，但同时编码噪声也会被引入到信号中。利用人耳听觉系统的掩蔽效应，在对音频信号进行频域和时域整形后这些噪声将很难或不被听到。利用这种消除音频信号中的统计冗余和感知不敏感的音频编码方法，可以用较高的比特数获得较高质量的编码性能，但是当带宽不稳定时，采用这种编码方法的音频质量下降非常明显。Audio coding generally refers to the coding method RIRAC (Redundancy and Irrelevancy Removal AudioCoding) that eliminates statistical redundancy and perceptual insensitivity in audio signals, such as transform domain coding. Audio coding can use a lower bit rate to represent the signal, but at the same time coding noise will be introduced into the signal. Using the masking effect of the human auditory system, these noises will be difficult or inaudible after frequency and time domain shaping of the audio signal. Using this audio coding method that eliminates statistical redundancy and perceptual insensitivity in audio signals, higher-quality coding performance can be obtained with a higher number of bits, but when the bandwidth is unstable, the audio quality of this coding method The decline is very noticeable.

相对于上述消除音频信号中的统计冗余和感知不敏感的音频编码方法，利用参数对音频编码是一种利用简洁的参数描述来表征信号的方法，通过这种方法可以用更低的编码速率获得较高的编码质量，其中，参数可以是包含信号的时域和频域特征的一组参数。由于这样一组参数可以用较少的比特数来表示，因此利用参数对音频编码的方法非常适用于低速率传输机制。在将参数描述传输至解码端之后，解码端可以跟据这些参数重构音频信号。目前利用参数对音频信号进行编码的方法主要有：Compared with the above-mentioned audio coding methods that eliminate statistical redundancy and perceptual insensitivity in audio signals, using parameters to encode audio is a method that uses concise parameter descriptions to characterize signals. This method can use lower coding rates. Higher coding quality is obtained, wherein the parameters may be a group of parameters including time domain and frequency domain characteristics of the signal. Since such a group of parameters can be represented by a small number of bits, the method of encoding audio using parameters is very suitable for low-rate transmission mechanisms. After transmitting the parameter description to the decoder, the decoder can reconstruct the audio signal according to these parameters. At present, the methods for encoding audio signals using parameters mainly include:

现有技术一Existing technology one

这种方法是将MPEG-4音频标准中的先进音频编码(Advanced Audio Coding，简称AAC)、频带复制(Spectral Band Replication，简称SBR)和参数立体声(Parametric Stereo，简称PS)3种技术结合来对音频信号进行编码。This method combines the advanced audio coding (Advanced Audio Coding, referred to as AAC), frequency band replication (Spectral Band Replication, referred to as SBR) and parametric stereo (Parametric Stereo, referred to as PS) in the MPEG-4 audio standard. The audio signal is encoded.

现有技术二Prior art two

这种方法主要是利用各种模型，例如谐波模型、暂态模型、单谱线模型和噪声模型等对音频信号进行分析，提取相应的模型参数，在合成端利用这些模型参数还原音频信号。This method mainly uses various models, such as harmonic model, transient model, single spectral line model and noise model, to analyze the audio signal, extract the corresponding model parameters, and use these model parameters to restore the audio signal at the synthesis end.

发明人在实现本发明的过程中，发现现有技术中至少存在如下问题：In the process of realizing the present invention, the inventor found that at least the following problems existed in the prior art:

实际应用中，现有技术一在对音频信号进行编码的过程中一般需要用较多的编码比特数才能获得较大的编码带宽，这就对传输信道的带宽提出了很高的要求；在信道带宽较小的情况下，会影响采用该技术进行编码的音频质量。In practical application, in the process of encoding the audio signal in the prior art, it generally needs to use more encoding bits to obtain a larger encoding bandwidth, which puts forward very high requirements on the bandwidth of the transmission channel; In the case of small bandwidth, it will affect the audio quality encoded by this technology.

现有技术二的方法中，需要在编码端用较多的参数来对信号进行描述，才能在解码端获得较高质量的合成音频信号；因此，实际应用中采用这种技术时需要传输的比特数也较多；当信道的传输能力进一步降低时，会影响采用该技术进行编码的音频质量。In the method of prior art 2, it is necessary to use more parameters to describe the signal at the encoding end in order to obtain a higher-quality synthetic audio signal at the decoding end; therefore, the bits that need to be transmitted when this technology is used in practical applications The number is also large; when the transmission capacity of the channel is further reduced, it will affect the audio quality encoded by this technology.

发明内容 Contents of the invention

本发明的实施例提供了一种音频编码、解码方法及装置、系统，可以在应用中降低编码时所需要的比特数，从而实现用更少的比特数对信号进行编码。同时，本发明实施例还提供了一种对音频信号进行分频带编码、解码处理方法和装置，能够在分频带编解码音频信号的过程中实现用更少的比特数对信号进行编码。Embodiments of the present invention provide an audio encoding and decoding method, device, and system, which can reduce the number of bits required for encoding in applications, thereby realizing encoding signals with fewer bits. At the same time, the embodiment of the present invention also provides a sub-band encoding and decoding processing method and device for an audio signal, which can encode the signal with fewer bits in the process of encoding and decoding the audio signal in the sub-band.

一种音频编码方法，包括：An audio encoding method comprising:

提取用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；将所述时域包络参数、频域包络参数、音调参数和谐波间隔参数编码后，传输给解码端。Extracting time domain envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize the audio signal; encoding the time domain envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters After that, it is transmitted to the decoder.

一种音频编码装置，包括：An audio encoding device, comprising:

参数提取单元，用于提取用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；A parameter extraction unit is used to extract time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize audio signals;

发送单元，用于将所述时域包络参数、频域包络参数、音调参数和谐波间隔参数编码后，传输给解码端。The sending unit is configured to encode the time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters, and transmit them to the decoding end.

一种音频解码方法，包括：An audio decoding method, comprising:

对收到的数据进行解码，得到用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；根据所述时域包络参数、频域包络参数、音调参数和谐波间隔参数，合成音频信号。Decoding the received data to obtain time domain envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize the audio signal; according to the time domain envelope parameters, frequency domain envelope parameters, The pitch parameter and the harmonic interval parameter synthesize the audio signal.

一种音频解码装置，包括：An audio decoding device, comprising:

解码单元，用于对收到的数据进行解码，得到用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；The decoding unit is used to decode the received data to obtain time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize the audio signal;

合成单元，用于根据所述时域包络参数、频域包络参数、音调参数和谐波间隔参数，合成音频信号。The synthesizing unit is used for synthesizing the audio signal according to the time domain envelope parameter, frequency domain envelope parameter, pitch parameter and harmonic interval parameter.

一种音频编解码系统，包括：An audio codec system comprising:

编码装置，用于提取用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；对所述时域包络参数、频域包络参数、音调参数和谐波间隔参数编码后，发送至解码装置；An encoding device, configured to extract time-domain envelope parameters, frequency-domain envelope parameters, tone parameters and harmonic interval parameters used to characterize audio signals; for the time-domain envelope parameters, frequency-domain envelope parameters, tone parameters and After the harmonic interval parameter is encoded, it is sent to the decoding device;

解码装置，用于对所述编码装置发送来的数据进行解码，得到所述时域包络参数、频域包络参数、音调参数和谐波间隔参数；根据所述时域包络参数、频域包络参数、音调参数和谐波间隔参数合成音频信号。A decoding device, configured to decode the data sent by the encoding device to obtain the time-domain envelope parameter, frequency-domain envelope parameter, tone parameter and harmonic interval parameter; according to the time-domain envelope parameter, frequency-domain envelope parameter, The audio signal is synthesized using domain envelope parameters, pitch parameters, and harmonic interval parameters.

一种编码处理方法，包括：An encoding processing method comprising:

当用分频带的方式对音频信号进行编码时，若当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似，则提取用于表征音频信号的时域包络参数和频域包络参数，并将所述时域包络参数和频域包络参数编码后发送，同时发送表示当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似的信息；若当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号不相似，则提取用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数，并将所述时域包络参数、频域包络参数、音调参数和谐波间隔参数编码后发送，同时发送表示当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似的信息。When the audio signal is encoded in the way of frequency division, if the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band, the time domain envelope parameters and frequency domain parameters used to characterize the audio signal are extracted. Envelope parameters, and encode the time-domain envelope parameters and frequency-domain envelope parameters to send, and at the same time send information indicating that the spectrum signal of the audio signal in the current frequency band is similar to the spectrum signal of the audio signal in the previous frequency band; if the current The spectral signal of the audio signal of the frequency band is not similar to the spectral signal of the audio signal of the previous frequency band, then the time domain envelope parameters, frequency domain envelope parameters, tone parameters and harmonic interval parameters used to characterize the audio signal are extracted, and The time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters are encoded and sent, and at the same time, information indicating that the spectrum signal of the audio signal in the current frequency band is similar to the spectrum signal of the audio signal in the previous frequency band is sent.

一种编码处理装置，包括：An encoding processing device, comprising:

判断单元，用于判断当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号是否相似；A judging unit, configured to judge whether the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band;

编码单元，用于根据所述判断单元得到的判断结果信息，在当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似时，提取用于表征音频信号的时域包络参数和频域包络参数；或者，在当前频带的音频信号的谱信号与前一个频带的音频信号间的谱信号不相似时，提取用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；The coding unit is used to extract the time-domain envelope parameters used to characterize the audio signal when the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band according to the judgment result information obtained by the judgment unit and frequency domain envelope parameters; or, when the spectral signal of the audio signal of the current frequency band is not similar to the spectral signal between the audio signal of the previous frequency band, the time domain envelope parameters and the frequency domain envelope used to characterize the audio signal are extracted parameter, pitch parameter and harmonic interval parameter;

传输单元，用于发送所述判断单元得到的当前频带的音频信号的谱信号与前一个频带的音频信号间的谱信号相似的信息，对所述编码单元提取的所述音频信号的时域包络参数和频域包络参数进行编码后发送；或者，发送所述判断单元得到的当前频带的音频信号的谱信号与前一个频带的音频信号间的谱信号不相似的信息，对所述编码单元提取的音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数进行编码后发送。a transmission unit, configured to send the similarity information between the spectral signal of the audio signal in the current frequency band obtained by the judging unit and the spectral signal of the audio signal in the previous frequency band, and transmit the time-domain packet of the audio signal extracted by the encoding unit Envelope parameters and frequency domain envelope parameters are encoded and then sent; or, sending the information that the spectrum signal of the audio signal of the current frequency band obtained by the judgment unit is not similar to the spectrum signal between the audio signal of the previous frequency band, and the coded The time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters of the audio signal extracted by the unit are encoded and then sent.

一种解码处理方法，包括：A decoding processing method, comprising:

接收编码端发送的数据，若接收到表示当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似的信息，根据用于表征音频信号的时域包络参数和频域包络参数合成音频信号，其中，所述时域包络参数和频域包络参数是从接收到的数据中解码得到；Receive the data sent by the encoding end, if the information indicating that the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band is received, according to the time domain envelope parameters and frequency domain envelope used to characterize the audio signal parametrically synthesized audio signals, wherein the time-domain envelope parameters and frequency-domain envelope parameters are decoded from received data;

若接收到表示当前频带的音频信号的谱信号与前一个频带的音频信号间的谱信号不相似的信息，根据用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数合成音频信号，其中，所述时域包络参数、频域包络参数、音调参数和谐波间隔参数是从接收到的数据中解码得到。If information indicating that the spectral signal of the audio signal of the current frequency band is not similar to the spectral signal of the audio signal of the previous frequency band is received, according to the time domain envelope parameters, frequency domain envelope parameters, tone parameters and Synthesizing an audio signal with harmonic interval parameters, wherein the time domain envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters are obtained by decoding from received data.

一种解码处理装置，其特征在于，包括：A decoding processing device is characterized in that it comprises:

接收信息单元，用于接收表示当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似的信息，并对收到的数据解码得到用于表征音频信号的时域包络参数和频域包络参数；或者，接收表示当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号不相似的信息，并对收到的数据解码得到用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；The receiving information unit is used to receive information indicating that the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band, and decode the received data to obtain the time-domain envelope parameters and Frequency-domain envelope parameters; or, receiving information indicating that the spectral signal of the audio signal in the current frequency band is not similar to the spectral signal of the audio signal in the previous frequency band, and decoding the received data to obtain a time-domain packet for representing the audio signal Envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters;

解码单元，用于根据所述接收信息单元接收的所述相似的信息，以及所述时域包络参数和频域包络参数，合成音频信号；或者，根据所述不相似的信息，以及所述时域包络参数、频域包络参数、音调参数和谐波间隔参数，合成音频信号。a decoding unit, configured to synthesize an audio signal according to the similar information received by the receiving information unit, and the time-domain envelope parameter and the frequency-domain envelope parameter; or, according to the dissimilar information, and the The time domain envelope parameter, the frequency domain envelope parameter, the tone parameter and the harmonic interval parameter are described to synthesize the audio signal.

由上述本发明的实施例提供的技术方案可以看出，相对于现有技术的基于一定模型的参数音频编码技术，本发明实施例采用的包含时域包络参数、频域包络参数、音调参数和谐波间隔参数的一组参数，减少了编码时需要的参数的个数，同时降低了使用参数进行编码时所需要的比特数，可以实现用更少的比特数对信号进行编码的目的。It can be seen from the technical solutions provided by the above-mentioned embodiments of the present invention that, compared with the parametric audio coding technology based on a certain model in the prior art, the embodiments of the present invention adopt time-domain envelope parameters, frequency-domain envelope parameters, pitch A set of parameters of parameters and harmonic interval parameters reduces the number of parameters required for encoding and reduces the number of bits required for encoding using parameters, which can achieve the purpose of encoding signals with fewer bits .

附图说明 Description of drawings

图1为本发明实施例提供的音频编码方法流程示意图；FIG. 1 is a schematic flowchart of an audio encoding method provided by an embodiment of the present invention;

图2为本发明实施例提供的音频解码方法流程示意图；FIG. 2 is a schematic flowchart of an audio decoding method provided by an embodiment of the present invention;

图3为本发明实施例的编码处理方法流程示意图；FIG. 3 is a schematic flowchart of an encoding processing method according to an embodiment of the present invention;

图4为本发明实施例的解码处理方法流程示意图；FIG. 4 is a schematic flowchart of a decoding processing method according to an embodiment of the present invention;

图5为本发明实施例二在编码端的处理过程示意图；FIG. 5 is a schematic diagram of the processing process at the encoding end in Embodiment 2 of the present invention;

图6为本发明实施例二在解码端的处理过程示意图；FIG. 6 is a schematic diagram of the processing process at the decoding end in Embodiment 2 of the present invention;

图7为本发明实施例提供的音频编码装置结构示意图；FIG. 7 is a schematic structural diagram of an audio encoding device provided by an embodiment of the present invention;

图8为本发明实施例提供的音频解码装置结构示意图；FIG. 8 is a schematic structural diagram of an audio decoding device provided by an embodiment of the present invention;

图9为本发明实施例提供的音频编解码系统结构示意图；FIG. 9 is a schematic structural diagram of an audio codec system provided by an embodiment of the present invention;

图10为木发明实施例提供的编码处理装置结构示意图；Fig. 10 is a schematic structural diagram of an encoding processing device provided by an embodiment of the invention;

图11为本发明实施例提供的解码处理装置结构示意图；FIG. 11 is a schematic structural diagram of a decoding processing device provided by an embodiment of the present invention;

图12为本发明实施例提供的解码单元结构示意图。FIG. 12 is a schematic structural diagram of a decoding unit provided by an embodiment of the present invention.

具体实施方式 Detailed ways

为了在现有音频编码基础上用更低的编码速率获得更大的编码带宽，并获得更高的编码质量，本发明实施例提供一种音频编码方法，具体可以提取用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；将所述时域包络参数、频域包络参数、音调参数和谐波间隔参数编码后，传输给解码端。In order to obtain a larger encoding bandwidth with a lower encoding rate and obtain higher encoding quality on the basis of existing audio encoding, an embodiment of the present invention provides an audio encoding method, which can specifically extract time Domain envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters; after encoding the time domain envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters, transmit them to the decoding end.

进一步的，当所述音频信号的谐波间隔与第一谐波偏移量的值不同时，提取所述音频信号的第一谐波偏移量参数，并对其编码后传输给所述解码端。Further, when the harmonic interval of the audio signal is different from the value of the first harmonic offset, extract the first harmonic offset parameter of the audio signal, encode it and transmit it to the decoder end.

图1是本发明实施例的音频编码方法流程示意图，下面将结合图1对本发明实施例的音频编码方法进行介绍。如图1所示，具体可以包括：FIG. 1 is a schematic flowchart of an audio coding method according to an embodiment of the present invention. The audio coding method according to this embodiment of the present invention will be described below in conjunction with FIG. 1 . As shown in Figure 1, the details may include:

11：提取需要进行编码处理的音频信号的时域包络参数；具体的，可以通过计算音频信号的子帧能量来得到信号的时域包络，也可以将信号变换到频域(或变换域)之后提取自回归(AR，Auto Regressive)模型参数来表征信号的时域包络；11: Extract the time-domain envelope parameters of the audio signal that needs to be encoded; specifically, the time-domain envelope of the signal can be obtained by calculating the sub-frame energy of the audio signal, or the signal can be transformed into the frequency domain (or transform domain ) to extract the autoregressive (AR, Auto Regressive) model parameters to characterize the time domain envelope of the signal;

12：提取音频信号的频域包络参数；具体的，可以通过计算频域(或变换域)下的子带能量得到信号的频域包络，也可以在时域提取信号的白回归模型参数来表征信号的频域包络；12: Extract the frequency domain envelope parameters of the audio signal; specifically, the frequency domain envelope of the signal can be obtained by calculating the subband energy in the frequency domain (or transform domain), or the white regression model parameters of the signal can be extracted in the time domain To characterize the frequency domain envelope of the signal;

13：提取音频信号的音调参数；音调参数表征了音频信号中谐波信号与噪声信号之间的比例；音调参数的表示方法有多种，可以是自相关函数的最大值与最小值之比；13: Extract the tone parameter of the audio signal; the tone parameter represents the ratio between the harmonic signal and the noise signal in the audio signal; there are many ways to express the tone parameter, which can be the ratio of the maximum value to the minimum value of the autocorrelation function;

14：提取音频信号的谐波间隔(PG，Pitch Grid)参数；谐波间隔参数表征了信号的不同谐波之间的间隔；具体可以通过峰值提取方法估计出谐波间隔参数；14: Extract the harmonic interval (PG, Pitch Grid) parameter of the audio signal; the harmonic interval parameter characterizes the interval between different harmonics of the signal; specifically, the harmonic interval parameter can be estimated by the peak extraction method;

15：提取第一谐波偏移量参数(P0，Pitch Offset)；具体的，可以根据谐波间隔参数，估计第一谐波偏移量参数，并将该第一谐波偏移量参数编码传输；第一谐波偏移量参数表征了音频信号第一个谐波的位置；需要指出的是，若第一谐波偏移量的值等于谐波间隔，则该步骤可以省略；也就是当所述音频信号的谐波间隔与第一谐波偏移量的值不同时，提取所述音频信号的第一谐波偏移量参数；15: Extract the first harmonic offset parameter (P0, Pitch Offset); specifically, the first harmonic offset parameter can be estimated according to the harmonic interval parameter, and the first harmonic offset parameter can be encoded Transmission; the first harmonic offset parameter represents the position of the first harmonic of the audio signal; it should be pointed out that if the value of the first harmonic offset is equal to the harmonic interval, this step can be omitted; that is When the harmonic interval of the audio signal is different from the value of the first harmonic offset, extracting the first harmonic offset parameter of the audio signal;

对上述时域包络参数，频域包络参数，音调参数，谐波间隔参数和第一谐波偏移量参数编码后(也可以量化后编码)，将其输出。After the above-mentioned time-domain envelope parameter, frequency-domain envelope parameter, pitch parameter, harmonic interval parameter and first harmonic offset parameter are coded (or quantized and then coded), they are output.

需要指出的是，上述音调参数，谐波间隔参数和第一谐波偏移量参数可以但不限于在频域(或变换域)计算得到，例如还可以在时域计算得到。并且，获取上述各参数的顺序不唯一，即不论以何种顺序，只要获取上述音频信号的时域包络参数，频域包络参数，音调参数，谐波间隔参数和第一谐波偏移量参数即可。It should be pointed out that the above pitch parameter, harmonic interval parameter and first harmonic offset parameter can be calculated in frequency domain (or transform domain), for example, can also be calculated in time domain. Moreover, the order of obtaining the above parameters is not unique, that is, no matter in what order, as long as the time domain envelope parameters, frequency domain envelope parameters, pitch parameters, harmonic interval parameters and first harmonic offset of the above audio signal are obtained Quantitative parameters are enough.

上述内容描述了本发明实施例的音频编码方法流程，通过上述方法，可以用包含时域包络参数，频域包络参数，音调参数，谐波间隔参数和第一谐波偏移量参数的一组参数，或用包含时域包络参数，频域包络参数，音调参数和谐波间隔参数的一组参数，来表征音频信号。相对于现有技术的基于一定模型的参数音频编码技术，本发明实施例采用的一组参数，减少了编码时需要的参数的个数，同时降低了使用参数进行编码时所需要的比特数；从而解决了传统的RIRAC编码方法比特数较高的问题；同时，与现有的参数音频编码算法相比，由于本发明实施例的这组参数可以用更少的比特数进行编码，从而进一步降低信号的编码速率，并且当信道的传输能力一定时，由于本发明的编码比特数较低，因此能够编码具有更高带宽的信号，实现了用更低的编码速率获得更大的编码带宽及更高的编码质量。The above content describes the audio coding method flow of the embodiment of the present invention. Through the above method, the parameters including time domain envelope parameters, frequency domain envelope parameters, pitch parameters, harmonic interval parameters and first harmonic offset parameters can be used. A set of parameters, or a set of parameters including time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters, to characterize an audio signal. Compared with the parametric audio coding technology based on a certain model in the prior art, the set of parameters adopted in the embodiment of the present invention reduces the number of parameters required for coding, and simultaneously reduces the number of bits required for coding using parameters; Thereby solving the problem that the number of bits of the traditional RIRAC coding method is higher; Simultaneously, compared with the existing parametric audio coding algorithm, because this group of parameters of the embodiment of the present invention can be encoded with less number of bits, thereby further reducing The encoding rate of the signal, and when the transmission capacity of the channel is constant, because the number of encoding bits of the present invention is low, it can encode a signal with a higher bandwidth, and realizes obtaining a larger encoding bandwidth and a higher encoding bandwidth with a lower encoding rate. High encoding quality.

本发明实施例还提供了一种音频解码方法，具体可以包括：对收到的数据进行解码，得到用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；根据所述时域包络参数、频域包络参数、音调参数和谐波间隔参数，合成音频信号。The embodiment of the present invention also provides an audio decoding method, which may specifically include: decoding the received data to obtain time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic intervals used to characterize the audio signal Parameters; synthesize an audio signal according to the time domain envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters.

进一步的，还包括：对收到的包含第一谐波偏移量参数的数据进行解码，得到用于表征所述音频信号的第一谐波偏移量参数。Further, the method further includes: decoding the received data including the first harmonic offset parameter to obtain the first harmonic offset parameter used to characterize the audio signal.

所述合成音频信号的步骤包括：The step of synthesizing audio signal comprises:

根据所述谐波间隔参数得到谐波信号；或当所述音频信号的谐波间隔与第一谐波偏移量参数不同时，根据所述谐波间隔参数和所述第一谐波偏移量参数，得到谐波信号；Obtain a harmonic signal according to the harmonic interval parameter; or when the harmonic interval of the audio signal is different from the first harmonic offset parameter, according to the harmonic interval parameter and the first harmonic offset Quantitative parameters, get the harmonic signal;

根据所述音调参数，调整谐波信号与噪声信号之间的比例；并根据调整后的谐波信号与噪声信号，得到重建的谱信号；adjusting the ratio between the harmonic signal and the noise signal according to the tone parameter; and obtaining a reconstructed spectrum signal according to the adjusted harmonic signal and noise signal;

根据所述频域包络参数和时域包络参数对所述重建的谱信号进行处理得到合成音频信号。Processing the reconstructed spectrum signal according to the frequency domain envelope parameter and the time domain envelope parameter to obtain a synthesized audio signal.

图2是本发明实施例提供的音频解码方法流程示意图，下面将结合图2对本发明实施例的音频解码方法进行介绍。如图2所示，具体可以包括：FIG. 2 is a schematic flowchart of an audio decoding method provided by an embodiment of the present invention. The audio decoding method of the embodiment of the present invention will be introduced below in conjunction with FIG. 2 . As shown in Figure 2, the details may include:

21：对收到的数据进行解码，得到用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数，当音频信号的谐波间隔与第一谐波偏移量的值不同时，还包括第一谐波偏移量参数；21: Decode the received data to obtain the time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize the audio signal. When the harmonic interval of the audio signal is offset from the first harmonic When the value of the displacement is different, the first harmonic offset parameter is also included;

22：根据谐波间隔参数，得到谐波信号(当所述音频信号的谐波间隔与第一谐波偏移量参数不同时，根据谐波间隔参数和第一谐波偏移量参数，得到谐波信号；否则第一谐波偏移量的值等于谐波间隔的值)；该谐波结构可以由具有随机相位的谐波表示，其中第一谐波偏移量参数确定了第一个谐波的位置，各个谐波的间隔由谐波间隔参数决定；该谐波结构即为谐波信号；22: Obtain the harmonic signal according to the harmonic interval parameter (when the harmonic interval of the audio signal is different from the first harmonic offset parameter, according to the harmonic interval parameter and the first harmonic offset parameter, obtain harmonic signal; otherwise the value of the first harmonic offset is equal to the value of the harmonic interval); this harmonic structure can be represented by harmonics with random phases, where the first harmonic offset parameter determines the first The position of the harmonic, the interval of each harmonic is determined by the harmonic interval parameter; the harmonic structure is the harmonic signal;

23：产生噪声信号，例如，可以由一个随机数产生器产生噪声信号；23: Generate a noise signal, for example, a noise signal can be generated by a random number generator;

24：根据音调参数的值调整谐波信号与噪声信号之间的比例；并根据调整后的谐波信号与噪声信号，得到重建的谱信号；24: Adjust the ratio between the harmonic signal and the noise signal according to the value of the tone parameter; and obtain a reconstructed spectrum signal according to the adjusted harmonic signal and noise signal;

25：根据频域包络参数对所述重建的谱信号进行频域整形处理，得到频域整形后的信号；例如，可以根据解码出的子带能量包络对重建的谱信号进行去归一化处理后得到频域整形后的信号；25: Perform frequency domain shaping processing on the reconstructed spectral signal according to the frequency domain envelope parameters to obtain a frequency domain shaped signal; for example, denormalize the reconstructed spectral signal according to the decoded subband energy envelope After processing, the signal after frequency domain shaping is obtained;

26：根据时域包络参数对所述频域整形后的信号进行时域整形处理，得到最终的合成音频信号；例如，可以根据解码出的子帧能量包络对频域整形后的信号变换到时域以后再进行去归一化处理后，得到最终的合成音频信号。26: Perform time-domain shaping processing on the frequency-domain shaped signal according to the time-domain envelope parameters to obtain a final synthesized audio signal; for example, the frequency-domain shaped signal may be transformed according to the decoded subframe energy envelope After going to the time domain and then performing denormalization processing, the final synthesized audio signal is obtained.

需要指出的是频域整形和时域整形的顺序不唯一，也可以先根据时域包络参数对所述重建的谱信号进行时域整形处理，再根据频域包络参数对整形后的谱信号进行频域整形处理，得到最终的合成音频信号。It should be pointed out that the order of frequency-domain shaping and time-domain shaping is not unique, and the reconstructed spectrum signal can also be processed in time-domain according to the time-domain envelope parameters first, and then the shaped spectrum can be processed according to the frequency-domain envelope parameters. The signal is subjected to frequency-domain shaping processing to obtain the final synthesized audio signal.

上述内容描述了本发明实施例的音频解码方法流程，通过本发明实施例提供的包含用于表征音频信号的时域包络参数、频域包络参数、音调参数、谐波间隔参数和第一谐波偏移量参数的一组参数，可以实现利用更少的比特数来合成音频信号，且该音频信号质量较高；并且，当音频信号的谐波结构明显时，解码得到的音频质量更佳。The above content describes the audio decoding method flow of the embodiment of the present invention. The embodiment of the present invention provides time-domain envelope parameters, frequency-domain envelope parameters, tone parameters, harmonic interval parameters and first A set of parameters of the harmonic offset parameters can realize the synthesis of audio signals with fewer bits, and the audio signal quality is higher; and, when the harmonic structure of the audio signal is obvious, the audio quality obtained by decoding is better good.

为便于对本发明实施例的理解，下面将对本发明实施例的编码、解码具体实现方案进行详细的描述。To facilitate the understanding of the embodiments of the present invention, the specific implementation solutions of encoding and decoding of the embodiments of the present invention will be described in detail below.

实施例一Embodiment one

本实施例中，编码端分别提取了音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数，由于本实施例中音频信号的谐波间隔与第一谐波偏移量参数相同，因此省略了提取第一谐波偏移量参数的步骤；解码端收到上述参数后，根据上述各参数进行解码，得到合成音频信号。In this embodiment, the encoder extracts the time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters of the audio signal respectively. Since the harmonic interval of the audio signal and the first harmonic deviation The offset parameters are the same, so the step of extracting the first harmonic offset parameter is omitted; after receiving the above parameters, the decoder performs decoding according to the above parameters to obtain a synthesized audio signal.

编码端的实施过程具体可以包括：The implementation process at the encoding end may specifically include:

(1)：提取信号的时域包络参数：例如，采用计算音频信号的子帧能量来得到信号的时域包络参数，可以计算信号的子帧能量包络{temp_env(0)，temp(1)，……，temp(N-1)}，其中N为子帧个数，设帧长为15ms，子帧长度为3ms，则N＝5；对此子帧能量包络进行量化，即得到时域包络参数，进一步的可以对该时域包络参数进行编码；同时可以利用量化后的时域包络对信号进行时域归一化处理；(1): Extract the time-domain envelope parameters of the signal: For example, by calculating the sub-frame energy of the audio signal to obtain the time-domain envelope parameters of the signal, the sub-frame energy envelope {temp_env(0), temp( 1),..., temp(N-1)}, where N is the number of subframes, if the frame length is 15ms, and the subframe length is 3ms, then N=5; the energy envelope of this subframe is quantized, namely The time-domain envelope parameter is obtained, and the time-domain envelope parameter can be further encoded; at the same time, the quantized time-domain envelope can be used to perform time-domain normalization processing on the signal;

当然，实际应用中也可以将信号变换到频域(或变换域)之后提取自回归(AR，AutoRegressive)模型参数来表征信号的时域包络；Of course, in practical applications, the signal can also be transformed into the frequency domain (or transform domain) and then the autoregressive (AR, AutoRegressive) model parameters can be extracted to characterize the time domain envelope of the signal;

(2)：提取音频信号的频域包络参数；例如，在时域提取信号的自回归模型参数来表征信号的频域包络时，在时域计算得到信号的自回归模型参数{α₀，α₁，……，α_M-1}，其中M为自回归模型的阶数，进一步的可以对该自回归模型参数进行量化、编码和传输；同时根据量化后的自回归模型参数进行滤波，得到残差信号err(n)；(2): Extract the frequency domain envelope parameters of the audio signal; for example, when the autoregressive model parameters of the signal are extracted in the time domain to characterize the frequency domain envelope of the signal, the autoregressive model parameters {α ₀ of the signal are calculated in the time domain _. _{_} , get the residual signal err(n);

具体应用中，还可以通过计算频域(或变换域)下的子带能量得到信号的频域包络参数；In specific applications, the frequency domain envelope parameters of the signal can also be obtained by calculating the subband energy in the frequency domain (or transform domain);

(3)：提取音频信号的音调参数；音调参数表征了音频信号中谐波信号与噪声信号之间的比例；音调参数的表示方法有多种，可以是自相关函数的最大值与最小值之比，例如T＝max(ACF(k₀))/min(ACF(k₀))，也可以是其它表现形式，只要可以表征谐波与噪声之间的比例关系即可；其中，自相关函数ACF(k₀)的计算可以利用FFT变化与逆FFT变换得到，例如，对(2)中的残差信号err(n)进行FFT变换，得到频域信号S(k)＝FFT(err(n))，并进一步得到自相关函数ACF(k₀)＝IFFT(|FFT(S(k))|²)；当然，也可以直接计算得到，例如 $ACF (k_{0}) = Σ_{k = 0}^{L - 1} S (k) S (k + k_{0}),$ 其中L为编码带宽范围内频域变换系数的个数；此外，还可以使用平均幅度差函数(AMDF，Average Mean Difference Function)来修正自相关函数；(3): Extract the tone parameter of the audio signal; the tone parameter represents the ratio between the harmonic signal and the noise signal in the audio signal; there are many ways to express the tone parameter, which can be between the maximum value and the minimum value of the autocorrelation function The ratio, such as T=max(ACF(k ₀ ))/min(ACF(k ₀ )), can also be in other forms, as long as the proportional relationship between harmonic and noise can be represented; among them, the autocorrelation function The calculation of ACF(k ₀ ) can be obtained by FFT transformation and inverse FFT transformation. For example, FFT transformation is performed on the residual signal err(n) in (2), and the frequency domain signal S(k)=FFT(err(n )), and further obtain the autocorrelation function ACF(k ₀ )=IFFT(|FFT(S(k))| ² ); of course, it can also be calculated directly, for example $ACF (k_{0}) = Σ_{k = 0}^{L - 1} S (k) S (k + k_{0}),$ Where L is the number of frequency-domain transform coefficients within the encoding bandwidth; in addition, the average amplitude difference function (AMDF, Average Mean Difference Function) can also be used to correct the autocorrelation function;

(4)：提取音频信号的谐波间隔(PG，Pitch Grid)参数；谐波间隔参数表征了信号的不同谐波之间的间隔；具体可以通过峰值提取方法估计出谐波间隔参数的整数部分，例如通过PG＝arg max(ACF(k₀))计算得到谐波间隔参数的整数部分；谐波间隔的分数值可以内插自相关函数ACF(k₀)以后通过峰值提取的方法获得；具体地，可以只在先获得的整数谐波间隔附近进行自相关函数的内插计算，并在内插后的自相关函数中搜索出谐波间隔的分数值；为了获得更好的性能，可以对得到的谐波间隔参数进一步修正后再进行编码传输，以抑制倍频和分数频的产生；例如，将求得的当前帧的谐波间隔PG与前一帧的谐波间隔old_PG进行比较，如果当前帧的谐波间隔与前一帧谐波间隔之间的比值小于某个域值(如0.1)且ACF(old_PG)>0.95ACF(PG)，则用前一帧的谐波间隔代替本帧求得的谐波间隔PG＝old_PG；(4): Extract the harmonic interval (PG, Pitch Grid) parameter of the audio signal; the harmonic interval parameter characterizes the interval between different harmonics of the signal; specifically, the integer part of the harmonic interval parameter can be estimated by the peak extraction method , for example, the integer part of the harmonic interval parameter can be obtained by calculating PG=arg max(ACF(k ₀ )); the fractional value of the harmonic interval can be obtained by the method of peak extraction after interpolating the autocorrelation function ACF(k ₀ ); specifically Therefore, the interpolation calculation of the autocorrelation function can be performed only near the integer harmonic interval obtained first, and the fractional value of the harmonic interval can be searched in the interpolated autocorrelation function; in order to obtain better performance, the The obtained harmonic interval parameters are further modified and then encoded and transmitted to suppress the generation of multiplied and fractional frequencies; for example, compare the obtained harmonic interval PG of the current frame with the harmonic interval old_PG of the previous frame, if The ratio between the harmonic interval of the current frame and the harmonic interval of the previous frame is less than a certain threshold value (such as 0.1) and ACF(old_PG)>0.95ACF(PG), then the harmonic interval of the previous frame is used instead of this frame The obtained harmonic interval PG=old_PG;

(5)：由于本实施例中第一谐波偏移量的值等于谐波间隔，该步骤可以省略；但在第一谐波偏移量的值不等于谐波间隔时，提取第一谐波偏移量参数时具体可以：根据谐波间隔参数，估计第一谐波偏移量参数，并将该第一谐波偏移量参数编码传输；第一谐波偏移量参数表征了音频信号第一个谐波的位置；需要指出的是，若第一谐波偏移量的值等于谐波间隔，则该步骤可以省略；也就是当所述音频信号的谐波间隔与第一谐波偏移量的值不同时，提取所述音频信号的第一谐波偏移量参数；(5): Since the value of the first harmonic offset in this embodiment is equal to the harmonic interval, this step can be omitted; but when the value of the first harmonic offset is not equal to the harmonic interval, extract the first harmonic Specifically, when using the harmonic offset parameter, the first harmonic offset parameter can be estimated according to the harmonic interval parameter, and the first harmonic offset parameter is encoded and transmitted; the first harmonic offset parameter represents the audio The position of the first harmonic of the signal; it should be pointed out that if the value of the first harmonic offset is equal to the harmonic interval, then this step can be omitted; that is, when the harmonic interval of the audio signal is the same as the first harmonic When the values of the wave offset are different, the first harmonic offset parameter of the audio signal is extracted;

将上述时域包络参数，频域包络参数，音调参数和谐波间隔参数编码后(或量化后输出)输出。当然，如果(5)没有被省略，则第一谐波偏移量参数也将被编码、传输。Encode (or output after quantization) the above time domain envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters. Of course, if (5) is not omitted, the first harmonic offset parameter will also be encoded and transmitted.

需要指出的是，上述音调参数，谐波间隔参数和第一谐波偏移量参数可以但不限于在频域(或变换域)计算得到，例如还可以在时域计算得到。并且，获取上述各参数的顺序不唯一，即不论以何种顺序，只要获取上述音频信号的时域包络参数，频域包络参数，音调参数，谐波间隔参数和第一谐波偏移量参数即可；It should be pointed out that the above pitch parameter, harmonic interval parameter and first harmonic offset parameter can be calculated in frequency domain (or transform domain), for example, can also be calculated in time domain. Moreover, the order of obtaining the above parameters is not unique, that is, no matter in what order, as long as the time domain envelope parameters, frequency domain envelope parameters, pitch parameters, harmonic interval parameters and first harmonic offset of the above audio signal are obtained Quantitative parameters can be;

对应的，解码端对收到的数据解码，得到用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数后，合成音频信号。当然，若编码端(5)没有被省略，则解码端解码得到的参数还包括第一谐波偏移量参数。Correspondingly, the decoding end decodes the received data to obtain time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize the audio signal, and synthesize the audio signal. Of course, if the encoding end (5) is not omitted, the parameters decoded by the decoding end also include the first harmonic offset parameter.

解码端实施解码的具体处理过程可以包括：The specific process of decoding at the decoding end may include:

(6)：对收到的数据进行解码，得到用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；当然，若编码端音频信号的谐波间隔与第一谐波偏移量的值不同时，还得到第一谐波偏移量参数；(6): Decode the received data to obtain the time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize the audio signal; of course, if the harmonic interval of the audio signal at the encoding end When it is different from the value of the first harmonic offset, the first harmonic offset parameter is also obtained;

(7)：根据谐波间隔参数得到谐波信号；该谐波结构可以由具有随机相位的谐波表示，其中第一个谐波的位置等于谐波间隔的值，各个谐波的间隔也由谐波间隔参数决定；该谐波结构即为谐波信号；具体的，例如：从起始频点开始按照谐波间隔参数(PG)表示的谐波间隔将具有随机相位的谐波以脉冲的形式放置于信号带宽范围内相应的频点，从而产生谐波信号buf_pulses(k)，例如 $buf_pulses (k) = h (k) * \underset{\underset{0 \leq n * PG \leq L - 1}{n &Element; Z}}{Σ} δ (k - (n * PG)),$ 其中h(k)表示具有随机相位的谐波；(7): The harmonic signal is obtained according to the harmonic interval parameter; the harmonic structure can be represented by harmonics with random phases, where the position of the first harmonic is equal to the value of the harmonic interval, and the interval of each harmonic is also given by The harmonic interval parameter is determined; the harmonic structure is the harmonic signal; specifically, for example: starting from the starting frequency point according to the harmonic interval expressed by the harmonic interval parameter (PG), the harmonics with random phases will be pulsed The form is placed at the corresponding frequency point within the signal bandwidth range, thereby generating a harmonic signal buf_pulses(k), for example $buf_pulses (k) = h (k) * \underset{\underset{0 \leq no * PG \leq L - 1}{no &Element; Z}}{Σ} δ (k - (no * PG)),$ where h(k) denotes a harmonic with random phase;

需要说明的是，若解码端若还收到了第一谐波偏移量参数，解码端则可以根据谐波间隔参数和第一谐波偏移量参数，得到谐波信号；该谐波结构可以由具有随机相位的谐波表示，其中第一谐波偏移量参数确定了第一个谐波的位置，各个谐波的间隔由谐波间隔参数决定；该谐波结构即为谐波信号。具体的具体的，例如，第一谐波偏移量参数(P0)为第一个脉冲的位置，从第一个脉冲位置开始按照谐波间隔参数(PG)表示的谐波间隔将具有随机相位的谐波以脉冲的形式放置于信号带宽范围内相应的频点，从而产生谐波信号buf_pulses(k)，例如 $buf_pulses (k) = h (k) * \underset{\underset{0 \leq PO + n * PG \leq L - 1}{n &Element; Z}}{Σ} δ (k - (PO + n * PG)),$ 其中h(k)表示具有随机相位的谐波；It should be noted that if the decoding end also receives the first harmonic offset parameter, the decoding end can obtain the harmonic signal according to the harmonic interval parameter and the first harmonic offset parameter; the harmonic structure can be Represented by harmonics with random phases, where the First Harmonic Offset parameter determines the position of the first harmonic, and the spacing of individual harmonics is determined by the Harmonic Interval parameter; this harmonic structure is the harmonic signal. Specifically, for example, the first harmonic offset parameter (P0) is the position of the first pulse, and the harmonic interval represented by the harmonic interval parameter (PG) from the first pulse position will have a random phase The harmonics of are placed in the corresponding frequency points within the signal bandwidth in the form of pulses, thereby generating the harmonic signal buf_pulses(k), for example $buf_pulses (k) = h (k) * \underset{\underset{0 \leq PO + no * PG \leq L - 1}{no &Element; Z}}{Σ} δ (k - (PO + no * PG)),$ where h(k) denotes a harmonic with random phase;

(8)：产生噪声信号，例如，可以由一个随机数产生器产生噪声信号buf_noise(k)；(8): Generate noise signal, for example, noise signal buf_noise(k) can be generated by a random number generator;

(9)：根据音调参数的值调整谐波信号与噪声信号之间的比例；并根据调整后的谐波信号与噪声信号，得到重建的谱信号；具体的调整可以有多种，例如：先分别计算谐波信号与噪声信号的能量，记作enerP和enerN，再计算调整因子β₁＝1-T和 $β_{2} = \sqrt{\frac{enerP}{enerN}} * T,$ 其中T是音调参数；并得到修正后的重建谱信号 $\hat{S} (k) = β_{1} buf_pulses (k) + β_{2} buf_noise (k);$ 通过逆FFT变换将重建的谱信号变换到时域，记作

(9): adjust the ratio between the harmonic signal and the noise signal according to the value of the tone parameter; and obtain the reconstructed spectral signal according to the adjusted harmonic signal and noise signal; there are many kinds of specific adjustments, for example: first Calculate the energy of the harmonic signal and the noise signal respectively, denoted as enerP and enerN, and then calculate the adjustment factor β ₁ =1-T and

β_{2} = \sqrt{\frac{enerP}{ener N}} * T,

Where T is the tone parameter; and get the reconstructed spectral signal after correction

\hat{S} (k) = β_{1} buf_pulses (k) + β_{2} buf_noise (k);

Transform the reconstructed spectral signal to the time domain by inverse FFT transformation, denoted as

(10)：根据频域包络参数对所述重建的谱信号进行频域整形处理，得到频域整形后的信号；例如，根据解码得到的自回归模型参数，对信号

进行逆滤波，得到频域整形后的信号

(10): Perform frequency-domain shaping processing on the reconstructed spectrum signal according to the frequency-domain envelope parameters to obtain a signal after frequency-domain shaping; for example, according to the autoregressive model parameters obtained by decoding, the signal is

Perform inverse filtering to obtain the signal after frequency domain shaping

(11)：根据时域包络参数对所述频域整形后的信号进行时域整形处理，得到最终的合成音频信号；例如，可以根据解码出的子帧能量包络对信号

进行去归一化处理后，得到最终的合成音频信号。(11): Perform time-domain shaping processing on the frequency-domain shaped signal according to the time-domain envelope parameters to obtain the final synthesized audio signal; for example, the signal can be processed according to the decoded sub-frame energy envelope

After denormalization processing, the final synthesized audio signal is obtained.

相对于现有技术的基于一定模型的参数音频编码技术，本发明实施例采用的一组参数，减少了编码时需要的参数的个数，同时降低了使用参数进行编码时所需要的比特数；从而解决了现有编码方法比特数较高的问题；同时，与现有的参数音频编码算法相比，由于本发明实施例的这组参数可以用更少的比特数进行编码，从而进一步降低信号的编码速率，并且当信道的传输能力一定时，由于本发明的编码比特数较低，因此能够编码具有更高带宽的信号，实现了用更低的编码速率获得更大的编码带宽及更高的编码质量。同时在解码端可以实现利用更少的比特数来合成音频信号，且该音频信号质量较高；并且，当音频信号的谐波结构明显时，解码得到的音频质量更佳。Compared with the parametric audio coding technology based on a certain model in the prior art, the set of parameters adopted in the embodiment of the present invention reduces the number of parameters required for coding, and simultaneously reduces the number of bits required for coding using parameters; Thereby solving the problem that the number of bits of the existing coding method is higher; at the same time, compared with the existing parametric audio coding algorithm, because this group of parameters in the embodiment of the present invention can be coded with less number of bits, thereby further reducing the signal coding rate, and when the transmission capacity of the channel is constant, because the number of coding bits of the present invention is low, it can code a signal with a higher bandwidth, and achieve a larger coding bandwidth and a higher coding rate with a lower coding rate. encoding quality. At the same time, the audio signal can be synthesized with fewer bits at the decoding end, and the quality of the audio signal is high; and, when the harmonic structure of the audio signal is obvious, the audio quality obtained through decoding is better.

本发明实施例还提供了一种编码处理方法，具体可以包括：当用分频带的方式对音频信号进行编码时，若当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似，则提取用于表征音频信号的时域包络参数和频域包络参数，并将所述时域包络参数和频域包络参数编码后发送，同时发送表示当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似的信息；若当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号不相似，则提取用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数，并将所述时域包络参数、频域包络参数、音调参数和谐波间隔参数编码后发送，同时发送表示当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似的信息。An embodiment of the present invention also provides an encoding processing method, which may specifically include: when encoding an audio signal by frequency division, if the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band , then extract the time-domain envelope parameters and frequency-domain envelope parameters used to characterize the audio signal, encode the time-domain envelope parameters and frequency-domain envelope parameters and send them, and at the same time send the spectrum representing the audio signal in the current frequency band Information that the signal is similar to the spectral signal of the audio signal of the previous frequency band; if the spectral signal of the audio signal of the current frequency band is not similar to the spectral signal of the audio signal of the previous frequency band, extract the time domain envelope parameters used to characterize the audio signal , frequency domain envelope parameters, tone parameters and harmonic interval parameters, and encode the time domain envelope parameters, frequency domain envelope parameters, tone parameters and harmonic interval parameters, and send an audio signal representing the current frequency band at the same time Information that the spectral signal of the audio signal is similar to that of the audio signal of the previous frequency band.

具体的，所述表示当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似或不相似的信息，具体可以用编码模式参数表示；所述编码模式参数，用于指示解码端在当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似时，根据所述音频信号的时域包络参数和频域包络参数，对当前频带的音频信号进行解码；或者指示解码端在当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号不相似时，根据所述音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数，对当前频带的音频信号进行解码。Specifically, the information indicating that the spectral signal of the audio signal in the current frequency band is similar or dissimilar to the spectral signal of the audio signal in the previous frequency band can be specifically represented by a coding mode parameter; the coding mode parameter is used to indicate that the decoding end When the spectral signal of the audio signal of the current frequency band is similar to the spectral signal of the audio signal of the previous frequency band, decoding the audio signal of the current frequency band according to the time domain envelope parameter and the frequency domain envelope parameter of the audio signal; or Indicates that when the spectral signal of the audio signal in the current frequency band is not similar to the spectral signal of the audio signal in the previous frequency band, according to the time domain envelope parameter, frequency domain envelope parameter, pitch parameter and harmonic interval of the audio signal Parameters to decode the audio signal in the current frequency band.

进一步的，若当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号不相似，且当所述音频信号的谐波间隔与第一谐波偏移量的值不同时，提取所述音频信号的第一谐波偏移量参数；并将所述第一谐波偏移量参数传输给解码端。而且，若当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似时，还可以提取所述音频信号的音调参数，并将所述音调参数传输给解码端。Further, if the spectral signal of the audio signal in the current frequency band is not similar to the spectral signal of the audio signal in the previous frequency band, and when the harmonic interval of the audio signal is different from the value of the first harmonic offset, the extracted The first harmonic offset parameter of the audio signal; and transmit the first harmonic offset parameter to the decoding end. Moreover, if the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band, the tone parameters of the audio signal may also be extracted and transmitted to the decoding end.

相应的，本发明实施例还提供了一种解码处理方法，具体可以包括：接收编码端发送的数据，若接收到表示当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似的信息，根据用于表征音频信号的时域包络参数和频域包络参数合成音频信号，其中，所述时域包络参数和频域包络参数是从接收到的数据中解码得到；若接收到表示当前频带的音频信号的谱信号与前一个频带的音频信号间的谱信号不相似的信息，根据用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数合成音频信号，其中，所述时域包络参数、频域包络参数、音调参数和谐波间隔参数是从接收到的数据中解码得到。Correspondingly, the embodiment of the present invention also provides a decoding processing method, which may specifically include: receiving the data sent by the encoding end, if the received spectral signal representing the audio signal of the current frequency band is similar to the spectral signal of the audio signal of the previous frequency band information, synthesizing the audio signal according to the time-domain envelope parameters and frequency-domain envelope parameters used to characterize the audio signal, wherein the time-domain envelope parameters and the frequency-domain envelope parameters are obtained by decoding from the received data; If information indicating that the spectral signal of the audio signal of the current frequency band is not similar to the spectral signal of the audio signal of the previous frequency band is received, according to the time domain envelope parameters, frequency domain envelope parameters, tone parameters and Synthesizing an audio signal with harmonic interval parameters, wherein the time domain envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters are obtained by decoding from received data.

具体的，根据接收到的编码模式参数，确定所述当前频带的音频信号的谱信号与前一个频带的音频信号间的谱信号相似或不相似；若当前频带的音频信号的谱信号与前一个频带的音频信号间的谱信号相似，则根据所述收到的用于表征音频信号的时域包络参数和频域包络参数合成音频信号；若当前频带的音频信号的谱信号与前一个频带的音频信号间的谱信号不相似，则根据所述收到的时域包络参数、频域包络参数、音调参数和谐波间隔参数，合成音频信号。Specifically, according to the received encoding mode parameter, it is determined whether the spectral signal of the audio signal of the current frequency band is similar or dissimilar to the spectral signal of the audio signal of the previous frequency band; if the spectral signal of the audio signal of the current frequency band is similar to the previous The spectral signals between the audio signals of the frequency bands are similar, then according to the received time domain envelope parameters and frequency domain envelope parameters used to characterize the audio signals, the audio signals are synthesized; If the spectral signals of the audio signals in the frequency bands are not similar, the audio signals are synthesized according to the received time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters.

若当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号不相似，所述收到的时域包络参数、频域包络参数、音调参数和谐波间隔参数，还可以包括：所述音频信号的第一谐波偏移量参数；若当前频带的音频信号的谱信号与前一个频带的音频信号间的谱信号相似，所述收到的用于表征音频信号的时域包络参数和频域包络参数，还可以包括：用于表征所述音频信号的音调参数。If the spectral signal of the audio signal of the current frequency band is not similar to the spectral signal of the audio signal of the previous frequency band, the received time domain envelope parameter, frequency domain envelope parameter, tone parameter and harmonic interval parameter may also include : The first harmonic offset parameter of the audio signal; if the spectral signal of the audio signal of the current frequency band is similar to the spectral signal of the audio signal of the previous frequency band, the received time domain used to characterize the audio signal The envelope parameter and the frequency domain envelope parameter may also include: a tone parameter used to characterize the audio signal.

图3是本发明实施例的编码处理方法流程示意图，下面将结合图3对本发明实施例的编码处理方法进行介绍。如图3所示，具体可以包括：FIG. 3 is a schematic flowchart of an encoding processing method according to an embodiment of the present invention, and the encoding processing method according to this embodiment of the present invention will be introduced below with reference to FIG. 3 . As shown in Figure 3, the details may include:

31：当用分频带的方式对音频信号进行编码时，判断当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号是否相似；具体的可以通过确定编码模式参数CM来表示是否相似；例如具体地，可以先计算当前频带信号谱与前一个频带信号谱之间的互相关，以确定当前频带谐波结构与前一个频带谐波结构之间的相似性；当互相关大于某一域值时，可以判定为当前频带谐波结构与前一个频带谐波结构之间是相似，将CM置为1，否则将CM置为0；并且当前频带信号谱与前一个频带信号谱之间相似时，可以不再提取下面的音调参数、谐波间隔参数和第一谐波偏移量参数；31: When the audio signal is encoded by frequency division, judge whether the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band; specifically, it can be expressed by determining the encoding mode parameter CM ; For example, the cross-correlation between the current frequency band signal spectrum and the previous frequency band signal spectrum can be calculated first to determine the similarity between the current frequency band harmonic structure and the previous frequency band harmonic structure; when the cross-correlation is greater than a certain threshold value, it can be determined that the current frequency band harmonic structure is similar to the previous frequency band harmonic structure, and CM is set to 1, otherwise CM is set to 0; and the current frequency band signal spectrum and the previous frequency band signal spectrum When similar, the following pitch parameters, harmonic interval parameters and first harmonic offset parameters may no longer be extracted;

32：若相似，则提取用于表征音频信号的时域包络参数和频域包络参数；若不相似，则提取用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；也就是说，若当前频带信号谱与前一个频带信号谱之间相似时，可以不提取音频信号的音调参数、谐波间隔参数和第一谐波偏移量参数；具体的，提取上述各参数的方法可以如下：32: If similar, extract the time domain envelope parameters and frequency domain envelope parameters used to characterize the audio signal; if not, extract the time domain envelope parameters, frequency domain envelope parameters, and pitch used to characterize the audio signal parameters and harmonic interval parameters; that is, if the current frequency band signal spectrum is similar to the previous frequency band signal spectrum, the pitch parameter, harmonic interval parameter and first harmonic offset parameter of the audio signal may not be extracted; Specifically, the method for extracting the above parameters may be as follows:

提取时域包络参数；例如可以通过计算当前频带信号的子帧能量包络和全局增益因子gain，并根据这两组值判断信号是稳态信号或是瞬态信号；若是稳态信号，则对全局增益因子gain进行量化，将得到的量化值作为时域包络参数；如果是瞬态信号，则对子帧能量包络进行量化，将得到的量化值作为时域包络参数；并根据时域包络参数对当前频带信号进行时域归一化处理，得到时域归一化后的信号；Extract time-domain envelope parameters; for example, by calculating the subframe energy envelope and global gain factor gain of the current frequency band signal, and judging whether the signal is a steady-state signal or a transient signal according to these two values; if it is a steady-state signal, then Quantize the global gain factor gain, and use the obtained quantization value as the time domain envelope parameter; if it is a transient signal, quantize the subframe energy envelope, and use the obtained quantization value as the time domain envelope parameter; and according to The time domain envelope parameter performs time domain normalization processing on the current frequency band signal to obtain the time domain normalized signal;

提取频域包络参数；例如对时域归一化以后的信号进行MDCT(修正的离散余弦变换，Modified Discrete Cosine Transform)变换后得到了一组MDCT系数，即时域归一化以后该频带对应的频域信号，对该频域信号处理时将这组频域信号分为N个子带，提取每个子带的子代能量并量化，得到一组量化后的频域包络，即为频域包络参数；根据频域包络参数对频域信号进行频域归一化处理，得到频域归一化后的信号；Extract frequency domain envelope parameters; for example, after MDCT (modified discrete cosine transform, Modified Discrete Cosine Transform) transformation is performed on the signal after time domain normalization, a set of MDCT coefficients is obtained, which corresponds to the frequency band after instant domain normalization For frequency domain signals, when processing the frequency domain signals, divide the group of frequency domain signals into N subbands, extract and quantize the sub-generation energy of each subband, and obtain a set of quantized frequency domain envelopes, which is the frequency domain envelope Envelope parameter; Carry out frequency domain normalization processing to frequency domain signal according to frequency domain envelope parameter, obtain the signal after frequency domain normalization;

提取音调参数；具体的，可以直接在MDCT域进行参数提取；为了进一步提高编码器的性能，也可以不直接在MDCT域进行参数提取，而是根据原始频域信号计算伪谱信号，并根据此伪谱信号计算音调参数；音调参数可以通过自相关函数的最大值与最小值之间的比值表示，其中最大值和最小值的获取是在期望的范围内或者是在对谐波间隔参数计算有益的范围内进行的；Extract pitch parameters; specifically, parameter extraction can be performed directly in the MDCT domain; in order to further improve the performance of the encoder, it is also possible not to directly perform parameter extraction in the MDCT domain, but to calculate the pseudo-spectrum signal according to the original frequency domain signal, and according to this Pseudospectral signal calculation pitch parameters; pitch parameters can be represented by the ratio between the maximum value and the minimum value of the autocorrelation function, where the maximum and minimum values are obtained in the expected range or are beneficial to the calculation of harmonic interval parameters carried out within the scope of

提取谐波间隔参数PG；高频带信号的谐波间隔参数，通常是在频域(或变换域)下提取的；谐波间隔的整数值可以通过峰值提取方法由自相关函数估计出来，谐波间隔的分数值可以通过峰值提取的方法由内插的自相关函数估计出来；也可以只在求得的整数谐波间隔附近进行自相关函数的内插计算，之后通过峰值提取的方法获得谐波间隔的分数值；Extract the harmonic interval parameter PG; the harmonic interval parameter of the high-frequency band signal is usually extracted in the frequency domain (or transform domain); the integer value of the harmonic interval can be estimated by the autocorrelation function by the peak extraction method, and the harmonic interval The fractional value of the wave interval can be estimated from the interpolated autocorrelation function by the method of peak extraction; the interpolation calculation of the autocorrelation function can also be performed only near the obtained integer harmonic interval, and then the harmonic can be obtained by the method of peak extraction fractional value of the wave interval;

提取第一谐波偏移量参数，例如根据谐波间隔，估计第一谐波偏移量参数P0；具体的可以在谐波间隔范围内，即[0，PG]范围内，将第一谐波分量分别置于不同偏移位置，并按谐波间隔依次放置其它谐波，并计算由此产生的谱与伪谱之间的相关性，相关性最大的偏移位置即所求的第一谐波偏移量；同时，第一谐波偏移量参数也可以用来进一步修正谐波间隔参数的估计值，从而达到更优的参数提取效果；需要指出的是，若第一谐波偏移量的值始终等于谐波间隔，则该步骤可以省略；Extract the first harmonic offset parameter, for example, estimate the first harmonic offset parameter P0 according to the harmonic interval; specifically, within the range of the harmonic interval, that is, within the range of [0, PG], the first harmonic The wave components are placed at different offset positions, and other harmonics are placed in sequence according to the harmonic interval, and the correlation between the resulting spectrum and the pseudo spectrum is calculated. The offset position with the greatest correlation is the first Harmonic offset; at the same time, the first harmonic offset parameter can also be used to further modify the estimated value of the harmonic interval parameter, so as to achieve a better parameter extraction effect; it should be pointed out that if the first harmonic offset If the value of the displacement is always equal to the harmonic interval, this step can be omitted;

33：将表示当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似或不相似的信息发送；例如将编码模式参数编码后发送；并将提取的参数编码后发送；具体的，当CM等于1时，包含编码模式参数、时域包络参数和频域包络参数的一组参数将会被量化或编码，并传输到解码端；当CM等于0时，包含了编码模式参数、时域包络参数、频域包络参数、音调参数和谐波间隔参数的一组参数，将会被量化、编码，并传输到解码端；33: Send information indicating that the spectral signal of the audio signal in the current frequency band is similar or dissimilar to the spectral signal of the audio signal in the previous frequency band; for example, encode the encoding mode parameters and send them; and send the extracted parameters after encoding; specifically , when CM is equal to 1, a set of parameters including encoding mode parameters, time domain envelope parameters and frequency domain envelope parameters will be quantized or encoded, and transmitted to the decoder; when CM is equal to 0, encoding mode is included A set of parameters, including time domain envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters, will be quantized, coded, and transmitted to the decoder;

需要指出的是当CM等于1时，传输到解码端的参数还可以包括音调参数；当CM等于0时，若第一谐波偏移量的值不等于谐波间隔，则还要传输第一谐波偏移量参数。It should be pointed out that when CM is equal to 1, the parameters transmitted to the decoding end can also include tone parameters; when CM is equal to 0, if the value of the first harmonic offset is not equal to the harmonic interval, the first harmonic must be transmitted Wave offset parameter.

对应的，解码端根据收到的上述包含编码模式参数、时域包络参数和频域包络参数的一组参数，或收到上述包含编码模式参数、时域包络参数、频域包络参数、音调参数和谐波间隔参数的一组参数，合成音频信号。Correspondingly, the decoder receives a set of parameters including the encoding mode parameters, time domain envelope parameters, and frequency domain envelope parameters, or receives the above-mentioned parameters including encoding mode parameters, time domain envelope parameters, and frequency domain envelope parameters. A set of parameters, a pitch parameter, and a harmonic interval parameter, synthesizes an audio signal.

需要指出的是，如果编码端在CM等于1时还传输了音调参数，相应的解码端也要接收音调参数；如果编码端在CM等于0时还传输了第一谐波偏移量参数，相应的解码端也要接收第一谐波偏移量参数。It should be pointed out that if the encoding end also transmits the tone parameter when CM is equal to 1, the corresponding decoding end also needs to receive the tone parameter; if the encoding end also transmits the first harmonic offset parameter when CM is equal to 0, the corresponding The decoding end of also needs to receive the first harmonic offset parameter.

图4是本发明实施例的解码处理方法流程示意图；如图4所示，解码处理的具体处理过程如图4所示，具体可以包括：Fig. 4 is a schematic flow chart of the decoding processing method according to an embodiment of the present invention;

41：接收到表示当前频带的音频信号的谱信号与前一个频带的音频信号间的谱信号相似的信息，或不相似的信息；例如根据接收到的数据，解码出编码模式参数CM，根据该编码模式参数CM，即可确定是否相似；41: Receive information indicating that the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band, or information that is not similar; for example, according to the received data, the encoding mode parameter CM is decoded, according to the The encoding mode parameter CM can determine whether they are similar;

42：当前频带的音频信号的谱信号与前一个频带的音频信号间的谱信号相似时，根据对收到数据解码得到的用于表征音频信号的时域包络参数和频域包络参数，合成音频信号；不相似时，根据对收到数据解码得到的用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数，合成音频信号；42: When the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band, according to the time-domain envelope parameters and frequency-domain envelope parameters used to characterize the audio signal obtained by decoding the received data, Synthesizing the audio signal; when not similar, synthesizing the audio signal according to the time domain envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize the audio signal obtained by decoding the received data;

具体的，重建谱信号时：Specifically, when reconstructing the spectral signal:

若当前频带信号谱与前一个频带信号谱之间相似，例如CM等于1，则可以采用谱复制的方式用前一个频带的谱信号作为当前频带重建的谱信号；当然也可以采用不同于谱复制的方式重建谱信号；如果编码端CM等于1时还传输了音调参数也可以从码流中解码出音调参数，采用谱复制的方式通过前一个频带的谱重建当前频带的谱信号；具体的，可以根据音调参数，对前一个频带的谱信号做整形，得到整形后的重建谱信号，将整形后的谱信号作为当前频带重建的谱信号；If the signal spectrum of the current frequency band is similar to the signal spectrum of the previous frequency band, for example, CM is equal to 1, the spectral signal of the previous frequency band can be used as the spectral signal of the current frequency band reconstruction in the way of spectrum replication; of course, different spectrum replication can also be used Reconstruct the spectral signal in the same way; if the encoding terminal CM is equal to 1, the pitch parameter is also transmitted, and the pitch parameter can also be decoded from the code stream, and the spectral signal of the current frequency band is reconstructed through the spectrum of the previous frequency band by means of spectrum copying; specifically, According to the tone parameter, the spectral signal of the previous frequency band can be shaped to obtain the reconstructed spectral signal after shaping, and the shaped spectral signal can be used as the reconstructed spectral signal of the current frequency band;

若当前频带信号谱与前一个频带信号谱之间的不相关，例如CM等于0，则从码流中解码出音调参数、谐波间隔参数和第一谐波偏移量参数，根据所述谐波间隔参数得到谐波信号；或根据所述谐波间隔参数和第一谐波偏移量参数，得到谐波信号；根据所述音调参数，调整谐波信号与噪声信号之间的比例；并根据调整后的谐波信号与噪声信号，得到重建的谱信号；即使用基于音调参数、谐波间隔参数和第一谐波偏移量参数的人工重建方法来重建高频带的谱信号；需要说明的是，当编码的码流中没有传输第一谐波偏移量参数时，解码端第一谐波偏移量参数等于谐波间隔参数。If there is no correlation between the signal spectrum of the current frequency band and the signal spectrum of the previous frequency band, for example, CM is equal to 0, the tone parameter, harmonic interval parameter and first harmonic offset parameter are decoded from the code stream, and according to the harmonic Harmonic signals are obtained from the wave interval parameter; or according to the harmonic interval parameter and the first harmonic offset parameter, the harmonic signal is obtained; according to the pitch parameter, the ratio between the harmonic signal and the noise signal is adjusted; and According to the adjusted harmonic signal and noise signal, the reconstructed spectral signal is obtained; that is, the spectral signal of the high frequency band is reconstructed by using an artificial reconstruction method based on the pitch parameter, the harmonic interval parameter and the first harmonic offset parameter; It is noted that when the first harmonic offset parameter is not transmitted in the coded code stream, the first harmonic offset parameter at the decoding end is equal to the harmonic interval parameter.

根据解码出的频域包络对重建的谱信号进行频域整形，例如进行频域去归一化处理，并将整形后的谱信号变换到时域；可以通过逆MDCT变换，也可以通过逆FFT变换将修整后的谱信号变换到时域，但是必须与编码端采用的变换方法相对应；Perform frequency domain shaping on the reconstructed spectral signal according to the decoded frequency domain envelope, for example, perform frequency domain denormalization processing, and transform the shaped spectral signal into the time domain; it can be transformed by inverse MDCT or by inverse The FFT transformation transforms the trimmed spectral signal into the time domain, but it must correspond to the transformation method used by the encoding end;

根据解码出的时域包络参数进行时域整形处理，例如时域去归一化处理，得到参数音频解码出的高频信号；得到合成的音频信号。Perform time-domain shaping processing, such as time-domain denormalization processing, according to the decoded time-domain envelope parameters, to obtain high-frequency signals decoded from parametric audio; and obtain synthesized audio signals.

需要说明的是，上述频域整形与时域整形的顺序不唯一，即也可以先对重建的谱信号进行时域整形，再进行频域整形。例如：根据所述频域包络参数对所述重建的谱信号进行频域整形处理，得到频域整形后的信号，根据所述时域包络参数对频域整形后的信号进行时域整形处理，得到合成音频信号；或者，根据所述时域包络参数对所述重建的谱信号进行时域整形处理，得到时域整形后的信号，根据所述频域包络参数对时域整形后的信号进行频域整形处理，得到合成音频信号。It should be noted that the order of frequency-domain shaping and time-domain shaping is not unique, that is, time-domain shaping may be performed on the reconstructed spectrum signal first, and then frequency-domain shaping may be performed. For example: performing frequency-domain shaping processing on the reconstructed spectral signal according to the frequency-domain envelope parameters to obtain a frequency-domain shaped signal, and performing time-domain shaping on the frequency-domain shaped signal according to the time-domain envelope parameters processing to obtain a synthesized audio signal; or, perform time-domain shaping processing on the reconstructed spectrum signal according to the time-domain envelope parameters to obtain a time-domain shaped signal, and perform time-domain shaping according to the frequency-domain envelope parameters The final signal is subjected to frequency-domain shaping processing to obtain a synthetic audio signal.

上述内容描述了当用分频带的方式对音频信号进行编码时，判断当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号是否相似，当不相似时提取包含时域包络参数、频域包络参数、音调参数、谐波间隔参数和第一谐波偏移量参数的一组参数，当相似时仅提取包含时域包络参数、频域包络参数和音调参数的一组参数，也可以是仅提取包含时域包络参数和频域包络参数的一组参数，由于本发明实施例减少了编码时需要的参数的个数，同时降低了使用参数进行编码时所需要的比特数；也有效地利用了信号不同频带之间谱的相似性进一步降低了编码速率，获得更大的编码带宽。解码端根据上述参数能够在分频带解码音频信号的过程中实现针对不同信号的特征采用不同的谱信号重建方法，对信号特征的适应性更强，可以对不同信号获得同样高的合成质量。The above content describes that when the audio signal is encoded by frequency division, it is judged whether the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band, and when they are not similar, the time domain envelope parameters are extracted , frequency domain envelope parameters, tone parameters, harmonic interval parameters and a set of parameters of the first harmonic offset parameters, when similar, only extract a set of time domain envelope parameters, frequency domain envelope parameters and tone parameters group parameters, or only extract a group of parameters including time-domain envelope parameters and frequency-domain envelope parameters, because the embodiment of the present invention reduces the number of parameters required for encoding, and at the same time reduces the number of parameters required for encoding. The number of bits required; it also effectively utilizes the similarity of the spectrum between different frequency bands of the signal to further reduce the coding rate and obtain a larger coding bandwidth. According to the above parameters, the decoding end can implement different spectral signal reconstruction methods according to the characteristics of different signals in the process of decoding audio signals in frequency bands, which is more adaptable to signal characteristics and can obtain the same high synthesis quality for different signals.

实施例二Embodiment two

在该实施例中，在编码端将输入的音频信号分为高频带信号和低频带信号，并分别对高频带信号和低频带信号进行编码处理。In this embodiment, the input audio signal is divided into a high frequency band signal and a low frequency band signal at the encoding end, and encoding processing is performed on the high frequency band signal and the low frequency band signal respectively.

图5是本发明实施例二在编码端的处理过程示意图，如图5所示，编码处理过程包括：Fig. 5 is a schematic diagram of the processing process at the encoding end of Embodiment 2 of the present invention. As shown in Fig. 5, the encoding process includes:

51：对输入的音频信号进行滤波分析；设输入的音频信号的采样率为32KHz，处理帧长为20ms；对输入的信号进行分频带、下采样处理后，对应于0~8kHz频带的信号有320个采样点，对应于8~16kHz频带的信号有320个采样点；51: Filter and analyze the input audio signal; set the sampling rate of the input audio signal to 32KHz, and the processing frame length is 20ms; after the input signal is divided into frequency bands and down-sampled, the signals corresponding to the 0~8kHz frequency band are 320 sampling points, corresponding to 8~16kHz frequency band signal has 320 sampling points;

52：0~8kHz频带内的信号通过核心编码进行编码处理，具体应用中，核心编码可以通过G.729.1编解码器完成，也可以通过其它宽带信号编解码器完成编码，即无论采用何种编码方式，能够对0~8kHz频带内的信号进行编码即可；并输出低频信号的比特流，即输出码流；52: Signals in the 0~8kHz frequency band are coded by core codes. In specific applications, core codes can be completed by G.729.1 codecs or by other broadband signal codecs, that is, no matter what codes are used In this way, it is enough to encode the signal in the 0~8kHz frequency band; and output the bit stream of the low frequency signal, that is, the output code stream;

53：对8~16kHz频带内的信号，例如时域信号{y_hi(0)，y_hi(1)，……，y_hi(319)}，采用本发明实施例提供的编码处理方法进行参数音频编码：这里高频带即为编码处理方法中所述的当前频带，低频带即为所述的前一个频带；当高频信号的谱与低频信号的谱不具有相似性时，提取包括时域包络参数、频域包络参数、音调参数、谐波间隔参数、第一谐波偏移量参数及编码模式参数的一组参数；当具有相似性时，仅提取包括时域包络参数、频域包络参数、音调参数及编码模式参数，也可以仅提取包括时域包络参数、频域包络参数及编码模式参数的一组参数；具体处理过程可以包括：53: For signals in the 8-16kHz frequency band, such as time-domain signals {y_hi(0), y_hi(1), ..., y_hi(319)}, use the encoding processing method provided by the embodiment of the present invention to perform parametric audio encoding: Here the high-frequency band is the current frequency band described in the encoding processing method, and the low-frequency band is the previous frequency band; when the spectrum of the high-frequency signal is not similar to the spectrum of the low-frequency signal, extracting the parameter, frequency domain envelope parameter, tone parameter, harmonic interval parameter, first harmonic offset parameter and a set of parameters of coding mode parameter; when there is similarity, only extract including time domain envelope parameter, frequency domain Envelope parameters, tone parameters and encoding mode parameters can also only extract a group of parameters including time domain envelope parameters, frequency domain envelope parameters and encoding mode parameters; the specific processing process can include:

(1)确定编码模式参数CM；具体地，可以先计算低频带信号谱与高频带信号谱之间的互相关，以确定低频带谐波结构与高频带谐波结构之间的相似性；当互相关大于某一域值时，可以判定为低频带谐波结构与高频带谐波结构之间是相似，将CM置为1，并采用谱复制整形的方式通过低频带的谱信号重建高频带的谱信号；或通过其他不同于谱复制的方式重建谱信号；当互相关小于等于所述域值时，则判定低频带谐波结构与高频带谐波结构之间是不相似的，将CM置为0，并根据参数人工重建出高频带的谱信号；当然在实际的应用中也可以采用一种简单的方式来进行编码模式判定，即当谐波间隔PG小于某一域值时，将CM置为1；否则置为0；(1) Determine the coding mode parameter CM; specifically, the cross-correlation between the low-band signal spectrum and the high-band signal spectrum can be calculated first to determine the similarity between the low-band harmonic structure and the high-band harmonic structure ; When the cross-correlation is greater than a certain threshold value, it can be judged that the harmonic structure of the low frequency band and the harmonic structure of the high frequency band are similar, set CM to 1, and use the spectral copy shaping method to pass the spectral signal of the low frequency band Reconstruct the spectral signal of the high frequency band; or reconstruct the spectral signal by other methods different from the spectral replication; when the cross-correlation is less than or equal to the threshold value, it is determined that the harmonic structure of the low frequency band and the harmonic structure of the high frequency band are inconsistent Similarly, set CM to 0, and artificially reconstruct the spectral signal in the high frequency band according to the parameters; of course, in practical applications, a simple method can also be used to determine the coding mode, that is, when the harmonic interval PG is less than a certain When a domain value, set CM to 1; otherwise set to 0;

(2)计算信号的子帧能量包络{temp_env(0)，temp(1)，……，temp(N-1)}和全局增益因子gain，在本实施例中N＝8；并根据这两组值判断信号是稳态信号或是瞬态信号；若是稳态信号，则对全局增益因子gain进行量化，将得到的量化值作为时域包络参数，并进行编码写入码流；如果是瞬态信号，则对子帧能量包络进行量化，将得到的量化值作为时域包络参数，并进行编码写入码流；并根据时域包络参数对8~16kHz频带信号进行时域归一化处理，得到时域归一化后的信号；(2) Calculate the subframe energy envelope {temp_env (0), temp (1), ..., temp (N-1)} and the global gain factor gain of the signal, N=8 in this embodiment; and according to this Two sets of values determine whether the signal is a steady-state signal or a transient signal; if it is a steady-state signal, the global gain factor gain is quantized, and the obtained quantized value is used as a time-domain envelope parameter, and encoded and written into the code stream; if If it is a transient signal, the energy envelope of the subframe is quantized, and the obtained quantized value is used as the time domain envelope parameter, and is encoded and written into the code stream; Domain normalization processing to obtain the time domain normalized signal;

(3)时域归一化后的信号经过MDCT(修正的离散余弦变换，Modified Discrete CosineTransform)变换(例如640点)后得到了一组MDCT系数，即该频带对应的频域信号{y_swb(0)，y_swb(1)，……，y_swb(319)}，由于超宽带编码器只要求处理8~14kHz频带内的信号，所以对频域信号仅处理{y_swb(0)，y_swb(1)，……，y_swb(239)}部分；处理时将这组频域信号分为N个子带，提取每个子带的子代能量并量化，得到一组量化后的频域包络{spec_env(0)，spec_env(1)，……，spec_env(N-1)}，即为8~14kHz频带内的频域包络参数；(3) After the time-domain normalized signal is transformed by MDCT (modified discrete cosine transform, Modified Discrete CosineTransform) (for example, 640 points), a set of MDCT coefficients is obtained, that is, the frequency domain signal corresponding to the frequency band {y_swb(0 ), y_swb(1),..., y_swb(319)}, since the UWB coder only requires to process signals in the 8~14kHz frequency band, it only processes {y_swb(0), y_swb(1), ..., y_swb(239)} part; during processing, this group of frequency domain signals is divided into N subbands, and the offspring energy of each subband is extracted and quantized to obtain a set of quantized frequency domain envelope {spec_env(0) , spec_env(1),..., spec_env(N-1)}, which is the frequency domain envelope parameter in the 8~14kHz frequency band;

由于对于宽带核心编码器G.729.1，7~8kHz部分信号已不在其处理范围内，为了确保在解码端解码信号频谱的连续性，还需要提取7~8kHz部分的信号的特征参数；由于G.729.1编码器对4~8kHz的信号进行了MDCT变换(例如320点)，对应的频域信号{y_wb(0)，y_wb(1)，……，y_wb(159)}，其中7~8kHz对应的频域信号为{y_wb(120)，y_wb(121)，……，y_wb(159)}，将其分为M个子带，提取每个子带的频域包络并量化，得到一组7~8kHz频带内的量化后的频域包络{spec_env_extra(0)，spec_env_extra(1)，……，spec_env_extra(M-1)}，与8~14kHz频带内的频域包络参数一起组成整个的频域包络参数；这组包络经过编码可以传输到解码端；在本实施例中N＝15，M＝3；As for the broadband core encoder G.729.1, the 7~8kHz part of the signal is no longer within its processing range, in order to ensure the continuity of the decoded signal spectrum at the decoder, it is also necessary to extract the characteristic parameters of the 7~8kHz part of the signal; due to G. The 729.1 encoder performs MDCT transformation (for example, 320 points) on the 4~8kHz signal, and the corresponding frequency domain signal {y_wb(0), y_wb(1),...,y_wb(159)}, where 7~8kHz corresponds to The frequency domain signal is {y_wb(120), y_wb(121),...,y_wb(159)}, which is divided into M subbands, and the frequency domain envelope of each subband is extracted and quantized to obtain a set of 7~8kHz The quantized frequency domain envelope in the frequency band {spec_env_extra(0), spec_env_extra(1), ..., spec_env_extra(M-1)}, together with the frequency domain envelope parameters in the 8~14kHz frequency band, form the entire frequency domain Envelope parameters; this group of envelopes can be transmitted to the decoding end after encoding; in this embodiment, N=15, M=3;

(4)提取音调参数；具体的，可以直接在MDCT域进行参数提取；为了进一步提高编码器的性能，也可以不直接在MDCT域进行参数提取，而是根据原始频域信号{y_swb(0)，y_swb(1)，……，y_swb(239)}计算伪谱信号，并根据此伪谱信号计算音调参数；(4) Extract tone parameters; specifically, parameter extraction can be performed directly in the MDCT domain; in order to further improve the performance of the encoder, it is also possible not to directly perform parameter extraction in the MDCT domain, but according to the original frequency domain signal {y_swb(0) , y_swb(1), ..., y_swb(239)} calculate the pseudo-spectral signal, and calculate the tone parameter according to the pseudo-spectral signal;

具体的伪谱信号S(k)＝{S(0)，S(1)，……，S(239)}可以按照下面的公式计算：Concrete pseudo spectrum signal S(k)={S(0), S(1),..., S(239)} can be calculated according to the following formula:

$S S ((k k)) = = \{\begin{matrix} \sqrt{y the y__{swb swb}^{22} ((00)) + + y the y__{swb swb}^{22} ((11))},, k k = = 00 \\ \sqrt{y the y__{swb swb}^{22} ((239239)) + + y the y__{swb swb}^{22} ((238238))},, k k = = 239239 \\ \sqrt{y the y__{swb swb}^{22} ((k k)) + + {((y the y__swb swb ((k k + + 11)) - - y the y__swb swb ((k k + + 11))))}^{22}},, otherwise otherwise \end{matrix},,,$

当然也可以通过其它方法，如对原始频域信号直接取绝对值得到的{|y_swb(0)|，|y_swb(1)|，……，|y_swb(239)|}进行计算；接着计算自相关函数ACF(k₀)，自相关函数可以由伪谱信号通过频域计算得到，例如ACF(k₀)＝IFFT(|FFT(S(k))|²)，其中FFT为快速傅立叶变换，IFFT为其逆变换；此外，也可以直接计算得到，例如 $ACF (k_{0}) = Σ_{k = 0}^{239} S (k) S (k + k_{0});$ 另外，还可以使用平均幅度差函数(AMDF)来增强自相关函数；Of course, other methods can also be used, such as {|y_swb(0)|, |y_swb(1)|, ..., |y_swb(239)|} obtained by directly taking the absolute value of the original frequency domain signal; then calculate from Correlation function ACF(k ₀ ), the autocorrelation function can be calculated from the pseudo-spectral signal in the frequency domain, for example, ACF(k ₀ )=IFFT(|FFT(S(k))| ² ), where FFT is Fast Fourier Transform, IFFT is its inverse transformation; in addition, it can also be directly calculated, for example $ACF (k_{0}) = Σ_{k = 0}^{239} S (k) S (k + k_{0});$ In addition, the average amplitude difference function (AMDF) can also be used to enhance the autocorrelation function;

音调参数可以通过自相关函数的最大值与最小值之间的比值表示，例如T＝max(ACF(k₀))/min(ACF(k₀))，其中最大值和最小值的获取是在期望的范围内或者是在对谐波间隔参数计算有益的范围内进行的；The tone parameter can be represented by the ratio between the maximum value and the minimum value of the autocorrelation function, such as T=max(ACF(k ₀ ))/min(ACF(k ₀ )), where the maximum value and the minimum value are obtained at within the expected range or within the beneficial range for the calculation of harmonic spacing parameters;

(5)根据ACF(k₀)，估计谐波间隔参数PG；高频带信号的谐波间隔参数，通常是在频域(或变换域)下提取的；谐波间隔的整数值可以通过峰值提取方法由自相关函数估计出来，例如根据PG＝argmax(ACF(k₀))获得，其中最大值的获取可以是限定在一个期望的范围内或者是感兴趣的范围内进行的，谐波间隔的分数值可以在适当地内插自相关函数ACF(k₀)之后，通过峰值提取的方法获得；也可以只在求得的整数谐波间隔附近进行自相关函数的内插计算，之后通过峰值提取的方法获得谐波间隔的分数值；(5) According to ACF(k ₀ ), estimate the harmonic interval parameter PG; the harmonic interval parameter of the high-frequency band signal is usually extracted in the frequency domain (or transform domain); the integer value of the harmonic interval can be obtained by the peak The extraction method is estimated by the autocorrelation function, for example, according to PG=argmax(ACF(k ₀ )), where the maximum value can be obtained within a desired range or a range of interest, and the harmonic interval The fractional value of can be obtained by peak extraction method after interpolating the autocorrelation function ACF(k ₀ ) appropriately; it is also possible to interpolate the autocorrelation function only around the obtained integer harmonic interval, and then extract the peak value The method to obtain the fractional value of the harmonic interval;

(6)还可以对估计的谐波间隔参数值进行修正，以抑制倍频和分数频的产生；例如，将求得的当前帧的谐波间隔PG与前一帧的谐波间隔old_PG进行比较，如果当前帧的谐波间隔与前一帧谐波间隔之间的比值小于某个域值(如0.1)且ACF(old_PG)>0.95ACF(PG)，则用前一帧的谐波间隔代替本帧求得的谐波间隔PG＝old_PG；(6) The estimated harmonic interval parameter value can also be corrected to suppress the generation of multiplier and fractional frequency; for example, compare the obtained harmonic interval PG of the current frame with the harmonic interval old_PG of the previous frame , if the ratio between the harmonic interval of the current frame and the harmonic interval of the previous frame is less than a certain threshold (such as 0.1) and ACF(old_PG)>0.95ACF(PG), replace it with the harmonic interval of the previous frame Harmonic interval PG obtained in this frame=old_PG;

(7)根据谐波间隔，估计第一谐波偏移量参数P0；例如，具体的可以在谐波间隔范围内，即[0，PG]范围内，将第一谐波分量分别置于不同偏移位置，并按谐波间隔依次放置其它谐波，并计算由此产生的谱与伪谱之间的相关性，相关性最大的偏移位置即所求的第一谐波偏移量，例如

其中

表示向下取整；需要指出的是，实际上谐波间隔参数与第一谐波偏移量参数之间也存在着一定程度上的相关性，因此可以通过谐波间隔参数估计出高频带信号的第一谐波偏移量参数；同时，第一谐波偏移量参数也可以用来进一步修正谐波间隔参数的估计值，从而达到更优的参数提取效果；(7) Estimate the first harmonic offset parameter P0 according to the harmonic interval; for example, specifically, within the range of the harmonic interval, that is, within the range of [0, PG], the first harmonic component can be placed in different Offset the position, and place other harmonics in sequence according to the harmonic interval, and calculate the correlation between the resulting spectrum and the pseudo-spectrum, the offset position with the greatest correlation is the first harmonic offset sought, For example

in

Indicates rounding down; it should be pointed out that there is actually a certain degree of correlation between the harmonic interval parameter and the first harmonic offset parameter, so the high frequency band can be estimated by the harmonic interval parameter The first harmonic offset parameter of the signal; at the same time, the first harmonic offset parameter can also be used to further modify the estimated value of the harmonic interval parameter, so as to achieve a better parameter extraction effect;

(8)当CM等于1时，包含编码模式参数、时域包络参数、频域包络参数、及音调参数的一组参数将会被量化或编码，并传输到解码端(即传输高频参数比特流)；当CM等于0时，包含了编码模式参数、时域包络参数、频域包络参数、音调参数、谐波间隔参数及第一谐波偏移量参数的一组参数，将会被量化或编码，并传输到解码端(即传输高频参数比特流)；(8) When CM is equal to 1, a group of parameters including encoding mode parameters, time domain envelope parameters, frequency domain envelope parameters, and tone parameters will be quantized or encoded, and transmitted to the decoding end (that is, the transmission of high frequency Parameter bit stream); when CM is equal to 0, a group of parameters including encoding mode parameters, time domain envelope parameters, frequency domain envelope parameters, tone parameters, harmonic interval parameters and first harmonic offset parameters, will be quantized or encoded, and transmitted to the decoding end (that is, transmit high-frequency parameter bit stream);

需要指出的是，当CM等于1时，也可以只将包含编码模式参数、时域包络参数和频域包络参数的一组参数量化或编码，并传输到解码端；It should be pointed out that when CM is equal to 1, only a set of parameters including encoding mode parameters, time domain envelope parameters and frequency domain envelope parameters may be quantized or encoded, and transmitted to the decoding end;

54：当完成高频带信号的参数音频编码后，可以根据所剩的编码比特数选择是否利用可选择的RIRAC音频编码对参数音频编码后的高频信号进行增强；本实施例采用的增强方式是对高频带信号在MDCT域进行变换编码；当然也可以选用其它方式对参数音频编码后的高频信号进行增强，如对高频带原始信号与高频带音频编码后的残差信号进行变换编码等；并传输高频增强比特流。54: After completing the parametric audio coding of the high-frequency band signal, it is possible to choose whether to use the optional RIRAC audio coding to enhance the high-frequency signal after parametric audio coding according to the number of remaining coding bits; the enhancement method adopted in this embodiment It is to transform and encode the high-frequency band signal in the MDCT domain; of course, other methods can also be used to enhance the high-frequency signal after parametric audio coding, such as the high-band original signal and the residual signal after high-band audio coding. Transform coding, etc.; and transmit high-frequency enhanced bit streams.

对应的，解码端收到上述低频比特流、高频参数比特流、高频增强比特流之后，进行解码，并合成音频信号；图6是本发明实施例二在解码端的处理过程示意图，如图6所示，解码具体处理过程可以包括：Correspondingly, after receiving the above-mentioned low-frequency bit stream, high-frequency parameter bit stream, and high-frequency enhanced bit stream, the decoder performs decoding and synthesizes an audio signal; FIG. As shown in 6, the specific decoding process may include:

61：0~8kHz频带内的信号合成通过核心解码完成；61: The signal synthesis in the 0~8kHz frequency band is completed through core decoding;

62：8~16kHz频带内的信号合成则通过参数音频解码完成；具体处理包括：(1)根据接收到的数据，解码出编码模式参数CM；62: The signal synthesis in the 8~16kHz frequency band is completed through parameter audio decoding; the specific processing includes: (1) decoding the encoding mode parameter CM according to the received data;

(2)从数据中解码出时域包络参数、频域包络参数；(2) Decoding time domain envelope parameters and frequency domain envelope parameters from the data;

(3)若CM等于1，则可以从收到的数据中解码出音调参数，采用谱复制整形的方式通过低频带的谱重建高频带的谱信号，或通过其它不同于谱复制的方式重建谱信号；例如具体的，可以根据音调参数，对通过核心解码得到的低频带信号的谱信号做整形，将整形后的谱信号作为重建的高频带谱信号；(3) If CM is equal to 1, the tone parameters can be decoded from the received data, and the spectrum signal of the high frequency band can be reconstructed through the spectrum of the low frequency band by means of spectrum copy shaping, or reconstructed by other methods different from spectrum copy Spectral signal; for example, specifically, according to the tone parameter, the spectral signal of the low-band signal obtained through core decoding can be shaped, and the shaped spectral signal can be used as the reconstructed high-band spectral signal;

需要指出的是，当编码端在CM等于1时没有传输音调参数，则解码端将核心解码得到的低频带的谱信号直接作为重建的高频带谱信号；It should be pointed out that when the encoding end does not transmit tone parameters when CM is equal to 1, the decoding end will directly use the low-band spectral signal obtained by core decoding as the reconstructed high-band spectral signal;

若CM等于0，则可以从收到的数据中解码出音调参数、谐波间隔参数和第一谐波偏移量参数，使用基于音调参数、谐波间隔参数和第一谐波偏移量参数的人工重建方法来重建高频带的谱信号；谱信号的重建方法基于谐波信号加噪声信号；具体地，具有随机相位的谐波以脉冲的形式被置于频域范围内的某些频点之上，从而重建谐波信号，其中脉冲的间隔由谐波间隔参数决定，第一个脉冲的位置可以根据第一谐波偏移量得到；噪声信号可以由一个随机数产生器获得；根据音调参数T的值，调整谐波信号与噪声信号之间的比例；并将调整后的谐波信号与噪声信号相加，得到重建的谱信号；具体的调整可以有多种，例如：先分别计算谐波信号与噪声信号的能量，记作enerP和enerN，再计算调整因子β₁＝1-T和 $β_{2} = \sqrt{\frac{enerP}{enerN}} * T,$ 并得到重建的谱信号 $\hat{S} (k) = β_{1} buf_pulses (k) + β_{2} buf_noise (k);$ If CM is equal to 0, the tone parameter, harmonic interval parameter and first harmonic offset parameter can be decoded from the received data, and the parameters based on the tone parameter, harmonic interval parameter and first harmonic offset can be used The manual reconstruction method to reconstruct the spectral signal of the high frequency band; the reconstruction method of the spectral signal is based on the harmonic signal plus the noise signal; specifically, the harmonic with random phase is placed in the form of pulses at certain frequencies in the frequency domain Point above, thereby reconstructing the harmonic signal, where the pulse interval is determined by the harmonic interval parameter, the position of the first pulse can be obtained according to the first harmonic offset; the noise signal can be obtained by a random number generator; according to The value of the tone parameter T adjusts the ratio between the harmonic signal and the noise signal; and adds the adjusted harmonic signal and the noise signal to obtain the reconstructed spectral signal; there are various specific adjustments, for example: first separate Calculate the energy of the harmonic signal and the noise signal, denoted as enerP and enerN, and then calculate the adjustment factor β ₁ =1-T and $β_{2} = \sqrt{\frac{enerP}{ener N}} * T,$ and get the reconstructed spectral signal $\hat{S} (k) = β_{1} buf_pulses (k) + β_{2} buf_noise (k);$

(4)根据解码出的频域包络对重建的谱信号进行频域整形，例如频域去归一化处理，并将整形后的谱信号变换到时域；例如，可以通过逆MDCT变化，也可以通过逆FFT变换将修整后的谱信号变换到时域；(4) Perform frequency-domain shaping on the reconstructed spectral signal according to the decoded frequency-domain envelope, such as frequency-domain denormalization processing, and transform the shaped spectral signal into the time domain; for example, it can be changed by inverse MDCT, The trimmed spectral signal can also be transformed to the time domain by inverse FFT transformation;

(5)根据解码出的时域包络参数进行时域整形处理，例如时域去归一化处理，得到解码出的高频信号；(5) Perform time-domain shaping processing according to the decoded time-domain envelope parameters, such as time-domain denormalization processing, to obtain decoded high-frequency signals;

需要说明的是，在时域和频域去归一化处理中，还可以对时域包络和频域包络进行一种可选择的平滑滤波处理。如果高频带的谱信号是按照人工重建的方式进行的，一旦谐波被放置到错误的子带中，此时去归一化所用的将是错误的包络因子。若谐波位置出现轻微的偏差，就会引入一定程度的失真，使用平滑滤波可以减轻这种失真。具体地，如果在接近子带边界的附近有一个非常强的音调成分，那么就可以用内插后的子带能量包络因子进行频域去归一化处理；然后将得到的信号变换到时域，再由自适应的子帧能量包络(ATE)在时域内插出时域增益函数；这个时域增益函数最后可以被用来对时域信号进行去归一化处理；It should be noted that, in the denormalization process of the time domain and the frequency domain, an optional smoothing filter process can also be performed on the time domain envelope and the frequency domain envelope. If the spectral signal in the high frequency band is artificially reconstructed, once the harmonics are placed in the wrong subband, the wrong envelope factor will be used for denormalization. If there is a slight deviation in the harmonic position, a certain degree of distortion will be introduced, which can be mitigated by smoothing filtering. Specifically, if there is a very strong tonal component near the subband boundary, then the interpolated subband energy envelope factor can be used for frequency domain denormalization; then the obtained signal is transformed into the time domain, and then interpolate the time domain gain function in the time domain by the adaptive subframe energy envelope (ATE); this time domain gain function can finally be used to denormalize the time domain signal;

63：在62完成高频带信号的解码后，可以根据接收到的数据中所剩的比特数选择是否对编码后的高频信号进行增强，具体的方法与编码端采用的增强方式相对应，这里不再赘述；63: After completing the decoding of the high-frequency signal at 62, you can choose whether to enhance the encoded high-frequency signal according to the number of bits left in the received data. The specific method corresponds to the enhancement method adopted by the encoding end. I won't go into details here;

64：将0~8kHz频带的合成信号，与8~16kHz频带的合成信号通过QMF合成滤波，即可得到最终的32kHz采样率的合成音频信号。64: Combine the synthesized signal in the 0~8kHz frequency band and the synthesized signal in the 8~16kHz frequency band through QMF synthesis filtering to obtain the final synthesized audio signal with a sampling rate of 32kHz.

实施例二中，在将音频信号分为低频带信号和高频带信号的情况下，对其中的高频带信号进行参数编码、解码处理，即采用编码模式参数指示利用表征信号的包含时域包络、频域包络、音调、谐波间隔和第一谐波偏移量的一组参数来完成编解码，或者利用表征信号的包含时域包络、频域包络和音调的一组参数，来完成编解码。本发明实施例采用的一组参数，减少了编码时需要的参数的个数，同时降低了使用参数进行编码时所需要的比特数；从而解决了现有编码方法比特数较高的问题；同时，与现有的参数音频编码算法相比，由于本发明实施例的这组参数可以用更少的比特数进行编码，从而进一步降低信号的编码速率，并且当信道的传输能力一定时，由于木发明的编码比特数较低，因此能够编码具有更高带宽的信号，实现了用更低的编码速率获得更大的编码带宽及更高的编码质量。同时在解码端可以实现利用更少的比特数来合成音频信号，且该音频信号质量较高；并且，当音频信号的谐波结构明显时，解码得到的音频质量更佳。In the second embodiment, when the audio signal is divided into a low frequency band signal and a high frequency band signal, parametric encoding and decoding are performed on the high frequency band signal, that is, the encoding mode parameter is used to indicate the use of the included time domain of the representative signal Envelope, frequency domain envelope, tone, harmonic interval and first harmonic offset to complete the encoding and decoding, or use a set of time domain envelope, frequency domain envelope and tone to characterize the signal parameters to complete the encoding and decoding. A set of parameters adopted in the embodiment of the present invention reduces the number of parameters required for encoding, and simultaneously reduces the number of bits required for encoding using parameters; thereby solving the problem of high bit numbers in existing encoding methods; at the same time , compared with the existing parametric audio coding algorithm, because this group of parameters in the embodiment of the present invention can be coded with fewer bits, thereby further reducing the coding rate of the signal, and when the transmission capacity of the channel is constant, because the wooden The invention has a low number of coding bits, so it can code a signal with a higher bandwidth, and achieve a larger coding bandwidth and higher coding quality with a lower coding rate. At the same time, the audio signal can be synthesized with fewer bits at the decoding end, and the quality of the audio signal is high; and, when the harmonic structure of the audio signal is obvious, the audio quality obtained through decoding is better.

实施例三Embodiment Three

相对于实施例二采用了先提取时域包络参数后提取频域包络参数的方法，实施例三则采用了先提取频域包络参数的方法来实现编码(以实施例三中的音频信号与分频带方法，与实施例二中的相同为例)。Compared with Embodiment 2, which adopts the method of firstly extracting the time-domain envelope parameters and then extracting the frequency-domain envelope parameters, Embodiment 3 adopts the method of first extracting the frequency-domain envelope parameters to realize encoding (based on the audio frequency domain in Embodiment 3). The signal and frequency division method are the same as those in Embodiment 2 as an example).

本实施例中，在编码端对高频带信号处理的过程具体可以包括：In this embodiment, the process of processing the high frequency band signal at the encoding end may specifically include:

(1)：按照实施例二中编码端的(1)中的方法确定编码模式参数CM；(1): Determine the encoding mode parameter CM according to the method in (1) of the encoding end in Embodiment 2;

(2)：8~16kHz频带内的时域信号经过MDCT变换后得到了一组MDCT系数，由于超宽带部分仅处理8~14kHz频带内的信号，所以对频域信号仅处理{y_swb(0)，y_swb(1)，……，y_swb(239)}部分；对于核心编码，7~8kHz部分信号已不在其处理范围之内，为了确保在解码端解码信号频谱的连续性，在编码端需要提取7~8kHz部分MDCT变换域信号{y_wb(120)，y_wb(121)，……，y_wb(159)}；(2): The time domain signal in the 8~16kHz frequency band is transformed by MDCT to obtain a set of MDCT coefficients. Since the ultra-wideband part only processes the signal in the 8~14kHz frequency band, only {y_swb(0) is processed for the frequency domain signal , y_swb(1),...,y_swb(239)} part; for the core code, the 7~8kHz part of the signal is no longer within its processing range, in order to ensure the continuity of the decoded signal spectrum at the decoder, it is necessary to extract 7~8kHz partial MDCT transform domain signal {y_wb(120), y_wb(121),...,y_wb(159)};

(3)：对7~14kHz频带内的MDCT系数进行分带，并计算各自的子带能量，作为频域包络参数，并对其量化后编码传输；(3): The MDCT coefficients in the 7~14kHz frequency band are divided into bands, and the respective sub-band energies are calculated as frequency domain envelope parameters, and quantized and encoded for transmission;

(4)：对7~14kHz频带内的MDCT系数进行频域归一化处理，并根据频域归一化以后的MDCT系数提取线性预测系数，作为时域包络参数，并对这组线性预测系数量化后编码传输；(4): Perform frequency domain normalization processing on the MDCT coefficients in the 7~14kHz frequency band, and extract linear prediction coefficients according to the MDCT coefficients after frequency domain normalization, as time domain envelope parameters, and perform linear prediction on this group The coefficients are quantized and then encoded and transmitted;

(5)：对于频域归一化的MDCT系数进行线性预测滤波，得到MDCT域的线性预测残差；(5): Perform linear prediction filtering on the MDCT coefficients normalized in the frequency domain to obtain the linear prediction residual in the MDCT domain;

(6)：按照实施例二中编码端53的(4)~(8)中的方法提取出高频信号的音调参数、谐波间隔参数以及第一谐波偏移量参数；当编码模式为1时，只传输编码模式参数、时域包络参数、频域包络参数和音调参数到解码端；当编码模式为0时，则将编码模式参数、时域包络参数、频域包络参数、音调参数、谐波间隔参数和第一谐波偏移量参数一起传输到解码端；(6): According to the method in (4)～(8) of encoding terminal 53 in embodiment two, extract the pitch parameter, harmonic interval parameter and first harmonic offset parameter of high-frequency signal; When encoding mode is When 1, only the encoding mode parameter, time domain envelope parameter, frequency domain envelope parameter and tone parameter are transmitted to the decoder; when the encoding mode is 0, the encoding mode parameter, time domain envelope parameter, frequency domain envelope parameters, tone parameters, harmonic interval parameters and first harmonic offset parameters are transmitted to the decoding end together;

对应的，解码端对高频带信号的处理的过程具体可以包括：Correspondingly, the process of processing the high frequency band signal at the decoding end may specifically include:

(7)：根据接收到的码流，解码出编码模式参数CM；并从码流中解码出时域包络参数、频域包络参数；具体地，可以通过码书查找获得量化后的线性预测系数，即时域包络参数；以便于随后根据该获得的线性预测系数进行时域整形处理；通过码书查找获得量化后的子带能量，即频域包络参数；以便于随后根据该获得的子带能量进行频域整形处理；(7): According to the received code stream, decode the coding mode parameter CM; and decode the time-domain envelope parameters and frequency-domain envelope parameters from the code stream; specifically, the quantized linearity can be obtained by searching the codebook The prediction coefficient is an envelope parameter in the instant domain; in order to perform time-domain shaping processing according to the obtained linear prediction coefficient; the quantized sub-band energy is obtained through codebook search, that is, the envelope parameter in the frequency domain; frequency-domain shaping processing of the sub-band energy;

(8)：按照实施例二中解码端62中的(3)中的方法重建高频带的谱信号；(8): reconstruct the spectral signal of the high frequency band according to the method in (3) in the decoding end 62 in the second embodiment;

(9)：使重建的高频带谱信号通过线性预测逆滤波器，也即相当于对重建的高频带谱信号进行时域整形处理；(9): Make the reconstructed high-band spectrum signal pass through the linear prediction inverse filter, which is equivalent to performing time-domain shaping processing on the reconstructed high-band spectrum signal;

(10)：根据量化后的子带能量，对重建的高频带谱信号进行频域整形处理；(10): according to the quantized sub-band energy, perform frequency-domain shaping processing on the reconstructed high-frequency band spectrum signal;

(11)：通过逆MDCT变换，将整形后的高频带谱信号变换到时域，得到最终的高频带合成信号。(11): Through the inverse MDCT transformation, transform the shaped high frequency band spectrum signal into the time domain to obtain the final high frequency band composite signal.

由上述描述可知，实施例三采用了先提取频域包络参数的方法来实现编码，由于获取上述各参数的顺序不唯一，即不论以何种顺序，只要获取上述音频信号的编码模式参数、时域包络参数，频域包络参数，音调参数，谐波间隔参数和第一谐波偏移量参数即可。本发明实施例采用的一组参数，减少了编码时需要的参数的个数，同时降低了使用参数进行编码时所需要的比特数；从而解决了现有编码方法比特数较高的问题；同时，与现有的参数音频编码算法相比，由于本发明实施例的这组参数可以用更少的比特数进行编码，从而进一步降低信号的编码速率，并且当信道的传输能力一定时，由于本发明的编码比特数较低，因此能够编码具有更高带宽的信号，实现了用更低的编码速率获得更大的编码带宽及更高的编码质量。同时在解码端可以实现利用更少的比特数来合成音频信号，且该音频信号质量较高；并且，当音频信号的谐波结构明显时，解码得到的音频质量更佳。It can be seen from the above description that Embodiment 3 adopts the method of extracting the frequency-domain envelope parameters first to implement encoding. Since the order of obtaining the above-mentioned parameters is not unique, that is, no matter in what order, as long as the above-mentioned audio signal encoding mode parameters, Time domain envelope parameters, frequency domain envelope parameters, pitch parameters, harmonic interval parameters and first harmonic offset parameters are enough. A set of parameters adopted in the embodiment of the present invention reduces the number of parameters required for encoding, and simultaneously reduces the number of bits required for encoding using parameters; thereby solving the problem of high bit numbers in existing encoding methods; at the same time , compared with the existing parametric audio coding algorithm, since this group of parameters in the embodiment of the present invention can be coded with fewer bits, thereby further reducing the coding rate of the signal, and when the transmission capacity of the channel is constant, due to the The invention has a low number of coding bits, so it can code a signal with a higher bandwidth, and achieve a larger coding bandwidth and higher coding quality with a lower coding rate. At the same time, the audio signal can be synthesized with fewer bits at the decoding end, and the quality of the audio signal is high; and, when the harmonic structure of the audio signal is obvious, the audio quality obtained through decoding is better.

本发明实施例还提供了相应的音频编码装置，其结构如图7所示，具体实现结构可以包括：The embodiment of the present invention also provides a corresponding audio encoding device, the structure of which is shown in Figure 7, and the specific implementation structure may include:

参数提取单元71，用于提取用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；当所述音频信号的谐波间隔与第一谐波偏移量的值不同时，还用于提取用于表征所述音频信号的第一谐波偏移量参数，并传送至发送单元；A parameter extraction unit 71, configured to extract time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize the audio signal; when the harmonic interval of the audio signal is offset from the first harmonic When the value of the quantity is different, it is also used to extract the first harmonic offset parameter used to characterize the audio signal, and transmit it to the sending unit;

发送单元72，用于将所述时域包络参数、频域包络参数、音调参数和谐波间隔参数编码后，传输给解码端，具体的，例如：对所述时域包络参数、频域包络参数、音调参数和谐波间隔参数，进行编码后，传输给解码端；或者用于将所述时域包络参数、频域包络参数、音调参数、谐波间隔参数和第一谐波偏移量参数编码后传输给解码端。The sending unit 72 is configured to encode the time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters, and transmit them to the decoding end. Specifically, for example: the time-domain envelope parameters, The frequency domain envelope parameters, tone parameters and harmonic interval parameters are encoded and then transmitted to the decoder; or used to transmit the time domain envelope parameters, frequency domain envelope parameters, tone parameters, harmonic interval parameters and the first A harmonic offset parameter is encoded and transmitted to the decoding end.

本发明实施例还提供了相应的音频解码装置，其结构如图8所示，具体实现结构可以包括：The embodiment of the present invention also provides a corresponding audio decoding device, the structure of which is shown in Figure 8, and the specific implementation structure may include:

解码单元81，用于对收到的数据进行解码，得到用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；还用于对收到的包含第一谐波偏移量参数的数据进行解码，得到用于表征所述音频信号的第一谐波偏移量参数；The decoding unit 81 is used to decode the received data to obtain time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize the audio signal; Decoding the data of a harmonic offset parameter to obtain a first harmonic offset parameter used to characterize the audio signal;

合成单元82，用于根据时域包络参数、频域包络参数、音调参数和谐波间隔参数；或者时域包络参数、频域包络参数、音调参数、谐波间隔参数和第一谐波偏移量参数，合成音频信号；具体可以包括：Synthesizing unit 82, is used for according to time domain envelope parameter, frequency domain envelope parameter, tone parameter and harmonic interval parameter; Or time domain envelope parameter, frequency domain envelope parameter, tone parameter, harmonic interval parameter and first Harmonic offset parameter, synthesized audio signal; specific can include:

谐波重建子单元821，用于根据所述谐波间隔参数，得到谐波信号；或当所述用于表征音频信号的谐波间隔与第一谐波偏移量不同时，根据所述谐波间隔参数和所述第一谐波偏移量参数，得到谐波信号；The harmonic reconstruction subunit 821 is configured to obtain a harmonic signal according to the harmonic interval parameter; or when the harmonic interval used to characterize the audio signal is different from the first harmonic offset, according to the harmonic wave interval parameter and the first harmonic offset parameter to obtain a harmonic signal;

谱信号重建子单元822，用于根据所述音调参数，调整所述谐波重建子单元821得到的谐波信号与噪声信号之间的比例；并根据调整后的谐波信号与噪声信号，得到重建的谱信号；The spectral signal reconstruction subunit 822 is configured to adjust the ratio between the harmonic signal and the noise signal obtained by the harmonic reconstruction subunit 821 according to the tone parameter; and obtain reconstructed spectral signal;

整形子单元823，用于根据所述频域包络参数和时域包络参数对所述谱信号重建子单元822重建的谱信号进行处理，得到合成音频信号；例如：根据所述频域包络参数对所述重建的谱信号进行频域整形处理，得到频域整形后的信号，根据所述时域包络参数对频域整形后的信号进行时域整形处理，得到所述合成音频信号；或者，根据所述时域包络参数对所述重建的谱信号进行时域整形处理，得到时域整形后的信号，根据所述频域包络参数对时域整形后的信号进行频域整形处理，得到所述合成音频信号。The shaping subunit 823 is configured to process the spectral signal reconstructed by the spectral signal reconstruction subunit 822 according to the frequency domain envelope parameter and the time domain envelope parameter to obtain a synthesized audio signal; for example: according to the frequency domain envelope Carry out frequency-domain shaping processing on the reconstructed spectrum signal by the envelope parameters to obtain a signal after frequency-domain shaping, and perform time-domain shaping processing on the signal after frequency-domain shaping according to the time-domain envelope parameters to obtain the synthesized audio signal or, performing time-domain shaping processing on the reconstructed spectrum signal according to the time-domain envelope parameters to obtain a signal after time-domain shaping, and performing frequency-domain processing on the signal after time-domain shaping according to the frequency-domain envelope parameters shaping process to obtain the synthesized audio signal.

本发明实施例还提供了相应的音频编解码系统，其结构如图9所示，具体实现结构可以包括：The embodiment of the present invention also provides a corresponding audio codec system, the structure of which is shown in Figure 9, and the specific implementation structure may include:

编码装置91，用于提取用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；对所述用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数编码后，发送至解码装置；具体可以包括：The encoding device 91 is used to extract time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize audio signals; Envelope parameters, tone parameters and harmonic interval parameters are encoded and then sent to the decoding device; the details may include:

参数提取单元911，用于提取音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；当所述音频信号的谐波间隔与第一谐波偏移量的值不同时，还用于提取所述音频信号的第一谐波偏移量参数；The parameter extraction unit 911 is used to extract the time domain envelope parameter, frequency domain envelope parameter, tone parameter and harmonic interval parameter of the audio signal; when the harmonic interval of the audio signal and the value of the first harmonic offset At the same time, it is also used to extract the first harmonic offset parameter of the audio signal;

发送单元912，用于将所述时域包络参数、频域包络参数、音调参数和谐波间隔参数；或者所述时域包络参数、频域包络参数、音调参数、谐波间隔参数和第一谐波偏移量参数，编码后传输给解码装置；The sending unit 912 is configured to send the time-domain envelope parameter, frequency-domain envelope parameter, tone parameter, and harmonic interval parameter; or the time-domain envelope parameter, frequency-domain envelope parameter, tone parameter, and harmonic interval The parameter and the first harmonic offset parameter are encoded and transmitted to the decoding device;

解码装置92，用于对所述编码装置发送来的数据进行解码，得到所述时域包络参数、频域包络参数、音调参数和谐波间隔参数；根据所述时域包络参数、频域包络参数、音调参数和谐波间隔参数合成音频信号；具体可以包括：The decoding device 92 is configured to decode the data sent by the encoding device to obtain the time domain envelope parameter, frequency domain envelope parameter, tone parameter and harmonic interval parameter; according to the time domain envelope parameter, Frequency domain envelope parameters, tone parameters and harmonic interval parameters synthesize audio signals; specifically, it may include:

解码单元921，用于对收到的数据进行解码，得到所述时域包络参数、频域包络参数、音调参数和谐波间隔参数，或者所述时域包络参数、频域包络参数、音调参数、谐波间隔参数和第一谐波偏移量参数；The decoding unit 921 is configured to decode the received data to obtain the time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters, or the time-domain envelope parameters and frequency-domain envelope parameters parameter, pitch parameter, harmonic spacing parameter and first harmonic offset parameter;

合成单元922，用于根据所述时域包络参数、频域包络参数、音调参数和谐波间隔参数，或者所述时域包络参数、频域包络参数、音调参数、谐波间隔参数和第一谐波偏移量参数，合成音频信号。Synthesizing unit 922, configured to use the time domain envelope parameter, frequency domain envelope parameter, tone parameter and harmonic interval parameter, or the time domain envelope parameter, frequency domain envelope parameter, tone parameter, harmonic interval parameter and first harmonic offset parameter, synthesized audio signal.

本发明实施例还提供了相应的编码处理装置，其结构如图10所示，具体实现结构可以包括：The embodiment of the present invention also provides a corresponding encoding processing device, the structure of which is shown in Figure 10, and the specific implementation structure may include:

判断单元101，用于判断当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号是否相似；具体的，可以用编码模式参数的值来表示是否相似的信息；Judging unit 101, configured to judge whether the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band; specifically, the value of the encoding mode parameter can be used to indicate whether the information is similar;

编码单元102，用于根据所述判断单元101得到的判断结果信息，在当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似时，提取用于表征音频信号的时域包络参数和频域包络参数，还用于提取音调参数；或者，在当前频带的音频信号的谱信号与前一个频带的音频信号间的谱信号不相似时，提取用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；在所述音频信号的谐波间隔与第一谐波偏移量的值不同时，还用于提取所述音频信号的第一谐波偏移量参数；The encoding unit 102 is configured to extract the time-domain packets used to characterize the audio signal when the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band according to the judgment result information obtained by the judgment unit 101 Envelope parameters and frequency domain envelope parameters are also used to extract tone parameters; or, when the spectral signal of the audio signal in the current frequency band is not similar to the spectral signal between the audio signal in the previous frequency band, extract the time used to characterize the audio signal Domain envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters; when the harmonic interval of the audio signal is different from the value of the first harmonic offset, it is also used to extract the audio signal First Harmonic Offset parameter;

传输单元103，用于发送所述判断单元101得到的当前频带的音频信号的谱信号与前一个频带的音频信号间的谱信号相似的信息，例如将编码模式参数编码后发送；还用于对所述编码单元提取的所述音频信号的时域包络参数和频域包络参数(还可以包括音调参数)进行编码后发送；或者，发送所述判断单元得到的当前频带的音频信号的谱信号与前一个频带的音频信号间的谱信号不相似的信息，对所述编码单元提取的音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数(还可以包括第一谐波偏移量参数)进行编码后发送。The transmission unit 103 is used to send information similar to the spectral signal between the spectral signal of the audio signal of the current frequency band obtained by the judging unit 101 and the spectral signal of the audio signal of the previous frequency band, such as encoding the encoding mode parameter and sending it; The time-domain envelope parameters and frequency-domain envelope parameters (may also include tone parameters) of the audio signal extracted by the encoding unit are encoded and sent; or, the spectrum of the audio signal in the current frequency band obtained by the judging unit is sent Information about the spectral signal dissimilarity between the signal and the audio signal of the previous frequency band, the time domain envelope parameter, frequency domain envelope parameter, pitch parameter and harmonic interval parameter of the audio signal extracted by the encoding unit (may also include The first harmonic offset parameter) is encoded and sent.

本发明实施例还提供了相应的解码处理装置，其结构如图11所示，具体实现结构可以包括：The embodiment of the present invention also provides a corresponding decoding processing device, the structure of which is shown in Figure 11, and the specific implementation structure may include:

接收信息单元111，用于接收表示当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似的信息，并对收到的数据解码得到用于表征音频信号的时域包络参数和频域包络参数；或者，接收表示当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号不相似的信息，并对收到的数据解码得到用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数；还用于对包含第一谐波偏移量参数的数据解码，得到用于表征音频信号的第一谐波偏移量参数；具体的，接收信息单元111可以根据接收到的编码模式参数，确定所述当前频带的音频信号的谱信号与前一个频带的音频信号间的谱信号相似或不相似；The receiving information unit 111 is used to receive information indicating that the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band, and decode the received data to obtain time-domain envelope parameters used to characterize the audio signal and frequency domain envelope parameters; or, receiving information indicating that the spectral signal of the audio signal of the current frequency band is not similar to the spectral signal of the audio signal of the previous frequency band, and decoding the received data to obtain the time domain used to characterize the audio signal Envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters; also used to decode the data containing the first harmonic offset parameter to obtain the first harmonic offset parameter used to characterize the audio signal Specifically, the receiving information unit 111 may determine whether the spectral signal of the audio signal of the current frequency band is similar or dissimilar to the spectral signal of the audio signal of the previous frequency band according to the received encoding mode parameter;

解码单元112，用于根据所述接收信息单元111接收的所述相似的信息，以及所述用于表征音频信号的时域包络参数和频域包络参数，合成音频信号；或者，根据所述不相似的信息，以及所述用于表征音频信号的时域包络参数、频域包络参数、音调参数和谐波间隔参数，合成音频信号；具体的：The decoding unit 112 is configured to synthesize an audio signal according to the similar information received by the receiving information unit 111, and the time-domain envelope parameters and frequency-domain envelope parameters used to characterize the audio signal; or, according to the The above dissimilar information, and the time domain envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize the audio signal, synthesize the audio signal; specifically:

当接收到当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号相似的信息，所述解码单元112，具体如图12所示，可以包括：When receiving information that the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band, the decoding unit 112, specifically as shown in FIG. 12 , may include:

重建子单元121：用于重建谱信号，得到重建的谱信号；Reconstruction subunit 121: used to reconstruct the spectrum signal to obtain the reconstructed spectrum signal;

第二整形子单元122：用于根据所述音调参数，对所述重建子单元121重建的谱信号进行整形处理，得到整形后的重建谱信号；The second shaping subunit 122: for performing shaping processing on the spectral signal reconstructed by the reconstruction subunit 121 according to the pitch parameter, to obtain the reconstructed spectral signal after shaping;

第一整形子单元123：用于根据所述频域包络参数和时域包络参数对所述重建的谱信号(或整形后的谱信号)进行处理得到合成音频信号；例如：根据所述频域包络参数和时域包络参数对所述第二整形子单元整形处理后的重建的谱信号进行处理，包括：根据所述频域包络参数对所述重建的谱信号进行频域整形处理，得到频域整形后的信号；根据所述时域包络参数对频域整形后的信号进行时域整形处理，得到所述合成音频信号；或，根据所述时域包络参数对所述重建的谱信号进行时域整形处理，得到时域整形后的信号；根据所述频域包络参数对时域整形后的信号进行频域整形处理，得到所述合成音频信号；The first shaping subunit 123: used to process the reconstructed spectral signal (or shaped spectral signal) according to the frequency domain envelope parameter and time domain envelope parameter to obtain a synthesized audio signal; for example: according to the The frequency-domain envelope parameter and the time-domain envelope parameter process the reconstructed spectrum signal after the shaping process by the second shaping subunit, including: performing frequency-domain processing on the reconstructed spectrum signal according to the frequency-domain envelope parameter. Shaping processing to obtain a frequency-domain shaped signal; performing time-domain shaping processing on the frequency-domain shaped signal according to the time-domain envelope parameter to obtain the synthesized audio signal; or, according to the time-domain envelope parameter. performing time-domain shaping processing on the reconstructed spectrum signal to obtain a time-domain shaping signal; performing frequency-domain shaping processing on the time-domain shaping signal according to the frequency domain envelope parameters to obtain the synthesized audio signal;

当接收到当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号不相似的信息，所述解码单元112，具体如图12所示，可以包括：When receiving information that the spectral signal of the audio signal in the current frequency band is not similar to the spectral signal of the audio signal in the previous frequency band, the decoding unit 112, specifically as shown in FIG. 12 , may include:

谐波重建子单元124，用于根据所述谐波间隔参数，得到谐波信号；或根据所述谐波间隔参数和第一谐波偏移量参数，得到谐波信号；The harmonic reconstruction subunit 124 is configured to obtain a harmonic signal according to the harmonic interval parameter; or obtain a harmonic signal according to the harmonic interval parameter and the first harmonic offset parameter;

谱信号重建子单元125，用于根据所述音调参数，调整谐波信号与噪声信号之间的比例，并根据调整后的谐波信号与噪声信号，得到重建的谱信号；The spectral signal reconstruction subunit 125 is configured to adjust the ratio between the harmonic signal and the noise signal according to the tone parameter, and obtain a reconstructed spectral signal according to the adjusted harmonic signal and noise signal;

第三整形子单元126，用于根据所述频域包络参数和时域包络参数对所述重建的谱信号进行处理得到合成音频信号。例如，根据所述频域包络参数对所述重建的谱信号进行频域整形处理，得到频域整形后的信号；根据所述时域包络参数对频域整形后的信号进行时域整形处理，得到所述合成音频信号；或，根据所述时域包络参数对所述重建的谱信号进行时域整形处理，得到时域整形后的信号；根据所述频域包络参数对时域整形后的信号进行频域整形处理，得到合成音频信号。The third shaping subunit 126 is configured to process the reconstructed spectrum signal according to the frequency domain envelope parameter and the time domain envelope parameter to obtain a synthesized audio signal. For example, performing frequency domain shaping processing on the reconstructed spectral signal according to the frequency domain envelope parameters to obtain a frequency domain shaped signal; performing time domain shaping on the frequency domain shaped signal according to the time domain envelope parameters Processing to obtain the synthesized audio signal; or, performing time-domain shaping processing on the reconstructed spectrum signal according to the time-domain envelope parameters to obtain a signal after time-domain shaping; The signal after domain shaping is subjected to frequency domain shaping processing to obtain a synthetic audio signal.

上述各个本发明实施例可以但不限于应用于音频编解码设备中。The various embodiments of the present invention above can be applied to audio codec devices, but are not limited to.

综上所述，本发明各实施例和现有技术中相比，由于本发明实施例采用包含时域包络参数、频域包络参数、音调参数和谐波间隔参数(还可以包括第一谐波偏移量参数)的一组参数，来表征音频信号，在对音频信号编码时可以实现在现有基础上降低了使用参数进行编码时所需要的比特数，可以用更少的比特数对信号进行编码，进一步降低信号的编码速率，从而用更低的编码速率获得更大的编码带宽以及更高的编码质量，特别是对谐波结构明显的信号，采用本发明实施例可以获得很好的编码质量。同时本发明实施例提供的编码、解码处理技术方案中，当用分频带的方式对音频信号进行编码时，判断当前频带的音频信号的谱信号与前一个频带的音频信号的谱信号是否相似，当不相似时提取包含时域包络参数、频域包络参数、音调参数和谐波间隔参数(还可以包括第一谐波偏移量参数)的一组参数，当相似时仅提取包含时域包络参数、频域包络参数(还可以包括音调参数)的一组参数，有效地利用了信号不同频带之间谱的相似性进一步降低了编码速率，获得更大的编码带宽。解码端根据上述参数能够在分频带解码音频信号的过程中实现针对不同信号的特征采用不同的谱信号重建方法，对信号特征的适应性更强，可以对不同信号获得同样高的合成质量。换句话说，当信道的传输能力一定时，由于本发明的编码比特数较低，因此能够编码具有更高带宽的信号。由于从听觉上讲信号的带宽越大获得听觉感受越好，因此当信道的传输能力一定时，本发明提供的方法可以获得更高的编码带宽及更高的合成质量。并且本发明实施例提供的一种对音频信号进行分频带编码、解码处理的技术方案，能够在分频带编解码音频信号的过程中实现用更低的编码速率获得更大的编码带宽，获得更高的编码质量。To sum up, compared with the prior art, each embodiment of the present invention adopts the parameters including time-domain envelope parameter, frequency-domain envelope parameter, pitch parameter and harmonic interval parameter (may also include the first Harmonic Offset Parameter) is a set of parameters to represent the audio signal. When encoding the audio signal, it can reduce the number of bits required for encoding using parameters on the existing basis, and can use fewer bits. Encode the signal to further reduce the encoding rate of the signal, thereby obtaining a larger encoding bandwidth and higher encoding quality with a lower encoding rate, especially for signals with obvious harmonic structures, the embodiment of the present invention can obtain a lot of Good encoding quality. At the same time, in the encoding and decoding processing technical solutions provided by the embodiments of the present invention, when the audio signal is encoded in a frequency-divided manner, it is judged whether the spectral signal of the audio signal of the current frequency band is similar to the spectral signal of the audio signal of the previous frequency band, When dissimilar, extract a set of parameters including time domain envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters (may also include the first harmonic offset parameter), and when similar, only extract the parameters containing A set of parameters including domain envelope parameters and frequency domain envelope parameters (which may also include tone parameters) effectively utilizes the similarity of spectra between different frequency bands of the signal to further reduce the coding rate and obtain a larger coding bandwidth. According to the above parameters, the decoding end can implement different spectral signal reconstruction methods according to the characteristics of different signals in the process of decoding audio signals in frequency bands, which is more adaptable to signal characteristics and can obtain the same high synthesis quality for different signals. In other words, when the transmission capability of the channel is constant, the present invention can encode a signal with a higher bandwidth because the number of encoding bits is lower. In terms of auditory perception, the larger the bandwidth of the signal, the better the auditory experience. Therefore, when the transmission capacity of the channel is constant, the method provided by the present invention can obtain higher coding bandwidth and higher synthesis quality. In addition, the embodiment of the present invention provides a technical solution for sub-band encoding and decoding processing of audio signals, which can achieve a larger encoding bandwidth with a lower encoding rate and obtain a higher encoding bandwidth during the process of encoding and decoding audio signals in sub-bands. High encoding quality.

以上所述，仅为本发明较佳的具体实施方式，但本发明的保护范围并不局限于此，任何熟悉本技术领域的技术人员在本发明揭露的技术范围内，可轻易想到的变化或替换，都应涵盖在本发明的保护范围之内。因此，本发明的保护范围应该以权利要求的保护范围为准。The above is only a preferred embodiment of the present invention, but the scope of protection of the present invention is not limited thereto. Any person skilled in the art within the technical scope disclosed in the present invention can easily think of changes or Replacement should be covered within the protection scope of the present invention. Therefore, the protection scope of the present invention should be determined by the protection scope of the claims.

Claims

1. An audio coding method, characterized in that, comprising:

Extract time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize the audio signal, and the pitch parameters represent the ratio between the harmonic signal and the noise signal in the audio signal;

After encoding the time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters, transmit them to the decoding end;

The method further includes: when the harmonic interval of the audio signal is different from the value of the first harmonic offset, further extracting the first harmonic offset parameter used to characterize the audio signal, and comparing After encoding, it is transmitted to the decoding end, and the first harmonic offset parameter represents the position of the first harmonic of the audio signal.

2. An audio coding device, characterized in that, comprising:

A parameter extraction unit is used to extract time-domain envelope parameters, frequency-domain envelope parameters, tone parameters and harmonic interval parameters used to characterize the audio signal, and the tone parameters represent the ratio between the harmonic signal and the noise signal in the audio signal;

A sending unit, configured to encode the time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters, and transmit them to the decoding end;

The parameter extraction unit is also used for:

When the harmonic interval of the audio signal is different from the value of the first harmonic offset, the first harmonic offset parameter used to characterize the audio signal is further extracted and sent to the sending unit, the first A harmonic offset parameter characterizes the position of the first harmonic of the audio signal;

The sending unit is further configured to encode the first harmonic offset parameter and transmit it to the decoding end.

3. An audio decoding method, characterized in that, comprising:

Decode the received data to obtain the time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize the audio signal. The pitch parameter represents the gap between the harmonic signal and the noise signal in the audio signal. Proportion;

According to the time-domain envelope parameter, the frequency-domain envelope parameter, the pitch parameter and the harmonic interval parameter, synthesize an audio signal, and the step of synthesizing the audio signal includes:

obtaining a harmonic signal according to the harmonic interval parameter;

adjusting the ratio between the harmonic signal and the noise signal according to the tone parameter; obtaining a reconstructed spectrum signal according to the adjusted harmonic signal and noise signal;

processing the reconstructed spectrum signal according to the frequency domain envelope parameter and the time domain envelope parameter to obtain a synthesized audio signal;

The method further includes: decoding the received data containing the first harmonic offset parameter to obtain the first harmonic offset parameter used to characterize the audio signal, the first harmonic offset parameter The position of the first harmonic of the audio signal is characterized, and the steps of synthesizing the audio signal include:

obtaining a harmonic signal according to the harmonic interval parameter and the first harmonic offset parameter;

Processing the reconstructed spectrum signal according to the frequency domain envelope parameter and the time domain envelope parameter to obtain a synthesized audio signal.

4. The method according to claim 3, further comprising: when the harmonic interval used to characterize the audio signal is different from the first harmonic offset parameter, receiving The data of the wave offset parameter is decoded to obtain the first harmonic offset parameter used to characterize the audio signal.

5. The method according to claim 3 or 4, wherein said processing the reconstructed spectrum signal according to said frequency domain envelope parameter and time domain envelope parameter to obtain a synthetic audio signal comprises:

performing frequency domain shaping processing on the reconstructed spectral signal according to the frequency domain envelope parameters to obtain a frequency domain shaped signal, and performing time domain shaping processing on the frequency domain shaped signal according to the time domain envelope parameters, obtaining the synthesized audio signal;

or,

performing time domain shaping processing on the reconstructed spectrum signal according to the time domain envelope parameters to obtain a time domain shaped signal, and performing frequency domain shaping processing on the time domain shaped signal according to the frequency domain envelope parameters, The synthesized audio signal is obtained.

6. An audio decoding device, characterized in that, comprising:

The decoding unit is used to decode the received data to obtain time-domain envelope parameters, frequency-domain envelope parameters, tone parameters and harmonic interval parameters used to represent the audio signal, and the tone parameters represent the harmonic signal and harmonic interval parameters in the audio signal. The ratio between the noise signal;

A synthesis unit, for synthesizing an audio signal according to the time-domain envelope parameter, the frequency-domain envelope parameter, the pitch parameter and the harmonic interval parameter; the synthesis unit includes:

A harmonic reconstruction subunit, configured to obtain a harmonic signal according to the harmonic interval parameter; or when the harmonic interval used to characterize the audio signal is different from the first harmonic offset, according to the harmonic interval parameter and the first harmonic offset parameter to obtain a harmonic signal;

The spectral signal reconstruction subunit is used to adjust the ratio between the harmonic signal and the noise signal obtained by the harmonic reconstruction subunit according to the tone parameter; and obtain the reconstructed spectral signal;

A shaping subunit, configured to process the reconstructed spectrum signal according to the frequency-domain envelope parameter and the time-domain envelope parameter, to obtain a synthesized audio signal;

The decoding unit is also used for:

Decoding the received data containing the first harmonic offset parameter to obtain the first harmonic offset parameter used to characterize the audio signal, the first harmonic offset parameter characterizes the first harmonic offset parameter of the audio signal position of the harmonic.

7. An audio codec system, characterized in that, comprising:

The encoding device according to claim 2;

And the decoding device as claimed in claim 6.

8. An encoding processing method, characterized in that, comprising:

When the audio signal is encoded in the way of frequency division, if the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band, the time domain envelope parameters and frequency domain parameters used to characterize the audio signal are extracted. Envelope parameters, encoding the time-domain envelope parameters and frequency-domain envelope parameters, and sending information indicating that the spectral signal of the audio signal of the current frequency band is similar to the spectral signal of the audio signal of the previous frequency band;

If the spectral signal of the audio signal of the current frequency band is not similar to the spectral signal of the audio signal of the previous frequency band, the time domain envelope parameter, frequency domain envelope parameter, pitch parameter and harmonic interval parameter used to characterize the audio signal are extracted, Encoding the time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters and sending them, and simultaneously sending a signal indicating that the spectrum signal of the audio signal of the current frequency band is similar to the spectrum signal of the audio signal of the previous frequency band Information, the tone parameter characterizes the ratio between the harmonic signal and the noise signal in the audio signal;

The information representing the similarity or dissimilarity between the spectral signal of the audio signal in the current frequency band and the spectral signal of the audio signal in the previous frequency band is specifically represented by a coding mode parameter.

9. The method according to claim 8, characterized in that, the method further comprises:

The encoding mode parameter is used to indicate that when the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band, the decoding end will use the time domain envelope parameter and the frequency domain envelope parameter of the audio signal When decoding the audio signal of the current frequency band, and indicating that the spectral signal of the audio signal of the current frequency band at the decoding end is not similar to the spectral signal of the audio signal of the previous frequency band, according to the time domain envelope parameter used to characterize the audio signal , frequency domain envelope parameter, pitch parameter and harmonic interval parameter, to decode the audio signal of the current frequency band.

10. The method according to claim 8, characterized in that, the method further comprises: when extracting time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize audio signals , when the harmonic interval of the audio signal is different from the value of the first harmonic offset, further extracting the first harmonic offset parameter of the audio signal, encoding it and sending it.

11. The method according to claim 8, characterized in that, the method further comprises: when extracting the time-domain envelope parameters and frequency-domain envelope parameters used to characterize the audio signal, also extracting the time-domain envelope parameters of the audio signal Tone parameters, encoded and sent.

12. An encoding processing device, comprising:

A judging unit, configured to judge whether the spectral signal representing the audio signal of the current frequency band is similar to the spectral signal of the audio signal of the previous frequency band;

The coding unit is used to extract the time-domain envelope parameters used to characterize the audio signal when the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band according to the judgment result information obtained by the judgment unit and frequency domain envelope parameters; or, when the spectral signal of the audio signal of the current frequency band is not similar to the spectral signal between the audio signal of the previous frequency band, the time domain envelope parameters and the frequency domain envelope used to characterize the audio signal are extracted Parameters, tone parameters and harmonic interval parameters, the tone parameters characterize the ratio between the harmonic signal and the noise signal in the audio signal; the spectral signal representing the audio signal of the current frequency band is similar to the spectral signal of the audio signal of the previous frequency band or Dissimilar information, specifically expressed by encoding mode parameters;

a transmission unit, configured to send the similarity information between the spectral signal of the audio signal in the current frequency band obtained by the judging unit and the spectral signal of the audio signal in the previous frequency band, and transmit the time-domain packet of the audio signal extracted by the encoding unit Envelope parameters and frequency domain envelope parameters are encoded and then sent; or, sending the information that the spectrum signal of the audio signal of the current frequency band obtained by the judgment unit is not similar to the spectrum signal between the audio signal of the previous frequency band, and the coded The time-domain envelope parameters, frequency-domain envelope parameters, pitch parameters and harmonic interval parameters of the audio signal extracted by the unit are encoded and then sent.

13. The encoding processing device according to claim 12, wherein:

The encoding unit is configured to extract time-domain envelope parameters, frequency-domain envelope parameters, and When pitch parameters and harmonic interval parameters are used, it is also used to:

When the harmonic interval of the audio signal is different from the value of the first harmonic offset, extracting the first harmonic offset parameter of the audio signal;

The transmission unit is further configured to encode the first harmonic offset parameter and send it.

14. The encoding processing device according to claim 12, wherein:

When the encoding unit performs the extraction of time-domain envelope parameters and frequency-domain envelope parameters used to characterize the audio signal when the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band, Also used for: extracting tone parameters of the audio signal;

The transmission unit is further configured to encode the tone parameter and send it.

15. A decoding processing method, characterized in that, comprising:

Receive the data sent by the encoding end, if the information indicating that the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band is received, according to the time domain envelope parameters and frequency domain envelope used to characterize the audio signal parametrically synthesized audio signals, wherein the time-domain envelope parameters and frequency-domain envelope parameters are decoded from received data;

If information indicating that the spectral signal of the audio signal of the current frequency band is not similar to the spectral signal of the audio signal of the previous frequency band is received, according to the time domain envelope parameters, frequency domain envelope parameters, tone parameters and Synthesizing an audio signal with harmonic interval parameters, wherein the time-domain envelope parameter, frequency-domain envelope parameter, pitch parameter and harmonic interval parameter are decoded from received data, and the pitch parameter represents the harmonic signal in the audio signal The ratio to the noise signal; the information indicating whether the spectral signal of the audio signal in the current frequency band is similar or not similar to the spectral signal of the audio signal in the previous frequency band is specifically represented by a coding mode parameter.

16. The method according to claim 15, wherein the information indicating that the spectral signal of the audio signal of the current frequency band is similar or dissimilar to the spectral signal of the audio signal of the previous frequency band is specifically represented by a coding mode parameter, so The methods described include:

According to the received coding mode parameter, it is determined whether the spectral signal of the audio signal of the current frequency band is similar or not similar to the spectral signal of the audio signal of the previous frequency band.

17. The method according to claim 16, wherein if the spectral signal of the audio signal of the current frequency band is similar to the spectral signal between the audio signal of the previous frequency band, then the step of synthesizing the audio signal comprises:

Reconstructing the spectrum signal; processing the reconstructed spectrum signal according to the frequency domain envelope parameter and the time domain envelope parameter to obtain a synthesized audio signal.

18. The method according to claim 17, wherein the reconstructed spectral signal comprises:

The spectrum signal is reconstructed by means of spectrum replication.

19. The method according to claim 17, wherein the data sent by the encoding end further comprises: a tone parameter for characterizing the audio signal;

After the reconstructed spectral signal, it also includes:

According to the tone parameter, the reconstructed spectrum signal is shaped to obtain the shaped reconstructed spectrum signal.

20. The method according to claim 15, wherein the data sent by the encoding end further includes: a first harmonic offset parameter.

21. The method according to claim 20, wherein if the spectral signal of the audio signal of the current frequency band is not similar to the spectral signal of the audio signal of the previous frequency band, then the step of synthesizing the audio signal comprises:

obtaining a harmonic signal according to the harmonic interval parameter; or obtaining a harmonic signal according to the harmonic interval parameter and the first harmonic offset parameter;

adjusting the ratio between the harmonic signal and the noise signal according to the tone parameter; and obtaining a reconstructed spectrum signal according to the adjusted harmonic signal and noise signal;

22. The method according to claim 17 or 21, wherein said processing the reconstructed spectrum signal according to said frequency domain envelope parameter and time domain envelope parameter to obtain a synthetic audio signal comprises:

performing frequency domain shaping processing on the reconstructed spectral signal according to the frequency domain envelope parameters to obtain a frequency domain shaped signal, and performing time domain shaping processing on the frequency domain shaped signal according to the time domain envelope parameters, obtain a synthetic audio signal;

or,

performing time domain shaping processing on the reconstructed spectrum signal according to the time domain envelope parameters to obtain a time domain shaped signal, and performing frequency domain shaping processing on the time domain shaped signal according to the frequency domain envelope parameters, A synthesized audio signal is obtained.

23. A decoding processing device, characterized in that it comprises:

The receiving information unit is used to receive information indicating that the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band, and decode the received data to obtain the time-domain envelope parameters and Frequency-domain envelope parameters; or, receiving information indicating that the spectral signal of the audio signal in the current frequency band is not similar to the spectral signal of the audio signal in the previous frequency band, and decoding the received data to obtain a time-domain packet for representing the audio signal An envelope parameter, a frequency domain envelope parameter, a tone parameter and a harmonic interval parameter, the tone parameter characterizes the ratio between the harmonic signal and the noise signal in the audio signal; Information about the similarity or dissimilarity of the spectral signal of the audio signal, specifically expressed by the encoding mode parameter;

a decoding unit, configured to synthesize an audio signal according to the similar information received by the receiving information unit, and the time-domain envelope parameter and the frequency-domain envelope parameter; or, according to the dissimilar information, and the The time domain envelope parameter, the frequency domain envelope parameter, the tone parameter and the harmonic interval parameter are described to synthesize the audio signal.

24. The device according to claim 23, wherein when receiving information that the spectral signal of the audio signal in the current frequency band is similar to the spectral signal of the audio signal in the previous frequency band, the decoding unit specifically includes:

Reconstruction subunit: used to reconstruct spectral signals;

The first shaping subunit is configured to process the reconstructed spectrum signal according to the frequency domain envelope parameter and the time domain envelope parameter to obtain a synthesized audio signal.

25. The device according to claim 24, wherein the receiving information unit is further configured to decode the received data containing tone parameters, and obtain tone parameters used to characterize the audio signal;

The decoding unit also includes:

The second shaping subunit: for performing shaping processing on the spectral signal reconstructed by the reconstruction subunit according to the pitch parameter, to obtain the reconstructed spectral signal after shaping;

The first shaping subunit processes the reconstructed spectral signal after shaping by the second shaping subunit according to the frequency domain envelope parameter and the time domain envelope parameter to obtain the synthesized audio signal.

26. The device according to claim 23, wherein when receiving information that the spectral signal of the audio signal in the current frequency band is not similar to the spectral signal of the audio signal in the previous frequency band, the received information unit When the data is decoded to obtain the time domain envelope parameters, frequency domain envelope parameters, pitch parameters and harmonic interval parameters used to characterize the audio signal, the data containing the first harmonic offset parameter is also received and decoded to obtain the The first harmonic offset parameter of the audio signal.

27. The device according to claim 26, wherein the decoding unit comprises:

The harmonic reconstruction subunit is used to obtain a harmonic signal according to the harmonic interval parameter; or obtain a harmonic signal according to the harmonic interval parameter and the first harmonic offset parameter;

The spectral signal reconstruction subunit is used to adjust the ratio between the harmonic signal and the noise signal according to the tone parameter, and obtain a reconstructed spectral signal according to the adjusted harmonic signal and noise signal;

The third shaping subunit is configured to process the reconstructed spectrum signal according to the frequency domain envelope parameter and the time domain envelope parameter to obtain a synthesized audio signal.