[go: up one dir, main page]

CN119068886B - An audio data stream compression method based on OFDM modulation - Google Patents

An audio data stream compression method based on OFDM modulation

Info

Publication number
CN119068886B
CN119068886B CN202411122852.8A CN202411122852A CN119068886B CN 119068886 B CN119068886 B CN 119068886B CN 202411122852 A CN202411122852 A CN 202411122852A CN 119068886 B CN119068886 B CN 119068886B
Authority
CN
China
Prior art keywords
signal
audio signal
subcarrier
crosstalk
window function
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202411122852.8A
Other languages
Chinese (zh)
Other versions
CN119068886A (en
Inventor
张启
张萌
徐常华
余建庚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Academy Of Aerospace Science Technology And Communications Technology Co ltd
Original Assignee
Academy Of Aerospace Science Technology And Communications Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Academy Of Aerospace Science Technology And Communications Technology Co ltd filed Critical Academy Of Aerospace Science Technology And Communications Technology Co ltd
Priority to CN202411122852.8A priority Critical patent/CN119068886B/en
Publication of CN119068886A publication Critical patent/CN119068886A/en
Application granted granted Critical
Publication of CN119068886B publication Critical patent/CN119068886B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L27/00Modulated-carrier systems
    • H04L27/26Systems using multi-frequency codes
    • H04L27/2601Multicarrier modulation systems
    • H04L27/2626Arrangements specific to the transmitter only
    • H04L27/2627Modulators
    • H04L27/2628Inverse Fourier transform modulators, e.g. inverse fast Fourier transform [IFFT] or inverse discrete Fourier transform [IDFT] modulators
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L27/00Modulated-carrier systems
    • H04L27/26Systems using multi-frequency codes
    • H04L27/2601Multicarrier modulation systems
    • H04L27/2647Arrangements specific to the receiver only
    • H04L27/2649Demodulators
    • H04L27/265Fourier transform demodulators, e.g. fast Fourier transform [FFT] or discrete Fourier transform [DFT] demodulators

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Mathematical Physics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Discrete Mathematics (AREA)
  • Noise Elimination (AREA)

Abstract

本发明公开了一种基于OFDM调制的音频数据流压缩方法,涉及信号技术领域,包括步骤S1:对原始音频信号使用多通道编码进行失真降低处理;步骤S2:使用离散余弦变换逐帧处理采样信号并按不同通道输出DCT系数,将原始音频信号分成若干个子载波信号;步骤S3:对子载波信号进行逆离散傅里叶变换,将频域数据转换为时域OFDM符号并进行第二次多通道编码;步骤S4:预设参考音频信号并设置相匹配的参考串扰衰减比值,将时域的OFDM符号转换至修正音频信号;步骤S5:计算修正音频信号的修正串扰衰减比值并与参考串扰衰减比值进行比对。本发明能够降低由于编码不一致造成的串扰,具有在使用多通道音频编码时有效兼顾控制声道间串扰的优点和有益效果。

The present invention discloses an audio data stream compression method based on OFDM modulation, which relates to the field of signal technology, including step S1: using multi-channel coding to reduce distortion of the original audio signal; step S2: using discrete cosine transform to process the sampled signal frame by frame and output DCT coefficients according to different channels, thereby dividing the original audio signal into a plurality of subcarrier signals; step S3: performing inverse discrete Fourier transform on the subcarrier signal, converting the frequency domain data into time domain OFDM symbols and performing a second multi-channel coding; step S4: presetting a reference audio signal and setting a matching reference crosstalk attenuation ratio, converting the time domain OFDM symbols into a modified audio signal; step S5: calculating the modified crosstalk attenuation ratio of the modified audio signal and comparing it with the reference crosstalk attenuation ratio. The present invention can reduce crosstalk caused by inconsistent coding and has the advantages and beneficial effects of effectively taking into account the control of crosstalk between channels when using multi-channel audio coding.

Description

Audio data stream compression method based on OFDM modulation
Technical Field
The invention relates to the technical field of signals, in particular to an audio data stream compression method based on OFDM modulation.
Background
In modern audio processing techniques, multichannel audio coding is widely used for transmission and storage of high dynamic range audio. This encoding method provides rich spatial sense and sound quality by decomposing an audio signal into a plurality of channels. However, in high dynamic range (HIGH DYNAMIC RANGE, HDR) audio processing, multi-channel coding suffers from dynamic range compression distortion. Dynamic range compression distortion refers to the fact that when processing audio signals with a wide dynamic range, the dynamic range of the signal is compressed due to the limitations of coding techniques, resulting in loss of detail and layering of the audio. Currently, multichannel audio coding is mainly used in the prior art to reduce dynamic range compression distortion. At present, when the existing multichannel audio coding method is used, inter-channel crosstalk is easy to occur in high dynamic range audio, signal aliasing and interference among channels can obviously influence the spatial positioning and the overall tone quality of the audio, and the specific phenomenon is that the audio contents of all channels are mutually influenced in the coding or decoding process, so that the original independent channel information is mixed together, and the spatial sense and the definition of the audio are damaged. At present, a method for reducing dynamic range compression distortion capable of obviously reducing inter-channel crosstalk during high dynamic range processing is lacking in the existing multi-channel audio coding technology.
Disclosure of Invention
The invention provides an audio data stream compression method based on OFDM modulation, which solves the problem that more inter-channel crosstalk is easy to cause when multi-channel audio coding is used for reducing dynamic range compression distortion in a high dynamic range in the prior art.
The invention is realized by the following technical scheme:
An audio data stream compression method based on OFDM modulation, the method comprising:
Step S1, sampling a high dynamic range part of an original audio signal, obtaining a sampling signal, preprocessing the sampling signal for noise removal and quantization conversion, primarily reducing the dynamic range compression distortion of the sampling signal, and dividing the sampling signal into a plurality of channels by using a multi-channel coding mode for further distortion reduction processing;
s2, dividing the sampling signal of each channel into a plurality of frames, processing the sampling signal frame by using discrete cosine transform, outputting DCT coefficients according to different channels, dividing an original audio signal into a plurality of subcarrier signals by using a frequency division multiplexing modulation method, and placing the subcarrier signals in a frequency domain, and setting a quantization allocation strategy for randomly allocating all DCT coefficients to all subcarrier signals;
S3, performing inverse discrete Fourier transform on DCT coefficients on subcarrier signals, so as to convert frequency domain data of original audio signals represented by the DCT coefficients into time domain OFDM symbols, adding cyclic prefix to each OFDM symbol, and performing second multi-channel coding on each OFDM symbol in a mode of independently coding each symbol;
S4, presetting a reference audio signal representing a specification, setting a matched reference crosstalk attenuation ratio, converting an OFDM symbol of a time domain into a DCT coefficient representing frequency domain data by using a frequency division multiplexing demodulation method, converting the DCT coefficient by using an inverse discrete cosine transform, and marking a conversion result as a corrected audio signal;
And S5, calculating a corrected crosstalk attenuation ratio of the corrected audio signal and comparing the corrected crosstalk attenuation ratio with a reference crosstalk attenuation ratio, judging that the signal is corrected when the corrected crosstalk attenuation ratio is smaller than the reference crosstalk attenuation ratio, and re-executing the step S2 when the corrected crosstalk attenuation ratio is larger than the reference crosstalk attenuation ratio.
Due to limitations of coding techniques, the dynamic range of the signal is compressed, resulting in loss of detail and layering of the audio. Currently, multichannel audio coding is mainly used in the prior art to reduce dynamic range compression distortion. At present, when the existing multichannel audio coding method is used, inter-channel crosstalk is easy to occur in high dynamic range audio, signal aliasing and interference among channels can obviously influence the spatial positioning and the overall tone quality of the audio, and the specific phenomenon is that the audio contents of all channels are mutually influenced in the coding or decoding process, so that the original independent channel information is mixed together, and the spatial sense and the definition of the audio are damaged. At present, a method for reducing dynamic range compression distortion capable of obviously reducing inter-channel crosstalk during high dynamic range processing is lacking in the existing multi-channel audio coding technology. Based on the above, the invention provides an audio data stream compression method based on OFDM modulation, which solves the problem that more inter-channel crosstalk is easy to cause when multi-channel audio coding is used for reducing dynamic range compression distortion in a high dynamic range in the prior art.
Further, as a possible implementation mode, the frequency division multiplexing modulation method includes the steps of using a frequency domain window function for subcarrier signals to smooth subcarrier spectrums and dynamically adjust subcarrier intervals, obtaining channel frequency response of each subcarrier signal based on frequency domain analysis, identifying spectrum spreading caused by multipath effects based on the channel frequency response, then converting all subcarrier signals to the frequency domain by using DFT conversion, carrying out frequency domain filtering on each subcarrier signal, and compensating spectrum data of the subcarrier signals after filtering to spectrum spreading caused by the multipath effects.
Further, as a possible implementation manner, the process of using the frequency domain window function includes applying a Blackman window function to the frequency domain data of each subcarrier for windowing, adjusting the intervals of the subcarriers based on the characteristics of the Blackman window function, setting a width critical value for the main lobe width of the Blackman window, and when the window function width of each subcarrier is lower than the width critical value, doubling the subcarrier interval width of the subcarrier signal.
Further, as a possible embodiment, let the sample index amount of the window function of the subcarrier be denoted as N, let the width of the window function be denoted as N, let the blackman window function value be denoted as ω, the window function value at the nth sample point be denoted as ω (N), the constant term be denoted as a, the first coefficient be denoted as B, the second coefficient be denoted as C,
The computational formula of the blackman window function is set as:,
wherein the constant term a is used to ensure that the value of the window function is not equal to zero at the end points of the beginning and end of the window,
The first coefficient B and the second coefficient C are used for adjusting the main lobe and side lobe characteristics of the window function.
Further, the compensating process of spectrum spreading further comprises adjusting the spectrum based on the channel frequency response by setting a compensation filter for adjusting the spectrum to reduce distortion.
Further, the setting method of the width critical value is to construct a linear relation between the spectrum efficiency and the width critical value, and dynamically adjust the width critical value according to the spectrum efficiency.
Further, the corrected crosstalk attenuation ratio of the corrected audio signal is set to be a ratio between the crosstalk amplitude of the corrected audio signal and the main lobe amplitude of the corrected audio signal.
Further, the reference crosstalk attenuation ratio is calculated based on a spectral leakage value representing a component size of a desired spectral component of the original audio signal that is spread to a non-target frequency and a crosstalk component value representing an energy amount caused by occurrence of overlapping interference of the original audio signal.
Further, the spectral leakage value is set as a ratio of side lobe energy of the original audio signal to main lobe energy of the reference audio signal, and the crosstalk component value is set as a ratio of total crosstalk component energy of the original audio signal to total signal energy of the reference audio signal.
Further, the cyclic prefix adjusts for length based on multipath effects and delay spread dynamics of the channel.
Compared with the prior art, the method adopts frequency division multiplexing modulation in multi-channel coding, divides the signal into a plurality of sub-carriers, uses DCT coefficient of frequency domain to further process, can independently compress and optimize dynamic range on each channel, carries out secondary multi-channel independent coding on each OFDM symbol, can reduce crosstalk caused by inconsistent coding, and has the advantage and beneficial effect of effectively controlling inter-channel crosstalk when multi-channel audio coding is used.
Drawings
The accompanying drawings, which are included to provide a further understanding of embodiments of the application and are incorporated in and constitute a part of this specification, illustrate embodiments of the application and together with the description serve to explain the principles of the application. In the drawings:
FIG. 1 is a flow chart of the present invention.
Detailed Description
For the purpose of making apparent the objects, technical solutions and advantages of the present invention, the present invention will be further described in detail with reference to the following examples and the accompanying drawings, wherein the exemplary embodiments of the present invention and the descriptions thereof are for illustrating the present invention only and are not to be construed as limiting the present invention.
Examples
As shown in fig. 1, the present embodiment is an audio data stream compression method based on OFDM modulation, which solves the problem in the prior art that more inter-channel crosstalk is easily caused when multi-channel audio coding is used in a high dynamic range to reduce dynamic range compression distortion.
The invention is realized by the following technical scheme:
An audio data stream compression method based on OFDM modulation, the method comprising:
Step S1, sampling a high dynamic range part of an original audio signal, obtaining a sampling signal, preprocessing the sampling signal for noise removal and quantization conversion, primarily reducing the dynamic range compression distortion of the sampling signal, and dividing the sampling signal into a plurality of channels by using a multi-channel coding mode for further distortion reduction processing;
s2, dividing the sampling signal of each channel into a plurality of frames, processing the sampling signal frame by using discrete cosine transform, outputting DCT coefficients according to different channels, dividing an original audio signal into a plurality of subcarrier signals by using a frequency division multiplexing modulation method, and placing the subcarrier signals in a frequency domain, and setting a quantization allocation strategy for randomly allocating all DCT coefficients to all subcarrier signals;
S3, performing inverse discrete Fourier transform on DCT coefficients on subcarrier signals, so as to convert frequency domain data of original audio signals represented by the DCT coefficients into time domain OFDM symbols, adding cyclic prefix to each OFDM symbol, and performing second multi-channel coding on each OFDM symbol in a mode of independently coding each symbol;
S4, presetting a reference audio signal representing a specification, setting a matched reference crosstalk attenuation ratio, converting an OFDM symbol of a time domain into a DCT coefficient representing frequency domain data by using a frequency division multiplexing demodulation method, converting the DCT coefficient by using an inverse discrete cosine transform, and marking a conversion result as a corrected audio signal;
And S5, calculating a corrected crosstalk attenuation ratio of the corrected audio signal and comparing the corrected crosstalk attenuation ratio with a reference crosstalk attenuation ratio, judging that the signal is corrected when the corrected crosstalk attenuation ratio is smaller than the reference crosstalk attenuation ratio, and re-executing the step S2 when the corrected crosstalk attenuation ratio is larger than the reference crosstalk attenuation ratio.
The original audio signal is the target processing audio signal, and the multi-channel coding process is the main mode for reducing distortion in the prior art. The use of Discrete Cosine Transform (DCT) to process the sampled signal frame by frame aims at the different frequency components in the audio signal being processed independently, which transform processes the spectral characteristics of the signal separately, helping to reduce cross-talk between the different frequency components during encoding and thus reduce inter-channel interference, while the frame by frame DCT processing of the sampled signal allows independent optimisation of the data for each frame. This framing process allows the crosstalk problem for each frame to be handled separately, helping to optimize the separation of signals between channels at the frame level, thereby reducing crosstalk. The original audio signal is divided into a plurality of subcarrier signals by using a frequency division multiplexing modulation method and is arranged in a frequency domain, so that a plurality of signals are allowed to be transmitted simultaneously in the same frequency spectrum resource, and each signal occupies a different frequency band. This approach increases the efficiency of spectrum utilization, allowing the system to transmit more information within a limited bandwidth while maintaining the independence of each channel signal. Converting frequency domain data of an original audio signal represented by DCT coefficients into time domain OFDM symbols for reducing frequency domain interference, signal interference and crosstalk in the frequency domain may be mitigated after conversion into the time domain because the processing and transmission characteristics of the time domain signal are different from the frequency domain. The inverse OFDM and inverse DCT operations are performed to convert the time domain signal back to the frequency domain and recover the original audio signal, which can effectively correct the interference caused by the crosstalk in the transmission process and further reduce the crosstalk between the multiple channels through proper demodulation and decoding steps. The reference audio signal refers to a standard audio signal for comparing, calibrating or evaluating the quality of other audio signals, i.e. an audio signal representing an industry specification.
The high dynamic range portion of the original audio signal is sampled, the sampled signal is pre-processed for noise removal and quantization conversion to reduce dynamic range compression distortion, and multi-channel coding is used to divide the sampled signal into multiple channels to further reduce distortion. The audio signal is cleaned and optimized for more efficient subsequent processing. Sampling of the high dynamic range portion ensures that the details of the audio signal are preserved. Noise removal and quantization conversion help to improve signal quality, and multi-channel coding provides more flexibility for subsequent processing. The sampled signal of each channel is divided into frames, the DCT is applied frame by frame and DCT coefficients are output for converting the audio signal from the time domain to the frequency domain, typically for data compression and removing redundant information. The division of the frames helps to gradually process the audio data, and improves the processing accuracy and efficiency.
More, the original audio signal is divided into several sub-carrier signals, placed in the frequency domain, and the quantization allocation strategy randomly allocates the DCT coefficients to the sub-carrier signals. The frequency division multiplexing technology allows signals to be transmitted on different subcarriers simultaneously, thereby improving the bandwidth utilization. The quantization allocation strategy is used to allocate quantization results of signals to different subcarriers or frequency ranges, and common quantization allocation strategies may include uniform quantization allocation, linear quantization allocation, and gradient quantization allocation. As a specific application, in a specific implementation, it is preferable that a quantization allocation policy is set based on gradient quantization allocation, the frequencies of subcarrier signals and DCT coefficients are arranged from high to low, all frequency ranges are equally divided into a plurality of frequency gradient range sections, subcarrier signals and DCT coefficients in the same gradient range section are allocated correspondingly and randomly, and at least one DCT coefficient is allocated on each subcarrier signal. The width of the frequency gradient range interval is comprehensively considered based on the sparsity of DCT coefficients and the subcarrier width. The quantization allocation strategy enables each subcarrier to bear different DCT coefficients, and frequency domain utilization is optimized. The DCT coefficients are converted into time domain OFDM symbols, and a cyclic prefix is added to each OFDM symbol for converting the data of the frequency domain back to the time domain, so as to prepare for OFDM modulation, and the cyclic prefix is helpful for reducing the interference among channel symbols and improving the robustness of the system. And then performing second multi-channel coding on each OFDM symbol for further crosstalk optimization. DCT coefficients of the time domain OFDM symbols converted back to the frequency domain are used for converting the time domain signals back to the frequency domain, the inverse DCT operation is prepared, the inverse discrete cosine transform restores the frequency domain data to the time domain, and the corrected audio signals are obtained. And (2) calculating a corrected crosstalk attenuation ratio of the corrected audio signal, comparing the corrected crosstalk attenuation ratio with a reference crosstalk attenuation ratio, judging whether the signal is corrected according to the ratio, and returning to the step (S2) if the signal is not qualified, so as to evaluate the corrected audio quality. If the correction is not standard, the process returns and reprocesses to ensure that the final audio signal reaches the desired quality.
Further, as a possible implementation mode, the frequency division multiplexing modulation method includes the steps of using a frequency domain window function for subcarrier signals to smooth subcarrier spectrums and dynamically adjust subcarrier intervals, obtaining channel frequency response of each subcarrier signal based on frequency domain analysis, identifying spectrum spreading caused by multipath effects based on the channel frequency response, then converting all subcarrier signals to the frequency domain by using DFT conversion, carrying out frequency domain filtering on each subcarrier signal, and compensating spectrum data of the subcarrier signals after filtering to spectrum spreading caused by the multipath effects.
The frequency domain window function helps to smooth the frequency spectrum of the subcarriers, reducing spectral leakage effects. The spectrum leakage refers to leakage of frequency domain signal energy from one frequency component to the other, which can lead to signal interference and crosstalk. The use of the simultaneous window function may adjust the subcarrier spacing to accommodate different spectral environments. Such dynamic adjustment helps to optimize spectrum utilization, reduce interference between adjacent subcarriers, and improve overall modulation efficiency. By analyzing the channel frequency response, the spectrum change condition of each subcarrier in the actual transmission process can be known. The channel frequency response reveals the attenuation and frequency selective distortion of the signal during transmission. Multipath effects typically cause spectral spreading, i.e., signal overlap and interference due to multiple path transmissions. This effect will widen the spectrum and increase interference between signals. Identifying these effects is a precondition for processing and compensating for spectral spreading. The frequency domain filtering of each subcarrier signal can effectively reduce or compensate for the spread of spectrum due to multipath effects. In particular implementations, the filter design may optimize the signal based on the characteristics of multipath effects, thereby reducing the impact of spectral spreading on signal quality.
Further, as a possible implementation manner, the process of using the frequency domain window function includes applying a Blackman window function to the frequency domain data of each subcarrier for windowing, adjusting the intervals of the subcarriers based on the characteristics of the Blackman window function, setting a width critical value for the main lobe width of the Blackman window, and when the window function width of each subcarrier is lower than the width critical value, doubling the subcarrier interval width of the subcarrier signal.
The blackman window function is a windowing function that reduces spectral leakage and improves the accuracy of frequency domain analysis. The shape of which typically has a large main lobe and a plurality of side lobes. The spectrum leakage can be effectively reduced, and the spectrum data can be smoothed. Compared to other window functions, such as hanning and hamming windows, the sidelobe attenuation of the blackman window is more pronounced, helping to reduce the sidelobe interference of the spectrum. By applying the blackman window function to the frequency domain data of the sub-carriers, the frequency domain leakage effect can be reduced, and the resolution of the frequency spectrum and the definition of the signal can be improved. The main lobe width of the blackman window directly affects the subcarrier spacing in the frequency domain, and in order to avoid spectral overlap between subcarriers, the subcarrier spacing needs to be adjusted according to the characteristics of the window function so that it is large enough to maintain effective isolation in the frequency domain. The main lobe width of the blackman window determines the resolution of the subcarriers in the frequency domain. Setting the width threshold helps to adjust the subcarrier spacing according to the actual window function width. When the main lobe width of the blackman window is below the set width threshold, it is indicated that the spectrum interference between the subcarriers is small. In order to further optimize the system performance, the subcarrier spacing needs to be increased to prevent interference caused by spectrum leakage or multipath effect, and the dynamic adjustment method can adapt to actual requirements in different frequency domain environments, so that a lower interference level and higher signal quality are ensured in the frequency domain processing process of subcarrier signals.
Further, the sample index amount of the window function of the subcarrier is represented as N, the width of the window function is represented as N, the value of the blackman window function is represented as ω, the value of the window function at the nth sample point is represented as ω (N), the constant term is represented as a, the first coefficient is represented as B, the second coefficient is represented as C,
The computational formula of the blackman window function is set as:,
wherein the constant term a is used to ensure that the value of the window function is not equal to zero at the end points of the beginning and end of the window,
The first coefficient B and the second coefficient C are used for adjusting the main lobe and side lobe characteristics of the window function.
The constant term a is used to ensure that the value of the window function at the beginning and ending endpoints of the window is not zero. The method is beneficial to reducing the influence of the window function on the signal edge, avoiding the sudden interruption of the signal at the boundary of the window, reducing the frequency spectrum leakage, reducing the frequency spectrum artifact generated by the window function at the signal edge by enabling the boundary value of the window function to be close to zero, and improving the accuracy of frequency spectrum analysis. The first coefficient B is used for adjusting the width and the shape of the main lobe of the window function. The width of the main lobe influences the width of the main peak of the window function in the frequency domain, thereby influencing the resolution of the frequency spectrum, increasing the B value can lead the main lobe to be wider and improve the resolution of the frequency spectrum, but the amplitude of the side lobe can be increased. Optimizing the value of B can find a balance between main lobe width and side lobe attenuation. The second coefficient C is used to adjust the sidelobe characteristics of the window function. The C value is added to help reduce the amplitude of the side lobe, make the side lobe decay faster, improve the frequency spectrum dynamic range of the window function, and help reduce noise and interference in the frequency spectrum. In operation, the window function value ω (n) at each sample point is calculated according to the formula. When the Blackman window function is applied in the frequency domain, the endpoint value of the window function is ensured to be non-zero, the frequency spectrum artifact is reduced, and the coefficients B and C are adjusted to optimize the main lobe and side lobe characteristics of the window function. This affects spectral resolution and sidelobe interference and the application of the Blackman window function to the frequency domain data. By selecting proper A, B, C values, the smoothing of the spectrum data can be optimized, the spectrum leakage and interference can be reduced, and the system performance can be improved.
Further, as a possible implementation manner, the compensating process of spectrum expansion further comprises setting a compensating filter based on the channel frequency response to adjust the spectrum, wherein the compensating filter is used for adjusting the spectrum to reduce distortion, the setting method of the width critical value is that a linear relation between the spectrum efficiency and the width critical value is constructed, the width critical value is dynamically adjusted according to the frequency spectrum efficiency, and the corrected crosstalk attenuation ratio of the corrected audio signal is set to be the ratio between the crosstalk amplitude of the corrected audio signal and the main lobe amplitude of the corrected audio signal.
The compensation filter is designed to adjust the frequency spectrum according to the channel frequency response. The function of such a filter is to correct for spectral spread due to multipath effects or other channel characteristics, ensuring that the spectrum of the signal is restored to the desired shape. The main purpose of the compensation filter is to reduce the distortion caused by the spread spectrum. By appropriate filtering, the signal spectrum can be tuned to a form that is more nearly ideal, thereby reducing interference and distortion due to spectral spreading. In practice, the filter is preferably designed to take into account the specific frequency response of the channel in order to accurately compensate for spectral shifts and spreads of the signal. The effective compensation filter can improve the quality of the signal and the overall performance of the system. The width critical value is used for determining whether the main lobe width of the Blackman window function is enough, when the spectrum efficiency is high, the spectrum overlap of the subcarriers is smaller, a smaller width critical value is needed, and when the spectrum efficiency is low, the spectrum overlap of the subcarriers is larger, a larger width critical value is needed. The modified crosstalk attenuation ratio is used to evaluate the crosstalk effect of the modified audio signal. The degree of crosstalk in the corrected signal can be quantified by calculating the ratio of the crosstalk amplitude to the main lobe amplitude, and the smaller the corrected crosstalk attenuation ratio is, the better the correction effect is, and the crosstalk influence is smaller. The ratio is used to determine the effectiveness of the correction process and to determine whether further adjustments are needed. By setting the ratio and comparing it with a reference value, the success of the correction process can be evaluated. If the ratio is not as expected, the description modification process needs to be optimized or re-executed. The embodiment can effectively reduce the distortion caused by spectrum expansion and improve the quality of the audio signal. The compensation filter corrects the spectral distortion, dynamically adjusts the width threshold to optimize spectral efficiency, and corrects the crosstalk attenuation ratio for evaluating the correction effect.
Further, as a possible implementation manner, the reference crosstalk attenuation ratio is calculated based on a spectral leakage value and a crosstalk component value, wherein the spectral leakage value represents a component size of an expected spectral component of an original audio signal extending to a non-target frequency, the crosstalk component value represents an energy size caused by overlapping interference of the original audio signal, the spectral leakage value is set to be a ratio of side lobe energy of the original audio signal to main lobe energy of the reference audio signal, the crosstalk component value is set to be a ratio of total energy of crosstalk components of the original audio signal to total energy of signals of the reference audio signal, and the cyclic prefix adjusts the length according to multipath effects and delay extension dynamics of a channel.
The spectral leakage value represents a component size of a desired spectral component of the original audio signal spread to non-target frequencies. The leakage degree of the signal in the frequency domain is estimated by calculating the ratio of the side lobe energy of the original signal to the main lobe energy of the reference audio signal, wherein the side lobe energy refers to the energy of the signal in the frequency range outside the main lobe, and the lower the leakage value is, the less the frequency spectrum leakage is. The crosstalk component value is used for quantifying the energy caused by overlapping interference by calculating the ratio of the total energy of the crosstalk component to the total energy of the signal, and the smaller the crosstalk component value is, the less the interference is. The length of the cyclic prefix should be dynamically adjusted according to multipath effects and delay spread of the channel. Multipath refers to the arrival of a signal at a receiver through multiple paths during propagation, resulting in a time domain spread of the signal. Delay spread refers to the time delay that a signal appears at the receiving end. In a specific implementation, the reference crosstalk attenuation ratio may be set as a weighted average of the spectrum leakage value and the crosstalk component value or other comprehensive indicators, and is used to evaluate the overall crosstalk processing effect.
The foregoing description of the embodiments has been provided for the purpose of illustrating the general principles of the invention, and is not meant to limit the scope of the invention, but to limit the invention to the particular embodiments, and any modifications, equivalents, improvements, etc. that fall within the spirit and principles of the invention are intended to be included within the scope of the invention.

Claims (10)

1.一种基于OFDM调制的音频数据流压缩方法,其特征在于,该方法包括:1. A method for compressing an audio data stream based on OFDM modulation, characterized in that the method comprises: 步骤S1:对原始音频信号的高动态范围部分进行采样并获得采样信号,对采样信号进行用于噪声去除和量化转换的预处理,对采样信号的动态范围压缩失真进行初步降低,并使用多通道编码的方式将采样信号分成多个通道进行进一步失真降低处理;Step S1: sampling the high dynamic range portion of the original audio signal to obtain a sampled signal, preprocessing the sampled signal for noise removal and quantization conversion, preliminarily reducing the dynamic range compression distortion of the sampled signal, and using multi-channel coding to divide the sampled signal into multiple channels for further distortion reduction processing; 步骤S2:将每个通道的采样信号划分成若干帧,使用离散余弦变换逐帧处理采样信号并按不同通道输出DCT系数,使用频分复用调制方法将原始音频信号分成若干个子载波信号并置于频域中,设置量化分配策略用以将全部DCT系数随机分配到全部子载波信号上;Step S2: Divide the sampled signal of each channel into several frames, use discrete cosine transform to process the sampled signal frame by frame and output DCT coefficients according to different channels, use frequency division multiplexing modulation method to divide the original audio signal into several subcarrier signals and place them in the frequency domain, and set a quantization allocation strategy to randomly allocate all DCT coefficients to all subcarrier signals; 步骤S3:对子载波信号上的DCT系数进行逆离散傅里叶变换,用以将DCT系数所表示的原始音频信号的频域数据转换为时域OFDM符号,对每个OFDM符号添加循环前缀,并以各符号独立编码的方式对每个OFDM符号进行第二次多通道编码;Step S3: performing an inverse discrete Fourier transform on the DCT coefficients on the subcarrier signal to convert the frequency domain data of the original audio signal represented by the DCT coefficients into time domain OFDM symbols, adding a cyclic prefix to each OFDM symbol, and performing a second multi-channel encoding on each OFDM symbol in a manner where each symbol is independently encoded; 步骤S4:预设表示规范的参考音频信号并设置相匹配的参考串扰衰减比值,使用频分复用解调方法将时域的OFDM符号转换至表示频域数据的DCT系数,再进行逆离散余弦变换将DCT系数进行转换并将转换结果标注为修正音频信号;Step S4: Preset a reference audio signal representing a standard and set a matching reference crosstalk attenuation ratio, use a frequency division multiplexing demodulation method to convert the time-domain OFDM symbols into DCT coefficients representing frequency-domain data, then perform an inverse discrete cosine transform to convert the DCT coefficients and label the conversion result as a modified audio signal; 步骤S5:计算修正音频信号的修正串扰衰减比值并与参考串扰衰减比值进行比对,若修正串扰衰减比值小于参考串扰衰减比值时判定信号完成修正,若修正串扰衰减比值大于参考串扰衰减比值时则重新执行步骤S2。Step S5: Calculate the corrected crosstalk attenuation ratio of the corrected audio signal and compare it with the reference crosstalk attenuation ratio. If the corrected crosstalk attenuation ratio is less than the reference crosstalk attenuation ratio, it is determined that the signal correction is completed. If the corrected crosstalk attenuation ratio is greater than the reference crosstalk attenuation ratio, re-execute step S2. 2.根据权利要求1所述的一种基于OFDM调制的音频数据流压缩方法,其特征在于,所述频分复用调制方法的过程包括:针对子载波信号使用频域窗函数,用以平滑子载波频谱并动态调整子载波间隔;基于频域分析得到每个子载波信号的信道频率响应,基于信道频率响应识别多径效应引起的频谱扩展;后使用DFT变换将全部子载波信号转至频域,并对每个子载波信号进行频域滤波,将滤波之后的子载波信号的频谱数据补偿至由多径效应引起的频谱扩展。2. According to claim 1, a method for compressing an audio data stream based on OFDM modulation is characterized in that the process of the frequency division multiplexing modulation method includes: using a frequency domain window function for the subcarrier signal to smooth the subcarrier spectrum and dynamically adjust the subcarrier spacing; obtaining the channel frequency response of each subcarrier signal based on frequency domain analysis, and identifying the spectrum expansion caused by the multipath effect based on the channel frequency response; then using DFT transformation to transfer all subcarrier signals to the frequency domain, and performing frequency domain filtering on each subcarrier signal, and compensating the spectrum data of the filtered subcarrier signal to the spectrum expansion caused by the multipath effect. 3.根据权利要求2所述的一种基于OFDM调制的音频数据流压缩方法,其特征在于,所述使用频域窗函数的过程包括:将布莱克曼窗函数应用于每个子载波的频域数据用以加窗处理,基于布莱克曼窗函数特性调整子载波的间隔,给布莱克曼窗的主瓣宽度设定宽度临界值,当每个子载波的窗函数宽度低于宽度临界值时,该子载波信号增加一倍的子载波间隔宽度。3. The audio data stream compression method based on OFDM modulation according to claim 2 is characterized in that the process of using the frequency domain window function includes: applying the Blackman window function to the frequency domain data of each subcarrier for windowing processing, adjusting the subcarrier spacing based on the characteristics of the Blackman window function, and setting a width critical value for the main lobe width of the Blackman window. When the window function width of each subcarrier is lower than the width critical value, the subcarrier signal is doubled in subcarrier spacing width. 4.根据权利要求3所述的一种基于OFDM调制的音频数据流压缩方法,其特征在于,设子载波的窗函数的样本索引量表示为n,设窗函数的宽度表示为N,布莱克曼窗函数值表示为ω,第n个样本点上的窗函数值表示为ω(n),常数项表示为A,第一系数表示为B,第二系数表示为C,4. The audio data stream compression method based on OFDM modulation according to claim 3, characterized in that, the sample index of the window function of the subcarrier is represented as n, the width of the window function is represented as N, the Blackman window function value is represented as ω, the window function value at the nth sample point is represented as ω(n), the constant term is represented as A, the first coefficient is represented as B, and the second coefficient is represented as C. 则所述布莱克曼窗函数的计算式设置为:Then the calculation formula of the Blackman window function is set as: , 其中所述常数项A用于确保窗函数的值在窗的开始和结束的端点不等于零,The constant term A is used to ensure that the value of the window function is not equal to zero at the beginning and end of the window. 所述第一系数B和第二系数C均用于调整窗函数的主瓣和旁瓣特性。The first coefficient B and the second coefficient C are both used to adjust the main lobe and side lobe characteristics of the window function. 5.根据权利要求2所述的一种基于OFDM调制的音频数据流压缩方法,其特征在于,所述频谱扩展的补偿过程还包括基于信道频率响应设置补偿滤波器来调整频谱,所述补偿滤波器用于将频谱调整为降低失真。5. The audio data stream compression method based on OFDM modulation according to claim 2 is characterized in that the compensation process of the spectrum expansion also includes setting a compensation filter based on the channel frequency response to adjust the spectrum, and the compensation filter is used to adjust the spectrum to reduce distortion. 6.根据权利要求3所述的一种基于OFDM调制的音频数据流压缩方法,其特征在于,所述宽度临界值的设定方法为,构建频谱效率于宽度临界值的线性关系,根据频谱效率的高低来动态调整宽度临界值。6. The audio data stream compression method based on OFDM modulation according to claim 3 is characterized in that the width threshold is set by establishing a linear relationship between spectral efficiency and the width threshold, and dynamically adjusting the width threshold according to the level of spectral efficiency. 7.根据权利要求1所述的一种基于OFDM调制的音频数据流压缩方法,其特征在于,所述修正音频信号的修正串扰衰减比值设为修正音频信号的串扰幅度与修正音频信号的主瓣幅度之间的比值。7. The audio data stream compression method based on OFDM modulation according to claim 1 is characterized in that the corrected crosstalk attenuation ratio of the corrected audio signal is set to the ratio between the crosstalk amplitude of the corrected audio signal and the main lobe amplitude of the corrected audio signal. 8.根据权利要求1所述的一种基于OFDM调制的音频数据流压缩方法,其特征在于,所述参考串扰衰减比值基于频谱泄露值和串扰成分值来计算获得,其中所述频谱泄露值表示原始音频信号的期望频谱分量扩展到非目标频率的分量大小,所述串扰成分值表示原始音频信号出现重叠干扰导致的能量大小。8. According to the audio data stream compression method based on OFDM modulation according to claim 1, it is characterized in that the reference crosstalk attenuation ratio is calculated based on the spectrum leakage value and the crosstalk component value, wherein the spectrum leakage value represents the size of the component of the desired spectrum component of the original audio signal extended to the non-target frequency, and the crosstalk component value represents the energy size caused by overlapping interference in the original audio signal. 9.根据权利要求8所述的一种基于OFDM调制的音频数据流压缩方法,其特征在于,所述频谱泄露值设置为原始音频信号的旁瓣能量与参考音频信号的主瓣能量的比值;所述串扰成分值设置为原始音频信号的串扰成分总能量与参考音频信号的信号总能量的比值。9. The audio data stream compression method based on OFDM modulation according to claim 8 is characterized in that the spectrum leakage value is set to the ratio of the sidelobe energy of the original audio signal to the mainlobe energy of the reference audio signal; and the crosstalk component value is set to the ratio of the total crosstalk component energy of the original audio signal to the total signal energy of the reference audio signal. 10.根据权利要求1所述的一种基于OFDM调制的音频数据流压缩方法,其特征在于,所述循环前缀根据信道的多径效应和时延扩展动态来调整长度。10. The audio data stream compression method based on OFDM modulation according to claim 1, characterized in that the length of the cyclic prefix is dynamically adjusted according to the multipath effect and delay spread of the channel.
CN202411122852.8A 2024-08-15 2024-08-15 An audio data stream compression method based on OFDM modulation Active CN119068886B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202411122852.8A CN119068886B (en) 2024-08-15 2024-08-15 An audio data stream compression method based on OFDM modulation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202411122852.8A CN119068886B (en) 2024-08-15 2024-08-15 An audio data stream compression method based on OFDM modulation

Publications (2)

Publication Number Publication Date
CN119068886A CN119068886A (en) 2024-12-03
CN119068886B true CN119068886B (en) 2025-10-03

Family

ID=93630899

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202411122852.8A Active CN119068886B (en) 2024-08-15 2024-08-15 An audio data stream compression method based on OFDM modulation

Country Status (1)

Country Link
CN (1) CN119068886B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102025678A (en) * 2009-09-11 2011-04-20 华为技术有限公司 Channel estimation method, device and related detection system
CN102752253A (en) * 2011-12-22 2012-10-24 南京邮电大学 Method for inhibiting inter-carrier interference of orthogonal frequency division multiplexing (OFDM) system by time-frequency domain combined processing

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102804673B (en) * 2010-03-12 2015-12-02 瑞典爱立信有限公司 Method and apparatus for multiple access in a wireless communication network using DCT-OFDM
US8989088B2 (en) * 2011-01-07 2015-03-24 Integrated Device Technology Inc. OFDM signal processing in a base transceiver system
CN104217726A (en) * 2014-09-01 2014-12-17 东莞中山大学研究院 A lossless audio compression coding method and its decoding method
CN107454030B (en) * 2017-07-17 2020-03-17 科大智能电气技术有限公司 Power line broadband carrier semi-parallel transmitter and implementation method thereof
CN107566311B (en) * 2017-07-31 2020-02-18 南京邮电大学 Transmission method of RB F-OFDM system based on resource block filtering
CN112688896B (en) * 2020-12-22 2022-09-06 中南民族大学 Orthogonal frequency division multiplexing modulation system and frequency domain spreading non-orthogonal active interference cancellation method
CN117014732A (en) * 2023-08-11 2023-11-07 航天科工通信技术研究院有限责任公司 C-band wireless image transmission system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102025678A (en) * 2009-09-11 2011-04-20 华为技术有限公司 Channel estimation method, device and related detection system
CN102752253A (en) * 2011-12-22 2012-10-24 南京邮电大学 Method for inhibiting inter-carrier interference of orthogonal frequency division multiplexing (OFDM) system by time-frequency domain combined processing

Also Published As

Publication number Publication date
CN119068886A (en) 2024-12-03

Similar Documents

Publication Publication Date Title
US7313519B2 (en) Transient performance of low bit rate audio coding systems by reducing pre-noise
US7088791B2 (en) Systems and methods for improving FFT signal-to-noise ratio by identifying stage without bit growth
EP2313884B1 (en) Parametric stereo conversion system and method
RU2550549C2 (en) Signal processing device and method and programme
EP2057809B1 (en) Transmission methods and apparatuses for cancelling inter-carrier interference
JP5312680B2 (en) Method and apparatus for adjusting channel delay parameters of multi-channel signals
RU2636697C1 (en) Device and method for coding
DK2615855T3 (en) Binaural signal enhancement system
CN109698712B (en) Narrow band satellite communication system
CN101076007A (en) Method for cancelling interference realized in frequency region and used in WCDMA straight-station system
GB2463231A (en) Embedding a watermark in an audio signal and applying gain to it
JP2012522255A (en) Audio signal classification method and apparatus
CN119068886B (en) An audio data stream compression method based on OFDM modulation
KR101436047B1 (en) Apparatus and method for reducing bit for digial to analog conversion in frequency division multiple access system
EP2104095A1 (en) A method and an apparatus for adjusting quantization quality in encoder and decoder
US8949114B2 (en) Method and arrangement for estimating the quality degradation of a processed signal
KR20070081381A (en) Automatic Gain Control Apparatus and Method in Wireless Communication System of Orthogonal Frequency Division Multiple Access
US9177566B2 (en) Noise suppression method and apparatus
CN105610748B (en) A kind of channel-equalization method of frequency segmentation
US12191834B2 (en) Method and unit for performing dynamic range control
JP4612511B2 (en) Receiving apparatus and receiving method
US20070147227A1 (en) Method of coding data, decoding method, transmitter and receiver
JP2009099084A (en) Conversion device
US7826561B2 (en) Single sideband voice signal tuning method
CN111277245B (en) Design method of low-order sub-band filter for filtering OFDM system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant