CN113270105B - A Voice-like Data Transmission Method Based on Hybrid Modulation - Google Patents
A Voice-like Data Transmission Method Based on Hybrid Modulation Download PDFInfo
- Publication number
- CN113270105B CN113270105B CN202110548729.2A CN202110548729A CN113270105B CN 113270105 B CN113270105 B CN 113270105B CN 202110548729 A CN202110548729 A CN 202110548729A CN 113270105 B CN113270105 B CN 113270105B
- Authority
- CN
- China
- Prior art keywords
- data
- frequency
- voice
- frequency band
- symbols
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Digital Transmission Methods That Use Modulated Carrier Waves (AREA)
Abstract
本发明公开了一种基于混合调制的类语音数据传输方法,包括以下步骤:将输入数据流进行串并转换,得到固定长度的分段数据流;在声码器语音通道的最佳传输带宽内对频段进行划分,对不同的频段采用不同的调制方式,以达到最小的传输误码率,并且每个频段的误码率相同;通过快速傅里叶逆变换得到数据符号;将空白符号、同步头符号和数据符号一起组帧得到类语音信号帧;在接收端接收类语音信号帧并经过解调得到数据流。通过本发明可以实现将数据调制在分频段的多个频点的幅度或相位参数上,产生类语音信号,具有一般语音的时频特性,能通过带有声码器的语音信道进行传输,可广泛用于端到端语音信道的保密通信。
The invention discloses a voice-like data transmission method based on hybrid modulation, comprising the following steps: performing serial-parallel conversion on an input data stream to obtain a segmented data stream of fixed length; within the optimal transmission bandwidth of a voice channel of a vocoder The frequency bands are divided, and different modulation methods are used for different frequency bands to achieve the minimum transmission bit error rate, and the bit error rate of each frequency band is the same; the data symbols are obtained by inverse fast Fourier transform; the blank symbols, synchronization The header symbol and the data symbol are framed together to obtain a voice-like signal frame; the receiving end receives the voice-like signal frame and demodulates it to obtain a data stream. Through the present invention, data can be modulated on the amplitude or phase parameters of multiple frequency points in the sub-band to generate a voice-like signal, which has the time-frequency characteristics of general voice, can be transmitted through a voice channel with a vocoder, and can be widely used. Secure communication for end-to-end voice channels.
Description
技术领域technical field
本发明属于设计类语音调制的技术领域,尤其涉及一种基于混合调制的类语音数据传输方法。The invention belongs to the technical field of designing speech-like modulation, and in particular relates to a speech-like data transmission method based on hybrid modulation.
背景技术Background technique
语音通信是日常信息交流不可缺少的方式,对手机、电话机的窃听事件时有发生。手机被安装木马之后,就很容易通过无线网络窃听用户的通话;对电话语音的监听则可以通过对电话线路进行并线的方式达到。近些年出现的网络电话即VoIP业务,虽然能够通过IP包加密的方式实现一定程度的安全传输,但语音在IP分组内仍为明文形式,无法实现端到端的安全传输。因此,如果能够在语音传输前就对语音进行加密,那么就可以从根源上杜绝在后续语音处理、传输过程中因第三方窃听而造成的语音信息泄露。Voice communication is an indispensable way of daily information exchange, and the eavesdropping of mobile phones and telephones occurs frequently. After the Trojan horse is installed on the mobile phone, it is easy to eavesdrop on the user's call through the wireless network; the monitoring of the telephone voice can be achieved by paralleling the telephone lines. The VoIP service that has appeared in recent years can achieve a certain degree of secure transmission through IP packet encryption, but the voice is still in plaintext in the IP packet, which cannot achieve end-to-end secure transmission. Therefore, if the voice can be encrypted before the voice is transmitted, the leakage of voice information caused by third-party eavesdropping during the subsequent voice processing and transmission can be prevented from the root cause.
然而,语音信号经数字加密后得到的数据,通常不再具备语音特征,这将会对语音信道传输这些数据造成困难。因为在大多数语音传输系统中,为了节省传输信道带宽,会对语音进行压缩编码;传统语音信道采用的声码器多采用混合参数编码算法对语音进行处理,通过提取语音的模型参数进行传输。而加密后的语音数据由于不具备语音信号的基本特征,通过声码器时便会提取不到语音信号基本参数,或者将数据当作噪音处理,这样会造成数据信息的丢失,接收端就无法根据接收到的加密数据解密得到正确语音。However, the data obtained by the digital encryption of the voice signal usually no longer have the characteristics of voice, which will cause difficulties for the voice channel to transmit these data. Because in most voice transmission systems, in order to save the bandwidth of the transmission channel, the voice will be compressed and encoded; the vocoder used in the traditional voice channel mostly uses the mixed parameter coding algorithm to process the voice, and transmits by extracting the model parameters of the voice. Since the encrypted voice data does not have the basic characteristics of the voice signal, the basic parameters of the voice signal will not be extracted when passing through the vocoder, or the data will be treated as noise, which will cause the loss of data information, and the receiving end will not be able to. Decrypt the received encrypted data to obtain the correct voice.
因此,对于语音信号经数字加密得到的数据,在发送之前还需要进行处理,使其具备语音特征,能够顺利通过语音信道。这与调制的目的相似,调制技术是将数字信号变换成一定形式,以适应信道传输。一般的调制方法是改变一个正弦载波信号,用载波的典型特征:振幅、频率和相位等来承载数字数据。然而,大部分传统的数据调制解调器不能用于使用了语音编解码压缩技术的系统。语音编码的主要目的是减少表示语音所需的比特,同时在解码恢复时仍然保持合格的语音质量,使得人耳听起来和原始语音几乎一样。因此,声码器会限定其所传输的语音数据的最佳频带范围。同时由于人耳对于声音频率的感知是非线性的,并且声码器信道对于不同频段的响应也有所不同,无差别对待载波频率的调制方式可能会使得调制解调器输出的承载数据的频率、相位和振幅特征由于抽样波形前后的不同或载波频率与声码器信道的不适配性而发生较大的畸变。因此,大部分调制解调器不能直接应用在使用语音编解码器的信道上,需要重新设计一种调制解调方案,能够将数字信号变换成具备一定时频特性的语音信号,即类语音调制解调方案。Therefore, the data obtained by digital encryption of the voice signal needs to be processed before being sent, so that it has voice characteristics and can pass through the voice channel smoothly. This is similar to the purpose of modulation. The modulation technique is to transform the digital signal into a certain form to suit the channel transmission. The general modulation method is to change a sinusoidal carrier signal to carry digital data with the typical characteristics of the carrier: amplitude, frequency and phase. However, most conventional data modems cannot be used in systems that use speech codec compression techniques. The main purpose of speech coding is to reduce the number of bits required to represent speech, while still maintaining acceptable speech quality when decoding and recovering, so that the human ear sounds almost the same as the original speech. Therefore, the vocoder defines the optimal frequency band for the speech data it transmits. At the same time, since the human ear's perception of sound frequency is non-linear, and the response of the vocoder channel to different frequency bands is also different, the indiscriminate treatment of the modulation method of the carrier frequency may make the frequency, phase and amplitude characteristics of the data carried by the modem output. Larger distortion occurs due to the difference before and after the sampling waveform or the incompatibility between the carrier frequency and the vocoder channel. Therefore, most modems cannot be directly applied to the channel using the voice codec. It is necessary to redesign a modulation and demodulation scheme, which can convert the digital signal into a voice signal with certain time-frequency characteristics, that is, a voice-like modulation and demodulation scheme. .
发明内容SUMMARY OF THE INVENTION
本发明目的在于提供一种基于混合调制的类语音数据传输方法,以解决现有调制解调器不能直接应用在使用语音编解码器的信道上,数字信号不能变换成具备一定时频特性的语音信号的技术问题。The purpose of the present invention is to provide a voice-like data transmission method based on hybrid modulation, so as to solve the problem that the existing modem cannot be directly applied to the channel using the voice codec, and the digital signal cannot be converted into a voice signal with certain time-frequency characteristics. question.
为解决上述技术问题,本发明的具体技术方案如下:For solving the above-mentioned technical problems, the concrete technical scheme of the present invention is as follows:
一种基于混合调制的类语音数据传输方法,包括以下步骤:A voice-like data transmission method based on hybrid modulation, comprising the following steps:
步骤1、对输入的低速数据流进行串并转换,得到n段固定长度的分段数据流{d1,1,d1,2,…,d1,k;d2,1,d2,2,...,d2,k;......;dn,1,dn,2,…,dn,k},每段数据流包含的数据比特量用k表示;
步骤2、选取声码器通道的传输频带[fL,fH],其中fL为声码器允许通过的最低频率,fH为声码器允许通过的最高频率。根据调制方式的不同将声码器通道的传输频带[fL,fH]划分成m个频段,每个频段包含的频点数为Nl个,其中l=1,2,...,m,相邻频点之间的频率间隔为Δf,Δf≥1/T,其中T为一个数据符号周期;Step 2. Select the transmission frequency band [f L , f H ] of the vocoder channel, where f L is the lowest frequency allowed to pass by the vocoder, and f H is the highest frequency allowed to pass by the vocoder. According to different modulation methods, the transmission frequency band [f L , f H ] of the vocoder channel is divided into m frequency bands, each frequency band contains N l frequency points, where l=1, 2,...,m , the frequency interval between adjacent frequency points is Δf, Δf≥1/T, where T is a data symbol period;
步骤3、在每个频段内,根据该频段的调制方式将数据映射成频点的相位和幅度度参数,构造数据符号的频谱{X(p)};Step 3. In each frequency band, map the data into the phase and amplitude parameters of the frequency point according to the modulation mode of the frequency band, and construct the spectrum {X(p)} of the data symbol;
步骤4、将构造好的数据符号的频域数据进行快速傅里叶反变换,得到数据符号的时域波形{x(q)},将n个数据符号与空白符号和同步符号一起组帧得到类语音信号,每一组完整地包含了空白符号、同步符号和数据符号的组合被称作一帧类语音信号帧;Step 4. Perform inverse fast Fourier transform on the frequency-domain data of the constructed data symbols to obtain the time-domain waveform {x(q)} of the data symbols, and frame n data symbols together with blank symbols and synchronization symbols to obtain Voice-like signal, each group of which completely includes blank symbols, synchronization symbols and data symbols is called a frame of voice-like signal frames;
步骤5、在解调端接收类语音信号,进行帧同步、类语音解调操作,得到数据流。Step 5: Receive the voice-like signal at the demodulation end, perform frame synchronization and voice-like demodulation operations, and obtain a data stream.
进一步的,用来划分频段的步骤是让每个频段内选用调制方式,使得每个频段承载的数据,在传输后获得相同的误码率;Further, the step used to divide the frequency band is to select the modulation mode in each frequency band, so that the data carried by each frequency band can obtain the same bit error rate after transmission;
所述每个频段的调制方式的选择,由每个频点调制的最大相位数N决定,满足:The selection of the modulation mode of each frequency band is determined by the maximum number of phases N modulated at each frequency point, which satisfies:
N<fs/(M·f(i))N<f s /(M·f(i))
其中fs为信号采样率,f(i)为频点频率,i=1,2,...,Nl,M为采样点偏移数;Where f s is the signal sampling rate, f(i) is the frequency of the frequency point, i=1, 2,..., N l , and M is the number of sampling point offsets;
当fs和M一定时,传输频带被划分成m个频段,每个频段内的频点调制的相位数小于N。When f s and M are fixed, the transmission frequency band is divided into m frequency bands, and the number of phases of frequency modulation in each frequency band is less than N.
进一步的,所述步骤3中,设每个频点的相位可承载j比特数据,则可以映射成2j个可选相位参数;数据与幅度的映射采用频段内每两个频点为一组的相对幅度调制方式,每个频段通过幅度调制承载的数据量为Nl/2比特,Nl为每个频段包含的频点数,其中l=1,2,...,m。Further, in the step 3, if the phase of each frequency point can carry j-bit data, it can be mapped into 2 j optional phase parameters; the mapping of data and amplitude adopts every two frequency points in the frequency band as a group. The relative amplitude modulation method of , the amount of data carried by each frequency band through amplitude modulation is N l /2 bits, and N l is the number of frequency points included in each frequency band, where l=1, 2, ..., m.
进一步的,所述步骤4中,信号帧的帧结构包括nb个空白符号、一个同步头符号和n个数据符号,每个数据符号承载k比特数据;进行保密通信时,信号帧内还包含对应的密钥流的密钥序号。Further, in the step 4, the frame structure of the signal frame includes n b blank symbols, a synchronization header symbol and n data symbols, and each data symbol carries k-bit data; when conducting confidential communication, the signal frame also contains The key sequence number of the corresponding key stream.
本发明的一种基于混合调制的类语音数据传输方法,具有以下优点:A kind of speech-like data transmission method based on hybrid modulation of the present invention has the following advantages:
1、本发明基于分频段多参数的混合调制方式得到的类语音信号具有一般语音信号的时频特性,频率范围在声码器最佳传输频带范围内,能够顺利通过声码器信道,具有良好抵抗语音压缩和解压缩非线性操作的能力;1. The speech-like signal obtained by the present invention based on the hybrid modulation method of sub-band and multi-parameter has the time-frequency characteristics of general speech signals, and the frequency range is within the optimal transmission frequency band range of the vocoder, which can smoothly pass through the vocoder channel and has good performance. The ability to resist non-linear operations of speech compression and decompression;
2、本发明将频谱空间进行多段划分,每段内选择最佳调制方式,使得每个频段可以获得相同的最低误码率,同时最大化数据符号承载比特量;2. The present invention divides the spectrum space into multiple segments, and selects the best modulation mode in each segment, so that each frequency band can obtain the same minimum bit error rate, while maximizing the bit amount of data symbols carried;
3、本发明成帧和解帧速度快,不影响低速数据的传输,调制解调延时小,可以实现实时传输。3. The framing and de-framing speed of the present invention is fast, the transmission of low-speed data is not affected, the modulation and demodulation delay is small, and real-time transmission can be realized.
附图说明Description of drawings
图1为本发明的基于混合调制的类语音数据传输方法流程示意图;1 is a schematic flowchart of a method for transmitting voice-like data based on hybrid modulation of the present invention;
图2为本发明的单个数据符号的幅度谱示意图;Fig. 2 is the amplitude spectrum schematic diagram of the single data symbol of the present invention;
图3为本发明的单个数据符号的相位谱示意图;Fig. 3 is the phase spectrum schematic diagram of the single data symbol of the present invention;
图4为本发明的信号帧的帧结构格式示意图;4 is a schematic diagram of a frame structure format of a signal frame of the present invention;
图5为本发明的同步符号的自相关特性示意图。FIG. 5 is a schematic diagram of the autocorrelation characteristic of the synchronization symbol of the present invention.
具体实施方式Detailed ways
为了更好地了解本发明的目的、结构及功能,下面结合附图,对本发明一种基于混合调制的类语音数据传输方法做进一步详细的描述。In order to better understand the purpose, structure and function of the present invention, a hybrid modulation-based speech-like data transmission method of the present invention will be described in further detail below with reference to the accompanying drawings.
如图1所示,为本发明提出的一种基于混合调制的类语音数据传输方法的整体流程示意图,该方法具体包括以下步骤:As shown in FIG. 1, it is a schematic overall flow diagram of a hybrid modulation-based voice data transmission method proposed by the present invention, and the method specifically includes the following steps:
步骤1、串并转换:对输入的低速数据流进行串并转换,得到n段固定长度的分段数据流{d1,1,d1,2,...,d1,k;d2,1,d2,2,...,d2,k;......;dn,1,dn,2,...,dn,k},每段数据流包含的比特量用k表示,k=30比特;
步骤2、频段划分:选取声码器通道的最佳传输频带[fL,fH],其中fL为声码器允许通过的最低频率,fH为声码器允许通过的最高频率。根据可选用的调制方式的不同将声码器通道的传输频带[fL,fH]划分成m个频段,每个频段包含的频点数为Nl个,其中l=1,2,...,m,相邻频点之间的频率间隔为Δf,不小于一个数据符号周期T的倒数,即:Δf≥1/T;Step 2, frequency band division: select the optimal transmission frequency band [f L , f H ] of the vocoder channel, where f L is the lowest frequency allowed by the vocoder, and f H is the highest frequency allowed by the vocoder. The transmission frequency band [f L , f H ] of the vocoder channel is divided into m frequency bands according to the different modulation modes that can be selected, and each frequency band contains N l frequency points, where l=1, 2, .. ., m, the frequency interval between adjacent frequency points is Δf, which is not less than the reciprocal of a data symbol period T, that is: Δf≥1/T;
具体的,常见的声码器通道最佳传输频带为[300Hz,3400Hz],在此频带范围内选择多个有效频点,设有效频点数为Nf=30个,初始频率为fL=300Hz,最高频率为fH=3400Hz,频率间隔Δf=100Hz,则每个频点频率为f(i):Specifically, the best transmission frequency band of a common vocoder channel is [300Hz, 3400Hz], select multiple effective frequency points within this frequency band, set the number of effective frequency points to be N f =30, and the initial frequency to be f L =300Hz , the highest frequency is f H = 3400Hz, and the frequency interval Δf = 100Hz, then the frequency of each frequency point is f(i):
f(i)=f0+i*Δf,i=0,1,2,...,30-1f(i)=f 0 +i*Δf, i=0, 1, 2, ..., 30-1
其中f0为起始频率,一般等于fL;i为频点位置,Δf为相邻频点的频率间隔。Where f 0 is the starting frequency, generally equal to f L ; i is the frequency point position, and Δf is the frequency interval between adjacent frequency points.
每个数据帧在类语音调制后包含每个频点对应的子载波之和。Each data frame contains the sum of subcarriers corresponding to each frequency point after speech-like modulation.
进一步的,设单个频点对应的子载波每周期采样点数为nT=fs/f(i),fs代表采样率,单采样点偏移所导致的相位偏移量为 Further, let the number of sampling points per cycle of the subcarrier corresponding to a single frequency point be n T =f s /f(i), f s represents the sampling rate, and the phase offset caused by the offset of a single sampling point is
设接收端在进行帧同步时发生M点的采样点偏移,频点f(i)对应的子载波可调制的相位数为N,则解调无误码的条件为:Assuming that the sampling point offset of M points occurs when the receiving end performs frame synchronization, and the number of modulated phases of the subcarrier corresponding to the frequency point f(i) is N, the conditions for demodulation error-free are:
代入前述公式,可得每个频点可调制的相位数N满足:Substituting into the above formula, it can be obtained that the number of phases N that can be modulated at each frequency point satisfies:
N<fs/(M·f(i))N<f s /(M·f(i))
即,每个频点对应的子载波可调制的相位数与该频点频率和帧同步的精确程度有关,当采样率fs和采样点偏移数M一定时,可以根据f(i)的不同将频谱划分为多个频段,每个频段根据可调制相位数的不同选用不同的多进制相移键控(N-Phase Shift Keying,N-PSK)调制方式,当可调相位数N小于2时,不再采用相位调制,即高频段考虑幅移键控等其它调制方式。That is, the number of modulated phases of the subcarrier corresponding to each frequency point is related to the frequency of the frequency point and the accuracy of frame synchronization. When the sampling rate f s and the number of sampling point offsets M are constant, the frequency of The spectrum is divided into multiple frequency bands, and each frequency band uses a different multi-ary phase shift keying (N-Phase Shift Keying, N-PSK) modulation method according to the number of adjustable phases. When the adjustable phase number N is less than 2, phase modulation is no longer used, that is, other modulation methods such as amplitude shift keying are considered in the high frequency band.
M=2时,传输频带理论上可被划分成四个频段,为降低算法复杂度同时兼具传输效果,实际应用中划分三个频段:[300Hz,1000Hz]、[1100Hz,1600Hz]、[1700Hz,3200Hz]。图2、图3给出了该情况下某个数据符号的幅频和相频示意图,频段一包含的频点数N1=8个频点,采用正交相移键控(Quadrature Phase Shift Keying,QPSK)调制方式,则频段一内承载数据量S1(f)=2*N1=16比特;频段二包含的频点数N2=6个频点,混合采用分组相对幅度调制和二进制相移键控(Binary Phase Shift Keying,BPSK)调制方式,则频段二内承载数据量S2(f)=N2=6比特;频段三包含的频点数N3=16个频点,只采用分组相对幅度调制方式,每个频点相位保持为0,则频段三内承载数据量S3(f)=N3/2=8比特。三个频段包含的数据量之和应满足:When M=2, the transmission frequency band can be divided into four frequency bands in theory. In order to reduce the complexity of the algorithm and have the transmission effect at the same time, in practical application, three frequency bands are divided: [300Hz, 1000Hz], [1100Hz, 1600Hz], [1700Hz] , 3200Hz]. Fig. 2 and Fig. 3 are schematic diagrams of amplitude frequency and phase frequency of a certain data symbol in this case. The number of frequency points included in
S(f)=S1(f)+S2(f)+S3(f)=2*N1+N2+N3/2=30 (比特)S(f)=S 1 (f)+S 2 (f)+S 3 (f)=2*N 1 +N 2 +N 3 /2=30 (bits)
至此,每段数据流包含的k=30比特数据量被分配到了其对应的数据符号的频谱上。So far, the data amount of k=30 bits contained in each data stream is allocated to the spectrum of its corresponding data symbol.
步骤3、数据-频谱参数映射:在每个频段内,根据该频段的调制方式将数据映射成频点的相位和幅度参数,构造数据符号的频谱{X(p)};Step 3, data-spectrum parameter mapping: in each frequency band, according to the modulation mode of the frequency band, the data is mapped into the phase and amplitude parameters of the frequency point, and the spectrum {X(p)} of the data symbol is constructed;
步骤4、类语音生成:将构造好的数据符号的频域数据进行快速傅里叶反变换,得到数据符号的时域波形{x(q)},将n个数据符号与空白符号和同步符号一起组帧得到类语音信号,每一组完整地包含了空白符号、同步符号和数据符号的组合被称作一帧类语音信号帧;Step 4. Voice-like generation: Inverse fast Fourier transform is performed on the frequency domain data of the constructed data symbols to obtain the time domain waveform {x(q)} of the data symbols, and n data symbols are combined with blank symbols and synchronization symbols. Group frames together to obtain speech-like signals, and each group that completely includes blank symbols, synchronization symbols and data symbols is called a frame of speech-like signal frames;
具体的,参照图4的示意,为每个类语音信号的帧结构格式,其由nb=6个空白符号、1个同步头符号和n=32个数据符号组成,每个信号帧共包含nb+n+1=39个符号,除同步头符号外,每个符号的时间为T=10毫秒,同步头时长为20毫秒,因此每一信号帧时间为Tall=T*(nb+n)+2*T=400毫秒。Specifically, referring to the schematic diagram of FIG. 4 , it is the frame structure format of each speech-like signal, which consists of n b =6 blank symbols, 1 synchronization header symbol and n = 32 data symbols, and each signal frame contains a total of n b +n+1=39 symbols, except for the sync header symbol, the time of each symbol is T=10 ms, and the duration of the sync header is 20 ms, so the time of each signal frame is T all =T*(n b +n)+2*T=400 milliseconds.
进一步的,数据帧传输的数据量为S(t)=n*k=960比特,因此总的数据传输速率v(t)为:Further, the amount of data transmitted by the data frame is S(t)=n*k=960 bits, so the total data transmission rate v(t) is:
进一步的,所述同步符号应选用具有强自相关性的定长时间序列,比如chirp信号,它时间长度为2*T=20毫秒,瞬时频率在[300Hz,3400Hz]范围内随着时间的变化而变化。参照图5的示意,即同步符号的自相关特性示意图,只有在非滞后非超前的时刻“0”,同步符号才表现出强相关性,其它时刻相关性很弱;根据这一特性,接收端可以利用相同的同步符号采样数据进行滑动相关,根据设定的相关阈值来精确定位接收的类语音信号中数据符号的位置。Further, the synchronization symbol should be a fixed time sequence with strong autocorrelation, such as chirp signal, whose time length is 2*T=20 milliseconds, and the instantaneous frequency changes with time in the range of [300Hz, 3400Hz]. and change. Referring to the schematic diagram of FIG. 5, that is, the schematic diagram of the autocorrelation characteristic of the synchronization symbol, only at the non-lag and non-advanced time "0", the synchronization symbol shows strong correlation, and the correlation at other times is very weak; according to this characteristic, the receiving end The same synchronization symbol sample data can be used to perform sliding correlation, and the position of the data symbol in the received speech-like signal can be precisely located according to the set correlation threshold.
步骤5、类语音解调:在解调端接收类语音信号,进行帧同步、类语音解调操作,得到低速数据流。Step 5. Voice-like demodulation: receive a voice-like signal at the demodulation end, perform frame synchronization and voice-like demodulation operations, and obtain a low-speed data stream.
具体的,在解调端测得的每个频段内数据传输的误码率为:频段一5‰、频段二5‰、频段三5‰,三个频段达到了相同的最低误码率,使得整体误码率最小,为5‰。Specifically, the bit error rate of data transmission in each frequency band measured at the demodulation end is:
本发明提出的基于混合调制的类语音数据传输方法,可以用于传输加密后的语音数据,这样在将可懂的语音变为不可懂语音的同时,还保留了语音特性,这样加密后的语音数据就可以顺利通过语音压缩编码通道进行传输。The voice-like data transmission method based on hybrid modulation proposed by the present invention can be used to transmit encrypted voice data, so that when intelligible voice is changed into incomprehensible voice, the voice characteristics are also retained. The data can be transmitted smoothly through the voice compression coding channel.
可以理解,本发明是通过一些实施例进行描述的,本领域技术人员知悉的,在不脱离本发明的精神和范围的情况下,可以对这些特征和实施例进行各种改变或等效替换。另外,在本发明的教导下,可以对这些特征和实施例进行修改以适应具体的情况及材料而不会脱离本发明的精神和范围。因此,本发明不受此处所公开的具体实施例的限制,所有落入本申请的权利要求范围内的实施例都属于本发明所保护的范围内。It can be understood that the present invention is described by some embodiments, and those skilled in the art know that various changes or equivalent substitutions can be made to these features and embodiments without departing from the spirit and scope of the present invention. In addition, in the teachings of this invention, these features and embodiments may be modified to adapt a particular situation and material without departing from the spirit and scope of the invention. Therefore, the present invention is not limited by the specific embodiments disclosed herein, and all embodiments falling within the scope of the claims of the present application fall within the protection scope of the present invention.
Claims (4)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202110548729.2A CN113270105B (en) | 2021-05-20 | 2021-05-20 | A Voice-like Data Transmission Method Based on Hybrid Modulation |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202110548729.2A CN113270105B (en) | 2021-05-20 | 2021-05-20 | A Voice-like Data Transmission Method Based on Hybrid Modulation |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN113270105A CN113270105A (en) | 2021-08-17 |
| CN113270105B true CN113270105B (en) | 2022-05-10 |
Family
ID=77231922
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202110548729.2A Active CN113270105B (en) | 2021-05-20 | 2021-05-20 | A Voice-like Data Transmission Method Based on Hybrid Modulation |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN113270105B (en) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN114550716A (en) * | 2022-02-10 | 2022-05-27 | 北京沃东天骏信息技术有限公司 | A method and device for controlling equipment based on sound signal |
| CN115514603B (en) * | 2022-08-10 | 2025-04-18 | 紫金山实验室 | Voice-like modulation and demodulation method and device |
Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101026836A (en) * | 2007-04-06 | 2007-08-29 | 东南大学 | Anti vocoder compressed end-to-end voice encryption device and method |
| CN102103859A (en) * | 2011-01-11 | 2011-06-22 | 东南大学 | Methods and devices for coding and decoding digital audio signals |
| WO2013141638A1 (en) * | 2012-03-21 | 2013-09-26 | 삼성전자 주식회사 | Method and apparatus for high-frequency encoding/decoding for bandwidth extension |
| CN104123946A (en) * | 2006-07-31 | 2014-10-29 | 高通股份有限公司 | Systemand method for including identifier with packet associated with speech signal |
| CN104185173A (en) * | 2013-05-23 | 2014-12-03 | 东南大学常州研究院 | Enhanced variable rate code resistant voice end-to-end encryption and decryption method |
| CN109525282A (en) * | 2017-09-18 | 2019-03-26 | 丰郅(上海)新能源科技有限公司 | Realize the device and method of data transmission |
| CN109545233A (en) * | 2018-06-20 | 2019-03-29 | Mimi听力技术有限责任公司 | System and method for handling audio signal to be reset on audio devices |
| CN112614498A (en) * | 2020-12-16 | 2021-04-06 | 东南大学 | Method for modulating low-speed data into voice-like signal |
Family Cites Families (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2020154367A1 (en) * | 2019-01-23 | 2020-07-30 | Sound Genetics, Inc. | Systems and methods for pre-filtering audio content based on prominence of frequency content |
-
2021
- 2021-05-20 CN CN202110548729.2A patent/CN113270105B/en active Active
Patent Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104123946A (en) * | 2006-07-31 | 2014-10-29 | 高通股份有限公司 | Systemand method for including identifier with packet associated with speech signal |
| CN101026836A (en) * | 2007-04-06 | 2007-08-29 | 东南大学 | Anti vocoder compressed end-to-end voice encryption device and method |
| CN102103859A (en) * | 2011-01-11 | 2011-06-22 | 东南大学 | Methods and devices for coding and decoding digital audio signals |
| WO2013141638A1 (en) * | 2012-03-21 | 2013-09-26 | 삼성전자 주식회사 | Method and apparatus for high-frequency encoding/decoding for bandwidth extension |
| CN104185173A (en) * | 2013-05-23 | 2014-12-03 | 东南大学常州研究院 | Enhanced variable rate code resistant voice end-to-end encryption and decryption method |
| CN109525282A (en) * | 2017-09-18 | 2019-03-26 | 丰郅(上海)新能源科技有限公司 | Realize the device and method of data transmission |
| CN109545233A (en) * | 2018-06-20 | 2019-03-29 | Mimi听力技术有限责任公司 | System and method for handling audio signal to be reset on audio devices |
| CN112614498A (en) * | 2020-12-16 | 2021-04-06 | 东南大学 | Method for modulating low-speed data into voice-like signal |
Non-Patent Citations (1)
| Title |
|---|
| 一种抗RPE-LTP声码器压缩的端到端数据传输方法;陈立全 等;《中国工程科学》;20071231;第9卷(第18期);第81-85页 * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN113270105A (en) | 2021-08-17 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN113726704A (en) | Frequency shift chirp spread spectrum modulation and demodulation method based on grouping | |
| US7787550B2 (en) | Combined frame alignment and timing recovery in digital subscriber line (DSL) communications systems | |
| DE112007000123B4 (en) | A modem for communicating data over a voice channel of a communication system | |
| CN101321146B (en) | Peak-to-average ratio restraining method and device in multi-carrier orthogonal frequency division multiplexing system | |
| CN113270105B (en) | A Voice-like Data Transmission Method Based on Hybrid Modulation | |
| JP2004147345A5 (en) | ||
| JP2014526201A (en) | Method and apparatus for reducing PAPR in a filter bank multi-carrier system | |
| CN101026836A (en) | Anti vocoder compressed end-to-end voice encryption device and method | |
| CN101340417A (en) | Improved Iterative PTS Method for Reducing Peak-to-Average Ratio in OFDM System | |
| CN112614498B (en) | A method of modulating low-speed data into speech-like signals | |
| US20090022214A1 (en) | Optimized Short Initialization After Low Power Mode for Digital Subscriber Line Communications | |
| CN110011760B (en) | Full-duplex multi-carrier near-field communication method based on sound waves | |
| CN105376189A (en) | Acoustic communication method and system | |
| US20040218521A1 (en) | Method for data communication between a single-carrier system and a multi-carrier system | |
| CN107426126A (en) | Frequency modulation modulates permanent envelope ofdm system | |
| CN101110805B (en) | Receiving and dispatching method and system based on orthogonal frequency division multiplexing | |
| CN105450312A (en) | Acoustic wave communication sending method and device | |
| CN107438041A (en) | A kind of method and device for sending signal and reception signal | |
| Biancucci et al. | Secure data and voice transmission over GSM voice channel: Applications for secure communications | |
| Rashidi et al. | Data mapping onto speech-like signal to transmission over the gsm voice channel | |
| CN110166398B (en) | Frequency offset OFDM transmission method | |
| Chen et al. | A r-ask ofdm modulation and demodulation scheme against vocoder compression | |
| CN109346090A (en) | A method of penetrating vocoder | |
| CN109039967A (en) | A kind of solution of the f-OFDM system ISI based on ICI from deleting technique | |
| Lin et al. | FBMC/COQAM: an enabler for cognitive radio |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |