[go: up one dir, main page]

WO2018161429A1 - Noise detection method, and terminal apparatus - Google Patents

Noise detection method, and terminal apparatus Download PDF

Info

Publication number
WO2018161429A1
WO2018161429A1 PCT/CN2017/083765 CN2017083765W WO2018161429A1 WO 2018161429 A1 WO2018161429 A1 WO 2018161429A1 CN 2017083765 W CN2017083765 W CN 2017083765W WO 2018161429 A1 WO2018161429 A1 WO 2018161429A1
Authority
WO
WIPO (PCT)
Prior art keywords
frequency
noise
audio signal
frame
terminal device
Prior art date
Application number
PCT/CN2017/083765
Other languages
French (fr)
Chinese (zh)
Inventor
张健
张海宏
陶蓓
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to CN201780026318.9A priority Critical patent/CN109074814B/en
Publication of WO2018161429A1 publication Critical patent/WO2018161429A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M9/00Arrangements for interconnection not involving centralised switching
    • H04M9/08Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic
    • H04M9/082Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic using echo cancellers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/02Constructional features of telephone sets
    • H04M1/19Arrangements of transmitters, receivers, or complete sets to prevent eavesdropping, to attenuate local noise or to prevent undesired transmission; Mouthpieces or receivers specially adapted therefor
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Definitions

  • the present application relates to the field of communication systems and the field of voice processing, and in particular, to a noise detection method and a terminal device.
  • Time Division Duplexing (TDD) noise and board vibration are "current sounds" caused by intermittently emitting large currents in mobile phones.
  • the Power Amplifier (PA) of the Time Division Multiple Access (TDMA) mobile phone of the Global System for Mobile communication (GSM) system transmits a large power every 4.616 ms.
  • the current causes the electro-acoustic device to receive interference and demodulate the TDD noise.
  • the battery voltage drops at the same frequency, acting on the ceramic capacitor, causing mechanical vibration and passing through the main board to form a plate vibration.
  • the input signal x(t) is first transformed from the time domain to the spectral domain to obtain the spectral signal S(f), and secondly, the spectral signal S(f) is transformed from the spectral domain to the cepstrum domain.
  • the threshold determines that the input signal is TDD noise; if the spectral signal C(q 1 ) corresponding to the frequency q 1 of the input signal in the cepstrum domain is not greater than a preset threshold, it is determined that the input signal is not TDD noise.
  • the cepstrum domain cannot quantitatively indicate the size of the input signal due to the characteristics of the cepstrum domain itself. Therefore, the existing detection technique can only judge that the input signal is TDD noise, or the input signal is not TDD noise. However, when it is determined that the input signal is TDD noise, the TDD noise level cannot be determined.
  • the embodiment of the present application provides a noise detecting method and a terminal device for detecting a noise energy amount in a signal.
  • the first aspect of the present application provides a noise detection method, including:
  • the amplitude spectrum of each frame signal is calculated according to the first formula; secondly, the noise frequency is determined according to the candidate noise frequency distribution, wherein the candidate noise frequency distribution is obtained by performing cepstrum analysis on each frame signal; finally, in determining After the amplitude spectrum and the noise frequency of each frame signal, and correspondingly calculating the noise energy value of each frame signal according to the above noise frequency and amplitude spectrum, it can be understood that the above noise energy value can represent the noise level in each frame signal.
  • the noise detection method further includes:
  • each frequency of the amplitude value of each frame of the audio signal exceeds a preset threshold is determined as each target frequency, wherein the frequency search interval is a frequency interval in the cepstrum domain;
  • Each target frequency having the largest geometric mean value corresponding to the amplitude value in each frame of the audio signal is determined as each candidate noise frequency.
  • the target frequency is first determined in each frame of the audio signal, and then the target in each frame of the audio signal The frequency is filtered out of the candidate noise frequency. In this way, each candidate noise frequency is effectively selected from each frame of the audio signal.
  • the noise method before calculating the amplitude value spectrum of each frame of the audio signal according to the first formula, the noise method further includes:
  • the sampled signal is subjected to framing and windowing to obtain at least two frames of audio signals.
  • the framing operation and windowing processing of the sampled signal can improve the performance of the algorithm for calculating the audio signal, and can also obtain the TDD noise or the sound of the board vibration over time. The relationship of change.
  • the noise detecting method before the sampling signal is subjected to framing and windowing to obtain at least two or more audio signals, the noise detecting method further includes:
  • the input audio signal is N-times up-sampled to obtain a sampled signal, where N is a positive integer not less than 2.
  • the input audio signal is sampled by using N times of interpolation and upsampling, so that the noise frequency is far away from the low frequency interference, thereby improving the accuracy of the noise detection detection.
  • the preset calculation method includes a loudness calculation method.
  • the loudness calculation method can be used to calculate the noise energy size efficiently and accurately.
  • the embodiment of the present application provides a terminal device, where the terminal device has the function of implementing the behavior of the terminal device in the foregoing method embodiment.
  • This function can be implemented in hardware or in hardware by executing the corresponding software.
  • the hardware or software includes one or more modules corresponding to the functions described above.
  • an embodiment of the present application provides a terminal device, including: a processor, a memory, a bus, a transmitter, and a receiver; the memory is configured to store a computer to execute an instruction, and the processor is connected to the memory through the bus.
  • the processor executes the computer-executed instruction stored in the memory to cause the terminal device to perform the noise detecting method according to any one of the above first aspects.
  • an embodiment of the present application provides a computer readable storage medium, configured to be stored as the foregoing terminal device.
  • the computer software instructions used, when run on a computer, cause the computer to perform the noise detection method of any of the above first aspects.
  • an embodiment of the present application provides a computer program product comprising instructions, which when executed on a computer, enable the computer to perform the noise detecting method of any of the above first aspects.
  • FIG. 1 is a schematic diagram of an embodiment of a noise detecting method in an embodiment of the present application
  • FIG. 2 is a schematic diagram of an embodiment of a terminal device according to an embodiment of the present application.
  • FIG. 3 is a schematic diagram of another embodiment of a terminal device according to an embodiment of the present application.
  • FIG. 4 is a schematic diagram of another embodiment of a terminal device according to an embodiment of the present application.
  • the embodiment of the present application provides a noise detecting method and a terminal device for detecting a noise energy amount in a signal.
  • the noise detecting method in the embodiment of the present application is mainly used for detecting TDD noise and board vibration in a mobile phone, and of course, can also be used for noises of other harmonic forms, and the present application does not impose any limitation.
  • the noise detection method in the embodiment of the present application will be described below in conjunction with a specific embodiment.
  • Embodiment 1 As shown in FIG. 1 , an embodiment of a noise detecting method in an embodiment of the present application includes:
  • the input audio signal is subjected to N times of interpolation upsampling to obtain a sampling signal, where N is a positive integer greater than or equal to 2.
  • the input audio signal is X
  • the output sampling signal is Z.
  • the N-time interpolation upsampling of the input audio signal does not change the amplitude value of the input audio signal, and actually increases the sampling frequency by N times.
  • the present application does not impose any restrictions on other sampling schemes that can achieve the same technical effects.
  • the sampling signal is performed.
  • the framing operation and the windowing process result in at least two frames of audio signals.
  • W can be a common window function (such as a rectangular window, a triangular window, a Hanning window, a Hamming window, and One of the Gaussian windows, etc., or other newly designed window functions, is not limited in this application.
  • the framing operation can not only improve the performance of the algorithm for calculating the audio signal, but also the relationship between the TDD noise or the amplitude of the plate vibration sound over time.
  • the framing operation does not change the amplitude value of the input audio signal, and the windowing process mainly mitigates the spectrum leakage problem.
  • the amplitude spectrum of each frame of the audio signal is calculated according to the first formula, and the amplitude spectrum of each frame of the audio signal is obtained.
  • M i abs (fft ( Y i)); expression of M i
  • the cepstrum of each frame of the audio signal is calculated according to the second formula, and the cepstrum of each frame of the audio signal is obtained.
  • C i represents an ith frame audio signal in the cepstrum domain
  • C i can be expressed as:
  • C i real(ifft(log(M i )));
  • log represents the logarithm;
  • ift represents the fast inverse Fourier transform, real represents the real number.
  • each candidate noise frequency in the frequency search interval is a frequency range corresponding to the cepstrum domain, and the frequency search interval thereof.
  • the specific determination manner may be preset according to the experience of the prophet, or may be determined according to different mobile phone systems, and the application does not impose any restrictions.
  • determining each candidate noise frequency within the frequency search interval is:
  • the frequency search interval in the cepstrum domain is determined according to the experience of the prophet. It should be noted that there are many amplitude values in the cepstrum domain, but the amplitude values in the cepstrum domain cannot quantitatively represent the audio signal of each frame. Size
  • the preset threshold is used to select each frequency of each frame signal that exceeds the preset threshold as the target frequency, and the preset threshold is substantially a preset amplitude value (for example, the maximum amplitude in the cepstrum domain may be specifically
  • the value of the preset threshold can be set according to the actual application scenario, and the present application does not impose any restrictions on this;
  • each target frequency selected from each frame signal is used as the fundamental frequency f base , and the second harmonic frequency f 2 and the third harmonic frequency f 3 corresponding to the fundamental frequency are sequentially calculated to obtain a fundamental frequency f base .
  • the three frequencies are calculated according to the geometric square value formula, and the geometric square value p(f base ) of the corresponding amplitude value in the cepstrum domain, wherein, Ground, the geometric squared formula can be:
  • Other forms of geometric squared formulas are also possible, and no limitation is imposed on this application.
  • the target frequency with the largest geometric square value in each of the calculated audio signals is used as the candidate noise frequency of the current frame.
  • the noise frequency is determined from each candidate noise frequency.
  • a possible way to determine the noise frequency from each candidate noise frequency is: counting the number of occurrences of each of the candidate noise frequencies, and then the most occurrences, and the total number of occurrences of all frequencies
  • the frequency of the number of times (frames) exceeds the preset threshold is determined as the noise frequency, and there is a frequency f t , and the number of occurrences accounts for 60% of the number of occurrences of all frequencies (one candidate frequency per frame, that is, the total number of frames). Greater than 50% threshold. That is, the noise is considered to be TDD noise or plate vibration, and f t is its noise frequency.
  • the present application does not impose any limitation. If the number of times without any candidate frequency exceeds the preset threshold, it can be judged that this noise is not TDD noise or plate vibration, that is, it is not necessary to calculate its energy value.
  • the noise energy value corresponding to the noise frequency in each frame of the audio signal is calculated according to the noise frequency, the amplitude spectrum, and a preset calculation method, wherein the noise energy value is represented by The size of the TDD noise or the plate vibration sound, if the noise energy value is higher, the TDD noise or the plate vibration sound is larger; if the noise energy value is lower, the TDD noise or the plate vibration sound is smaller.
  • one possible way to calculate the noise energy value is to calculate the noise energy value of the TDD noise or the plate vibration sound by using the loudness calculation method. At the same time, the energy value corresponding to the second harmonic frequency f 2 and the third harmonic frequency f 3 corresponding to the noise frequency can be calculated. In addition, for other calculation methods, the application does not impose any restrictions.
  • the corresponding amplitude spectrum is obtained in advance by calculating each frame signal, and then the noise frequency is determined by the candidate noise frequency distribution obtained by performing cepstrum analysis on each frame signal, and finally, according to the above amplitude spectrum and the above The noise frequency is calculated correspondingly to obtain the noise energy value of each frame signal. Therefore, the embodiment of the present application can effectively detect the noise energy amount in each frame signal.
  • an embodiment of a terminal device in this embodiment of the present application includes:
  • a first calculating module 201 configured to calculate an amplitude spectrum of each frame of the audio signal according to the first formula
  • the first determining module 202 is configured to determine a noise frequency according to each candidate noise frequency, where each candidate noise frequency is obtained by performing cepstrum analysis on each frame of the audio signal;
  • a second calculating module 203 configured to calculate, according to the noise frequency, the amplitude spectrum, and a preset calculation method The noise energy value of each frame of the audio signal.
  • the terminal device further includes: a second determining module 304, a third calculating module 305, and a third determining module 306; wherein each module function is as follows:
  • the second determining module 304 is configured to determine, in the preset frequency search interval, each frequency of the amplitude value of each frame of the audio signal exceeding a preset threshold as each target frequency;
  • a third calculating module 305 configured to calculate, as a fundamental frequency, a fundamental frequency, a second harmonic frequency of the fundamental frequency, and a geometric mean value of the amplitude value respectively corresponding to the third harmonic frequency of the fundamental frequency;
  • the third determining module 306 is configured to determine each target frequency corresponding to the maximum geometric mean as each candidate noise frequency.
  • the terminal device further includes: a first processing module 307, wherein the first processing module 307 is configured to perform framing and windowing processing on the sampling signal, Obtain at least two frames of audio signals.
  • the terminal device further includes: a second processing module 308, wherein the second processing module 308 is configured to perform N times of insertion of the input audio signal. Sampling, the sampled signal is obtained, wherein the N is a positive integer not less than 2.
  • the corresponding amplitude spectrum is obtained in advance by calculating each frame signal, and then the noise frequency is determined by the candidate noise frequency distribution obtained by performing cepstrum analysis on each frame signal, and finally, according to the above amplitude spectrum and the above The noise frequency is calculated correspondingly to obtain the noise energy value of each frame signal. Therefore, the embodiment of the present application can effectively detect the noise energy amount in each frame signal.
  • the foregoing embodiment 2 describes the terminal device in the embodiment of the present application in detail from the aspect of the virtual functional device.
  • the following describes the terminal device in the embodiment of the present application from the aspect of the physical structure, which may be specifically as follows:
  • the third embodiment as shown in FIG. 4, another embodiment of the terminal device in the embodiment of the present application includes: a receiver 401, a transmitter 402, a processor 403, a memory 404, and a bus 405.
  • the memory 404 can include read only memory and random access memory and provides instructions and data to the processor 403.
  • a portion of the memory 404 may also include a non-volatile random access memory (English name: Non-Volatile Random Access Memory, English abbreviation: NVRAM).
  • the memory 404 stores the following elements, executable modules or data structures, or a subset thereof, or an extended set thereof:
  • Operation instructions including various operation instructions for implementing various operations
  • Operating system Includes a variety of system programs for implementing various basic services and handling hardware-based tasks.
  • the processor 403 in the embodiment of the present application may be used to perform operations corresponding to the first communication network element in the foregoing embodiment, and may include the following operations:
  • each candidate noise frequency is obtained by performing cepstrum analysis on each frame of the audio signal
  • the amplitude spectrum is calculated by a preset calculation method to obtain a noise energy value of each frame of the audio signal.
  • the processor 403 may be configured to: determine, in a preset frequency search interval, each frequency in the audio signal of each frame that exceeds a preset threshold as each target frequency;
  • each target frequency as a fundamental frequency, calculating a fundamental frequency, a second harmonic frequency of the fundamental frequency, and a geometric mean value of the amplitude value corresponding to the third harmonic frequency of the fundamental frequency;
  • Each target frequency corresponding to the maximum geometric mean is determined as each candidate noise frequency.
  • the processor 403 is configured to perform the following steps: performing N-times up-sampling on the input audio signal to obtain the sampling signal, where the N is a positive integer not less than 2;
  • the sampled signal is subjected to framing and windowing to obtain at least two frames of audio signals.
  • the processor 403 controls the operation of the first communication network element, and the processor 403 may also be referred to as a central processing unit (English full name: Central Processing Unit, English abbreviation: CPU).
  • Memory 404 can include read only memory and random access memory and provides instructions and data to processor 403. A portion of memory 404 may also include NVRAM.
  • the components of the first communication network element are coupled together by a bus system 405.
  • the bus system 405 may include a power bus, a control bus, a status signal bus, and the like in addition to the data bus. However, for clarity of description, various buses are labeled as bus system 405 in the figure.
  • Processor 403 may be an integrated circuit chip with signal processing capabilities. In the implementation process, each step of the foregoing method may be completed by an integrated logic circuit of hardware in the processor 403 or an instruction in a form of software.
  • the processor 403 may be a general-purpose processor, a digital signal processor (English name: Digital Signal Processing, English abbreviation: DSP), an application specific integrated circuit (English name: Application Specific Integrated Circuit, English abbreviation: ASIC), ready-made programmable Gate array (English name: Field-Programmable Gate Array, English abbreviation: FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components.
  • the methods, steps, and logical block diagrams disclosed in the embodiments of the present application can be implemented or executed.
  • the general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
  • the steps of the method disclosed in the embodiments of the present application may be directly implemented by the hardware decoding processor, or may be performed by a combination of hardware and software modules in the decoding processor.
  • the software module can be located in a conventional storage medium such as random access memory, flash memory, read only memory, programmable read only memory or electrically erasable programmable memory, registers, and the like.
  • the storage medium is located in memory 404, and processor 403 reads the information in memory 404 and, in conjunction with its hardware, performs the steps of the above method.
  • FIG. 4 The related description of FIG. 4 can be understood by referring to the related description and effect of the method part of FIG. 1, and no further description is made here.
  • the disclosed system, apparatus, and method may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the integrated unit if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium.
  • a computer readable storage medium A number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present application.
  • the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Telephone Function (AREA)
  • Noise Elimination (AREA)

Abstract

A noise detection method, and terminal apparatus, configured to detect a noise energy level in a signal. The noise detection method comprises: computing, according to a first formula, an amplitude spectrum of each audio signal frame (103); determining, according to candidate noise frequencies, a noise frequency (106), said candidate noise frequencies being obtained by means of performing cepstral analysis on said audio signal frame (104, 105); and computing, according to the noise frequency, the amplitude spectrum, and a predetermined computing algorithm, a value of a noise energy level of said audio signal frame (107).

Description

一种噪声检测方法及终端设备Noise detecting method and terminal device
本申请要求于2017年03月07日提交中国专利局、申请号为201710131996.3、发明名称为“一种时分双工噪声检测方法和设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。The present application claims priority to Chinese Patent Application No. 200910131996.3, entitled "A Time-Division Duplex Noise Detection Method and Apparatus", filed on March 7, 2017, the entire contents of which are incorporated by reference. In this application.
技术领域Technical field
本申请涉及通信系统领域以及语音处理领域,尤其涉及一种噪声检测方法及终端设备。The present application relates to the field of communication systems and the field of voice processing, and in particular, to a noise detection method and a terminal device.
背景技术Background technique
时分双工(Time Division Duplexing,TDD)噪声和板振都是由于手机间歇性发射大电流引起的“电流声”。具体来说,以全球移动通信系统(Global System for Mobile communication,GSM)的时分多址(Time Division Multiple Access,TDMA)制式手机的功率放大器(Power Amplifier,PA)大约每隔4.616ms都会发射一个大电流,使得电声器件收到干扰,解调出TDD噪声。与此同时,电池电压以同样的频率跌落,作用在陶瓷电容上,导致其出现机械振动并通过主板传递,形成板振声音。Time Division Duplexing (TDD) noise and board vibration are "current sounds" caused by intermittently emitting large currents in mobile phones. Specifically, the Power Amplifier (PA) of the Time Division Multiple Access (TDMA) mobile phone of the Global System for Mobile communication (GSM) system transmits a large power every 4.616 ms. The current causes the electro-acoustic device to receive interference and demodulate the TDD noise. At the same time, the battery voltage drops at the same frequency, acting on the ceramic capacitor, causing mechanical vibration and passing through the main board to form a plate vibration.
现有检测技术,首先,将输入信号x(t)先由时域变换至频谱域得到频谱信号S(f),其次,再将上述频谱信号S(f)由频谱域变换至倒谱域得到倒谱信号C(q),最后,在倒谱域中找到TDD噪声对应的频率q1(如设信号采样频率为fs,TDD噪声频率为217Hz,则倒谱域的频率的计算公式为:q1=fs/(2*217)),并设置预置阈值来判断输入信号是否为TDD噪声,若输入信号在倒谱域中频率q1对应的频谱信号C(q1)大于预置阈值,则确定输入信号是TDD噪声;若输入信号在倒谱域中频率q1对应的频谱信号C(q1)不大于预置阈值,则确定输入信号不是TDD噪声。In the existing detection technique, first, the input signal x(t) is first transformed from the time domain to the spectral domain to obtain the spectral signal S(f), and secondly, the spectral signal S(f) is transformed from the spectral domain to the cepstrum domain. The cepstrum signal C(q), finally, find the frequency q 1 corresponding to the TDD noise in the cepstrum domain (if the signal sampling frequency is f s and the TDD noise frequency is 217 Hz, the frequency of the cepstrum domain is calculated as: q 1 = f s / (2 * 217)), and setting a preset threshold value to determine whether the input signal is a noise TDD, if the input signal in the frequency domain corresponding to q cepstral spectrum signal C 1 (q 1) greater than the preset The threshold determines that the input signal is TDD noise; if the spectral signal C(q 1 ) corresponding to the frequency q 1 of the input signal in the cepstrum domain is not greater than a preset threshold, it is determined that the input signal is not TDD noise.
在现有检测技术中,由于倒谱域本身的特性导致倒谱域无法定量表示出输入信号的大小,因此,通过现有检测技术只能判断出输入信号是TDD噪声,或输入信号不是TDD噪声,而当确定输入信号是TDD噪声时,仍然无法确定出其TDD噪声大小。In the existing detection technology, the cepstrum domain cannot quantitatively indicate the size of the input signal due to the characteristics of the cepstrum domain itself. Therefore, the existing detection technique can only judge that the input signal is TDD noise, or the input signal is not TDD noise. However, when it is determined that the input signal is TDD noise, the TDD noise level cannot be determined.
发明内容Summary of the invention
本申请实施例提供了一种噪声检测方法及终端设备,用于检测信号中噪声能量大小。The embodiment of the present application provides a noise detecting method and a terminal device for detecting a noise energy amount in a signal.
第一方面本申请实施例提供了一种噪声检测方法,包括:The first aspect of the present application provides a noise detection method, including:
首先,根据第一公式计算每帧信号的幅度谱;其次,根据候选噪声频率分布确定噪声频率,其中,上述候选噪声频率分布为通过对每帧信号进行倒谱分析之后得到的;最后,在确定每帧信号的幅度谱和噪声频率之后,进而根据上述噪声频率和幅度谱进行相应的计算得到每帧信号的噪声能量值,可以理解的是,上述噪声能量值可以表征每帧信号中噪声大小。First, the amplitude spectrum of each frame signal is calculated according to the first formula; secondly, the noise frequency is determined according to the candidate noise frequency distribution, wherein the candidate noise frequency distribution is obtained by performing cepstrum analysis on each frame signal; finally, in determining After the amplitude spectrum and the noise frequency of each frame signal, and correspondingly calculating the noise energy value of each frame signal according to the above noise frequency and amplitude spectrum, it can be understood that the above noise energy value can represent the noise level in each frame signal.
从本申请实施例第一方面提供的技术方案中,可以看出本申请实施例具有以下优点:In the technical solution provided by the first aspect of the embodiment of the present application, it can be seen that the embodiment of the present application has the following advantages:
通过对每帧信号进行计算预先得到相应的幅度谱,进而通过由对每帧信号进行倒谱分 析得到的候选噪声频率分布,确定出噪声频率,最终,根据上述幅度谱和上述噪声频率进行相应的计算得到每帧信号的噪声能量值,因此,本申请实施例可以有效地检测出每帧信号中噪声能量大小。在一种可能的设计中,在第一方面的第一种可能的实现方式中,在根据各候选噪声频率确定噪声频率之前,该噪声检测方法还包括:The corresponding amplitude spectrum is obtained in advance by calculating each frame signal, and then by performing cepstrum analysis on each frame signal. The obtained candidate noise frequency distribution is determined, and the noise frequency is determined. Finally, the noise energy value of each frame signal is obtained according to the amplitude spectrum and the noise frequency. Therefore, the embodiment of the present application can effectively detect each frame signal. The amount of noise energy. In a possible design, in a first possible implementation manner of the first aspect, before the noise frequency is determined according to each candidate noise frequency, the noise detection method further includes:
在预置的频率搜索区间内,将每帧音频信号中幅度值超过预设阈值的各频率确定为各目标频率,其中,频率搜索区间为倒谱域中的频率区间;In the preset frequency search interval, each frequency of the amplitude value of each frame of the audio signal exceeds a preset threshold is determined as each target frequency, wherein the frequency search interval is a frequency interval in the cepstrum domain;
将每个目标频率作为基频,计算基频,基频的二次谐波频率,和所述基频的三次谐波频率分别对应的幅度值的几何平均值;Calculating a geometric mean value of the amplitude value corresponding to the fundamental frequency, the second harmonic frequency of the fundamental frequency, and the third harmonic frequency of the fundamental frequency, respectively, using each target frequency as a fundamental frequency;
将每帧音频信号中幅度值对应的几何平均值最大的各目标频率确定为各候选噪声频率。Each target frequency having the largest geometric mean value corresponding to the amplitude value in each frame of the audio signal is determined as each candidate noise frequency.
其次,在第一方面的第一种可能的实现方式中,在倒谱域中预置的频率搜索区间内,先将每帧音频信号中确定出目标频率,再从每帧音频信号中的目标频率筛选出候选噪声频率。这样,有效地将各候选噪声频率从每帧音频信号中选择出来。Secondly, in the first possible implementation manner of the first aspect, in the frequency search interval preset in the cepstrum domain, the target frequency is first determined in each frame of the audio signal, and then the target in each frame of the audio signal The frequency is filtered out of the candidate noise frequency. In this way, each candidate noise frequency is effectively selected from each frame of the audio signal.
在一种可能的设计中,在第一方面的第二种可能的实现方式中,在根据第一公式计算每帧音频信号的幅度值谱之前,该噪声方法还包括:In a possible design, in a second possible implementation manner of the first aspect, before calculating the amplitude value spectrum of each frame of the audio signal according to the first formula, the noise method further includes:
将采样信号进行分帧和加窗处理,得到至少两帧音频信号。The sampled signal is subjected to framing and windowing to obtain at least two frames of audio signals.
其次,在第一方面的第二种可能的实现方式中,对采样信号进行分帧操作和加窗处理既可以提高计算音频信号的算法性能,还可以得出TDD噪声或板振声音大小随时间的变化关系。Secondly, in the second possible implementation manner of the first aspect, the framing operation and windowing processing of the sampled signal can improve the performance of the algorithm for calculating the audio signal, and can also obtain the TDD noise or the sound of the board vibration over time. The relationship of change.
在一种可能的设计中,在第一方面的第三种可能的实现方式中,在将采样信号进行分帧和加窗处理,得到至少两帧以上音频信号之前,该噪声检测方法还包括:In a possible design, in a third possible implementation manner of the first aspect, before the sampling signal is subjected to framing and windowing to obtain at least two or more audio signals, the noise detecting method further includes:
将输入音频信号进行N倍的插零上采样,得到采样信号,其中,N为不小于2的正整数。The input audio signal is N-times up-sampled to obtain a sampled signal, where N is a positive integer not less than 2.
其次,在第一方面的第三种可能的实现方式中,使用N倍的插零上采样对输入音频信号进行采样处理,可以使得噪声频率远离低频的干扰,从而提高噪声检测检测的准确性Secondly, in the third possible implementation manner of the first aspect, the input audio signal is sampled by using N times of interpolation and upsampling, so that the noise frequency is far away from the low frequency interference, thereby improving the accuracy of the noise detection detection.
在一种可能的设计中,在第一方面的第四种可能的实现方式中,上述预设计算方法包括响度计算方法。In a possible design, in a fourth possible implementation manner of the first aspect, the preset calculation method includes a loudness calculation method.
其次,在第一方面的第四种可能的实现方式中,使用响度计算方法可以有效并准确地将噪声能量大小计算出来。Secondly, in the fourth possible implementation of the first aspect, the loudness calculation method can be used to calculate the noise energy size efficiently and accurately.
第二方面,本申请实施例提供一种终端设备,该终端设备具有实现上述方法实施例中终端设备行为的功能。该功能可以通过硬件实现,也可以通过硬件执行相应的软件实现。该硬件或软件包括一个或多个与上述功能相对应的模块。In a second aspect, the embodiment of the present application provides a terminal device, where the terminal device has the function of implementing the behavior of the terminal device in the foregoing method embodiment. This function can be implemented in hardware or in hardware by executing the corresponding software. The hardware or software includes one or more modules corresponding to the functions described above.
第三方面,本申请实施例提供一种终端设备,包括:处理器、存储器、总线、发射器和接收器;该存储器用于存储计算机执行指令,该处理器与该存储器通过该总线连接,当该终端设备运行时,该处理器执行该存储器存储的该计算机执行指令,以使该终端设备执行如上述第一方面任意一项的噪声检测方法。In a third aspect, an embodiment of the present application provides a terminal device, including: a processor, a memory, a bus, a transmitter, and a receiver; the memory is configured to store a computer to execute an instruction, and the processor is connected to the memory through the bus. When the terminal device is in operation, the processor executes the computer-executed instruction stored in the memory to cause the terminal device to perform the noise detecting method according to any one of the above first aspects.
第四方面,本申请实施例提供了一种计算机可读存储介质,用于储存为上述终端设备 所用的计算机软件指令,当其在计算机上运行时,使得计算机可以执行上述第一方面中任意一项的噪声检测方法。In a fourth aspect, an embodiment of the present application provides a computer readable storage medium, configured to be stored as the foregoing terminal device. The computer software instructions used, when run on a computer, cause the computer to perform the noise detection method of any of the above first aspects.
第五方面,本申请实施例提供了一种包含指令的计算机程序产品,当其在计算机上运行时,使得计算机可以执行上述第一方面中任意一项的噪声检测方法。In a fifth aspect, an embodiment of the present application provides a computer program product comprising instructions, which when executed on a computer, enable the computer to perform the noise detecting method of any of the above first aspects.
另外,第二方面至第五方面中任一种设计方式所带来的技术效果可参见第一方面中不同设计方式所带来的技术效果,此处不再赘述。In addition, the technical effects brought by the design mode of any one of the second aspect to the fifth aspect can be referred to the technical effects brought by different design modes in the first aspect, and details are not described herein again.
附图说明DRAWINGS
图1为本申请实施例中噪声检测方法的一个实施例示意图;1 is a schematic diagram of an embodiment of a noise detecting method in an embodiment of the present application;
图2为本申请实施例中终端设备的一个实施例示意图;2 is a schematic diagram of an embodiment of a terminal device according to an embodiment of the present application;
图3为本申请实施例中终端设备的另一个实施例示意图;3 is a schematic diagram of another embodiment of a terminal device according to an embodiment of the present application;
图4为本申请实施例中终端设备的另一个实施例示意图。FIG. 4 is a schematic diagram of another embodiment of a terminal device according to an embodiment of the present application.
具体实施方式detailed description
本申请实施例提供了一噪声检测方法及终端设备,用于检测信号中噪声能量大小。The embodiment of the present application provides a noise detecting method and a terminal device for detecting a noise energy amount in a signal.
本申请的说明书和权利要求书及上述附图中的术语“第一”、“第二”、“第三”、“第四”等(如果存在)是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的实施例能够以除了在这里图示或描述的内容以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。The terms "first", "second", "third", "fourth", etc. (if present) in the specification and claims of the present application and the above figures are used to distinguish similar objects without having to use To describe a specific order or order. It is to be understood that the data so used may be interchanged where appropriate so that the embodiments described herein can be implemented in a sequence other than what is illustrated or described herein. In addition, the terms "comprises" and "comprises" and "the" and "the" are intended to cover a non-exclusive inclusion, for example, a process, method, system, product, or device that comprises a series of steps or units is not necessarily limited to Those steps or units may include other steps or units not explicitly listed or inherent to such processes, methods, products or devices.
本申请实施例中噪声检测方法主要用于检测手机中TDD噪声和板振声音,当然,还可以用于其他谐波形式类似的噪声,对此本申请不做任何限制。The noise detecting method in the embodiment of the present application is mainly used for detecting TDD noise and board vibration in a mobile phone, and of course, can also be used for noises of other harmonic forms, and the present application does not impose any limitation.
为了便于理解本申请实施例中噪声检测方法,下面将结合具体的实施例来对本申请实施例中噪声检测方法进行说明。In order to facilitate the understanding of the noise detection method in the embodiment of the present application, the noise detection method in the embodiment of the present application will be described below in conjunction with a specific embodiment.
实施例一,如图1所示,本申请实施例中噪声检测方法的一个实施例,包括:Embodiment 1 As shown in FIG. 1 , an embodiment of a noise detecting method in an embodiment of the present application includes:
101、将输入音频信号进行N倍的插值上采样,得到采样信号。101. Perform an N-time interpolation upsampling on the input audio signal to obtain a sampling signal.
本实施例中,可选地,对输入音频信号进行N倍的插值上采样,得到采样信号,其中,N为大于或等于2的正整数。具体可以为:设输入音频信号为X,输出的采样信号为Z,根据上采样原理可以得到X和Z之间的关系,可以用关系式:Z=upsample(X,N)表示,其中,需要说明的是upsample表示上采样函数。In this embodiment, optionally, the input audio signal is subjected to N times of interpolation upsampling to obtain a sampling signal, where N is a positive integer greater than or equal to 2. Specifically, the input audio signal is X, and the output sampling signal is Z. According to the upsampling principle, the relationship between X and Z can be obtained, which can be represented by a relationship: Z=upsample(X, N), where It is stated that upsample represents the upsampling function.
其次,还需要说明的是,对输入音频信号进行N倍的插值上采样不会改变该输入音频信号的幅度值大小,实际上是将采样频率提高为原来的N倍。此外,对于其他能达到相同技术效果的采样方案,本申请不做任何限制。Secondly, it should be noted that the N-time interpolation upsampling of the input audio signal does not change the amplitude value of the input audio signal, and actually increases the sampling frequency by N times. In addition, the present application does not impose any restrictions on other sampling schemes that can achieve the same technical effects.
102、将采样信号进行分帧和加窗处理,得到至少两帧音频信号。102. Perform framing and windowing on the sampled signal to obtain at least two frames of audio signals.
本实施例中,可选地,在将输入信号进行采样,得到采样信号之后,将采样信号进行 分帧操作和加窗处理,得到至少两帧音频信号。In this embodiment, optionally, after sampling the input signal to obtain a sampling signal, the sampling signal is performed. The framing operation and the windowing process result in at least two frames of audio signals.
可选地,分帧操作可具体为:以Z(k)表示采样信号Z中第k个点,设每帧音频信号的长度为l,每两帧音频信号之间的移动步长为s,则第i帧音频信号Zi为Z中的l个点,其中,Zi可以表示为:Zi=[z*(1+(i-1)*s),z*(2+(i-1)*s),......,z*(l+(i-1)*s)],l,i均为正整数,s为大于0且小于l的正整数,其中,s的常用典型值为l/2,l/4或l/8等。Optionally, the framing operation may be specifically: indicating a kth point in the sampling signal Z by Z(k), setting a length of each frame of the audio signal to be 1, and a moving step size between each two frames of the audio signal is s, Then the ith frame audio signal Z i is 1 point in Z, where Z i can be expressed as: Z i =[z*(1+(i-1)*s), z*(2+(i- 1) *s), ..., z*(l+(i-1)*s)], l, i are positive integers, s is a positive integer greater than 0 and less than l, where s Typical values are usually l/2, l/4 or l/8.
可选地,加窗处理可具体为:设加窗函数为W,则W可表示为:Wi=[W(1),W(2),.....,W(l)],其中需要说明的是,窗函数W可根据实际需要进行选择,对此本申请不做限制,加窗函数W具体可为常用窗函数(如矩形窗,三角窗,汉宁窗,海明窗和高斯窗等)中的一种,或者也可以是其他新设计的窗函数,对此本申请也不做任何限制。Optionally, the windowing process may be specifically: if the windowing function is W, then W can be expressed as: W i =[W(1), W(2), . . . , W(l)], It should be noted that the window function W can be selected according to actual needs, and the application does not limit the window function W, which can be a common window function (such as a rectangular window, a triangular window, a Hanning window, a Hamming window, and One of the Gaussian windows, etc., or other newly designed window functions, is not limited in this application.
可选地,在经过上述分帧操作和加窗处理之后,得到的至少两帧音频信号Yi可表示为:Yi=[W(1)*Zi(1),W(2)*Zi(2),......,W(l)*Zi(l)]。Optionally, after the framing operation and the windowing process, the obtained at least two frames of audio signals Y i may be expressed as: Y i =[W(1)*Z i (1), W(2)*Z i (2), ..., W(l) * Z i (l)].
其次,需要说明的是,分帧操作既可以提高计算音频信号的算法性能,还可以得出TDD噪声或板振声音大小随时间的变化关系。此外,分帧操作均不会改变输入音频信号的幅度值大小,加窗处理主要是缓解频谱泄露问题。Secondly, it should be noted that the framing operation can not only improve the performance of the algorithm for calculating the audio signal, but also the relationship between the TDD noise or the amplitude of the plate vibration sound over time. In addition, the framing operation does not change the amplitude value of the input audio signal, and the windowing process mainly mitigates the spectrum leakage problem.
103、根据第一公式计算每帧音频信号的幅度谱。103. Calculate an amplitude spectrum of each frame of the audio signal according to the first formula.
本实施例中,在经过分帧操作和加窗处理得到至少两帧音频信号之后,根据第一公式对每帧音频信号的幅度谱进行计算,并得到每帧音频信号的幅度谱。In this embodiment, after at least two frames of audio signals are obtained through the framing operation and the windowing process, the amplitude spectrum of each frame of the audio signal is calculated according to the first formula, and the amplitude spectrum of each frame of the audio signal is obtained.
可选地,在一种可能的计算方式中,以Mi表示频谱域中第i帧音频信号,则Mi可表示为:Mi=abs(fft(Yi));Mi的表达式中abs表示取绝对值;fft表示快速傅里叶变换。Alternatively, in one possible method of calculation in order to M i represents the i-th frame of the audio signal in the spectral domain, then M i can be expressed as: M i = abs (fft ( Y i)); expression of M i The middle abs represents the absolute value; the fft represents the fast Fourier transform.
104、根据第二公式计算每帧音频信号的倒谱。104. Calculate a cepstrum of each frame of the audio signal according to the second formula.
本实施例中,在计算得到每帧音频信号的幅度谱之后,根据第二公式对每帧音频信号的倒谱进行计算,并得到每帧音频信号的倒谱。In this embodiment, after calculating the amplitude spectrum of each frame of the audio signal, the cepstrum of each frame of the audio signal is calculated according to the second formula, and the cepstrum of each frame of the audio signal is obtained.
可选地,在一种可能的计算方式中,以Ci表示倒谱域中第i帧音频信号,则Ci可表示为:Ci=real(ifft(log(Mi)));其中,log表示取对数;ifft表示快速逆傅里叶变换,real表示取实数。Optionally, in a possible calculation manner, C i represents an ith frame audio signal in the cepstrum domain, and C i can be expressed as: C i =real(ifft(log(M i ))); , log represents the logarithm; ift represents the fast inverse Fourier transform, real represents the real number.
105、在频率搜索区间内确定各候选噪声频率。105. Determine each candidate noise frequency in the frequency search interval.
本实施例中,可选地,在计算得到每帧音频信号的倒谱之后,在频率搜索区间内确定各候选噪声频率,其中,频率搜索区间为倒谱域对应的频率范围,其频率搜索区间的具体确定方式可以是根据先知经验预先设置,也可以是根据不同的手机制式进行确定,对此本申请不做任何限制。In this embodiment, optionally, after calculating the cepstrum of each frame of the audio signal, determining each candidate noise frequency in the frequency search interval, wherein the frequency search interval is a frequency range corresponding to the cepstrum domain, and the frequency search interval thereof The specific determination manner may be preset according to the experience of the prophet, or may be determined according to different mobile phone systems, and the application does not impose any restrictions.
可选地,在频率搜索区间内确定各候选噪声频率的一种可能的实现方式为:Optionally, one possible implementation of determining each candidate noise frequency within the frequency search interval is:
首先,根据先知经验确定倒谱域中的频率搜索区间,其中,需要说明的是,在倒谱域中会存在许多幅度值,但是倒谱域中的幅度值是无法定量表示出每帧音频信号大小的;First, the frequency search interval in the cepstrum domain is determined according to the experience of the prophet. It should be noted that there are many amplitude values in the cepstrum domain, but the amplitude values in the cepstrum domain cannot quantitatively represent the audio signal of each frame. Size
其次,通过预设阈值去选择出每帧信号中超过该预设阈值的各频率作为各目标频率,预设阈值实质上是一个预先设定的幅度值(如具体可以为倒谱域中最大幅度值的0.1倍),对于预设阈值的具体大小可根据实际应用场景进行设定,本申请对此不做任何限制; Secondly, the preset threshold is used to select each frequency of each frame signal that exceeds the preset threshold as the target frequency, and the preset threshold is substantially a preset amplitude value (for example, the maximum amplitude in the cepstrum domain may be specifically The value of the preset threshold can be set according to the actual application scenario, and the present application does not impose any restrictions on this;
再次,将从每帧信号中选择出来的各目标频率作为基频fbase,并依次计算出该基频对应的二次谐波频率f2和三次谐波频率f3,得到基频fbase,二次谐波频率f2和三次谐波频率f3之后,再根据几何平方值公式计算出三者频率,在倒谱域中对应幅度值的几何平方值p(fbase),其中,可选地,几何平方值公式可以是:
Figure PCTCN2017083765-appb-000001
也可以是其他形式的几何平方值公式,对此本申请不做任何限制。
Again, each target frequency selected from each frame signal is used as the fundamental frequency f base , and the second harmonic frequency f 2 and the third harmonic frequency f 3 corresponding to the fundamental frequency are sequentially calculated to obtain a fundamental frequency f base . After the second harmonic frequency f 2 and the third harmonic frequency f 3 , the three frequencies are calculated according to the geometric square value formula, and the geometric square value p(f base ) of the corresponding amplitude value in the cepstrum domain, wherein, Ground, the geometric squared formula can be:
Figure PCTCN2017083765-appb-000001
Other forms of geometric squared formulas are also possible, and no limitation is imposed on this application.
最后,在将计算得到的每帧音频信号中几何平方值最大的目标频率作为本帧的候选噪声频率。Finally, the target frequency with the largest geometric square value in each of the calculated audio signals is used as the candidate noise frequency of the current frame.
106、根据各候选噪声频率确定噪声频率。106. Determine a noise frequency according to each candidate noise frequency.
本实施例中,在搜索并确定各噪声频率之后,在从各候选噪声频率中确定出噪声频率。In this embodiment, after searching and determining each noise frequency, the noise frequency is determined from each candidate noise frequency.
可选地,一种可能的从各候选噪声频率中确定出噪声频率的方式为:统计各候选噪声频率中每种噪声频率的出现次数,进而将出现次数最多,且出现次数所占所有频率总次数(帧数)的比例超过预设门限的频率确定为噪声频率,有频率ft,出现次数占所有频率出现次数(每帧都有1个候选频率,即总数为帧数)的60%,大于门限50%。即认为该噪声为TDD噪声或者板振,ft为其噪声频率,当然,也可以是其他可能的确定方式,对此本申请不做任何限制。如果没有任何一个候选频率的次数超过预设门限,可以判断这个噪声并不是TDD噪声或者板振声音,即不必再计算其能量值。Optionally, a possible way to determine the noise frequency from each candidate noise frequency is: counting the number of occurrences of each of the candidate noise frequencies, and then the most occurrences, and the total number of occurrences of all frequencies The frequency of the number of times (frames) exceeds the preset threshold is determined as the noise frequency, and there is a frequency f t , and the number of occurrences accounts for 60% of the number of occurrences of all frequencies (one candidate frequency per frame, that is, the total number of frames). Greater than 50% threshold. That is, the noise is considered to be TDD noise or plate vibration, and f t is its noise frequency. Of course, it may be other possible determination manners, and the present application does not impose any limitation. If the number of times without any candidate frequency exceeds the preset threshold, it can be judged that this noise is not TDD noise or plate vibration, that is, it is not necessary to calculate its energy value.
107、根据噪声频率,幅度谱和预设计算方法进行计算得到每帧音频信号的噪声能量值。107. Calculate the noise energy value of each frame of the audio signal according to the noise frequency, the amplitude spectrum, and the preset calculation method.
本实施例中,在根据各候选噪声频率确定噪声频率之后,根据噪声频率,幅度谱和预设计算方法计算出每帧音频信号中的噪声频率对应的噪声能量值,其中,该噪声能量值表示TDD噪声或板振声音的大小,若噪声能量值越高,则TDD噪声或板振声音越大;若噪声能量值越低,则TDD噪声或板振声音越小。In this embodiment, after determining the noise frequency according to each candidate noise frequency, the noise energy value corresponding to the noise frequency in each frame of the audio signal is calculated according to the noise frequency, the amplitude spectrum, and a preset calculation method, wherein the noise energy value is represented by The size of the TDD noise or the plate vibration sound, if the noise energy value is higher, the TDD noise or the plate vibration sound is larger; if the noise energy value is lower, the TDD noise or the plate vibration sound is smaller.
可选地,一种可能的计算噪声能量值的方式为:通过使用响度计算方法计算TDD噪声或板振声音的噪声能量值。同时还可以将噪声频率对应的二次谐波频率f2和三次谐波频率f3对应的能量值计算出来。此外,对于其他计算方式,本申请不做任何限制。Alternatively, one possible way to calculate the noise energy value is to calculate the noise energy value of the TDD noise or the plate vibration sound by using the loudness calculation method. At the same time, the energy value corresponding to the second harmonic frequency f 2 and the third harmonic frequency f 3 corresponding to the noise frequency can be calculated. In addition, for other calculation methods, the application does not impose any restrictions.
本实施例中,通过对每帧信号进行计算预先得到相应的幅度谱,进而通过由对每帧信号进行倒谱分析得到的候选噪声频率分布,确定出噪声频率,最终,根据上述幅度谱和上述噪声频率进行相应的计算得到每帧信号的噪声能量值,因此,本申请实施例可以有效地检测出每帧信号中噪声能量大小。In this embodiment, the corresponding amplitude spectrum is obtained in advance by calculating each frame signal, and then the noise frequency is determined by the candidate noise frequency distribution obtained by performing cepstrum analysis on each frame signal, and finally, according to the above amplitude spectrum and the above The noise frequency is calculated correspondingly to obtain the noise energy value of each frame signal. Therefore, the embodiment of the present application can effectively detect the noise energy amount in each frame signal.
上述实施例一对本申请中噪声检测方法进行了详细说明,为了便于理解本申请实施例中终端设备,下面将结合具体的实施例对本申请实施例中终端设备进行说明。The foregoing embodiment is a detailed description of the noise detecting method in the present application. To facilitate the understanding of the terminal device in the embodiment of the present application, the terminal device in the embodiment of the present application will be described below in conjunction with a specific embodiment.
实施例二,如图2所示,本申请实施例中终端设备的一个实施例,包括:Embodiment 2 As shown in FIG. 2, an embodiment of a terminal device in this embodiment of the present application includes:
第一计算模块201,用于根据第一公式计算每帧音频信号的幅度谱;a first calculating module 201, configured to calculate an amplitude spectrum of each frame of the audio signal according to the first formula;
第一确定模块202,用于根据各候选噪声频率确定噪声频率,该各候选噪声频率由对每帧音频信号进行倒谱分析得到;The first determining module 202 is configured to determine a noise frequency according to each candidate noise frequency, where each candidate noise frequency is obtained by performing cepstrum analysis on each frame of the audio signal;
第二计算模块203,用于根据该噪声频率,该幅度谱,和预设计算方法进行计算得到 每帧音频信号的噪声能量值。a second calculating module 203, configured to calculate, according to the noise frequency, the amplitude spectrum, and a preset calculation method The noise energy value of each frame of the audio signal.
如图3所示,可选地,在一种可能的设计中,终端设备还包括:第二确定模块304,第三计算模块305和第三确定模块306;其中,各模块功能具体如下:As shown in FIG. 3, optionally, in a possible design, the terminal device further includes: a second determining module 304, a third calculating module 305, and a third determining module 306; wherein each module function is as follows:
第二确定模块304,用于在预置的频率搜索区间内,将每帧音频信号中幅度值超过预设阈值的各频率确定为各目标频率;The second determining module 304 is configured to determine, in the preset frequency search interval, each frequency of the amplitude value of each frame of the audio signal exceeding a preset threshold as each target frequency;
第三计算模块305,用于将每个目标频率作为基频,计算基频,该基频的二次谐波频率,和该基频的三次谐波频率分别对应的幅度值的几何平均值;a third calculating module 305, configured to calculate, as a fundamental frequency, a fundamental frequency, a second harmonic frequency of the fundamental frequency, and a geometric mean value of the amplitude value respectively corresponding to the third harmonic frequency of the fundamental frequency;
第三确定模块306,用于将最大几何平均值对应的各目标频率确定为各候选噪声频率。The third determining module 306 is configured to determine each target frequency corresponding to the maximum geometric mean as each candidate noise frequency.
如图3所示,可选地,在一种可能的设计中,终端设备还包括:第一处理模块307,其中,第一处理模块307,用于将采样信号进行分帧和加窗处理,得到至少两帧音频信号。As shown in FIG. 3, optionally, in a possible design, the terminal device further includes: a first processing module 307, wherein the first processing module 307 is configured to perform framing and windowing processing on the sampling signal, Obtain at least two frames of audio signals.
如图3所示,可选地,在一种可能的设计中,终端设备还包括:第二处理模块308,其中,第二处理模块308,用于将输入音频信号进行N倍的插零上采样,得到该采样信号,其中,该N为不小于2的正整数。As shown in FIG. 3, optionally, in a possible design, the terminal device further includes: a second processing module 308, wherein the second processing module 308 is configured to perform N times of insertion of the input audio signal. Sampling, the sampled signal is obtained, wherein the N is a positive integer not less than 2.
本实施例中,通过对每帧信号进行计算预先得到相应的幅度谱,进而通过由对每帧信号进行倒谱分析得到的候选噪声频率分布,确定出噪声频率,最终,根据上述幅度谱和上述噪声频率进行相应的计算得到每帧信号的噪声能量值,因此,本申请实施例可以有效地检测出每帧信号中噪声能量大小。In this embodiment, the corresponding amplitude spectrum is obtained in advance by calculating each frame signal, and then the noise frequency is determined by the candidate noise frequency distribution obtained by performing cepstrum analysis on each frame signal, and finally, according to the above amplitude spectrum and the above The noise frequency is calculated correspondingly to obtain the noise energy value of each frame signal. Therefore, the embodiment of the present application can effectively detect the noise energy amount in each frame signal.
上述实施例二从虚拟的功能性装置方面对本申请实施例中终端设备进行了详细说明,下面从实体结构方面对本申请实施例中终端设备进行说明,可具体如下:The foregoing embodiment 2 describes the terminal device in the embodiment of the present application in detail from the aspect of the virtual functional device. The following describes the terminal device in the embodiment of the present application from the aspect of the physical structure, which may be specifically as follows:
实施例三,如图4所示,本申请实施例中终端设备的另一个实施例,包括:接收器401、发射器402、处理器403、存储器404和总线405。The third embodiment, as shown in FIG. 4, another embodiment of the terminal device in the embodiment of the present application includes: a receiver 401, a transmitter 402, a processor 403, a memory 404, and a bus 405.
其中,存储器404可以包括只读存储器和随机存取存储器,并向处理器403提供指令和数据。存储器404的一部分还可以包括非易失性随机存取存储器(英文全称:Non-Volatile Random Access Memory,英文缩写:NVRAM)。The memory 404 can include read only memory and random access memory and provides instructions and data to the processor 403. A portion of the memory 404 may also include a non-volatile random access memory (English name: Non-Volatile Random Access Memory, English abbreviation: NVRAM).
存储器404存储了如下的元素,可执行模块或者数据结构,或者它们的子集,或者它们的扩展集:The memory 404 stores the following elements, executable modules or data structures, or a subset thereof, or an extended set thereof:
操作指令:包括各种操作指令,用于实现各种操作;Operation instructions: including various operation instructions for implementing various operations;
操作系统:包括各种系统程序,用于实现各种基础业务以及处理基于硬件的任务。Operating system: Includes a variety of system programs for implementing various basic services and handling hardware-based tasks.
其中,本申请实施例中处理器403可以用于执行上述实施例中第一通信网元对应的操作,可以包括如下操作:The processor 403 in the embodiment of the present application may be used to perform operations corresponding to the first communication network element in the foregoing embodiment, and may include the following operations:
根据第一公式计算每帧音频信号的幅度谱;Calculating an amplitude spectrum of each frame of the audio signal according to the first formula;
根据各候选噪声频率确定噪声频率,该各候选噪声频率由对每帧音频信号进行倒谱分析得到;Determining a noise frequency according to each candidate noise frequency, wherein each candidate noise frequency is obtained by performing cepstrum analysis on each frame of the audio signal;
根据该噪声频率,该幅度谱,和预设计算方法进行计算得到每帧音频信号的噪声能量值。 According to the noise frequency, the amplitude spectrum is calculated by a preset calculation method to obtain a noise energy value of each frame of the audio signal.
可选的,处理器403可以用于执行如下步骤:在预置的频率搜索区间内,将每帧音频信号中幅度值超过预设阈值的各频率确定为各目标频率;Optionally, the processor 403 may be configured to: determine, in a preset frequency search interval, each frequency in the audio signal of each frame that exceeds a preset threshold as each target frequency;
将每个目标频率作为基频,计算基频,该基频的二次谐波频率,和该基频的三次谐波频率分别对应的幅度值的几何平均值;Using each target frequency as a fundamental frequency, calculating a fundamental frequency, a second harmonic frequency of the fundamental frequency, and a geometric mean value of the amplitude value corresponding to the third harmonic frequency of the fundamental frequency;
将最大几何平均值对应的各目标频率确定为各候选噪声频率。Each target frequency corresponding to the maximum geometric mean is determined as each candidate noise frequency.
可选的,处理器403可以用于执行如下步骤:将输入音频信号进行N倍的插零上采样,得到该采样信号,其中,该N为不小于2的正整数;Optionally, the processor 403 is configured to perform the following steps: performing N-times up-sampling on the input audio signal to obtain the sampling signal, where the N is a positive integer not less than 2;
将采样信号进行分帧和加窗处理,得到至少两帧音频信号。The sampled signal is subjected to framing and windowing to obtain at least two frames of audio signals.
处理器403控制第一通信网元的操作,处理器403还可以称为中央处理单元(英文全称:Central Processing Unit,英文缩写:CPU)。存储器404可以包括只读存储器和随机存取存储器,并向处理器403提供指令和数据。存储器404的一部分还可以包括NVRAM。具体的应用中,第一通信网元的各个组件通过总线系统405耦合在一起,其中总线系统405除包括数据总线之外,还可以包括电源总线、控制总线和状态信号总线等。但是为了清楚说明起见,在图中将各种总线都标为总线系统405。The processor 403 controls the operation of the first communication network element, and the processor 403 may also be referred to as a central processing unit (English full name: Central Processing Unit, English abbreviation: CPU). Memory 404 can include read only memory and random access memory and provides instructions and data to processor 403. A portion of memory 404 may also include NVRAM. In a specific application, the components of the first communication network element are coupled together by a bus system 405. The bus system 405 may include a power bus, a control bus, a status signal bus, and the like in addition to the data bus. However, for clarity of description, various buses are labeled as bus system 405 in the figure.
上述本申请实施例揭示的方法可以应用于处理器403中,或者由处理器403实现。处理器403可能是一种集成电路芯片,具有信号的处理能力。在实现过程中,上述方法的各步骤可以通过处理器403中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器403可以是通用处理器、数字信号处理器(英文全称:Digital Signal Processing,英文缩写:DSP)、专用集成电路(英文全称:Application Specific Integrated Circuit,英文缩写:ASIC)、现成可编程门阵列(英文全称:Field-Programmable Gate Array,英文缩写:FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。可以实现或者执行本申请实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。结合本申请实施例所公开的方法的步骤可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器,闪存、只读存储器,可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器404,处理器403读取存储器404中的信息,结合其硬件完成上述方法的步骤。The method disclosed in the foregoing embodiment of the present application may be applied to the processor 403 or implemented by the processor 403. Processor 403 may be an integrated circuit chip with signal processing capabilities. In the implementation process, each step of the foregoing method may be completed by an integrated logic circuit of hardware in the processor 403 or an instruction in a form of software. The processor 403 may be a general-purpose processor, a digital signal processor (English name: Digital Signal Processing, English abbreviation: DSP), an application specific integrated circuit (English name: Application Specific Integrated Circuit, English abbreviation: ASIC), ready-made programmable Gate array (English name: Field-Programmable Gate Array, English abbreviation: FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components. The methods, steps, and logical block diagrams disclosed in the embodiments of the present application can be implemented or executed. The general purpose processor may be a microprocessor or the processor or any conventional processor or the like. The steps of the method disclosed in the embodiments of the present application may be directly implemented by the hardware decoding processor, or may be performed by a combination of hardware and software modules in the decoding processor. The software module can be located in a conventional storage medium such as random access memory, flash memory, read only memory, programmable read only memory or electrically erasable programmable memory, registers, and the like. The storage medium is located in memory 404, and processor 403 reads the information in memory 404 and, in conjunction with its hardware, performs the steps of the above method.
图4的相关描述可以参阅图1方法部分的相关描述和效果进行理解,此处不做过多赘述。The related description of FIG. 4 can be understood by referring to the related description and effect of the method part of FIG. 1, and no further description is made here.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统,装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。A person skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the system, the device and the unit described above can refer to the corresponding process in the foregoing method embodiment, and details are not described herein again.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。 In the several embodiments provided by the present application, it should be understood that the disclosed system, apparatus, and method may be implemented in other manners. For example, the device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner, for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not executed. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。The integrated unit, if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application, in essence or the contribution to the prior art, or all or part of the technical solution may be embodied in the form of a software product stored in a storage medium. A number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present application. The foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like. .
以上所述,以上实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围。 The above embodiments are only used to explain the technical solutions of the present application, and are not limited thereto; although the present application has been described in detail with reference to the foregoing embodiments, those skilled in the art should understand that they can still The technical solutions described in the embodiments are modified, or the equivalents of the technical features are replaced by the equivalents. The modifications and substitutions of the embodiments do not depart from the spirit and scope of the technical solutions of the embodiments of the present application.

Claims (11)

  1. 一种噪声检测方法,其特征在于,包括:A noise detecting method, comprising:
    根据第一公式计算每帧音频信号的幅度谱;Calculating an amplitude spectrum of each frame of the audio signal according to the first formula;
    根据各候选噪声频率确定噪声频率,所述各候选噪声频率由对每帧音频信号进行倒谱分析得到;Determining a noise frequency according to each candidate noise frequency, wherein each candidate noise frequency is obtained by performing cepstrum analysis on each frame of the audio signal;
    根据所述噪声频率,所述幅度谱,和预设计算方法进行计算得到每帧音频信号的噪声能量值。According to the noise frequency, the amplitude spectrum, and a preset calculation method are calculated to obtain a noise energy value of each frame of the audio signal.
  2. 根据权利要求1所述的噪声检测方法,其特征在于,在所述根据各候选噪声频率确定噪声频率之前,所述方法还包括:The noise detecting method according to claim 1, wherein before the determining the noise frequency according to each candidate noise frequency, the method further comprises:
    在预置的频率搜索区间内,将每帧音频信号中幅度值超过预设阈值的各频率确定为各目标频率;Determining, in a preset frequency search interval, each frequency of the amplitude value of each frame of the audio signal exceeding a preset threshold as each target frequency;
    将每个目标频率作为基频,计算基频,所述基频的二次谐波频率,和所述基频的三次谐波频率分别对应的幅度值的几何平均值;Calculating a fundamental frequency, a fundamental frequency, a second harmonic frequency of the fundamental frequency, and a geometric mean value of amplitude values respectively corresponding to a third harmonic frequency of the fundamental frequency;
    将最大几何平均值对应的各目标频率确定为各候选噪声频率。Each target frequency corresponding to the maximum geometric mean is determined as each candidate noise frequency.
  3. 根据权利要求1或2所述的噪声检测方法,其特征在于,在所述根据第一公式计算每帧音频信号的幅度值谱之前,所述方法还包括:The noise detecting method according to claim 1 or 2, wherein before the calculating the amplitude value spectrum of each frame of the audio signal according to the first formula, the method further comprises:
    将采样信号进行分帧和加窗处理,得到至少两帧音频信号。The sampled signal is subjected to framing and windowing to obtain at least two frames of audio signals.
  4. 根据权利要求3中所述的噪声检测方法,其特征在于,在所述将采样信号信号进行分帧和加窗处理,得到至少两帧以上音频信号之前,所述方法还包括:The noise detecting method according to claim 3, wherein before the framing and windowing processing of the sampled signal signal to obtain at least two or more audio signals, the method further comprises:
    将输入音频信号进行N倍的插零上采样,得到所述采样信号,其中,所述N为不小于2的正整数。The input audio signal is N-times up-sampled to obtain the sampled signal, wherein the N is a positive integer not less than 2.
  5. 根据权利要求1至4中任一项所述的噪声检测方法,其特征在于,所述预设计算方法包括响度计算方法。The noise detecting method according to any one of claims 1 to 4, wherein the preset calculating method includes a loudness calculating method.
  6. 一种终端设备,其特征在于,包括:A terminal device, comprising:
    第一计算模块,用于根据第一公式计算每帧音频信号的幅度谱;a first calculating module, configured to calculate an amplitude spectrum of each frame of the audio signal according to the first formula;
    第一确定模块,用于根据各候选噪声频率确定噪声频率,所述各候选噪声频率由对每帧音频信号进行倒谱分析得到;a first determining module, configured to determine a noise frequency according to each candidate noise frequency, where each candidate noise frequency is obtained by performing cepstrum analysis on each frame of the audio signal;
    第二计算模块,用于根据所述噪声频率,所述幅度谱,和预设计算方法进行计算得到每帧音频信号的噪声能量值。And a second calculating module, configured to calculate, according to the noise frequency, the amplitude spectrum, and a preset calculation method, a noise energy value of each frame of the audio signal.
  7. 根据权利要求6所述的终端设备,其特征在于,所述终端设备还包括:The terminal device according to claim 6, wherein the terminal device further comprises:
    第二确定模块,用于在预置的频率搜索区间内,将每帧音频信号中幅度值超过预设阈值的各频率确定为各目标频率;a second determining module, configured to determine, in a preset frequency search interval, each frequency of the amplitude value of each frame of the audio signal exceeding a preset threshold as each target frequency;
    第三计算模块,用于将每个目标频率作为基频,计算基频,所述基频的二次谐波频率,和所述基频的三次谐波频率分别对应的幅度值的几何平均值;a third calculating module, configured to calculate, as a fundamental frequency, a fundamental frequency, a second harmonic frequency of the fundamental frequency, and a geometric mean value of the amplitude value corresponding to the third harmonic frequency of the fundamental frequency ;
    第三确定模块,用于将最大几何平均值对应的各目标频率确定为各候选噪声频率。And a third determining module, configured to determine each target frequency corresponding to the maximum geometric mean as each candidate noise frequency.
  8. 根据权利要求6或7所述的终端设备,其特征在于,所述终端设备还包括:The terminal device according to claim 6 or 7, wherein the terminal device further comprises:
    第一处理模块,用于将采样信号进行分帧和加窗处理,得到至少两帧音频信号。 The first processing module is configured to perform framing and windowing processing on the sampling signal to obtain at least two frames of audio signals.
  9. 根据权利要求8所述的终端设备,其特征在于,所述终端设备还包括:The terminal device according to claim 8, wherein the terminal device further comprises:
    第二处理模块,用于将输入音频信号进行N倍的插零上采样,得到所述采样信号,其中,所述N为不小于2的正整数。And a second processing module, configured to perform N times of interpolation on the input audio signal to obtain the sampling signal, where the N is a positive integer not less than 2.
  10. 根据权利要求6至9中任一项所述的终端设备,其特征在于,所述预设计算方法包括响度计算方法。The terminal device according to any one of claims 6 to 9, characterized in that the preset calculation method comprises a loudness calculation method.
  11. 一种终端设备,其特征在于,包括:A terminal device, comprising:
    接收器、发射器、存储器、总线和处理器;Receiver, transmitter, memory, bus, and processor;
    所述总线,用于连接所述接收器、所述发射器、所述存储器和所述处理器;The bus for connecting the receiver, the transmitter, the memory, and the processor;
    所述存储器,用于存储操作指令;The memory is configured to store an operation instruction;
    所述处理器,用于通过调用所述操作指令,执行上述权利要求1至5中的操作。 The processor is configured to perform the operations in the above claims 1 to 5 by calling the operation instruction.
PCT/CN2017/083765 2017-03-07 2017-05-10 Noise detection method, and terminal apparatus WO2018161429A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201780026318.9A CN109074814B (en) 2017-03-07 2017-05-10 Noise detection method and terminal equipment

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710131996 2017-03-07
CN201710131996.3 2017-03-07

Publications (1)

Publication Number Publication Date
WO2018161429A1 true WO2018161429A1 (en) 2018-09-13

Family

ID=63447205

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/083765 WO2018161429A1 (en) 2017-03-07 2017-05-10 Noise detection method, and terminal apparatus

Country Status (2)

Country Link
CN (1) CN109074814B (en)
WO (1) WO2018161429A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112614507A (en) * 2020-12-09 2021-04-06 腾讯音乐娱乐科技(深圳)有限公司 Method and apparatus for detecting noise
CN115086852A (en) * 2022-05-30 2022-09-20 歌尔股份有限公司 Earphone detection method and device, electronic equipment and computer readable storage medium
CN115457972A (en) * 2022-08-31 2022-12-09 海信冰箱有限公司 Noise data processing method and device for washing machine
CN117792525A (en) * 2023-12-30 2024-03-29 深圳朴为科技有限公司 A low-interference filtering device for radio frequency systems

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111554314B (en) * 2020-05-15 2024-08-16 腾讯科技(深圳)有限公司 Noise detection method, device, terminal and storage medium
CN114125095B (en) * 2020-08-31 2024-09-10 北京小米移动软件有限公司 Terminal equipment, vibration noise control method, device and medium
CN113726367B (en) * 2021-09-01 2023-01-20 嘉兴中科声学科技有限公司 Signal detection method and device and electronic equipment
CN114040309B (en) * 2021-09-24 2024-03-19 北京小米移动软件有限公司 Wind noise detection method, device, electronic equipment and storage medium
CN114385977B (en) * 2021-12-13 2024-05-28 广州方硅信息技术有限公司 Signal effective frequency detection method, terminal equipment and storage medium
CN115083440A (en) * 2022-06-15 2022-09-20 阿里巴巴(中国)有限公司 Audio signal noise reduction method, electronic device, and storage medium
CN115389198B (en) * 2022-08-29 2025-08-08 上汽通用五菱汽车股份有限公司 Signal interference elimination method, device, equipment and computer-readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120095753A1 (en) * 2010-10-15 2012-04-19 Honda Motor Co., Ltd. Noise power estimation system, noise power estimating method, speech recognition system and speech recognizing method
CN103646648A (en) * 2013-11-19 2014-03-19 清华大学 Noise power estimation method
CN103730126A (en) * 2012-10-16 2014-04-16 联芯科技有限公司 Noise suppression method and noise suppressor
CN104952449A (en) * 2015-01-09 2015-09-30 珠海高凌技术有限公司 Method and device for identifying environmental noise sources
CN105761726A (en) * 2014-12-15 2016-07-13 华为终端(东莞)有限公司 Method and device for eliminating TDD noise

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6266633B1 (en) * 1998-12-22 2001-07-24 Itt Manufacturing Enterprises Noise suppression and channel equalization preprocessor for speech and speaker recognizers: method and apparatus
CN101872616B (en) * 2009-04-22 2013-02-06 索尼株式会社 Endpoint detection method and system using same
CN103794222B (en) * 2012-10-31 2017-02-22 展讯通信(上海)有限公司 Method and apparatus for detecting voice fundamental tone frequency

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120095753A1 (en) * 2010-10-15 2012-04-19 Honda Motor Co., Ltd. Noise power estimation system, noise power estimating method, speech recognition system and speech recognizing method
CN103730126A (en) * 2012-10-16 2014-04-16 联芯科技有限公司 Noise suppression method and noise suppressor
CN103646648A (en) * 2013-11-19 2014-03-19 清华大学 Noise power estimation method
CN105761726A (en) * 2014-12-15 2016-07-13 华为终端(东莞)有限公司 Method and device for eliminating TDD noise
CN104952449A (en) * 2015-01-09 2015-09-30 珠海高凌技术有限公司 Method and device for identifying environmental noise sources

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112614507A (en) * 2020-12-09 2021-04-06 腾讯音乐娱乐科技(深圳)有限公司 Method and apparatus for detecting noise
CN112614507B (en) * 2020-12-09 2024-06-11 腾讯音乐娱乐科技(深圳)有限公司 Method and device for detecting noise
CN115086852A (en) * 2022-05-30 2022-09-20 歌尔股份有限公司 Earphone detection method and device, electronic equipment and computer readable storage medium
CN115457972A (en) * 2022-08-31 2022-12-09 海信冰箱有限公司 Noise data processing method and device for washing machine
CN117792525A (en) * 2023-12-30 2024-03-29 深圳朴为科技有限公司 A low-interference filtering device for radio frequency systems

Also Published As

Publication number Publication date
CN109074814B (en) 2023-05-09
CN109074814A (en) 2018-12-21

Similar Documents

Publication Publication Date Title
WO2018161429A1 (en) Noise detection method, and terminal apparatus
US11056130B2 (en) Speech enhancement method and apparatus, device and storage medium
US11323807B2 (en) Echo cancellation method and apparatus based on time delay estimation
EP3742756A1 (en) Method and device for detecting wearing state of earphone, earphone, and storage medium
US9916840B1 (en) Delay estimation for acoustic echo cancellation
CN106486131B (en) Method and device for voice denoising
WO2019101123A1 (en) Voice activity detection method, related device, and apparatus
WO2020037555A1 (en) Method, device, apparatus, and system for evaluating microphone array consistency
CN113160846B (en) Noise suppression method and electronic device
US9520141B2 (en) Keyboard typing detection and suppression
US9374651B2 (en) Sensitivity calibration method and audio device
WO2021000498A1 (en) Composite speech recognition method, device, equipment, and computer-readable storage medium
CN103903633A (en) Method and apparatus for detecting voice signal
TW202322106A (en) Method of suppressing wind noise of microphone and electronic device
WO2023000444A1 (en) Method and apparatus for detecting noise of loudspeaker, and electronic device and storage medium
WO2023103253A1 (en) Audio detection method and apparatus, and terminal device
BR112014009647B1 (en) NOISE Attenuation APPLIANCE AND NOISE Attenuation METHOD
CN106024017A (en) Voice detection method and device
CN113345469A (en) Voice signal processing method and device, electronic equipment and storage medium
CN103390403B (en) The extracting method of MFCC feature and device
CN109119097B (en) Pitch detection method, device, storage medium and mobile terminal
US8150062B2 (en) Determination of the adequate measurement window for sound source localization in echoic environments
CN118737196A (en) Noise suppression method, storage medium, electronic device and chip
CN110189763B (en) Sound wave configuration method and device and terminal equipment
CN113205824A (en) Sound signal processing method, device, storage medium, chip and related equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17899684

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17899684

Country of ref document: EP

Kind code of ref document: A1