[go: up one dir, main page]

CN108831497B - Echo compression method and device, storage medium and electronic equipment - Google Patents

Echo compression method and device, storage medium and electronic equipment Download PDF

Info

Publication number
CN108831497B
CN108831497B CN201810495505.8A CN201810495505A CN108831497B CN 108831497 B CN108831497 B CN 108831497B CN 201810495505 A CN201810495505 A CN 201810495505A CN 108831497 B CN108831497 B CN 108831497B
Authority
CN
China
Prior art keywords
frequency point
determining
signal
correlation coefficient
gain value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810495505.8A
Other languages
Chinese (zh)
Other versions
CN108831497A (en
Inventor
周舒然
李志飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Volkswagen China Investment Co Ltd
Mobvoi Innovation Technology Co Ltd
Original Assignee
Mobvoi Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mobvoi Information Technology Co Ltd filed Critical Mobvoi Information Technology Co Ltd
Priority to CN201810495505.8A priority Critical patent/CN108831497B/en
Publication of CN108831497A publication Critical patent/CN108831497A/en
Priority to PCT/CN2019/087301 priority patent/WO2019223603A1/en
Application granted granted Critical
Publication of CN108831497B publication Critical patent/CN108831497B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0264Noise filtering characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02082Noise filtering the noise being echo, reverberation of the speech

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
  • Telephone Function (AREA)

Abstract

The invention provides an echo compression method and device, a storage medium and electronic equipment, wherein the echo compression method comprises the following steps: determining near-end voice information according to the near-end signal, the reference signal and the residual echo signal; the near-end signal, the reference signal and the residual echo signal are all signals in a set frequency band; determining gain information according to the near-end voice information; and performing echo compression processing on the residual echo signal by using the gain information. Therefore, the scheme provided by the invention can improve the signal-to-noise ratio.

Description

Echo compression method and device, storage medium and electronic equipment
Technical Field
The embodiment of the invention relates to the technical field of voice processing, in particular to an echo compression method and device, a storage medium and electronic equipment.
Background
The intelligent voice technology is applied more and more widely at present, and each intelligent voice device can interact with a user by utilizing the intelligent voice technology. The voice signal received by the smart voice device may include a near-end signal and a reference signal. After a reference signal received by the voice terminal is sounded through a loudspeaker, the reference signal can form an echo.
Currently, echo compression is usually performed on the echo to reduce the echo. But often results in distortion of the near-end speech while reducing the residual echo during the echo compression process. The sound sounds not flat and harsh.
In the course of the inventive process, the inventor found that the echo compression process in the prior art results in a low signal-to-noise ratio.
Disclosure of Invention
In view of this, embodiments of the present invention provide an echo compression method and apparatus, a storage medium, and an electronic device, and mainly aim to improve a signal-to-noise ratio.
In a first aspect, an embodiment of the present invention provides an echo compression method, where the echo compression method includes:
determining near-end voice information according to the near-end signal, the reference signal and the residual echo signal; the near-end signal, the reference signal and the residual echo signal are all signals in a set frequency band;
determining gain information according to the near-end voice information;
and performing echo compression processing on the residual echo signal by using the gain information.
Alternatively to this, the first and second parts may,
the gain information comprises at least one frequency point gain value;
the performing echo compression processing on the residual echo signal by using the gain information includes:
the set overload amount is used for inhibiting the at least one frequency point gain value to obtain at least one overload gain value;
performing smoothing processing on the at least one overload gain value to obtain an adjustment gain value corresponding to the frequency band;
and determining echo compression output corresponding to the residual echo signal by using the adjustment gain value.
Alternatively to this, the first and second parts may,
the suppressing the at least one frequency point gain value by using the set overload amount to obtain at least one overload gain value includes:
suppressing the gain value of the at least one frequency point by using a formula (1) to obtain an overload gain value corresponding to each frequency point;
gain2i=gain1i t(1)
wherein, the gain1iRepresenting a frequency point gain value corresponding to the ith frequency point; the gain2iRepresenting an overload gain value corresponding to the ith frequency point; the t characterizes the overload.
Alternatively to this, the first and second parts may,
the smoothing of the at least one overload gain value to obtain an adjustment gain value corresponding to the frequency band includes:
determining a smooth gain value corresponding to each frequency point according to a formula;
gain3i=K1×gain2(i-1)+K2×gain2i+K3×gain2(i+1)(2)
wherein, the gain3iRepresenting a smooth gain value corresponding to the ith frequency point; the gain2(i-1)Representing an overload gain value corresponding to the (i-1) th frequency point; the gain2(i+1)Representing an overload gain value corresponding to the (i + 1) th frequency point; the K1 characterizes a first constant; the K2 characterizing a second constant; the K3 characterizes a third constant;
determining the adjustment gain value through a formula (3) according to the smooth gain value corresponding to each frequency point;
M=[gain31,……gain3i](3)
wherein the M characterizes the adjustment gain value.
Alternatively to this, the first and second parts may,
the determining the echo compression output corresponding to the residual echo signal by using the adjustment gain value includes:
determining the echo compression output by formula (4) by using the adjustment gain value;
out=M×E (4)
wherein the out characterizes the echo compressed output; the M characterizes the adjusted gain value; the E characterizes the residual echo signal.
Alternatively to this, the first and second parts may,
the determining near-end speech information according to the near-end signal, the reference signal and the residual echo signal includes:
determining a first average correlation coefficient between the near-end signal and the reference signal;
determining a second average correlation coefficient between the near-end signal and the residual echo signal;
and determining near-end voice information according to the first average correlation coefficient and the second average correlation coefficient.
Alternatively to this, the first and second parts may,
the determining a first average correlation coefficient between the near-end signal and the reference signal comprises:
determining at least one frequency point included in the frequency band;
determining at least one first correlation coefficient corresponding to the at least one frequency point according to the near-end signal and the reference signal;
and determining the first average correlation coefficient according to the at least one first correlation coefficient.
Alternatively to this, the first and second parts may,
said determining a second average correlation coefficient between said near-end signal and said residual echo signal comprises:
determining at least one frequency point included in the frequency band;
determining at least one second correlation coefficient corresponding to the at least one frequency point according to the near-end signal and the residual echo signal;
and determining the second average correlation coefficient according to the at least one second correlation coefficient.
Alternatively to this, the first and second parts may,
the determining near-end speech information according to the first average correlation coefficient and the second average correlation coefficient includes:
determining a near-end speech value through an equation set (1) according to the first average correlation coefficient and the second average correlation coefficient;
Figure GDA0002406110680000041
wherein T represents the near-end speech value; the above-mentioned
Figure GDA0002406110680000042
Characterizing the first average correlation coefficient; the above-mentioned
Figure GDA0002406110680000043
Characterizing the second average correlation coefficient; the K1 characterizes a first constant; the K2 characterizing a second constant; the K3 characterizes a third constant; the K4 characterizes a fourth constant;
when the near-end voice value is 1, the near-end voice information is that near-end voice exists;
and when the near-end voice value is 0, the near-end voice information is that near-end voice does not exist.
In a second aspect, the present invention provides an echo compression device, comprising:
the near-end voice determining module is used for determining near-end voice information according to the near-end signal, the reference signal and the residual echo signal; the near-end signal, the reference signal and the residual echo signal are all signals in a set frequency band;
a gain information determining module, configured to determine gain information according to the near-end speech information determined by the near-end speech determining module;
and the processing module is used for performing echo compression processing on the residual echo signal by using the gain information determined by the gain information determining module.
In a third aspect, the present invention provides a storage medium, where the storage medium includes a stored program, and where the apparatus in which the storage medium is located is controlled to execute any one of the above echo compression methods when the program runs.
In a fourth aspect, the present invention provides an electronic device, including a processor, a memory, and a bus; the processor and the memory complete mutual communication through the bus; the processor is configured to call program instructions in the memory to perform the echo compression method of any of the above.
The embodiment of the invention provides an echo compression method and device, a storage medium and electronic equipment, which determine near-end voice information (the near-end voice information can be near-end voice or not) according to a near-end signal, a reference signal and a residual echo signal in a set frequency band. And then determining gain information according to the near-end voice information, and performing echo compression processing on the residual echo signal by using the gain information so as to reduce the residual echo signal. As can be seen from the above, in the embodiment of the present invention, the echo compression processing may be performed on the residual echo signal according to the gain information determined by the near-end speech information, so that the residual echo signal may be reduced to the maximum. Therefore, the scheme provided by the embodiment of the invention can improve the signal to noise ratio.
The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
Fig. 1 is a flow chart illustrating an echo compression method according to an embodiment of the present invention;
fig. 2 is a flow chart of an echo compression method according to another embodiment of the present invention;
fig. 3 is a schematic structural diagram of an echo compressing device according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of an echo compressing device according to another embodiment of the present invention;
fig. 5 is a schematic structural diagram of an echo compressing device according to yet another embodiment of the present invention;
fig. 6 is a schematic structural diagram illustrating an echo compressing device according to yet another embodiment of the present invention;
fig. 7 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
As shown in fig. 1, an embodiment of the present invention provides an echo compression method, which may include the following steps:
step 101: determining near-end voice information according to the near-end signal, the reference signal and the residual echo signal; the near-end signal, the reference signal and the residual echo signal are all signals in a set frequency band;
step 102: determining gain information according to the near-end voice information;
step 103: and performing echo compression processing on the residual echo signal by using the gain information.
According to the embodiment shown in fig. 1, near-end speech information (which may be the presence of near-end speech or the absence of near-end speech) is determined according to a near-end signal, a reference signal and a residual echo signal within a set frequency band. And then determining gain information according to the near-end voice information, and performing echo compression processing on the residual echo signal by using the gain information so as to reduce the residual echo signal. As can be seen from the above, in the embodiment of the present invention, the echo compression processing may be performed on the residual echo signal according to the gain information determined by the near-end speech information, so that the residual echo signal may be reduced to the maximum. Therefore, the scheme provided by the embodiment of the invention can improve the signal to noise ratio.
In an embodiment of the present invention, the near-end signal, the reference signal, and the residual echo signal involved in the flowchart shown in fig. 1 are all located in a set frequency band, which may be determined according to service requirements. For example, the frequency band may be 500Hz to 3000 Hz.
In an embodiment of the present invention, the step 101 in the flowchart shown in fig. 1 determines the near-end speech information according to the near-end signal, the reference signal and the residual echo signal, which may include:
a1: determining a first average correlation coefficient between the near-end signal and the reference signal;
a2: determining a second average correlation coefficient between the near-end signal and the residual echo signal;
a3: and determining near-end voice information according to the first average correlation coefficient and the second average correlation coefficient.
In an embodiment of the present invention, the step a1 involved in the foregoing embodiment of determining the first average correlation coefficient between the near-end signal and the reference signal may include:
determining at least one frequency point included in the frequency band;
determining at least one first correlation coefficient corresponding to the at least one frequency point according to the near-end signal and the reference signal;
and determining the first average correlation coefficient according to the at least one first correlation coefficient.
In this embodiment, at least one frequency point corresponding to the frequency band may be determined according to a service requirement, where each frequency point corresponds to a part of the reference signal and a part of the near-end signal, respectively.
In this embodiment, the near-end signal and the reference signal are used to determine a first near-end smooth value, a reference smooth value and a first integrated smooth value, and each of the smooth values can be determined according to equation set (2).
Figure GDA0002406110680000071
Wherein R isabiRepresenting a first comprehensive smooth value corresponding to the ith frequency point; raaiRepresenting a first near-end smooth value corresponding to the ith frequency point; rbbiRepresenting the reference smooth value corresponding to the ith frequency point, β representing the first smooth coefficient, Rab(i-1)Representing a first comprehensive smooth value corresponding to a near-end signal and a reference signal corresponding to the (i-1) th frequency point; raa(i-1)Representing a first near-end smooth value corresponding to a near-end signal corresponding to the (i-1) th frequency point; rbb(i-1)Representing a reference smooth value corresponding to a reference signal corresponding to the (i-1) th frequency point; a isiA near-end signal corresponding to the ith frequency point; biRepresenting reference signal corresponding to ith frequency point。
Then, the first correlation coefficient corresponding to each frequency point can be calculated according to the following formula (5).
Figure GDA0002406110680000072
Wherein M isabiAnd characterizing a first correlation coefficient corresponding to the ith frequency point.
In this embodiment, after the first correlation coefficients corresponding to the frequency points are determined, the first correlation coefficients are summed, and the sum is divided by the first average correlation coefficient of the total number of the frequency points.
According to the embodiment, the first average correlation coefficient is determined by using the first correlation coefficient corresponding to each frequency point and the total amount of the frequency points. Therefore, the determined first average correlation coefficient can reflect the current states of the near-end signal and the reference signal.
In an embodiment of the present invention, the step a2 involved in the foregoing embodiment of determining the second average correlation coefficient between the near-end signal and the residual echo signal may include:
determining at least one frequency point included in the frequency band;
determining at least one second correlation coefficient corresponding to the at least one frequency point according to the near-end signal and the residual echo signal;
and determining the second average correlation coefficient according to the at least one second correlation coefficient.
In this embodiment, at least one frequency point corresponding to the frequency band may be determined according to a service requirement, where each frequency point corresponds to a part of the reference signal and a part of the residual echo signal, respectively. It should be noted that each frequency point may be the same as the frequency point determined when the first average correlation coefficient is determined.
In this embodiment, the near-end signal and the residual echo signal are used to determine a second near-end smoothing value, a residual echo smoothing value, and a second combined smoothing value, and the above smoothing values can be determined by equation set (3).
Figure GDA0002406110680000081
Wherein, R isaciRepresenting a second comprehensive smooth value corresponding to the ith frequency point; the R isaaiRepresenting a second near-end smooth value corresponding to the ith frequency point; the R iscciRepresenting a residual echo smooth value corresponding to the ith frequency point; the gamma characterizes a second smoothing coefficient; the R isac(i-1)Representing a second comprehensive smooth value corresponding to the near-end signal and the residual echo signal corresponding to the (i-1) th frequency point; the R isaa(i-1)Representing a second near-end smooth value corresponding to the near-end signal corresponding to the i-1 frequency point; the R iscc(i-1)Representing a residual echo smooth value corresponding to a reference signal corresponding to the (i-1) th frequency point; a isiA near-end signal corresponding to the ith frequency point; c. CiAnd characterizing the residual echo signal corresponding to the ith frequency point.
Then, the second correlation coefficient corresponding to each frequency point can be calculated according to the following formula (6).
Figure GDA0002406110680000082
Wherein M isaciRepresenting a second correlation coefficient corresponding to the ith frequency point; raciRepresenting a second comprehensive smooth value corresponding to the ith frequency point; raaiRepresenting a second near-end smooth value corresponding to the ith frequency point; rcciAnd representing a residual echo smooth value corresponding to the ith frequency point.
In this embodiment, after the second correlation numbers corresponding to the frequency points are determined, the second correlation numbers are summed, and the sum is divided by the second average correlation coefficient of the total number of the frequency points.
According to the embodiment, the second average correlation coefficient is determined by using the second correlation number corresponding to each frequency point and the total amount of the frequency points. Therefore, the determined second average correlation coefficient can reflect the current states of the near-end signal and the residual echo signal.
In an embodiment of the present invention, the step a3 related in the foregoing embodiment of determining the near-end speech information according to the first average correlation coefficient and the second average correlation coefficient may include:
determining a near-end speech value through an equation set (1) according to the first average correlation coefficient and the second average correlation coefficient;
Figure GDA0002406110680000091
wherein T represents the near-end speech value; the above-mentioned
Figure GDA0002406110680000092
Characterizing the first average correlation coefficient; the above-mentioned
Figure GDA0002406110680000093
Characterizing the second average correlation coefficient; the K1 characterizes a first constant; the K2 characterizing a second constant; the K3 characterizes a third constant; the K4 characterizes a fourth constant;
when the near-end voice value is 1, the near-end voice information is that near-end voice exists;
and when the near-end voice value is 0, the near-end voice information is that near-end voice does not exist.
In the embodiment, K1-K4 can be determined according to the service requirement. For example, the first constant K1 is preferably 0.95; the second constant K2 is 0.15; the third constant is 0.9; the fourth constant is 0.2.
In the present embodiment, the following are exemplified: at the moment of determining
Figure GDA0002406110680000094
Greater than 0.95, and
Figure GDA0002406110680000095
if the value is less than 0.15, the value of the near-end voice is determined to be 1, and then the existence of the near-end voice can be judged.
In the present embodiment, the following are exemplified: at the moment of determining
Figure GDA0002406110680000096
Less than 0.9, and
Figure GDA0002406110680000097
if the value is greater than 0.2, the value of the near-end voice is determined to be 0, and then the fact that the near-end voice does not exist can be judged.
According to the embodiment, whether the near-end speech exists can be determined according to the relationship between the first average correlation coefficient and the set constant and the second average correlation coefficient, so that the determined near-end speech information is more accurate.
In an embodiment of the present invention, the gain information related in the flowchart shown in fig. 1 includes at least one frequency point gain value;
when the near-end speech information referred to in the above flowchart shown in fig. 1 is that no near-end speech exists,
then, the step 102 in the flowchart shown in fig. 1 determines gain information according to the near-end speech information, which may include:
determining at least one frequency point included in the frequency band;
determining at least one first correlation coefficient corresponding to the at least one frequency point according to the near-end signal and the reference signal;
determining at least one second correlation coefficient corresponding to the at least one frequency point according to the near-end signal and the residual echo signal;
and determining a frequency point gain value corresponding to each frequency point according to the at least one correlation coefficient and the at least one second correlation coefficient.
In this embodiment, the determining, according to the at least one correlation coefficient and the at least one second correlation coefficient, a frequency point gain value corresponding to each frequency band may include:
determining a frequency point gain value corresponding to each frequency point according to a formula (7);
Figure GDA0002406110680000101
wherein, the gain1iRepresenting a frequency point gain value corresponding to the ith frequency point; said C isaciRepresenting a second correlation coefficient corresponding to the ith frequency point; said C isabiAnd characterizing a first correlation coefficient corresponding to the ith frequency point.
In an embodiment of the present invention, the gain information related in the flowchart shown in fig. 1 includes at least one frequency point gain value;
when the near-end speech information referred to in the above flowchart shown in fig. 1 is the presence of near-end speech,
then, the step 102 in the flowchart shown in fig. 1 determines gain information according to the near-end speech information, which may include:
determining at least one frequency point included in the frequency band;
determining at least one second correlation coefficient corresponding to the at least one frequency point according to the near-end signal and the residual echo signal;
executing for each frequency point: and determining the frequency point gain value of the frequency point as a second correlation coefficient corresponding to the frequency point.
According to the embodiment, the frequency point gain value corresponding to each frequency point can be determined in a targeted manner no matter the near-end voice information exists or the near-end voice signal does not exist.
In an embodiment of the present invention, the gain information related in the flowchart shown in fig. 1 includes at least one frequency point gain value;
the step 103 in the flowchart shown in fig. 1 performs echo compression processing on the residual echo signal by using the gain information, which may include:
b1: the set overload capacity is used for carrying out suppression processing on the at least one gain value to obtain at least one overload gain value;
b2: performing smoothing processing on the at least one overload gain value to obtain an adjustment gain value corresponding to the frequency band;
b3: and determining echo compression output corresponding to the residual echo signal by using the adjustment gain value.
In an embodiment of the present invention, the step B1 related in the foregoing embodiment performs suppression processing on the at least one frequency point gain value by using the set overload amount to obtain at least one overload gain value, where the method includes:
suppressing the gain value of the at least one frequency point by using a formula (1) to obtain an overload gain value corresponding to each frequency point;
gain2i=gain1i t(1)
wherein, the gain1iRepresenting a frequency point gain value corresponding to the ith frequency point; the gain2iRepresenting an overload gain value corresponding to the ith frequency point; the t characterizes the overload.
In this embodiment, the amount of overload may be determined based on traffic requirements. For example, the amount of excess may be any value between-1 and 1.
In this embodiment, the overload amount is utilized to suppress the frequency point gain value of each frequency point, and particularly, the suppression effect is strong in a high frequency band.
According to the embodiment, the overload capacity is utilized to suppress the gain value of each frequency point, so that the overload gain value corresponding to each frequency point is obtained. Therefore, the overload gain values corresponding to the frequency points can be utilized to inhibit the frequency points.
In an embodiment of the present invention, the step B2 of the above embodiment of smoothing the at least one overload gain value to obtain an adjustment gain value corresponding to the frequency band may include:
determining a smooth gain value corresponding to each frequency point according to a formula (2);
gain3i=K1×gain2(i-1)+K2×gain2i+K3×gain2(i+1)(2)
wherein, the gain3iRepresenting a smooth gain value corresponding to the ith frequency point; the gain2(i-1)Representing an overload gain value corresponding to the (i-1) th frequency point; the gain2(i+1)Representing an overload gain value corresponding to the (i + 1) th frequency point; the K1 characterizes a first constant; the K2 characterizing a second constant; the K3 characterizes a third constant;
determining the adjustment gain value through a formula (3) according to the smooth gain value corresponding to each frequency point;
M=[gain31,……gain3i](3)
wherein the M characterizes the adjustment gain value.
In this embodiment, the smooth gain value corresponding to each frequency point is determined according to the overload gain value of the frequency point, the overload gain value of the previous frequency point adjacent to the frequency point, and the overload gain value corresponding to the next frequency point adjacent to the frequency point. Therefore, the smooth gain values corresponding to the frequency points can reflect the change condition of the signal as a whole.
In this embodiment, the adjustment gain value is determined according to the smooth gain value corresponding to each frequency point, so that the adjustment gain value can reduce the echo to the maximum.
In an embodiment of the present invention, the step B3 in the foregoing embodiment, determining an echo compression output corresponding to the residual echo signal by using the adjusted gain value, may include:
determining the echo compression output by formula (4) by using the adjustment gain value;
out=M×E (4)
wherein the out characterizes the echo compressed output; the M characterizes the adjusted gain value; the E characterizes the residual echo signal.
In this embodiment, the adjustment gain value is obtained after being subjected to the overload processing and the smoothing processing. Therefore, when the residual echo signal is compressed by using the adjustment gain value, the noise can be reduced to the maximum extent, so that the voice is more gentle and not harsh.
The echo compression method will be described below by taking the example of obtaining the near-end signal, the reference signal and the residual echo signal in the frequency band of "500 Hz to 3000 Hz". As shown in fig. 2, the echo compression method includes:
step 201: and acquiring a near-end signal, a reference signal and a residual echo signal with the frequency of 500Hz-3000 Hz.
Step 202: at least one frequency point included in the frequency band is determined.
Step 203: and determining at least one first correlation coefficient corresponding to at least one frequency point according to the near-end signal and the reference signal.
In this step, the first correlation coefficient corresponding to each frequency point can be determined according to the formula (5).
Step 204: a first average correlation coefficient is determined from the at least one first correlation coefficient.
In this step, the respective first correlation coefficients are summed, and the sum is divided by the total number of frequency points to the first average correlation coefficient.
Step 205: and determining at least one second correlation coefficient corresponding to at least one frequency point according to the near-end signal and the residual echo signal.
In this step, the second correlation coefficient corresponding to each frequency point can be determined according to the formula (6).
Step 206: and determining a second average correlation coefficient according to the at least one second correlation coefficient.
In this step, the second correlation numbers are summed, and the sum is divided by the total number of frequency points to obtain a second average correlation coefficient.
Step 207: determining near-end voice information according to the first average correlation coefficient and the second average correlation coefficient; when the near-end voice information is that near-end voice exists, executing step 208; when the near-end voice information is that no near-end voice exists, step 209 is executed.
In this step, the determination of the near-end speech information is made according to equation set (1). Wherein the first constant K1 in equation set (1) is 0.95; the second constant K2 is 0.15; the third constant is 0.9; the fourth constant is 0.2.
In this step, the determination is made
Figure GDA0002406110680000131
Less than 0.9, and
Figure GDA0002406110680000132
if it is greater than 0.2, it is determined that the value of the near-end speech is 0, and it can be determined that there is no near-end speech, and step 209 is executed.
Step 208: and determining the frequency point gain value corresponding to each frequency point according to each first correlation coefficient and each second correlation coefficient, and executing the step 210.
Step 209: and determining the frequency point gain value of each frequency point as the corresponding second correlation coefficient, and executing step 210.
In this step, a frequency point gain value corresponding to each frequency point is determined according to each second correlation coefficient determined in step 205.
Step 210: and inhibiting the gain value of each frequency point to obtain the overload gain value corresponding to each frequency point.
In this step, the suppression processing is performed on each bin gain value according to the formula (1). For example, the selected amount of overload t is 0.5.
Step 211: and smoothing each overload gain value to determine a smooth gain value corresponding to each frequency point.
In this step, the formula (2) is adopted to smooth each overload gain value, and a smooth gain value corresponding to each frequency point is obtained.
Step 212: and determining an adjustment gain value according to the determined smooth gain values.
In this step, the adjustment gain value is determined by each smoothed gain value according to equation (3).
Step 213: and determining the echo compression output by using the adjusted gain value.
In this step, the residual echo signal is subjected to echo compression processing by using the formula (4), so as to obtain echo compression output. Because the adjustment gain value is utilized to compress the residual echo signal, the noise can be reduced to the maximum extent, and the voice is more smooth and not harsh.
As shown in fig. 3, an embodiment of the present invention provides an echo compression device, which may include:
a near-end speech determining module 301, configured to determine near-end speech information according to the near-end signal, the reference signal, and the residual echo signal; the near-end signal, the reference signal and the residual echo signal are all signals in a set frequency band;
a gain information determining module 302, configured to determine gain information according to the near-end speech information determined by the near-end speech determining module 301;
a processing module 303, configured to perform echo compression processing on the residual echo signal by using the gain information determined by the gain information determining module 302.
According to the embodiment shown in fig. 3, in the embodiment of the present invention, the processing module may perform echo compression processing on the residual echo signal according to the gain information determined by the near-end speech information, so as to reduce the residual echo signal to the maximum. Therefore, the scheme provided by the embodiment of the invention can improve the signal to noise ratio.
In an embodiment of the present invention, when the gain information includes at least one frequency point gain value and when the near-end speech information is that near-end speech does not exist, as shown in fig. 4,
the gain information determination module 302 may include: a coefficient determining submodule 3021 and a frequency point gain value determining submodule 3022;
the coefficient determining submodule 3021 is configured to determine at least one frequency point included in the frequency band; determining at least one first correlation coefficient corresponding to the at least one frequency point according to the near-end signal and the reference signal; determining at least one second correlation coefficient corresponding to the at least one frequency point according to the near-end signal and the residual echo signal;
the frequency point gain value determining sub-module 3022 is configured to determine, according to the at least one correlation coefficient and the at least one second correlation coefficient, a frequency point gain value corresponding to each frequency point.
In an embodiment of the present invention, the frequency point gain value determining sub-module 3022 is configured to determine, according to formula (7), a frequency point gain value corresponding to each frequency point;
Figure GDA0002406110680000151
wherein, the gain1iRepresenting a frequency point gain value corresponding to the ith frequency point; said C isaciRepresenting a second correlation coefficient corresponding to the ith frequency point; said C isabiAnd characterizing a first correlation coefficient corresponding to the ith frequency point.
In an embodiment of the present invention, when the gain information includes at least one frequency point gain value, and when the near-end speech information is near-end speech, the gain information determining module 302 is configured to determine at least one frequency point included in the frequency band; determining at least one second correlation coefficient corresponding to the at least one frequency point according to the near-end signal and the residual echo signal; executing for each frequency point: and determining the frequency point gain value of the frequency point as a second correlation coefficient corresponding to the frequency point.
In one embodiment of the present invention, as shown in fig. 5, when the gain information includes at least one bin gain value,
the processing module 303 may include: a suppression submodule 3031, a smoothing submodule 3032 and a determination submodule 3033;
the suppression submodule 3031 is configured to perform suppression processing on the at least one frequency point gain value by using a set overload amount to obtain at least one overload gain value;
the smoothing submodule 3032 is configured to smooth the at least one overload gain value to obtain an adjustment gain value corresponding to the frequency band;
the determining submodule 3033 is configured to determine, by using the adjustment gain value, an echo compression output corresponding to the residual echo signal.
In an embodiment of the present invention, the suppressor module 3031 is configured to perform suppression processing on the gain value of the at least one frequency point according to formula (1) to obtain an overload gain value corresponding to each frequency point;
gain2i=gain1i t(1)
wherein, the gain1iRepresenting a frequency point gain value corresponding to the ith frequency point; the gain2iRepresenting an overload gain value corresponding to the ith frequency point; the t characterizes the overload.
In an embodiment of the present invention, the smoothing sub-module 3032 is configured to determine a smoothing gain value corresponding to each frequency point according to formula (2); determining the adjustment gain value through a formula (3) according to the smooth gain value corresponding to each frequency point;
gain3i=K1×gain2(i-1)+K2×gain2i+K3×gain2(i+1)(2)
wherein, the gain3iRepresenting a smooth gain value corresponding to the ith frequency point; the gain2(i-1)Representing an overload gain value corresponding to the (i-1) th frequency point; the gain2(i+1)Representing an overload gain value corresponding to the (i + 1) th frequency point; the K1 characterizes a first constant; the K2 characterizing a second constant; the K3 characterizes a third constant;
M=[gain31,……gain3i](3)
wherein the M characterizes the adjustment gain value.
In an embodiment of the present invention, the determining submodule 3033 is configured to determine the echo compression output according to formula (4) by using the adjustment gain value;
out=M×E(4)
wherein the out characterizes the echo compressed output; the M characterizes the adjusted gain value; the E characterizes the residual echo signal.
In an embodiment of the present invention, as shown in fig. 6, the near-end speech determination module 301 may include: a first coefficient determination sub-module 3011, a second coefficient determination sub-module 3012, and a near-end speech determination sub-module 3013;
the first coefficient determination sub-module 3011, configured to determine a first average correlation coefficient between the near-end signal and the reference signal;
the second coefficient determining sub-module 3012, configured to determine a second average correlation coefficient between the near-end signal and the residual echo signal;
the near-end speech determining sub-module 3013 is configured to determine near-end speech information according to the first average correlation coefficient and the second average correlation coefficient.
In an embodiment of the present invention, the first coefficient determining sub-module 3011 is configured to determine at least one frequency point included in the frequency band; determining at least one first correlation coefficient corresponding to the at least one frequency point according to the near-end signal and the reference signal; and determining the first average correlation coefficient according to the at least one first correlation coefficient.
In an embodiment of the present invention, the second coefficient determining sub-module 3012 is configured to determine at least one frequency point included in the frequency band; determining at least one second correlation coefficient corresponding to the at least one frequency point according to the near-end signal and the residual echo signal; and determining the second average correlation coefficient according to the at least one second correlation coefficient.
In an embodiment of the present invention, the near-end speech determining sub-module 3013 is configured to determine a near-end speech value according to the first average correlation coefficient and the second average correlation coefficient through an equation set (1); when the near-end voice value is 1, the near-end voice information is that near-end voice exists; when the near-end voice value is 0, the near-end voice information is that near-end voice does not exist;
Figure GDA0002406110680000171
wherein T represents the near-end speech value; the above-mentioned
Figure GDA0002406110680000172
Characterizing the first average correlation coefficient; the above-mentioned
Figure GDA0002406110680000173
Characterizing the second average correlation coefficient; the K1 characterizes a first constant; the K2 characterizing a second constant; the K3 characterizes a third constant; the K4 characterizes a fourth constant;
in one embodiment of the present invention, the first constant K1 is 0.95; the second constant K2 is 0.15; the third constant is 0.9; the fourth constant is 0.2.
An embodiment of the present invention provides a storage medium, where the storage medium includes a stored program, where when the program runs, a device in which the storage medium is located is controlled to execute the echo compression method described in any one of the above.
In an embodiment of the present invention, an electronic device is provided, as shown in fig. 7, which includes a processor 401, a memory 402, and a bus 403; the processor 401 and the memory 402 complete communication with each other through the bus 403; the processor 401 is configured to call program instructions in the memory 402 to execute the echo compression method described in any of the above.
Because the information interaction, execution process, and other contents between the units in the device are based on the same concept as the method embodiment of the present invention, specific contents may refer to the description in the method embodiment of the present invention, and are not described herein again.
The embodiments of the invention have at least the following beneficial effects:
1. in the embodiment of the present invention, near-end speech information (the near-end speech information may be the presence of near-end speech or the absence of near-end speech) is determined according to a near-end signal, a reference signal and a residual echo signal within a set frequency band. And then determining gain information according to the near-end voice information, and performing echo compression processing on the residual echo signal by using the gain information so as to reduce the residual echo signal. Therefore, in the scheme, the echo compression processing can be performed on the residual echo signal according to the gain information determined by the near-end voice information, so that the residual echo signal can be reduced to the maximum extent. Therefore, the scheme provided by the embodiment of the invention can improve the signal to noise ratio.
2. In the embodiment of the invention, the first average correlation coefficient is determined by using the first correlation coefficient corresponding to each frequency point and the total amount of the frequency points. Therefore, the determined first average correlation coefficient can reflect the current states of the near-end signal and the reference signal.
3. In the embodiment of the invention, the second average correlation coefficient is determined by using the second correlation number corresponding to each frequency point and the total amount of the frequency points. Therefore, the determined second average correlation coefficient can reflect the current states of the near-end signal and the residual echo signal.
4. In the embodiment of the present invention, since whether the near-end speech exists can be determined according to the relationship between the first average correlation coefficient and the set constant, the determined near-end speech information is more accurate.
5. In the embodiment of the invention, the frequency point gain value corresponding to each frequency point can be determined in a targeted manner no matter the near-end voice information exists or the near-end voice signal does not exist.
6. In the embodiment of the invention, the overload capacity is utilized to carry out suppression processing on the gain value of each frequency point, so that the overload gain value corresponding to each frequency point is obtained. Therefore, the overload gain values corresponding to the frequency points can be utilized to inhibit the frequency points.
7. In the embodiment of the invention, the adjustment gain value is determined according to the smooth gain value corresponding to each frequency point, so that the adjustment gain value can reduce the echo to the maximum extent.
8. In the embodiment of the invention, the adjustment gain value is obtained after overload processing and smoothing processing. Therefore, when the residual echo signal is compressed by using the adjustment gain value, the noise can be reduced to the maximum extent, so that the voice is more gentle and not harsh.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising a" does not exclude the presence of other similar elements in a process, method, article, or apparatus that comprises the element.
Those of ordinary skill in the art will understand that: all or part of the steps for realizing the method embodiments can be completed by hardware related to program instructions, the program can be stored in a computer readable storage medium, and the program executes the steps comprising the method embodiments when executed; and the aforementioned storage medium includes: various media that can store program codes, such as ROM, RAM, magnetic or optical disks.
Finally, it is to be noted that: the above description is only a preferred embodiment of the present invention, and is only used to illustrate the technical solutions of the present invention, and not to limit the protection scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.

Claims (9)

1. A method of echo compression, comprising:
determining near-end voice information according to the near-end signal, the reference signal and the residual echo signal; the near-end signal, the reference signal and the residual echo signal are all signals in a set frequency band;
determining gain information according to the near-end voice information;
performing echo compression processing on the residual echo signal by using the gain information;
the determining near-end speech information according to the near-end signal, the reference signal and the residual echo signal includes:
determining a first average correlation coefficient between the near-end signal and the reference signal;
determining a second average correlation coefficient between the near-end signal and the residual echo signal;
determining near-end voice information according to the first average correlation coefficient and the second average correlation coefficient;
the determining a first average correlation coefficient between the near-end signal and the reference signal comprises:
determining a first near-end smoothing value, a reference smoothing value and a first comprehensive smoothing value by using the near-end signal and the reference signal through the following equation set;
Figure FDA0002406110670000011
wherein R isabiRepresenting a first comprehensive smooth value corresponding to the ith frequency point; raaiRepresenting a first near-end smooth value corresponding to the ith frequency point; rbbiRepresenting the reference smooth value corresponding to the ith frequency point, β representing the first smooth coefficient, Rab(i-1)Representing a first comprehensive smooth value corresponding to a near-end signal and a reference signal corresponding to the (i-1) th frequency point; r'aa(i-1)Representing a first near-end smooth value corresponding to a near-end signal corresponding to the (i-1) th frequency point; r'bb(i-1)Representing a reference smooth value corresponding to a reference signal corresponding to the (i-1) th frequency point; a isiA near-end signal corresponding to the ith frequency point; biRepresenting a reference signal corresponding to the ith frequency point;
determining a first correlation coefficient of each frequency point according to the first near-end smooth value, the reference smooth value and the first comprehensive smooth value of each frequency point;
and summing the first correlation coefficients of the frequency points, and determining the quotient of the summation result and the total number of the frequency points as a first average correlation coefficient.
2. The echo compression method of claim 1,
the gain information comprises at least one frequency point gain value;
the performing echo compression processing on the residual echo signal by using the gain information includes:
the set overload amount is used for inhibiting the at least one frequency point gain value to obtain at least one overload gain value;
performing smoothing processing on the at least one overload gain value to obtain an adjustment gain value corresponding to the frequency band;
and determining echo compression output corresponding to the residual echo signal by using the adjustment gain value.
3. The echo compression method of claim 2,
the suppressing the at least one frequency point gain value by using the set overload amount to obtain at least one overload gain value includes:
the gain value of at least one frequency point is suppressed through a first formula, and an overload gain value corresponding to each frequency point is obtained;
the first formula includes:
gain2i=gain1i t
wherein, the gain1iRepresenting a frequency point gain value corresponding to the ith frequency point; the gain2iRepresenting an overload gain value corresponding to the ith frequency point; the t represents the overload amount;
and/or the presence of a gas in the gas,
the smoothing of the at least one overload gain value to obtain an adjustment gain value corresponding to the frequency band includes:
determining a smooth gain value corresponding to each frequency point according to a second formula;
the second formula includes:
gain3i=K1×gain2(i-1)+K2×gain2i+K3×gain2(i+1)
wherein, the gain3iRepresenting a smooth gain value corresponding to the ith frequency point; the gain2(i-1)Representing an overload gain value corresponding to the (i-1) th frequency point; the gain2(i+1)Representing an overload gain value corresponding to the (i + 1) th frequency point; the gain2iRepresenting an overload gain value corresponding to the ith frequency point; the K1 characterizes a first constant; the K2 characterizing a second constant; the K3 characterizes a third constant;
determining the adjustment gain value through a third formula according to the smooth gain value corresponding to each frequency point;
the third formula includes:
M=[gain31,……gain3i]
wherein the M characterizes the adjustment gain value.
4. The echo compression method of claim 3,
the determining the echo compression output corresponding to the residual echo signal by using the adjustment gain value includes:
determining the echo compression output by a fourth formula by using the adjustment gain value;
the fourth formula includes:
out=M×E
wherein the out characterizes the echo compressed output; the M characterizes the adjusted gain value; the E characterizes the residual echo signal.
5. The echo compression method of claim 1,
the determining a first average correlation coefficient between the near-end signal and the reference signal comprises:
determining at least one frequency point included in the frequency band;
determining at least one first correlation coefficient corresponding to the at least one frequency point according to the near-end signal and the reference signal;
determining the first average correlation coefficient according to the at least one first correlation coefficient;
and/or the presence of a gas in the gas,
said determining a second average correlation coefficient between said near-end signal and said residual echo signal comprises:
determining at least one frequency point included in the frequency band;
determining at least one second correlation coefficient corresponding to the at least one frequency point according to the near-end signal and the residual echo signal;
and determining the second average correlation coefficient according to the at least one second correlation coefficient.
6. The echo compression method of claim 5,
the determining near-end speech information according to the first average correlation coefficient and the second average correlation coefficient includes:
determining a near-end speech value through a first equation set according to the first average correlation coefficient and the second average correlation coefficient;
the first set of equations includes:
Figure FDA0002406110670000041
wherein T represents the near-end speech value; the above-mentioned
Figure FDA0002406110670000042
Characterizing the first average correlation coefficient; the above-mentioned
Figure FDA0002406110670000043
Characterizing the second average correlation coefficient; the K1 characterizes a first constant; the K2 characterizing a second constant; the K3 characterizes a third constant; the K4 characterizes a fourth constant;
when the near-end voice value is 1, the near-end voice information is that near-end voice exists;
and when the near-end voice value is 0, the near-end voice information is that near-end voice does not exist.
7. An echo compression device, comprising:
the near-end voice determining module is used for determining near-end voice information according to the near-end signal, the reference signal and the residual echo signal; the near-end signal, the reference signal and the residual echo signal are all signals in a set frequency band;
a gain information determining module, configured to determine gain information according to the near-end speech information determined by the near-end speech determining module;
the processing module is used for performing echo compression processing on the residual echo signal by using the gain information determined by the gain information determining module;
the near-end speech determination module comprises:
a first coefficient determination submodule for determining a first average correlation coefficient between the near-end signal and the reference signal;
a second coefficient determination submodule for determining a second average correlation coefficient between the near-end signal and the residual echo signal;
the near-end voice determining submodule is used for determining near-end voice information according to the first average correlation coefficient and the second average correlation coefficient;
a first coefficient determination submodule for determining a first near-end smoothing value, a reference smoothing value and a first integrated smoothing value by using the near-end signal and the reference signal through the following equation set; determining a first correlation coefficient of each frequency point according to the first near-end smooth value, the reference smooth value and the first comprehensive smooth value of each frequency point; summing the first correlation coefficients of all the frequency points, and determining the quotient of the sum result and the total number of the frequency points as a first average correlation coefficient;
Figure FDA0002406110670000051
wherein R isabiRepresenting a first comprehensive smooth value corresponding to the ith frequency point; raaiRepresenting a first near-end smooth value corresponding to the ith frequency point; rbbiRepresenting the reference smooth value corresponding to the ith frequency point, β representing the first smooth coefficient, Rab(i-1)Representing a first comprehensive smooth value corresponding to a near-end signal and a reference signal corresponding to the (i-1) th frequency point; r'aa(i-1)Representing a first near-end smooth value corresponding to a near-end signal corresponding to the (i-1) th frequency point; r'bb(i-1)Representing a reference smooth value corresponding to a reference signal corresponding to the (i-1) th frequency point; a isiA near-end signal corresponding to the ith frequency point; biAnd characterizing the reference signal corresponding to the ith frequency point.
8. A storage medium comprising a stored program, wherein the program, when executed, controls an apparatus in which the storage medium is located to perform the echo compression method according to any one of claims 1 to 6.
9. An electronic device, wherein the electronic device comprises a processor, a memory and a bus; the processor and the memory complete mutual communication through the bus; the processor is configured to call program instructions in the memory to perform the echo compression method of any one of claims 1 to 6.
CN201810495505.8A 2018-05-22 2018-05-22 Echo compression method and device, storage medium and electronic equipment Active CN108831497B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201810495505.8A CN108831497B (en) 2018-05-22 2018-05-22 Echo compression method and device, storage medium and electronic equipment
PCT/CN2019/087301 WO2019223603A1 (en) 2018-05-22 2019-05-16 Voice processing method and apparatus and electronic device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810495505.8A CN108831497B (en) 2018-05-22 2018-05-22 Echo compression method and device, storage medium and electronic equipment

Publications (2)

Publication Number Publication Date
CN108831497A CN108831497A (en) 2018-11-16
CN108831497B true CN108831497B (en) 2020-06-09

Family

ID=64148971

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810495505.8A Active CN108831497B (en) 2018-05-22 2018-05-22 Echo compression method and device, storage medium and electronic equipment

Country Status (1)

Country Link
CN (1) CN108831497B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019223603A1 (en) * 2018-05-22 2019-11-28 出门问问信息科技有限公司 Voice processing method and apparatus and electronic device
CN109741760B (en) * 2018-12-18 2020-12-22 科大讯飞股份有限公司 Noise estimation method and system

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101888455A (en) * 2010-04-09 2010-11-17 熔点网讯(北京)科技有限公司 A Frequency Domain Adaptive Echo Cancellation Method
CN101917527A (en) * 2010-09-02 2010-12-15 杭州华三通信技术有限公司 Method and device of echo elimination
CN103325379A (en) * 2012-03-23 2013-09-25 杜比实验室特许公司 Method and device used for acoustic echo control
CN103718538A (en) * 2011-05-17 2014-04-09 谷歌公司 Non-linear post-processing for acoustic echo cancellation
CN104395957A (en) * 2012-04-30 2015-03-04 创新科技有限公司 A universal reconfigurable echo cancellation system
US9196258B2 (en) * 2008-05-12 2015-11-24 Broadcom Corporation Spectral shaping for speech intelligibility enhancement
CN105810202A (en) * 2014-12-31 2016-07-27 展讯通信(上海)有限公司 Method and device for reducing echoes, and communication equipment
CN105872156A (en) * 2016-05-25 2016-08-17 腾讯科技(深圳)有限公司 Echo time delay tracking method and device

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104835502B (en) * 2015-05-20 2018-04-10 北京捷思锐科技股份有限公司 Acoustic signal processing method, device and electronic equipment
CN105791611B (en) * 2016-02-22 2020-07-07 腾讯科技(深圳)有限公司 Echo cancellation method, device, terminal and storage medium
CN106782586B (en) * 2016-11-14 2020-08-07 阔地教育科技有限公司 Audio signal processing method and device

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9196258B2 (en) * 2008-05-12 2015-11-24 Broadcom Corporation Spectral shaping for speech intelligibility enhancement
CN101888455A (en) * 2010-04-09 2010-11-17 熔点网讯(北京)科技有限公司 A Frequency Domain Adaptive Echo Cancellation Method
CN101917527A (en) * 2010-09-02 2010-12-15 杭州华三通信技术有限公司 Method and device of echo elimination
CN103718538A (en) * 2011-05-17 2014-04-09 谷歌公司 Non-linear post-processing for acoustic echo cancellation
CN103325379A (en) * 2012-03-23 2013-09-25 杜比实验室特许公司 Method and device used for acoustic echo control
CN104395957A (en) * 2012-04-30 2015-03-04 创新科技有限公司 A universal reconfigurable echo cancellation system
CN105810202A (en) * 2014-12-31 2016-07-27 展讯通信(上海)有限公司 Method and device for reducing echoes, and communication equipment
CN105872156A (en) * 2016-05-25 2016-08-17 腾讯科技(深圳)有限公司 Echo time delay tracking method and device

Also Published As

Publication number Publication date
CN108831497A (en) 2018-11-16

Similar Documents

Publication Publication Date Title
JP3963850B2 (en) Voice segment detection device
JP4681163B2 (en) Howling detection and suppression device, acoustic device including the same, and howling detection and suppression method
JP4307557B2 (en) Voice activity detector
CN102273222B (en) Method, system and apparatus for selectively switching between multiple microphones
CN108766456B (en) Voice processing method and device
US11349525B2 (en) Double talk detection method, double talk detection apparatus and echo cancellation system
CN102164210A (en) Systems and methods for echo cancellation and echo suppression
JP2004507141A (en) Voice enhancement system
CN110782914B (en) Signal processing method and device, terminal equipment and storage medium
CN1332911A (en) Methods and apparatus for providing echo suppression using frequency domain nonlinear processing
US8543390B2 (en) Multi-channel periodic signal enhancement system
US20110137646A1 (en) Noise Suppression Method and Apparatus
JP2012501152A (en) Method for determining updated filter coefficients of an adaptive filter adapted by an LMS algorithm with pre-whitening
CN111989934B (en) Echo cancellation device, echo cancellation method, signal processing chip, and electronic apparatus
CN108806713B (en) Method and device for detecting double-speech state
CN111524532B (en) Echo suppression method, device, equipment and storage medium
US20020103636A1 (en) Frequency-domain post-filtering voice-activity detector
KR20010043833A (en) Signal noise reduction by spectral subtraction using spectrum dependent exponential gain function averaging
CN112602150A (en) Noise estimation method, noise estimation device, voice processing chip and electronic equipment
CN108831497B (en) Echo compression method and device, storage medium and electronic equipment
US8964967B2 (en) Subband domain echo masking for improved duplexity of spectral domain echo suppressors
CN106941006A (en) Audio signal is separated into harmonic wave and transient signal component and audio signal bass boost
JP2000330597A (en) Noise suppression device
CN109215672B (en) Method, device and equipment for processing sound information
CN108831496B (en) echo suppression method and device, storage medium and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20230614

Address after: 210034 floor 8, building D11, Hongfeng Science Park, Nanjing Economic and Technological Development Zone, Jiangsu Province

Patentee after: New Technology Co.,Ltd.

Patentee after: VOLKSWAGEN (CHINA) INVESTMENT Co.,Ltd.

Address before: 100094 1001, 10th floor, office building a, 19 Zhongguancun Street, Haidian District, Beijing

Patentee before: MOBVOI INFORMATION TECHNOLOGY Co.,Ltd.