Blanchette, 2014 - Google Patents
Short-time multichannel noise power spectral density estimators for acoustic signalsBlanchette, 2014
View PDF- Document ID
- 7601245830532948906
- Author
- Blanchette J
- Publication year
External Links
Snippet
The estimation of power spectral densities is a critical step in many speech enhancement algorithms. The demand for multi-channel speech enhancement systems is high with applications in teleconferencing, cellular phones, and hearing aids. The first objective of the …
- 230000003595 spectral 0 title abstract description 30
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S3/00—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
- G01S3/80—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
- G01S3/802—Systems for determining direction or deviation from predetermined direction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/301—Automatic calibration of stereophonic sound system, e.g. with test microphone
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01V—GEOPHYSICS; GRAVITATIONAL MEASUREMENTS; DETECTING MASSES OR OBJECTS
- G01V1/00—Seismology; Seismic or acoustic prospecting or detecting
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Gannot et al. | A consolidated perspective on multimicrophone speech enhancement and source separation | |
| CN109839612B (en) | Sound source direction estimation method and device based on time-frequency masking and deep neural network | |
| Wang et al. | Robust speaker localization guided by deep learning-based time-frequency masking | |
| Kuklasiński et al. | Maximum likelihood PSD estimation for speech enhancement in reverberation and noise | |
| Yoshioka et al. | Generalization of multi-channel linear prediction methods for blind MIMO impulse response shortening | |
| Schwartz et al. | Multi-microphone speech dereverberation and noise reduction using relative early transfer functions | |
| Schwartz et al. | An expectation-maximization algorithm for multimicrophone speech dereverberation and noise reduction with coherence matrix estimation | |
| Alinaghi et al. | Joint mixing vector and binaural model based stereo source separation | |
| Koldovský et al. | Spatial source subtraction based on incomplete measurements of relative transfer function | |
| MX2014006499A (en) | Apparatus and method for microphone positioning based on a spatial power density. | |
| Braun et al. | A multichannel diffuse power estimator for dereverberation in the presence of multiple sources | |
| Srivastava et al. | Blind room parameter estimation using multiple multichannel speech recordings | |
| Dadvar et al. | Robust binaural speech separation in adverse conditions based on deep neural network with modified spatial features and training target | |
| Zheng et al. | Statistical analysis of the multichannel Wiener filter using a bivariate normal distribution for sample covariance matrices | |
| Zhang et al. | A speech separation algorithm based on the comb-filter effect | |
| Bologni et al. | Wideband relative transfer function (rtf) estimation exploiting frequency correlations | |
| KR101537653B1 (en) | Method and system for noise reduction based on spectral and temporal correlations | |
| Poschadel et al. | Room geometry estimation from higher-order ambisonics signals using convolutional recurrent neural networks | |
| Mustière et al. | Design of multichannel frequency domain statistical-based enhancement systems preserving spatial cues via spectral distances minimization | |
| Astapov et al. | Directional Clustering with Polyharmonic Phase Estimation for Enhanced Speaker Localization | |
| Blanchette | Short-time multichannel noise power spectral density estimators for acoustic signals | |
| Townsend | Enhancements to the generalized sidelobe canceller for audio beamforming in an immersive environment | |
| Laufer et al. | ML estimation and CRBs for reverberation, speech, and noise PSDs in rank-deficient noise field | |
| Xiong et al. | Joint doa estimation and dereverberation based on multi-channel linear prediction filtering and azimuth sparsity | |
| Weisman et al. | Spatial Covariance Matrix Estimation for Reverberant Speech with Application to Speech Enhancement. |