Habets et al., 2018 - Google Patents
DereverberationHabets et al., 2018
- Document ID
- 11867613678460066609
- Author
- Habets E
- Naylor P
- Publication year
- Publication venue
- Audio Source Separation and Speech Enhancement
External Links
Snippet
The degrading effects of reverberation on speech are a major concern in applications including speech telecommunications and automatic speech recognition (ASR). The distortion caused by reverberation can be modeled as a convolutive distortion or an additive …
- 239000000654 additive 0 abstract description 2
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Nakatani et al. | Speech dereverberation based on variance-normalized delayed linear prediction | |
| Doclo et al. | GSVD-based optimal filtering for single and multimicrophone speech enhancement | |
| Togami et al. | Optimized speech dereverberation from probabilistic perspective for time varying acoustic transfer function | |
| EP3685378B1 (en) | Signal processor and method for providing a processed audio signal reducing noise and reverberation | |
| Braun et al. | Online dereverberation for dynamic scenarios using a Kalman filter with an autoregressive model | |
| Schmid et al. | Variational Bayesian inference for multichannel dereverberation and noise reduction | |
| Doire et al. | Single-channel online enhancement of speech corrupted by reverberation and noise | |
| Krueger et al. | Model-based feature enhancement for reverberant speech recognition | |
| Nakatani et al. | Speech dereverberation based on maximum-likelihood estimation with time-varying Gaussian source model | |
| Koldovský et al. | Spatial source subtraction based on incomplete measurements of relative transfer function | |
| Schwartz et al. | An expectation-maximization algorithm for multimicrophone speech dereverberation and noise reduction with coherence matrix estimation | |
| Habets et al. | Dereverberation | |
| Wang et al. | Noise power spectral density estimation using MaxNSR blocking matrix | |
| Mohammadiha et al. | Speech dereverberation using non-negative convolutive transfer function and spectro-temporal modeling | |
| Nakatani et al. | Dominance based integration of spatial and spectral features for speech enhancement | |
| Habets | Speech dereverberation using statistical reverberation models | |
| Nesta et al. | A flexible spatial blind source extraction framework for robust speech recognition in noisy environments | |
| Thuene et al. | Maximum-likelihood approach to adaptive multichannel-Wiener postfiltering for wind-noise reduction | |
| Li et al. | Multichannel online dereverberation based on spectral magnitude inverse filtering | |
| Zhao et al. | Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction | |
| Song et al. | An integrated multi-channel approach for joint noise reduction and dereverberation | |
| Li et al. | Multichannel identification and nonnegative equalization for dereverberation and noise reduction based on convolutive transfer function | |
| Sehr et al. | Towards robust distant-talking automatic speech recognition in reverberant environments | |
| Pfeifenberger et al. | Eigenvector-Based Speech Mask Estimation Using Logistic Regression. | |
| Nakatani et al. | Simultaneous denoising, dereverberation, and source separation using a unified convolutional beamformer |