Koldovský et al., 2015 - Google Patents
Spatial source subtraction based on incomplete measurements of relative transfer functionKoldovský et al., 2015
View PDF- Document ID
- 17137663650368946420
- Author
- Koldovský Z
- Málek J
- Gannot S
- Publication year
- Publication venue
- IEEE/ACM Transactions on Audio, Speech, and Language Processing
External Links
Snippet
Relative impulse responses between microphones are usually long and dense due to the reverberant acoustic environment. Estimating them from short and noisy recordings poses a long-standing challenge of audio signal processing. In this paper, we apply a novel strategy …
- 238000005259 measurement 0 title abstract description 13
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Koldovský et al. | Spatial source subtraction based on incomplete measurements of relative transfer function | |
| Gannot et al. | A consolidated perspective on multimicrophone speech enhancement and source separation | |
| Higuchi et al. | Online MVDR beamformer based on complex Gaussian mixture model with spatial prior for noise robust ASR | |
| Braun et al. | Evaluation and comparison of late reverberation power spectral density estimators | |
| Kumatani et al. | Microphone array processing for distant speech recognition: From close-talking microphones to far-field sensors | |
| Taseska et al. | Informed spatial filtering for sound extraction using distributed microphone arrays | |
| Schwartz et al. | An expectation-maximization algorithm for multimicrophone speech dereverberation and noise reduction with coherence matrix estimation | |
| Schmid et al. | Variational Bayesian inference for multichannel dereverberation and noise reduction | |
| Wang et al. | Noise power spectral density estimation using MaxNSR blocking matrix | |
| Kumatani et al. | Beamforming with a maximum negentropy criterion | |
| Peled et al. | Linearly-constrained minimum-variance method for spherical microphone arrays based on plane-wave decomposition of the sound field | |
| Li et al. | Multichannel speech separation and enhancement using the convolutive transfer function | |
| Schwartz et al. | Two model-based EM algorithms for blind source separation in noisy environments | |
| Li et al. | Multichannel online dereverberation based on spectral magnitude inverse filtering | |
| Habets et al. | Dereverberation | |
| Martín-Doñas et al. | Dual-channel DNN-based speech enhancement for smartphones | |
| Hëb-Umbach et al. | Microphone Array Signal Processing and Deep Learning for Speech Enhancement: Combining model-based and data-driven approaches to parameter estimation and filtering [Special Issue On Model-Based and Data-Driven Audio Signal Processing] | |
| Schwartz et al. | Nested generalized sidelobe canceller for joint dereverberation and noise reduction | |
| Aroudi et al. | Cognitive-driven convolutional beamforming using EEG-based auditory attention decoding | |
| Li et al. | Low complex accurate multi-source RTF estimation | |
| Yadav et al. | Joint dereverberation and beamforming with blind estimation of the shape parameter of the desired source prior | |
| Taghia et al. | Dual-channel noise reduction based on a mixture of circular-symmetric complex Gaussians on unit hypersphere | |
| Pfeifenberger et al. | Blind source extraction based on a direction-dependent a-priori SNR. | |
| Ali et al. | Completing the RTF vector for an MVDR beamformer as applied to a local microphone array and an external microphone | |
| Malek et al. | Speaker extraction using LCMV beamformer with DNN-based SPP and RTF identification scheme |