Koldovský et al., 2015 - Google Patents

Spatial source subtraction based on incomplete measurements of relative transfer function

Koldovský et al., 2015

Document ID: 17137663650368946420
Author: Koldovský Z; Málek J; Gannot S
Publication year: 2015
Publication venue: IEEE/ACM Transactions on Audio, Speech, and Language Processing

External Links

Cited by

Snippet

Relative impulse responses between microphones are usually long and dense due to the reverberant acoustic environment. Estimating them from short and noisy recordings poses a long-standing challenge of audio signal processing. In this paper, we apply a novel strategy …

Continue reading at arxiv.org (PDF) (other versions)

238000005259 measurement 0 title abstract description 13

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones

Similar Documents

Publication	Publication Date	Title
Koldovský et al.	2015	Spatial source subtraction based on incomplete measurements of relative transfer function
Gannot et al.	2017	A consolidated perspective on multimicrophone speech enhancement and source separation
Higuchi et al.	2017	Online MVDR beamformer based on complex Gaussian mixture model with spatial prior for noise robust ASR
Braun et al.	2018	Evaluation and comparison of late reverberation power spectral density estimators
Kumatani et al.	2012	Microphone array processing for distant speech recognition: From close-talking microphones to far-field sensors
Taseska et al.	2014	Informed spatial filtering for sound extraction using distributed microphone arrays
Schwartz et al.	2016	An expectation-maximization algorithm for multimicrophone speech dereverberation and noise reduction with coherence matrix estimation
Schmid et al.	2014	Variational Bayesian inference for multichannel dereverberation and noise reduction
Wang et al.	2015	Noise power spectral density estimation using MaxNSR blocking matrix
Kumatani et al.	2009	Beamforming with a maximum negentropy criterion
Peled et al.	2013	Linearly-constrained minimum-variance method for spherical microphone arrays based on plane-wave decomposition of the sound field
Li et al.	2019	Multichannel speech separation and enhancement using the convolutive transfer function
Schwartz et al.	2017	Two model-based EM algorithms for blind source separation in noisy environments
Li et al.	2019	Multichannel online dereverberation based on spectral magnitude inverse filtering
Habets et al.	2018	Dereverberation
Martín-Doñas et al.	2017	Dual-channel DNN-based speech enhancement for smartphones
Hëb-Umbach et al.	2025	Microphone Array Signal Processing and Deep Learning for Speech Enhancement: Combining model-based and data-driven approaches to parameter estimation and filtering [Special Issue On Model-Based and Data-Driven Audio Signal Processing]
Schwartz et al.	2015	Nested generalized sidelobe canceller for joint dereverberation and noise reduction
Aroudi et al.	2020	Cognitive-driven convolutional beamforming using EEG-based auditory attention decoding
Li et al.	2022	Low complex accurate multi-source RTF estimation
Yadav et al.	2023	Joint dereverberation and beamforming with blind estimation of the shape parameter of the desired source prior
Taghia et al.	2013	Dual-channel noise reduction based on a mixture of circular-symmetric complex Gaussians on unit hypersphere
Pfeifenberger et al.	2014	Blind source extraction based on a direction-dependent a-priori SNR.
Ali et al.	2018	Completing the RTF vector for an MVDR beamformer as applied to a local microphone array and an external microphone
Malek et al.	2017	Speaker extraction using LCMV beamformer with DNN-based SPP and RTF identification scheme