[go: up one dir, main page]

Koldovský et al., 2015 - Google Patents

Spatial source subtraction based on incomplete measurements of relative transfer function

Koldovský et al., 2015

View PDF
Document ID
17137663650368946420
Author
Koldovský Z
Málek J
Gannot S
Publication year
Publication venue
IEEE/ACM Transactions on Audio, Speech, and Language Processing

External Links

Snippet

Relative impulse responses between microphones are usually long and dense due to the reverberant acoustic environment. Estimating them from short and noisy recordings poses a long-standing challenge of audio signal processing. In this paper, we apply a novel strategy …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/04Training, enrolment or model building
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones

Similar Documents

Publication Publication Date Title
Koldovský et al. Spatial source subtraction based on incomplete measurements of relative transfer function
Gannot et al. A consolidated perspective on multimicrophone speech enhancement and source separation
Higuchi et al. Online MVDR beamformer based on complex Gaussian mixture model with spatial prior for noise robust ASR
Braun et al. Evaluation and comparison of late reverberation power spectral density estimators
Kumatani et al. Microphone array processing for distant speech recognition: From close-talking microphones to far-field sensors
Taseska et al. Informed spatial filtering for sound extraction using distributed microphone arrays
Schwartz et al. An expectation-maximization algorithm for multimicrophone speech dereverberation and noise reduction with coherence matrix estimation
Schmid et al. Variational Bayesian inference for multichannel dereverberation and noise reduction
Wang et al. Noise power spectral density estimation using MaxNSR blocking matrix
Kumatani et al. Beamforming with a maximum negentropy criterion
Peled et al. Linearly-constrained minimum-variance method for spherical microphone arrays based on plane-wave decomposition of the sound field
Li et al. Multichannel speech separation and enhancement using the convolutive transfer function
Schwartz et al. Two model-based EM algorithms for blind source separation in noisy environments
Li et al. Multichannel online dereverberation based on spectral magnitude inverse filtering
Habets et al. Dereverberation
Martín-Doñas et al. Dual-channel DNN-based speech enhancement for smartphones
Hëb-Umbach et al. Microphone Array Signal Processing and Deep Learning for Speech Enhancement: Combining model-based and data-driven approaches to parameter estimation and filtering [Special Issue On Model-Based and Data-Driven Audio Signal Processing]
Schwartz et al. Nested generalized sidelobe canceller for joint dereverberation and noise reduction
Aroudi et al. Cognitive-driven convolutional beamforming using EEG-based auditory attention decoding
Li et al. Low complex accurate multi-source RTF estimation
Yadav et al. Joint dereverberation and beamforming with blind estimation of the shape parameter of the desired source prior
Taghia et al. Dual-channel noise reduction based on a mixture of circular-symmetric complex Gaussians on unit hypersphere
Pfeifenberger et al. Blind source extraction based on a direction-dependent a-priori SNR.
Ali et al. Completing the RTF vector for an MVDR beamformer as applied to a local microphone array and an external microphone
Malek et al. Speaker extraction using LCMV beamformer with DNN-based SPP and RTF identification scheme