[go: up one dir, main page]

Blanchette, 2014 - Google Patents

Short-time multichannel noise power spectral density estimators for acoustic signals

Blanchette, 2014

View PDF
Document ID
7601245830532948906
Author
Blanchette J
Publication year

External Links

Snippet

The estimation of power spectral densities is a critical step in many speech enhancement algorithms. The demand for multi-channel speech enhancement systems is high with applications in teleconferencing, cellular phones, and hearing aids. The first objective of the …
Continue reading at ruor.uottawa.ca (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S3/00Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
    • G01S3/80Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
    • G01S3/802Systems for determining direction or deviation from predetermined direction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/301Automatic calibration of stereophonic sound system, e.g. with test microphone
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01VGEOPHYSICS; GRAVITATIONAL MEASUREMENTS; DETECTING MASSES OR OBJECTS
    • G01V1/00Seismology; Seismic or acoustic prospecting or detecting

Similar Documents

Publication Publication Date Title
Gannot et al. A consolidated perspective on multimicrophone speech enhancement and source separation
CN109839612B (en) Sound source direction estimation method and device based on time-frequency masking and deep neural network
Wang et al. Robust speaker localization guided by deep learning-based time-frequency masking
Kuklasiński et al. Maximum likelihood PSD estimation for speech enhancement in reverberation and noise
Yoshioka et al. Generalization of multi-channel linear prediction methods for blind MIMO impulse response shortening
Schwartz et al. Multi-microphone speech dereverberation and noise reduction using relative early transfer functions
Schwartz et al. An expectation-maximization algorithm for multimicrophone speech dereverberation and noise reduction with coherence matrix estimation
Alinaghi et al. Joint mixing vector and binaural model based stereo source separation
Koldovský et al. Spatial source subtraction based on incomplete measurements of relative transfer function
MX2014006499A (en) Apparatus and method for microphone positioning based on a spatial power density.
Braun et al. A multichannel diffuse power estimator for dereverberation in the presence of multiple sources
Srivastava et al. Blind room parameter estimation using multiple multichannel speech recordings
Dadvar et al. Robust binaural speech separation in adverse conditions based on deep neural network with modified spatial features and training target
Zheng et al. Statistical analysis of the multichannel Wiener filter using a bivariate normal distribution for sample covariance matrices
Zhang et al. A speech separation algorithm based on the comb-filter effect
Bologni et al. Wideband relative transfer function (rtf) estimation exploiting frequency correlations
KR101537653B1 (en) Method and system for noise reduction based on spectral and temporal correlations
Poschadel et al. Room geometry estimation from higher-order ambisonics signals using convolutional recurrent neural networks
Mustière et al. Design of multichannel frequency domain statistical-based enhancement systems preserving spatial cues via spectral distances minimization
Astapov et al. Directional Clustering with Polyharmonic Phase Estimation for Enhanced Speaker Localization
Blanchette Short-time multichannel noise power spectral density estimators for acoustic signals
Townsend Enhancements to the generalized sidelobe canceller for audio beamforming in an immersive environment
Laufer et al. ML estimation and CRBs for reverberation, speech, and noise PSDs in rank-deficient noise field
Xiong et al. Joint doa estimation and dereverberation based on multi-channel linear prediction filtering and azimuth sparsity
Weisman et al. Spatial Covariance Matrix Estimation for Reverberant Speech with Application to Speech Enhancement.