[go: up one dir, main page]

Garcia et al., 2006 - Google Patents

Formants Measurement for Esophageal Speech Using Wavelets with Band and Resolution Adjustment

Garcia et al., 2006

Document ID
14515524108143179841
Author
Garcia B
Ruiz I
Vicente J
Alonso A
Publication year
Publication venue
2006 IEEE International Symposium on Signal Processing and Information Technology

External Links

Snippet

Patients who have undergone a laryngectomy as a result of larynx cancer have extremely low intelligibility. This is due to the removal of their vocal fold, which forces them to use the air flowing through the esophagus: this is known as esophageal speech. Measurement of …
Continue reading at ieeexplore.ieee.org (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Detecting, measuring or recording for diagnostic purposes; Identification of persons
    • A61B5/72Signal processing specially adapted for physiological signals or for diagnostic purposes
    • A61B5/7235Details of waveform analysis
    • A61B5/7253Details of waveform analysis characterised by using transforms
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification

Similar Documents

Publication Publication Date Title
Gonzalez et al. PEFAC-A pitch estimation algorithm robust to high levels of noise
JP3277398B2 (en) Voiced sound discrimination method
US7124075B2 (en) Methods and apparatus for pitch determination
CN102054480B (en) A Monophonic Aliasing Speech Separation Method Based on Fractional Fourier Transform
Sukhostat et al. A comparative analysis of pitch detection methods under the influence of different noise conditions
Veprek et al. Analysis, enhancement and evaluation of five pitch determination techniques
Hadjitodorov et al. Laryngeal pathology detection by means of class-specific neural maps
Liu et al. Fundamental frequency estimation based on the joint time-frequency analysis of harmonic spectral structure
Wiśniewski et al. Automatic detection of disorders in a continuous speech with the hidden Markov models approach
Roy et al. Precise detection of speech endpoints dynamically: A wavelet convolution based approach
CN108172214A (en) A Method of Extracting Feature Parameters of Wavelet Speech Recognition Based on Mel Domain
Hsu et al. Voice activity detection based on frequency modulation of harmonics
Garcia et al. Formants Measurement for Esophageal Speech Using Wavelets with Band and Resolution Adjustment
Eshaghi et al. Voice activity detection based on using wavelet packet
Hamzenejadi et al. Extraction of speech pitch and formant frequencies using discrete wavelet transform
Nemer et al. Speech enhancement using fourth-order cumulants and optimum filters in the subband domain
Yusnita et al. Classification of speaker accent using hybrid DWT-LPC features and K-nearest neighbors in ethnically diverse Malaysian English
Qaisar et al. An event-driven approach for time-domain recognition of spoken English letters
Thirumuru et al. Improved vowel region detection from a continuous speech using post processing of vowel onset points and vowel end-points
Karan et al. Intelligent speech processing in the time-frequency domain
Thirumuru et al. Application of non-negative frequency-weighted energy operator for vowel region detection
Shome et al. Non-negative frequency-weighted energy-based speech quality estimation for different modes and quality of speech
de León et al. A complex wavelet based fundamental frequency estimator in singlechannel polyphonic signals
Nassar et al. Endpoints detection for noisy speech using a wavelet based algorithm
Every et al. Enhancement of harmonic content of speech based on a dynamic programming pitch tracking algorithm