Garcia et al., 2006 - Google Patents
Formants Measurement for Esophageal Speech Using Wavelets with Band and Resolution AdjustmentGarcia et al., 2006
- Document ID
- 14515524108143179841
- Author
- Garcia B
- Ruiz I
- Vicente J
- Alonso A
- Publication year
- Publication venue
- 2006 IEEE International Symposium on Signal Processing and Information Technology
External Links
Snippet
Patients who have undergone a laryngectomy as a result of larynx cancer have extremely low intelligibility. This is due to the removal of their vocal fold, which forces them to use the air flowing through the esophagus: this is known as esophageal speech. Measurement of …
- 238000005259 measurement 0 title abstract description 10
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Detecting, measuring or recording for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7235—Details of waveform analysis
- A61B5/7253—Details of waveform analysis characterised by using transforms
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Gonzalez et al. | PEFAC-A pitch estimation algorithm robust to high levels of noise | |
| JP3277398B2 (en) | Voiced sound discrimination method | |
| US7124075B2 (en) | Methods and apparatus for pitch determination | |
| CN102054480B (en) | A Monophonic Aliasing Speech Separation Method Based on Fractional Fourier Transform | |
| Sukhostat et al. | A comparative analysis of pitch detection methods under the influence of different noise conditions | |
| Veprek et al. | Analysis, enhancement and evaluation of five pitch determination techniques | |
| Hadjitodorov et al. | Laryngeal pathology detection by means of class-specific neural maps | |
| Liu et al. | Fundamental frequency estimation based on the joint time-frequency analysis of harmonic spectral structure | |
| Wiśniewski et al. | Automatic detection of disorders in a continuous speech with the hidden Markov models approach | |
| Roy et al. | Precise detection of speech endpoints dynamically: A wavelet convolution based approach | |
| CN108172214A (en) | A Method of Extracting Feature Parameters of Wavelet Speech Recognition Based on Mel Domain | |
| Hsu et al. | Voice activity detection based on frequency modulation of harmonics | |
| Garcia et al. | Formants Measurement for Esophageal Speech Using Wavelets with Band and Resolution Adjustment | |
| Eshaghi et al. | Voice activity detection based on using wavelet packet | |
| Hamzenejadi et al. | Extraction of speech pitch and formant frequencies using discrete wavelet transform | |
| Nemer et al. | Speech enhancement using fourth-order cumulants and optimum filters in the subband domain | |
| Yusnita et al. | Classification of speaker accent using hybrid DWT-LPC features and K-nearest neighbors in ethnically diverse Malaysian English | |
| Qaisar et al. | An event-driven approach for time-domain recognition of spoken English letters | |
| Thirumuru et al. | Improved vowel region detection from a continuous speech using post processing of vowel onset points and vowel end-points | |
| Karan et al. | Intelligent speech processing in the time-frequency domain | |
| Thirumuru et al. | Application of non-negative frequency-weighted energy operator for vowel region detection | |
| Shome et al. | Non-negative frequency-weighted energy-based speech quality estimation for different modes and quality of speech | |
| de León et al. | A complex wavelet based fundamental frequency estimator in singlechannel polyphonic signals | |
| Nassar et al. | Endpoints detection for noisy speech using a wavelet based algorithm | |
| Every et al. | Enhancement of harmonic content of speech based on a dynamic programming pitch tracking algorithm |