Jamaludin et al., 2012 - Google Patents
An improved time domain pitch detection algorithm for pathological voiceJamaludin et al., 2012
View PDF- Document ID
- 14991646874381354464
- Author
- Jamaludin M
- Salleh S
- Swee T
- Ahmad K
- Ibrahim A
- Ismail K
- Publication year
- Publication venue
- American Journal of Applied Sciences
External Links
Snippet
Problem statement: The present study proposes a new pitch detection algorithm which could  potentially be used to detect pitch for disordered or pathological voices. One of the  parameters required for dysphonia diagnosis is pitch and this prompted the development of … 
    - 230000001575 pathological 0 title abstract description 40
Classifications
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
 
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/90—Pitch determination of speech signals
 
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
 
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
 
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
 
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
 
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
 
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
 
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
 
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
 
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
 
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
 
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
 
Similar Documents
| Publication | Publication Date | Title | 
|---|---|---|
| Gonzalez et al. | PEFAC-A pitch estimation algorithm robust to high levels of noise | |
| Deshmukh et al. | Use of temporal information: Detection of periodicity, aperiodicity, and pitch in speech | |
| Huang et al. | Pitch estimation in noisy speech using accumulated peak spectrum and sparse estimation technique | |
| CN103503060B (en) | Speech syllable/vowel/phoneme boundary detection using auditory attention cues | |
| Sukhostat et al. | A comparative analysis of pitch detection methods under the influence of different noise conditions | |
| CN104221079B (en) | Carry out the improved Mel filter bank structure of phonetic analysiss using spectral characteristic | |
| CN103646649A (en) | High-efficiency voice detecting method | |
| CN104900229A (en) | Method for extracting mixed characteristic parameters of voice signals | |
| Mayer et al. | Impact of phase estimation on single-channel speech separation based on time-frequency masking | |
| US8942977B2 (en) | System and method for speech recognition using pitch-synchronous spectral parameters | |
| Pati et al. | Subsegmental, segmental and suprasegmental processing of linear prediction residual for speaker information | |
| WO2020061346A1 (en) | Methods and apparatuses for tracking weak signal traces | |
| Jamaludin et al. | An improved time domain pitch detection algorithm for pathological voice | |
| US9899039B2 (en) | Method for determining alcohol consumption, and recording medium and terminal for carrying out same | |
| Yarra et al. | A mode-shape classification technique for robust speech rate estimation and syllable nuclei detection | |
| Li et al. | Instantaneous pitch estimation based on empirical wavelet transform | |
| CN104036785A (en) | Speech signal processing method, speech signal processing device and speech signal analyzing system | |
| Bouzid et al. | Voice source parameter measurement based on multi-scale analysis of electroglottographic signal | |
| TWI299855B (en) | Detection method for voice activity endpoint | |
| Deshpande et al. | A successive difference feature for detecting emotional valence from speech | |
| Messaoud et al. | Using multi-scale product spectrum for single and multi-pitch estimation | |
| Shome et al. | Non-negative frequency-weighted energy-based speech quality estimation for different modes and quality of speech | |
| JP5203404B2 (en) | Tempo value detection device and tempo value detection method | |
| Sharma et al. | Speech Diarization and ASR with GMM | |
| Every et al. | Enhancement of harmonic content of speech based on a dynamic programming pitch tracking algorithm |