Jamaludin et al., 2012 - Google Patents

An improved time domain pitch detection algorithm for pathological voice

Jamaludin et al., 2012

Document ID: 14991646874381354464
Author: Jamaludin M; Salleh S; Swee T; Ahmad K; Ibrahim A; Ismail K
Publication year: 2012
Publication venue: American Journal of Applied Sciences

External Links

Cited by

Snippet

Problem statement: The present study proposes a new pitch detection algorithm which could potentially be used to detect pitch for disordered or pathological voices. One of the parameters required for dysphonia diagnosis is pitch and this prompted the development of …

Continue reading at www.researchgate.net (PDF) (other versions)

230000001575 pathological 0 title abstract description 40

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/90—Pitch determination of speech signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis

Similar Documents

Publication	Publication Date	Title
Gonzalez et al.	2014	PEFAC-A pitch estimation algorithm robust to high levels of noise
Deshmukh et al.	2005	Use of temporal information: Detection of periodicity, aperiodicity, and pitch in speech
Huang et al.	2012	Pitch estimation in noisy speech using accumulated peak spectrum and sparse estimation technique
CN103503060B (en)	2015-07-22	Speech syllable/vowel/phoneme boundary detection using auditory attention cues
Sukhostat et al.	2015	A comparative analysis of pitch detection methods under the influence of different noise conditions
CN104221079B (en)	2017-03-01	Carry out the improved Mel filter bank structure of phonetic analysiss using spectral characteristic
CN103646649A (en)	2014-03-19	High-efficiency voice detecting method
CN104900229A (en)	2015-09-09	Method for extracting mixed characteristic parameters of voice signals
Mayer et al.	2017	Impact of phase estimation on single-channel speech separation based on time-frequency masking
US8942977B2 (en)	2015-01-27	System and method for speech recognition using pitch-synchronous spectral parameters
Pati et al.	2011	Subsegmental, segmental and suprasegmental processing of linear prediction residual for speaker information
WO2020061346A1 (en)	2020-03-26	Methods and apparatuses for tracking weak signal traces
Jamaludin et al.	2012	An improved time domain pitch detection algorithm for pathological voice
US9899039B2 (en)	2018-02-20	Method for determining alcohol consumption, and recording medium and terminal for carrying out same
Yarra et al.	2016	A mode-shape classification technique for robust speech rate estimation and syllable nuclei detection
Li et al.	2014	Instantaneous pitch estimation based on empirical wavelet transform
CN104036785A (en)	2014-09-10	Speech signal processing method, speech signal processing device and speech signal analyzing system
Bouzid et al.	2009	Voice source parameter measurement based on multi-scale analysis of electroglottographic signal
TWI299855B (en)	2008-08-11	Detection method for voice activity endpoint
Deshpande et al.	2019	A successive difference feature for detecting emotional valence from speech
Messaoud et al.	2011	Using multi-scale product spectrum for single and multi-pitch estimation
Shome et al.	2022	Non-negative frequency-weighted energy-based speech quality estimation for different modes and quality of speech
JP5203404B2 (en)	2013-06-05	Tempo value detection device and tempo value detection method
Sharma et al.	2023	Speech Diarization and ASR with GMM
Every et al.	2006	Enhancement of harmonic content of speech based on a dynamic programming pitch tracking algorithm