Muhammad, 2008 - Google Patents
Noise robust pitch detection based on extended AMDFMuhammad, 2008
View PDF- Document ID
- 12767264921036107240
- Author
- Muhammad G
- Publication year
- Publication venue
- 2008 IEEE International Symposium on Signal Processing and Information Technology
External Links
Snippet
This paper introduces a new extended average magnitude difference function (EAMDF) for noise robust pitch detection. EAMDF involves in sufficient number of averaging for all lag values compared to the original AMDF, and thereby eliminates the falling tendency of the …
- 238000001514 detection method 0 title abstract description 22
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/09—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being zero crossing rates
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US10510363B2 (en) | Pitch detection algorithm based on PWVT | |
| Hui et al. | A pitch detection algorithm based on AMDF and ACF | |
| JPH01118900A (en) | Noise suppressor | |
| Muhammad | Extended average magnitude difference function based pitch detection | |
| Muhammad | Noise robust pitch detection based on extended AMDF | |
| EP1944754B1 (en) | Speech fundamental frequency estimator and method for estimating a speech fundamental frequency | |
| US8108164B2 (en) | Determination of a common fundamental frequency of harmonic signals | |
| Li et al. | A pitch estimation algorithm for speech in complex noise environments based on the radon transform | |
| CN118337917B (en) | Call center intelligent customer service interaction method based on voice processing | |
| KR20050080649A (en) | Voiced sound and unvoiced sound detection method and apparatus | |
| JP4125322B2 (en) | Basic frequency extraction device, method thereof, program thereof, and recording medium recording the program | |
| Sundaram et al. | Usable Speech Detection Using Linear Predictive Analysis–A Model-Based Approach | |
| Hasan et al. | MMSE estimator for speech enhancement considering the constructive and destructive interference of noise | |
| Chang et al. | Pitch estimation of speech signal based on adaptive lattice notch filter | |
| Zhao et al. | A New Pitch Estimation Method Based on AMDF. | |
| Roy et al. | Harmonic modification and data adaptive filtering based approach to robust pitch estimation | |
| Molla et al. | Pitch estimation of noisy speech signals using empirical mode decomposition. | |
| Zeremdini et al. | Multi-pitch estimation based on multi-scale product analysis, improved comb filter and dynamic programming | |
| JP3841705B2 (en) | Occupancy degree extraction device and fundamental frequency extraction device, method thereof, program thereof, and recording medium recording the program | |
| Liu et al. | Real-time pitch tracking based on combined SMDSF. | |
| Zeremdini et al. | Contribution to the Multipitch Estimation by Multi-scale Product Analysis | |
| Hasan et al. | A fundamental frequency extraction method based on windowless and normalized autocorrelation functions | |
| Roy et al. | Dominant harmonic modification with data adaptive filter based algorithm for robust pitch estimation | |
| Li et al. | Pitch detection method for noisy speech signals based on pre-filter and weighted wavelet coefficients | |
| Selvi et al. | Speech Enhancement using Adaptive Filtering with Different Window Functions and Overlapping Sizes |