Versfeld et al., 2000 - Google Patents
Method for the selection of sentence materials for efficient measurement of the speech reception thresholdVersfeld et al., 2000
View PDF- Document ID
- 1657067768343986938
- Author
- Versfeld N
- Daalder L
- Festen J
- Houtgast T
- Publication year
- Publication venue
- The Journal of the Acoustical Society of America
External Links
Snippet
A method is described to select sentence materials for efficient measurement of the speech reception threshold (SRT). The first part of the paper addresses the creation of the sentence materials, the recording procedure, and a listening experiment to evaluate the new speech …
- 239000000463 material 0 title abstract description 44
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0202—Applications
- G10L21/0205—Enhancement of intelligibility of clean or coded speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Versfeld et al. | Method for the selection of sentence materials for efficient measurement of the speech reception threshold | |
| Chen et al. | Large-scale training to increase speech intelligibility for hearing-impaired listeners in novel noises | |
| Peelle et al. | Dissociations in perceptual learning revealed by adult age differences in adaptation to time-compressed speech. | |
| Besser et al. | Speech-in-speech listening on the LiSN-S test by older adults with good audiograms depends on cognition and hearing acuity at high frequencies | |
| Rhebergen et al. | Release from informational masking by time reversal of native and non-native interfering speech | |
| Rhebergen et al. | Extended speech intelligibility index for the prediction of the speech reception threshold in fluctuating noise | |
| Khouw et al. | Perceptual correlates of Cantonese tones | |
| Pittman et al. | Recognition of speech produced in noise | |
| Oxenham et al. | Masking release for low-and high-pass-filtered speech in the presence of noise and single-talker interference | |
| Gustafson et al. | Listening effort and perceived clarity for normal-hearing children with the use of digital noise reduction | |
| Gaudrain et al. | Factors limiting vocal-tract length discrimination in cochlear implant simulations | |
| Humes et al. | Auditory speech recognition and visual text recognition in younger and older adults: Similarities and differences between modalities and the effects of presentation rate | |
| Humes et al. | Development and efficacy of a frequent-word auditory training protocol for older adults with impaired hearing | |
| Montgomery et al. | Evaluation of two speech enhancement techniques to improve intelligibility for hearing-impaired adults | |
| Jin et al. | Speech perception in gated noise: The effects of temporal resolution | |
| Agus et al. | Informational masking in young and elderly listeners for speech masked by simultaneous speech and noise | |
| Calandruccio et al. | The effect of target/masker fundamental frequency contour similarity on masked-speech recognition | |
| Watkins et al. | Perceptual compensation for speaker differences and for spectral‐envelope distortion | |
| Monson et al. | Detection of high-frequency energy level changes in speech and singing | |
| Phatak et al. | Consonant recognition loss in hearing impaired listeners | |
| Iverson et al. | Vowel recognition via cochlear implants and noise vocoders: Effects of formant movement and duration | |
| Marriage et al. | Effects of three amplification strategies on speech perception by children with severe and profound hearing loss | |
| Wong et al. | Development of the Cantonese speech intelligibility index | |
| Kleczkowski et al. | Lombard effect in Polish speech and its comparison in English speech | |
| Wasiuk et al. | Predicting speech-in-speech recognition: Short-term audibility, talker sex, and listener factors |