Hollier et al., 1994 - Google Patents
Error activity and error entropy as a measure of psychoacoustic significance in the perceptual domainHollier et al., 1994
- Document ID
- 8532668084568857346
- Author
- Hollier M
- Hawksford M
- Guard D
- Publication year
- Publication venue
- IEE Proceedings-Vision, Image and Signal Processing
External Links
Snippet
Several models have been described in the literature which seek to represent audio stimuli in the perceptual domain to best predict the audibility of errors and distortions. By modelling the principal nonlinear processes of human hearing it is possible to calculate a perceptual …
- 230000000694 effects 0 title abstract description 25
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique using neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Hollier et al. | Error activity and error entropy as a measure of psychoacoustic significance in the perceptual domain | |
| EP0776567B1 (en) | Analysis of audio quality | |
| US5794188A (en) | Speech signal distortion measurement which varies as a function of the distribution of measured distortion over time and frequency | |
| US6446038B1 (en) | Method and system for objectively evaluating speech | |
| Falk et al. | Single-ended speech quality measurement using machine learning methods | |
| EP0840975B1 (en) | Assessment of signal quality | |
| EP1091689B1 (en) | Apparatus and method for estimating the pulmonary function, by analysing speech parameters | |
| EP0722164A1 (en) | Method and apparatus for characterizing an input signal | |
| US6609092B1 (en) | Method and apparatus for estimating subjective audio signal quality from objective distortion measures | |
| CN101411171A (en) | Non-intrusive signal quality evaluation | |
| JP2009500952A (en) | Voice quality evaluation method and voice quality evaluation system | |
| Yu et al. | Metricnet: Towards improved modeling for non-intrusive speech quality assessment | |
| US5799133A (en) | Training process | |
| Mumtaz et al. | Nonintrusive perceptual audio quality assessment for user-generated content using deep learning | |
| Picovici et al. | Output-based objective speech quality measure using self-organizing map | |
| Narwaria et al. | Non-intrusive speech quality assessment with support vector regression | |
| Spang et al. | Personalized task load prediction in speech communication | |
| Dimolitsas | Subjective assessment methods for the measurement of digital speech coder quality | |
| Hinterleitner et al. | Comparison of Approaches for Instrumentally Predicting the Quality of Text-to-Speech Systems: Data from Blizzard Challenges 2008 and 2009. | |
| Shu et al. | RNN based noise annoyance measurement for urban noise evaluation | |
| Kubichek et al. | Speech quality assessment using expert pattern recognition | |
| Hauenstein | Application of Meddis' inner hair-cell model to the prediction of subjective speech quality | |
| Huang et al. | Exploration of audio quality assessment and anomaly localisation using attention models | |
| Hollier et al. | Algorithms for assessing the subjectivity of perceptually weighted audible errors | |
| Aburas et al. | Perceptual evaluation of speech quality-implementation using a non-traditional symbian operating system |