Oria et al., 2005 - Google Patents
Automatic generation of speech interfaces for Web-based applicationsOria et al., 2005
- Document ID
- 5817993667373166155
- Author
- Oria D
- Vetek A
- Publication year
- Publication venue
- IEEE Workshop on Automatic Speech Recognition and Understanding, 2005.
External Links
Snippet
This paper describes a framework for automatically providing speech access to Web-based corporate applications that are accessible via SMS, WAP and Web browsers and are authored using a subset of HTML. A small subset of voice-related extensions was defined …
- 238000000034 method 0 abstract description 6
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G10L15/265—Speech recognisers specially adapted for particular applications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/027—Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/44—Arrangements for executing specific programmes
- G06F9/4443—Execution mechanisms for user interfaces
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| RU2352979C2 (en) | Synchronous comprehension of semantic objects for highly active interface | |
| US9083798B2 (en) | Enabling voice selection of user preferences | |
| RU2349969C2 (en) | Synchronous understanding of semantic objects realised by means of tags of speech application | |
| US7552055B2 (en) | Dialog component re-use in recognition systems | |
| US7962344B2 (en) | Depicting a speech user interface via graphical elements | |
| Reddy et al. | Speech to text conversion using android platform | |
| US8160883B2 (en) | Focus tracking in dialogs | |
| US8290775B2 (en) | Pronunciation correction of text-to-speech systems between different spoken languages | |
| US8224650B2 (en) | Web server controls for web enabled recognition and/or audible prompting | |
| EP1215656B1 (en) | Idiom handling in voice service systems | |
| US6832196B2 (en) | Speech driven data selection in a voice-enabled program | |
| US7711570B2 (en) | Application abstraction with dialog purpose | |
| US7739117B2 (en) | Method and system for voice-enabled autofill | |
| US7546382B2 (en) | Methods and systems for authoring of mixed-initiative multi-modal interactions and related browsing mechanisms | |
| US20040230637A1 (en) | Application controls for speech enabled recognition | |
| US20040230434A1 (en) | Web server controls for web enabled recognition and/or audible prompting for call controls | |
| KR20080040644A (en) | Voice Application Instrumentation and Logging | |
| Di Fabbrizio et al. | AT&t help desk. | |
| JP4809358B2 (en) | Method and system for improving the fidelity of a dialogue system | |
| Oria et al. | Automatic generation of speech interfaces for Web-based applications | |
| Schnelle et al. | Audio Navigation Patterns. | |
| Paternò et al. | Deriving Vocal Interfaces from Logical Descriptions in Multi-device Authoring Environments | |
| Sharman | Speech interfaces for computer systems: Problems and potential | |
| Turunen et al. | Speech application design and development | |
| Dobrišek et al. | A voice-driven Web browser for blind people |