Oria et al., 2005 - Google Patents

Automatic generation of speech interfaces for Web-based applications

Oria et al., 2005

Document ID: 5817993667373166155
Author: Oria D; Vetek A
Publication year: 2005
Publication venue: IEEE Workshop on Automatic Speech Recognition and Understanding, 2005.

External Links

Cited by

Snippet

This paper describes a framework for automatically providing speech access to Web-based corporate applications that are accessible via SMS, WAP and Web browsers and are authored using a subset of HTML. A small subset of voice-related extensions was defined …

Continue reading at ieeexplore.ieee.org (other versions)

238000000034 method 0 abstract description 6

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G10L15/265—Speech recognisers specially adapted for particular applications
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/027—Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/44—Arrangements for executing specific programmes
- G06F9/4443—Execution mechanisms for user interfaces

Similar Documents

Publication	Publication Date	Title
RU2352979C2 (en)	2009-04-20	Synchronous comprehension of semantic objects for highly active interface
US9083798B2 (en)	2015-07-14	Enabling voice selection of user preferences
RU2349969C2 (en)	2009-03-20	Synchronous understanding of semantic objects realised by means of tags of speech application
US7552055B2 (en)	2009-06-23	Dialog component re-use in recognition systems
US7962344B2 (en)	2011-06-14	Depicting a speech user interface via graphical elements
Reddy et al.	2013	Speech to text conversion using android platform
US8160883B2 (en)	2012-04-17	Focus tracking in dialogs
US8290775B2 (en)	2012-10-16	Pronunciation correction of text-to-speech systems between different spoken languages
US8224650B2 (en)	2012-07-17	Web server controls for web enabled recognition and/or audible prompting
EP1215656B1 (en)	2005-06-15	Idiom handling in voice service systems
US6832196B2 (en)	2004-12-14	Speech driven data selection in a voice-enabled program
US7711570B2 (en)	2010-05-04	Application abstraction with dialog purpose
US7739117B2 (en)	2010-06-15	Method and system for voice-enabled autofill
US7546382B2 (en)	2009-06-09	Methods and systems for authoring of mixed-initiative multi-modal interactions and related browsing mechanisms
US20040230637A1 (en)	2004-11-18	Application controls for speech enabled recognition
US20040230434A1 (en)	2004-11-18	Web server controls for web enabled recognition and/or audible prompting for call controls
KR20080040644A (en)	2008-05-08	Voice Application Instrumentation and Logging
Di Fabbrizio et al.	2002	AT&t help desk.
JP4809358B2 (en)	2011-11-09	Method and system for improving the fidelity of a dialogue system
Oria et al.	2005	Automatic generation of speech interfaces for Web-based applications
Schnelle et al.	2005	Audio Navigation Patterns.
Paternò et al.	2010	Deriving Vocal Interfaces from Logical Descriptions in Multi-device Authoring Environments
Sharman	1993	Speech interfaces for computer systems: Problems and potential
Turunen et al.	2004	Speech application design and development
Dobrišek et al.	2002	A voice-driven Web browser for blind people