Horvitz et al., 2000 - Google Patents

Deeplistener: harnessing expected utility to guide clarification dialog in spoken language systems.

Horvitz et al., 2000

Document ID: 4377204221556194569
Author: Horvitz E; Paek T
Publication year: 2000
Publication venue: INTERSPEECH

External Links

Cited by

Snippet

We describe research on endowing spoken language systems with the ability to consider the cost of misrecognition, and using that knowledge to guide clarification dialog about a user's intentions. Our approach relies on coupling utility-directed policies for dialog with the …

Continue reading at www.cs.cmu.edu (PDF) (other versions)

238000005352 clarification 0 title abstract description 14

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0635—Training updating or merging of old and new templates; Mean values; Weighting
- G10L2015/0636—Threshold criteria for the updating
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems

Similar Documents

Publication	Publication Date	Title
US20220122607A1 (en)	2022-04-21	Controlling an engagement state of an agent during a human-machine dialog
CN114127710B (en)	2025-05-23	Ambiguous solutions using conversational search history
US7580908B1 (en)	2009-08-25	System and method providing utility-based decision making about clarification dialog given communicative uncertainty
US10319381B2 (en)	2019-06-11	Iteratively updating parameters for dialog states
Smith et al.	2011	Interaction strategies for an affective conversational agent
US20220115001A1 (en)	2022-04-14	Method, System and Apparatus for Understanding and Generating Human Conversational Cues
Horvitz et al.	2003	Models of attention in computing and communication: from principles to applications
KR101622111B1 (en)	2016-05-18	Dialog system and conversational method thereof
US20030061029A1 (en)	2003-03-27	Device for conducting expectation based mixed initiative natural language dialogs
Trung	2006	Multimodal dialogue management-state of the art
EP4091161B1 (en)	2024-10-09	Synthesized speech audio data generated on behalf of human participant in conversation
US11437039B2 (en)	2022-09-06	Intelligent software agent
Horvitz et al.	2001	Harnessing models of users’ goals to mediate clarification dialog in spoken language systems
Horvitz et al.	2000	Deeplistener: harnessing expected utility to guide clarification dialog in spoken language systems.
CN116417003A (en)	2023-07-11	Voice interaction system, method, electronic device and storage medium
US11669697B2 (en)	2023-06-06	Hybrid policy dialogue manager for intelligent personal assistants
US12148417B1 (en)	2024-11-19	Label confidence scoring
CN114127694A (en)	2022-03-01	Error recovery for the session system
Feng et al.	2021	ASR-GLUE: A new multi-task benchmark for asr-robust natural language understanding
Paul et al.	2022	Intent based multimodal speech and gesture fusion for human-robot communication in assembly situation
US11430446B1 (en)	2022-08-30	Dialogue system and a dialogue method
CN111292749B (en)	2023-06-09	Session control method and device of intelligent voice platform
EP4089569A1 (en)	2022-11-16	A dialogue system and a dialogue method
EP4158621B1 (en)	2025-04-23	Enabling natural conversations with soft endpointing for an automated assistant
Hofmann	2016	Intuitive speech interface technology for information exchange tasks