Kurimo, 2002 - Google Patents

Thematic indexing of spoken documents by using self-organizing maps

Kurimo, 2002

View PS

Document ID: 16495195842976336891
Author: Kurimo M
Publication year: 2002
Publication venue: Speech Communication

External Links

Cited by

Snippet

A method is presented to provide a useful searchable index for spoken audio documents. The task differs from the traditional (text) document indexing, because large audio databases are decoded by automatic speech recognition and decoding errors occur …

Continue reading at users.ics.aalto.fi (PS) (other versions)

238000009499 grossing 0 abstract description 28

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99936—Pattern matching access
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker

Similar Documents

Publication	Publication Date	Title
CN112131350B (en)	2024-04-30	Text label determining method, device, terminal and readable storage medium
US11182435B2 (en)	2021-11-23	Model generation device, text search device, model generation method, text search method, data structure, and program
JP4664423B2 (en)	2011-04-06	How to find relevant information
US7739286B2 (en)	2010-06-15	Topic specific language models built from large numbers of documents
US6556987B1 (en)	2003-04-29	Automatic text classification system
US8543565B2 (en)	2013-09-24	System and method using a discriminative learning approach for question answering
US20020099730A1 (en)	2002-07-25	Automatic text classification system
US20070271226A1 (en)	2007-11-22	Annotation by Search
CN107229610A (en)	2017-10-03	The analysis method and device of a kind of affection data
CN111414763A (en)	2020-07-14	A semantic disambiguation method, device, device and storage device for sign language computing
Jin et al.	2014	Entity linking at the tail: sparse signals, unknown entities, and phrase models
CN113761125B (en)	2025-06-03	Dynamic summary determination method and device, computing device and computer storage medium
US20070112720A1 (en)	2007-05-17	Two stage search
CN119577459B (en)	2025-04-22	Intelligent customer service training method and device for multi-mode large model and storage medium
CN112307364A (en)	2021-02-02	A Character Representation Oriented Extraction Method of News Text Occurrence
Kurimo	1999	Indexing audio documents by using latent semantic analysis and som
Celikyilmaz et al.	2011	Leveraging web query logs to learn user intent via bayesian latent variable model
Kurimo	2002	Thematic indexing of spoken documents by using self-organizing maps
Lin et al.	2019	Enhanced BERT-based ranking models for spoken document retrieval
CA3017999A1 (en)	2017-09-21	Audio search user interface
CN111259650A (en)	2020-06-09	An automatic text generation method based on the generative adversarial model of the class label sequence
Xue et al.	2008	Fast query by example of environmental sounds via robust and efficient cluster-based indexing
US8862459B2 (en)	2014-10-14	Generating Chinese language banners
Chen et al.	2014	A recurrent neural network language modeling framework for extractive speech summarization
CN116108181A (en)	2023-05-12	Client information processing method and device and electronic equipment