Kurimo, 2002 - Google Patents
Thematic indexing of spoken documents by using self-organizing mapsKurimo, 2002
View PS- Document ID
- 16495195842976336891
- Author
- Kurimo M
- Publication year
- Publication venue
- Speech Communication
External Links
Snippet
A method is presented to provide a useful searchable index for spoken audio documents. The task differs from the traditional (text) document indexing, because large audio databases are decoded by automatic speech recognition and decoding errors occur …
- 238000009499 grossing 0 abstract description 28
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99936—Pattern matching access
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN112131350B (en) | Text label determining method, device, terminal and readable storage medium | |
| US11182435B2 (en) | Model generation device, text search device, model generation method, text search method, data structure, and program | |
| JP4664423B2 (en) | How to find relevant information | |
| US7739286B2 (en) | Topic specific language models built from large numbers of documents | |
| US6556987B1 (en) | Automatic text classification system | |
| US8543565B2 (en) | System and method using a discriminative learning approach for question answering | |
| US20020099730A1 (en) | Automatic text classification system | |
| US20070271226A1 (en) | Annotation by Search | |
| CN107229610A (en) | The analysis method and device of a kind of affection data | |
| CN111414763A (en) | A semantic disambiguation method, device, device and storage device for sign language computing | |
| Jin et al. | Entity linking at the tail: sparse signals, unknown entities, and phrase models | |
| CN113761125B (en) | Dynamic summary determination method and device, computing device and computer storage medium | |
| US20070112720A1 (en) | Two stage search | |
| CN119577459B (en) | Intelligent customer service training method and device for multi-mode large model and storage medium | |
| CN112307364A (en) | A Character Representation Oriented Extraction Method of News Text Occurrence | |
| Kurimo | Indexing audio documents by using latent semantic analysis and som | |
| Celikyilmaz et al. | Leveraging web query logs to learn user intent via bayesian latent variable model | |
| Kurimo | Thematic indexing of spoken documents by using self-organizing maps | |
| Lin et al. | Enhanced BERT-based ranking models for spoken document retrieval | |
| CA3017999A1 (en) | Audio search user interface | |
| CN111259650A (en) | An automatic text generation method based on the generative adversarial model of the class label sequence | |
| Xue et al. | Fast query by example of environmental sounds via robust and efficient cluster-based indexing | |
| US8862459B2 (en) | Generating Chinese language banners | |
| Chen et al. | A recurrent neural network language modeling framework for extractive speech summarization | |
| CN116108181A (en) | Client information processing method and device and electronic equipment |