[go: up one dir, main page]

Kurimo, 2002 - Google Patents

Thematic indexing of spoken documents by using self-organizing maps

Kurimo, 2002

View PS
Document ID
16495195842976336891
Author
Kurimo M
Publication year
Publication venue
Speech Communication

External Links

Snippet

A method is presented to provide a useful searchable index for spoken audio documents. The task differs from the traditional (text) document indexing, because large audio databases are decoded by automatic speech recognition and decoding errors occur …
Continue reading at users.ics.aalto.fi (PS) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30634Querying
    • G06F17/30657Query processing
    • G06F17/30675Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30861Retrieval from the Internet, e.g. browsers
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99936Pattern matching access
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker

Similar Documents

Publication Publication Date Title
CN112131350B (en) Text label determining method, device, terminal and readable storage medium
US11182435B2 (en) Model generation device, text search device, model generation method, text search method, data structure, and program
JP4664423B2 (en) How to find relevant information
US7739286B2 (en) Topic specific language models built from large numbers of documents
US6556987B1 (en) Automatic text classification system
US8543565B2 (en) System and method using a discriminative learning approach for question answering
US20020099730A1 (en) Automatic text classification system
US20070271226A1 (en) Annotation by Search
CN107229610A (en) The analysis method and device of a kind of affection data
CN111414763A (en) A semantic disambiguation method, device, device and storage device for sign language computing
Jin et al. Entity linking at the tail: sparse signals, unknown entities, and phrase models
CN113761125B (en) Dynamic summary determination method and device, computing device and computer storage medium
US20070112720A1 (en) Two stage search
CN119577459B (en) Intelligent customer service training method and device for multi-mode large model and storage medium
CN112307364A (en) A Character Representation Oriented Extraction Method of News Text Occurrence
Kurimo Indexing audio documents by using latent semantic analysis and som
Celikyilmaz et al. Leveraging web query logs to learn user intent via bayesian latent variable model
Kurimo Thematic indexing of spoken documents by using self-organizing maps
Lin et al. Enhanced BERT-based ranking models for spoken document retrieval
CA3017999A1 (en) Audio search user interface
CN111259650A (en) An automatic text generation method based on the generative adversarial model of the class label sequence
Xue et al. Fast query by example of environmental sounds via robust and efficient cluster-based indexing
US8862459B2 (en) Generating Chinese language banners
Chen et al. A recurrent neural network language modeling framework for extractive speech summarization
CN116108181A (en) Client information processing method and device and electronic equipment