Behera et al., 2004 - Google Patents
Looking at projected documents: Event detection & document identificationBehera et al., 2004
- Document ID
- 3963098646208752285
- Author
- Behera A
- Lalanne D
- Ingold R
- Publication year
- Publication venue
- 2004 IEEE International Conference on Multimedia and Expo (ICME)(IEEE Cat. No. 04TH8763)
External Links
Snippet
In the context of a multimodal application, the article proposes an image-based method for bridging the gap between document excerpts and video extracts. The approach, called document image alignment, takes advantage of the observable events related to documents …
- 238000001514 detection method 0 title description 17
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
- G06F17/30247—Information retrieval; Database structures therefor; File system structures therefor in image databases based on features automatically derived from the image data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30017—Multimedia data retrieval; Retrieval of more than one type of audiovisual media
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00288—Classification, e.g. identification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
- G06K9/32—Aligning or centering of the image pick-up or image-field
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G06K9/00711—Recognising video content, e.g. extracting audiovisual features from movies, extracting representative key-frames, discriminating news vs. sport content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment; Cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, TV cameras, video cameras, camcorders, webcams, camera modules for embedding in other devices, e.g. mobile phones, computers or vehicles
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effect; Cameras specially adapted for the electronic generation of special effects
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Erol et al. | Linking multimedia presentations with their symbolic source documents: algorithm and applications | |
| US7372991B2 (en) | Method and apparatus for summarizing and indexing the contents of an audio-visual presentation | |
| CN1477590B (en) | A system and method for whiteboard and audio capture | |
| Cotsaces et al. | Video shot detection and condensed representation. a review | |
| US20110080424A1 (en) | Image processing | |
| US20140164927A1 (en) | Talk Tags | |
| JP5050075B2 (en) | Image discrimination method | |
| Chen et al. | Visual storylines: Semantic visualization of movie sequence | |
| US7904815B2 (en) | Content-based dynamic photo-to-video methods and apparatuses | |
| Fan et al. | Matching slides to presentation videos using sift and scene background matching | |
| Ma et al. | Lecture video segmentation and indexing | |
| Behera et al. | Looking at projected documents: Event detection & document identification | |
| Bertini et al. | Semantic adaptation of sport videos with user-centred performance analysis | |
| Behera et al. | DocMIR: An automatic document-based indexing system for meeting retrieval | |
| Wilk et al. | Robust tracking for interactive social video | |
| Zeng et al. | Instant video summarization during shooting with mobile phone | |
| Yeh | Selecting interesting image regions to automatically create cinemagraphs | |
| El-Bendary et al. | PCA-based home videos annotation system | |
| Wang et al. | Robust alignment of presentation videos with slides | |
| Hou et al. | An automatic extraction system for screen-shot documents based on deep learning | |
| Wamane et al. | Embedded Technology Based Image and Video Data Extraction | |
| Charara et al. | Tracking a screen and detecting its rate of change in 3-D video scenes of multipurpose halls | |
| DeCamp | Headlock: Wide-range head pose estimation for low resolution video | |
| Leon | Content Identification using video tomography | |
| Behera | A visual signature-based identification method of low-resolution document images and its exploitation to automate indexing of multimodal recordings |