Jain et al., 2013 - Google Patents
FOCUS: Clustering crowdsourced videos by line-of-sightJain et al., 2013
View PDF- Document ID
- 15223740841878377508
- Author
- Jain P
- Manweiler J
- Acharya A
- Beaty K
- Publication year
- Publication venue
- Proceedings of the 11th ACM conference on embedded networked sensor systems
External Links
Snippet
Crowdsourced video often provides engaging and diverse perspectives not captured by professional videographers. Broad appeal of user-uploaded video has been widely confirmed: freely distributed on YouTube, by subscription on Vimeo, and to peers on …
- 230000000007 visual effect 0 abstract description 15
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
- G06F17/30247—Information retrieval; Database structures therefor; File system structures therefor in image databases based on features automatically derived from the image data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G06K9/00711—Recognising video content, e.g. extracting audiovisual features from movies, extracting representative key-frames, discriminating news vs. sport content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G06K9/00664—Recognising scenes such as could be captured by a camera operated by a pedestrian or robot, including objects at substantially different ranges from the camera
- G06K9/00684—Categorising the entire scene, e.g. birthday party or wedding scene
- G06K9/00697—Outdoor scenes
- G06K9/00704—Urban scenes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30017—Multimedia data retrieval; Retrieval of more than one type of audiovisual media
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment; Cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, TV cameras, video cameras, camcorders, webcams, camera modules for embedding in other devices, e.g. mobile phones, computers or vehicles
- H04N5/225—Television cameras; Cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, video cameras, camcorders, webcams, camera modules for embedding in other devices, e.g. mobile phones, computers or vehicles
- H04N5/232—Devices for controlling television cameras, e.g. remote control; Control of cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, TV cameras, video cameras, camcorders, webcams, camera modules for embedding in, e.g. mobile phones, computers or vehicles
- H04N5/23238—Control of image capture or reproduction to achieve a very large field of view, e.g. panorama
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Jain et al. | FOCUS: Clustering crowdsourced videos by line-of-sight | |
| US9570111B2 (en) | Clustering crowdsourced videos by line-of-sight | |
| Chen et al. | City-scale landmark identification on mobile devices | |
| Chen et al. | Rise of the indoor crowd: Reconstruction of building interior view via mobile crowdsourcing | |
| Irschara et al. | From structure-from-motion point clouds to fast location recognition | |
| US9626585B2 (en) | Composition modeling for photo retrieval through geometric image segmentation | |
| US20090295791A1 (en) | Three-dimensional environment created from video | |
| Tompkin et al. | Videoscapes: exploring sparse, unstructured video collections | |
| Bettadapura et al. | Egocentric field-of-view localization using first-person point-of-view devices | |
| Baker et al. | Localization and tracking of stationary users for augmented reality | |
| Porzi et al. | Learning contours for automatic annotations of mountains pictures on a smartphone | |
| Zhang et al. | Multi-video summary and skim generation of sensor-rich videos in geo-space | |
| KR20210004918A (en) | Method of Discovering Region of Attractions from Geo-tagged Photos and Apparatus Thereof | |
| Zhu et al. | Large-scale architectural asset extraction from panoramic imagery | |
| Revaud et al. | Did it change? learning to detect point-of-interest changes for proactive map updates | |
| Hao et al. | Keyframe presentation for browsing of user-generated videos on map interfaces | |
| Park et al. | Estimating the camera direction of a geotagged image using reference images | |
| Brejcha et al. | Camera orientation estimation in natural scenes using semantic cues | |
| Hao et al. | Point of interest detection and visual distance estimation for sensor-rich video | |
| Lu et al. | A fast 3D indoor-localization approach based on video queries | |
| Chippendale et al. | Spatial and temporal attractiveness analysis through geo-referenced photo alignment | |
| Liu et al. | Robust and accurate mobile visual localization and its applications | |
| Doulamis | Automatic 3D reconstruction from unstructured videos combining video summarization and structure from motion | |
| Cricri et al. | Multimodal Semantics Extraction from User‐Generated Videos | |
| Kroepfl et al. | Efficiently locating photographs in many panoramas |