Kumar et al., 2018 - Google Patents
Monocular fisheye camera depth estimation using sparse lidar supervisionKumar et al., 2018
View PDF- Document ID
- 9376119594460204816
- Author
- Kumar V
- Milz S
- Witt C
- Simon M
- Amende K
- Petzold J
- Yogamani S
- Pech T
- Publication year
- Publication venue
- 2018 21st International Conference on Intelligent Transportation Systems (ITSC)
External Links
Snippet
Near-field depth estimation around a self-driving car is an important function that can be achieved by four wide-angle fisheye cameras having a field of view of over 180°. Depth estimation based on convolutional neural networks (CNNs) produce state of the art results …
- 230000001537 neural 0 abstract description 11
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30196—Human being; Person
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00362—Recognising human body or animal bodies, e.g. vehicle occupant, pedestrian; Recognising body parts, e.g. hand
- G06K9/00369—Recognition of whole body, e.g. static pedestrian or occupant recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Kumar et al. | Monocular fisheye camera depth estimation using sparse lidar supervision | |
| US11830246B2 (en) | Systems and methods for extracting and vectorizing features of satellite imagery | |
| Mou et al. | IM2HEIGHT: Height estimation from single monocular imagery via fully residual convolutional-deconvolutional network | |
| Guerry et al. | Snapnet-r: Consistent 3d multi-view semantic labeling for robotics | |
| Hepp et al. | Learn-to-score: Efficient 3d scene exploration by predicting view utility | |
| CN118314180A (en) | Point cloud matching method and system based on derivative-free optimization | |
| JP2019520654A (en) | Object detection in video data | |
| Bešić et al. | Dynamic object removal and spatio-temporal RGB-D inpainting via geometry-aware adversarial learning | |
| JP2018520425A (en) | 3D space modeling | |
| WO2009023044A2 (en) | Method and system for fast dense stereoscopic ranging | |
| CN113160275A (en) | Automatic target tracking and track calculating method based on multiple videos | |
| CN103886287A (en) | Perspective-crossing gait recognition method based on 3D projection | |
| WO2025133902A1 (en) | Methods and systems for generating 3d representations of scenes from a plurality of images using pointmaps | |
| Khan et al. | Towards monocular neural facial depth estimation: Past, present, and future | |
| Sengupta | Semantic Mapping of Road Scenes | |
| Pu et al. | Sdf-gan: Semi-supervised depth fusion with multi-scale adversarial networks | |
| Kumar et al. | Monocular Fisheye Camera Depth Estimation Using Semisupervised Sparse Velodyne Data | |
| Zhang et al. | A self-supervised monocular depth estimation approach based on UAV aerial images | |
| de Carvalho | Deep depth from defocus: Neural networks for monocular depth estimation | |
| US20250285359A1 (en) | Dynamic multi-dimensional media content projection | |
| Sangregorio | Estimating Depth Images from Monocular Camera with Deep Learning for Service Robotics Applications | |
| Nadar et al. | Sensor simulation for monocular depth estimation using deep neural networks | |
| Janai | Addressing the Data Scarcity of Learning-based Optical Flow Approaches | |
| Zhang | Extending deep learning based Multi-view stereo algorithms for aerial datasets | |
| Bastico | Simultaneous depth completion and 3D object detection via deep learning for scene reconstruction in autonomous driving scenarios |