Fernando et al., 2017 - Google Patents

Discriminatively learned hierarchical rank pooling networks

Fernando et al., 2017

Document ID: 11300584018311415162
Author: Fernando B; Gould S
Publication year: 2017
Publication venue: International Journal of Computer Vision

External Links

Cited by

Snippet

Rank pooling is a temporal encoding method that summarizes the dynamics of a video sequence to a single vector which has shown good results in human action recognition in prior work. In this work, we present novel temporal encoding methods for action and activity …

Continue reading at arxiv.org (PDF) (other versions)

238000011176 pooling 0 title abstract description 259

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G06K9/6269—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches based on the distance between the decision surface and training patterns lying on the boundary of the class cluster, e.g. support vector machines
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6232—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
- G06K9/6247—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6279—Classification techniques relating to the number of classes
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6256—Obtaining sets of training patterns; Bootstrap methods, e.g. bagging, boosting
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/18—Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis

Similar Documents

Publication	Publication Date	Title
Zou et al.	2023	Object detection in 20 years: A survey
Koohzadi et al.	2017	Survey on deep learning methods in human action recognition
Xu et al.	2018	Dual-stream recurrent neural network for video captioning
Fernando et al.	2017	Discriminatively learned hierarchical rank pooling networks
Escorcia et al.	2016	Daps: Deep action proposals for action understanding
Fernando et al.	2016	Learning end-to-end video classification with rank-pooling
Kishore et al.	2018	Indian classical dance action identification and classification with convolutional neural networks
Lin et al.	2016	A deep structured model with radius–margin bound for 3D human activity recognition
Bilal et al.	2022	A transfer learning-based efficient spatiotemporal human action recognition framework for long and overlapping action classes
Asadi-Aghbolaghi et al.	2017	Deep learning for action and gesture recognition in image sequences: A survey
Zhang et al.	2020	Key frame proposal network for efficient pose estimation in videos
Wei et al.	2017	Robotic grasping recognition using multi-modal deep extreme learning machine
Gao et al.	2016	Object-centric representation learning from unlabeled videos
Yu et al.	2017	Fully convolutional networks for action recognition
Zong et al.	2016	Emotion recognition in the wild via sparse transductive transfer linear discriminant analysis
Ramesh et al.	2020	Low-power dynamic object detection and classification with freely moving event cameras
Nikpour et al.	2024	Deep reinforcement learning in human activity recognition: A survey and outlook
Li et al.	2017	Learning hierarchical video representation for action recognition
Yu et al.	2020	Rhyrnn: Rhythmic rnn for recognizing events in long and complex videos
Zhu et al.	2018	Random temporal skipping for multirate video analysis
Aubret et al.	2024	Self-supervised visual learning from interactions with objects
Zhong et al.	2023	Multimodal cooperative self‐attention network for action recognition
Karim et al.	2023	Understanding video transformers for segmentation: A survey of application and interpretability
Bahroun et al.	2017	Building efficient deep hebbian networks for image classification tasks
Kang et al.	2021	Crowd activity recognition in live video streaming via 3D‐ResNet and region graph convolution network