Featureless: Bypassing feature extraction in action categorization

S. Pintea,Pascal Mettes,J. V. Gemert,A. Smeulders

Published 2016 in International Conference on Information Photonics

ABSTRACT

This method introduces an efficient manner of learning action categories without the need of feature estimation. The approach starts from low-level values, in a similar style to the successful CNN methods. However, rather than extracting general image features, we learn to predict specific video representations from raw video data. The benefit of such an approach is that at the same computational expense it can predict 2D video representations as well as 3D ones, based on motion. The proposed model relies on discriminative Wald-boost, which we enhance to a multiclass formulation for the purpose of learning video representations. The suitability of the proposed approach as well as its time efficiency are tested on the UCF11 action recognition dataset.

PUBLICATION RECORD

Publication year
2016
Venue
International Conference on Information Photonics
Publication date
2016-09-01
Fields of study
Computer Science
Identifiers
DOI 10.1109/ICIP.2016.7532346 arXiv 1803.06962
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Contextual Action Recognition with R*CNN
2015cited by this paper
Human Action Recognition Using Factorized Spatio-Temporal Convolutional Networks
2015cited by this paper
Modeling video evolution for action recognition
2015cited by this paper
P-CNN: Pose-Based CNN Features for Action Recognition
2015cited by this paper
What do 15,000 object categories tell us about classifying and localizing actions?
2015cited by this paper
Visualizing Object Detection Features
2015cited by this paper
Learning to Track for Spatio-Temporal Action Localization
2015cited by this paper
DISCOVER: Discovering Important Segments for Classification of Video Events and Recounting
2014cited by this paper
A discriminative CNN video representation for event detection
2014cited by this paper
From Categories to Individuals in Real Time -- A Unified Boosting Approach
2014cited by this paper
Efficient Action Localization with Approximately Normalized Fisher Vectors
2014cited by this paper
Online, Real-Time Tracking Using a Category-to-Individual Detector
2014cited by this paper
Two-Stream Convolutional Networks for Action Recognition in Videos
2014cited by this paper
Deep learning in neural networks: An overview
2014cited by this paper
Efficient Feature Extraction, Encoding, and Classification for Action Recognition
2014cited by this paper
Action Recognition with Stacked Fisher Vectors
2014cited by this paper
Large-Scale Video Classification with Convolutional Neural Networks
2014cited by this paper
Recognizing 50 human action categories of web videos
2013cited by this paper
Author manuscript, published in "International Journal of Computer Vision (2013)" International Journal of Computer Vision manuscript No. (will be inserted by the editor) Image Classification with the Fisher Vector: Theory and Practice
2013cited by this paper
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
2012cited by this paper
Aggregating Local Image Descriptors into Compact Codes
2012cited by this paper
ImageNet classification with deep convolutional neural networks
2012cited by this paper
Vlfeat: an open and portable library of computer vision algorithms
2010cited by this paper
Visual Word Ambiguity
2010cited by this paper
Evaluating Color Descriptors for Object and Scene Recognition
2010cited by this paper
Multi-class AdaBoost ∗
2009influential reference
Learning semantic visual vocabularies using diffusion distance
2009cited by this paper
Weighted Sampling for Large-Scale Boosting
2008cited by this paper
WaldBoost - learning for time constrained sequential detection
2005influential reference
Histograms of oriented gradients for human detection
2005cited by this paper
Distinctive Image Features from Scale-Invariant Keypoints
2004cited by this paper
Distinctive Image Features from Scale-Invariant Keypoints
2004cited by this paper
Video Google: a text retrieval approach to object matching in videos
2003cited by this paper

CITED BY

IoT Vulnerability Detection using Featureless LLM CyBert Model
2024cites this paper
Ripple20 Vulnerabilities Detection using a Featureless Deep Learning Model
2023cites this paper
Optimized feature selection-based clustering approach for computer-aided detection of lung nodules in different modalities
2019cites this paper
Continuous learning in computer vision
2017cites this paper
New classifier architecture and training methodologies for lung nodule detection in chest radiographs and computed tomography
2017cites this paper