YouTubeCat: Learning to categorize wild web videos

Zheshen Wang,Ming Zhao,Yang Song,Sanjiv Kumar,Baoxin Li

Published 2010 in 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition

ABSTRACT

Automatic categorization of videos in a Web-scale unconstrained collection such as YouTube is a challenging task. A key issue is how to build an effective training set in the presence of missing, sparse or noisy labels. We propose to achieve this by first manually creating a small labeled set and then extending it using additional sources such as related videos, searched videos, and text-based webpages. The data from such disparate sources has different properties and labeling quality, and thus fusing them in a coherent fashion is another practical challenge. We propose a fusion framework in which each data source is first combined with the manually-labeled set independently. Then, using the hierarchical taxonomy of the categories, a Conditional Random Field (CRF) based fusion strategy is designed. Based on the final fused classifier, category labels are predicted for the new videos. Extensive experiments on about 80K videos from 29 most frequent categories in YouTube show the effectiveness of the proposed method for categorizing large-scale wild Web videos1.

PUBLICATION RECORD

Publication year
2010
Venue
2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Publication date
2010-06-01
Fields of study
Computer Science
Identifiers
DOI 10.1109/CVPR.2010.5540125
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Audiovisual celebrity recognition in unconstrained web videos
2009cited by this paper
Automatic annotation of human actions in video
2009cited by this paper
Co-training with noisy perceptual observations
2009cited by this paper
VideoMule: a consensus learning approach to multi-label classification from noisy user-generated videos
2009cited by this paper
Internet video category recognition
2008cited by this paper
Solving the label resolution problem in supervised video content classification
2008cited by this paper
LIBLINEAR: A Library for Large Linear Classification
2008cited by this paper
A walk through the web’s video clips
2008influential reference
Learning realistic human actions from movies
2008cited by this paper
Watch, Listen & Learn: Co-training on Captioned Images and Videos
2008cited by this paper
Cross-domain video concept detection using adaptive svms
2007cited by this paper
Learning Bayesian networks
2007cited by this paper
Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples
2006influential reference
Evaluation campaigns and TRECVid
2006cited by this paper
Hello! My name is... Buffy'' -- Automatic Naming of Characters in TV Video
2006cited by this paper
Large scale image-based adult-content filtering
2006cited by this paper
Early versus late fusion in semantic video analysis
2005cited by this paper
Distinctive Image Features from Scale-Invariant Keypoints
2004cited by this paper
Discriminative Fields for Modeling Spatial Dependencies in Natural Images
2003influential reference
Learning from labeled and unlabeled data with label propagation
2002cited by this paper
Agile Software Development
2002cited by this paper
Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons
2001cited by this paper
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
2001cited by this paper
Statistical Color Models with Application to Skin Detection
1999cited by this paper
Automatic partitioning of full-motion video
1993cited by this paper

CITED BY

Abstractive Multi-Video Captioning: Benchmark Dataset Construction and Extensive Evaluation
2024cites this paper
Classifying YouTube Videos by Thumbnail
2021cites this paper
Link between Facial Expressions and Emotional States Induced by Exposure to Multimedia Content
2019cites this paper
Social web video clustering based on multi-modal and clustering ensemble
2019cites this paper
Web video classification with visual and contextual semantics
2019cites this paper
Movie Genres Classification using Collaborative Filtering
2019cites this paper
Moocs Video Mining Using Decision Tree J48 and Naive Bayesian Classification Models
2018influential citation
Web Video Clustering Based on Emotion Category
2018cites this paper
Learning to Recognize Actions with Weak Supervision
2018influential citation
Learning from Multiple Sources for Video
2018cites this paper
Learning From Web Videos for Event Classification
2018cites this paper
Cooperative Caching Plan of Popular Videos for Mobile Users by Grouping Preferences
2018cites this paper
Learning from Multiple Sources for Video Summarisation
2018cites this paper
Learning to Recognize Actions with Weak Supervision. (Reconnaissance d'actions de manière faiblement supervisée)
2018influential citation
Context-Associative Hierarchical Memory Model for Human Activity Recognition and Prediction
2017cites this paper
Mining Moocs Videos Metadata Using Classification Techniques
2017cites this paper
Event Recognition and Classification in Sports Video
2017cites this paper
Web Video Mining: Metadata Predictive Analysis using Classification Techniques
2016cites this paper
Social Web Videos Clustering Based on Ensemble Technique
2016cites this paper
Covariance of Motion and Appearance Featuresfor Spatio Temporal Recognition Tasks
2016cites this paper
A Physiologically Motivated Approach to the Classification of Natural Sounds using High Order Sound Statistics
2016cites this paper
Web video categorization using category-predictive classifiers and category-specific concept classifiers
2016cites this paper
From Traditional to Modern: Domain Adaptation for Action Classification in Short Social Video Clips
2016cites this paper
Semi-supervised evolutionary ensembles for Web video categorization
2015cites this paper
Methods to Obtain Training Videos for Fully Automated Application-Specific Classification
2015cites this paper
Multi-Source Video Summarisation
2015cites this paper
Learning from Multiple Sources for Video Summarisation
2015cites this paper
Quality-Based Learning for Web Data Classification
2014cites this paper
Minimally Needed Evidence for Complex Event Recognition in Unconstrained Videos
2014cites this paper
Best practices for learning video concept detectors from social media examples
2014cites this paper
Classification of Cinematographic Shots Using Lie Algebra and its Application to Complex Event Recognition
2014cites this paper
Video Classification Using Semantic Concept Co-occurrences
2014cites this paper
YouDACC: the Youtube Dialectal Arabic Comment Corpus
2014cites this paper
A data-driven approach for tag refinement and localization in web videos
2014cites this paper
Ground plane rectification from crowd motion
2014cites this paper
Identifying Presentation Styles in Online Educational Videos MSR-TR-2014-141
2014cites this paper
Tagging based Efficient Web Video Event Categorization
2014cites this paper
Identifying Presentation Styles in Online Educational Videos
2014cites this paper
An Efficient Gradient-based Approach to Optimizing Average Precision Through Maximal Figure-of-Merit Learning
2013cites this paper
Subband autocorrelation features for video soundtrack classification
2013cites this paper
Multiple Classifier Systems
2013influential citation
Semi-supervised Clustering Ensemble for Web Video Categorization
2013cites this paper
Film segmentation and indexing using autoassociative neural networks
2013cites this paper
Recognizing 50 human action categories of web videos
2013cites this paper
Video classification and recommendation based on affective analysis of viewers
2013cites this paper
Recognition of complex events in open-source web-scale videos: a bottom up approach
2013cites this paper
Scene image categorization and video event detection using Naive Bayes Nearest Neighbor
2013cites this paper
Evaluating sources and strategies for learning video concepts from social media
2013cites this paper
Noise robust keyword spotting for user generated video blogs
2013influential citation
Characterizing Audio Events for Video Soundtrack Analysis
2013cites this paper
Video Synopsis by Heterogeneous Multi-source Correlation
2013cites this paper
Mining Conversational Social Video
2013cites this paper
Recognition of Complex Events in Open-source Web-scale Videos: Features, Intermediate Representations and Their Temporal Interactions
2013cites this paper
Sentiment Analysis : A Literature Survey
2013cites this paper
Sentiment Analysis
2013cites this paper
Multimodal genre classification of TV programs and YouTube videos
2013cites this paper
Fully Automated Learning for Application-Specific Web Video Classification
2013cites this paper
Template-based keyword spotting for user generated videos
2013influential citation
The impact of YouTube videos on the student's learning
2012cites this paper
Random-sampling-based spatial-temporal feature for consumer video concept classification
2012cites this paper
On-the-fly Topic Adaptation for YouTube Video Transcription
2012cites this paper
Complex Events Detection Using Data-Driven Concepts
2012cites this paper
Multimodal feature fusion for robust event detection in web videos
2012cites this paper
Multi-channel Shape-Flow Kernel Descriptors for Robust Video Event Detection and Retrieval
2012cites this paper
Adaptation for YouTube Video Transcription
2012cites this paper
Somebody helps me: Travel video scene detection using web-based context
2012cites this paper
The Action Similarity Labeling Challenge
2012cites this paper
Comparative Analysis of Content-based TV Genre Classification and Web Video Categorization
2012cites this paper
Automatic collection of Web video shots corresponding to specific actions using Web images
2012cites this paper
Semantic multimedia analysis using knowledge and context
2012cites this paper
Action Recognition Using Particle Flow Fields
2012cites this paper
YouCat: Weakly Supervised Youtube Video Categorization System from Meta Data & User Comments using WordNet & Wikipedia
2012cites this paper
An audio-visual approach to web video categorization
2012cites this paper
KIT at MediaEval 2012 - Content - based Genre Classification with Visual Cues
2012cites this paper
Explicit Performance Metric Optimization for Fusion-Based Video Retrieval
2012influential citation
Features with Feelings - Incorporating User Preferences in Video Categorization
2012cites this paper
Mining Semantics from Low-level Features in Multimedia Computing
2011cites this paper
Video indexing and recommendation based on affective analysis of viewers
2011cites this paper
Boosting video classification using cross-video signals
2011cites this paper
KIT at MediaEval 2011 - Content-based genre classification on web-videos
2011cites this paper
Automatic construction of an action video shot database using web videos
2011cites this paper
A Bayesian network modeling approach for cross media analysis
2011cites this paper
Learning heterogeneous data for hierarchical web video classification
2011cites this paper
Automatic annotation of Web videos
2011cites this paper
YouTubeEvent: On large-scale video event classification
2011cites this paper
Efficient Orthogonal Matching Pursuit using sparse random projections for scene and video classification
2011cites this paper
Improving video classification via youtube video co-watch data
2011cites this paper
Handling label noise in video classification via multiple instance learning
2011cites this paper
Movie genre classification via scene categorization
2010influential citation