Online Multi-object Tracking Using CNN-Based Single Object Tracker with Spatial-Temporal Attention Mechanism

Q. Chu,Wanli Ouyang,Hongsheng Li,Xiaogang Wang,Nenghai Yu

Published 2017 in IEEE International Conference on Computer Vision

ABSTRACT

In this paper, we propose a CNN-based framework for online MOT. This framework utilizes the merits of single object trackers in adapting appearance models and searching for target in the next frame. Simply applying single object tracker for MOT will encounter the problem in computational efficiency and drifted results caused by occlusion. Our framework achieves computational efficiency by sharing features and using ROI-Pooling to obtain individual features for each target. Some online learned target-specific CNN layers are used for adapting the appearance model for each target. In the framework, we introduce spatial-temporal attention mechanism (STAM) to handle the drift caused by occlusion and interaction among targets. The visibility map of the target is learned and used for inferring the spatial attention map. The spatial attention map is then applied to weight the features. Besides, the occlusion status can be estimated from the visibility map, which controls the online updating process via weighted loss on training samples with different occlusion statuses in different frames. It can be considered as temporal attention mechanism. The proposed algorithm achieves 34.3% and 46.0% in MOTA on challenging MOT15 and MOT16 benchmark dataset respectively.

PUBLICATION RECORD

Publication year
2017
Venue
IEEE International Conference on Computer Vision
Publication date
2017-08-09
Fields of study
Computer Science
Identifiers
DOI 10.1109/ICCV.2017.518 arXiv 1708.02843
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Factors in Finetuning Deep Model for Object Detection with Long-Tail Distribution
2016cited by this paper
STCT: Sequentially Training Convolutional Networks for Visual Tracking
2016cited by this paper
Multi-person Tracking by Multicut and Deep Matching
2016cited by this paper
Joint Learning of Convolutional Neural Networks and Temporally Constrained Metrics for Tracklet Association
2016cited by this paper
Online multi-person tracking using Integral Channel Features
2016cited by this paper
Learning Mutual Visibility Relationship for Pedestrian Detection with a Deep Model
2016cited by this paper
MOT16: A Benchmark for Multi-Object Tracking
2016influential reference
Learning by Tracking: Siamese CNN for Robust Target Association
2016cited by this paper
Online Multi-object Tracking via Structural Constraint Event Aggregation
2016cited by this paper
Tracking Multiple Persons Based on a Variational Bayesian Model
2016cited by this paper
Online Multi-target Tracking with Strong and Weak Detections
2016cited by this paper
Joint Probabilistic Data Association Revisited
2015cited by this paper
GMMCP tracker: Globally optimal Generalized Maximum Multi Clique problem for multiple object tracking
2015cited by this paper
Near-Online Multi-target Tracking with Aggregated Local Flow Descriptor
2015cited by this paper
Understanding pedestrian behaviors from stationary crowd groups
2015cited by this paper
Multiple Hypothesis Tracking Revisited
2015cited by this paper
Online Tracking by Learning Discriminative Saliency Map with Convolutional Neural Network
2015cited by this paper
MOTChallenge 2015: Towards a Benchmark for Multi-Target Tracking
2015cited by this paper
Bayesian Multi-object Tracking Using Motion Context from Multiple Objects
2015influential reference
Learning to Track: Online Multi-object Tracking by Decision Making
2015cited by this paper
Fast R-CNN
2015influential reference
Hierarchical Convolutional Features for Visual Tracking
2015cited by this paper
Subgraph decomposition for multi-target tracking
2015cited by this paper
Visual Tracking with Fully Convolutional Networks
2015cited by this paper
Learning Spatially Regularized Correlation Filters for Visual Tracking
2015cited by this paper
Cross-scene crowd counting via deep convolutional neural networks
2015cited by this paper
Learning Deep Representation with Large-Scale Attributes
2015cited by this paper
DeepID-Net: Deformable deep convolutional neural networks for object detection
2014cited by this paper
Very Deep Convolutional Networks for Large-Scale Image Recognition
2014cited by this paper
Detection and Tracking of Occluded People
2014cited by this paper
Continuous Energy Minimization for Multitarget Tracking
2014cited by this paper
Caffe: Convolutional Architecture for Fast Feature Embedding
2014cited by this paper
Fast Feature Pyramids for Object Detection
2014cited by this paper
Robust Online Multi-object Tracking Based on Tracklet Confidence and Online Discriminative Appearance Learning
2014cited by this paper
Structure Preserving Object Tracking
2013cited by this paper
The Way They Move: Tracking Multiple Targets with Similar Appearance
2013cited by this paper
Part-based multiple-person tracking with partial occlusion handling
2012cited by this paper
(MP)2T: Multiple People Multiple Parts Tracker
2012cited by this paper
Single and Multiple Object Tracking Using Log-Euclidean Riemannian Subspace and Block-Division Appearance Model
2012cited by this paper
Online Learned Discriminative Part-Based Appearance Models for Multi-human Tracking
2012cited by this paper
To Track or To Detect? An Ensemble Framework for Optimal Selection
2012cited by this paper
GMCP-Tracker: Global Multi-object Tracking Using Generalized Minimum Clique Graphs
2012cited by this paper
ImageNet classification with deep convolutional neural networks
2012cited by this paper
Globally-optimal greedy algorithms for tracking a variable number of objects
2011cited by this paper
Multiobject tracking as maximum weight independent set
2011cited by this paper
Robust Object Tracking with Online Multiple Instance Learning
2011cited by this paper
Struck: Structured output tracking with kernels
2011cited by this paper
Online Multiperson Tracking-by-Detection from a Single, Uncalibrated Camera
2011cited by this paper
Object Detection with Discriminatively Trained Part Based Models
2010cited by this paper
Learning to associate: HybridBoosted multi-target tracker for crowded scene
2009cited by this paper
Multi-object tracking through occlusions by local tracklets filtering and global tracklets association with detection responses
2009cited by this paper
Markov Chain Monte Carlo Data Association for Multi-Target Tracking
2009cited by this paper
ImageNet: A large-scale hierarchical image database
2009cited by this paper
Robust Object Tracking by Hierarchical Association of Detection Responses
2008cited by this paper
Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics
2008cited by this paper
Global data association for multi-object tracking using network flows
2008cited by this paper
Detection and Tracking of Multiple, Partially Occluded Humans by Bayesian Combination of Edgelet based Part Detectors
2007cited by this paper
Histograms of oriented gradients for human detection
2005cited by this paper
Ieee Transactions on Pattern Analysis and Machine Intelligence High-speed Tracking with Kernelized Correlation Filters
year unknowncited by this paper

CITED BY

Propagating spatio-temporal state and progressively associating trajectory for satellite video multi-object tracking
2026cites this paper
DeMark: A Query-Free Black-Box Attack on Deepfake Watermarking Defenses
2026cites this paper
Surveillance to self-driving: a comprehensive review of object detection and tracking paradigms
2026cites this paper
Data-Driven Object Tracking: Integrating Modular Neural Networks into a Kalman Framework
2025cites this paper
Application and Challenges of Deep Learning in Object Detection for High-Altitude Observation Systems
2025cites this paper
Optimization of Multi-Object Tracking Algorithm Based on IMM-UKF-JPDAF
2025cites this paper
Fast highway abandoned object detection via block-based multi-group foreground extraction
2025cites this paper
AMF-MOT: Multi-Object Tracking Based on Motion-Appearance Feature Fusion for Object Vehicle Loss and Occlusion
2025cites this paper
Adaptive Low Light Enhancement via Joint Global-Local Illumination Adjustment
2025cites this paper
Dual attention for multi object tracking with intra sample context and cross sample interaction
2025cites this paper
A method for classroom behavior state recognition and teaching quality monitoring
2025cites this paper
High Performance Autonomous Target Tracking and Control Method for Quadrotors Using Artificial Intelligence
2025cites this paper
MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes
2025cites this paper
A Review of Vision-Based Tracking Methods: Temporal and Spatial Features
2025cites this paper
Potato precision planter metering system based on improved YOLOv5n-ByteTrack
2025cites this paper
Detecting subsurface diseases on airport road surface based on an improved SSD algorithm
2025cites this paper
Multi-source data-driven intelligent analysis and decision optimization for high-density pedestrian flows in urban public spaces
2025cites this paper
Crowd behavior detection: leveraging video swin transformer for crowd size and violence level analysis
2024cites this paper
Temporal Correlation Meets Embedding: Towards a 2nd Generation of JDE-based Real-Time Multi-Object Tracking
2024cites this paper
Object Recognition and Tracking System to Assist Visually Impaired: A Neural Network-Based Deep RBM Technique
2024cites this paper
An Approximate Dynamic Programming Framework for Occlusion-Robust Multi-Object Tracking
2024cites this paper
Autonomous Mobile Robot Navigation: Tracking problem
2024cites this paper
One-Stage Anchor-Free Online Multiple Target Tracking With Deformable Local Attention and Task-Aware Prediction
2024cites this paper
CenterADNet: Infrared Video Target Detection Based on Central Point Regression
2024cites this paper
A reliable unmanned aerial vehicle multi-target tracking system with global motion compensation for monitoring Procapra przewalskii
2024cites this paper
Recurrent neural network based on attention mechanism in prediction of glass forming ability by element proportion
2024cites this paper
Learning Discriminative Motion Models for Multiple Object Tracking
2024cites this paper
Estimating Dynamic Flow Features in Groups of Tracked Objects
2024cites this paper
An Explainable Non-local Network for COVID-19 Diagnosis
2024cites this paper
FACT: Feature Adaptive Continual-learning Tracker for Multiple Object Tracking
2024cites this paper
YOLOShipTracker: Tracking ships in SAR images using lightweight YOLOv8
2024cites this paper
Multi-Agent Reinforcement Learning as Interaction Model for Online Multi-Object Tracking
2024cites this paper
Detection of group-housed pigs feeding behavior using deep learning and edge devices
2024cites this paper
AMtrack: Anti-occlusion multi-object tracking algorithm
2024cites this paper
Transformers for Enhanced Multi-Object Tracking with Sensor Fusion
2024cites this paper
Detection of pine wilt disease infected pine trees using YOLOv5 optimized by attention mechanisms and loss functions
2024cites this paper
Interpretable Dynamic Graph Neural Networks for Small Occluded Object Detection and Tracking
2024cites this paper
Re-identification of people in a video stream based on a Kalman filter
2024cites this paper
SimpleTrackV2: Rethinking the Timing Characteristics for Multi-Object Tracking
2024cites this paper
Based on improved joint detection and tracking of UAV for multi-target detection of livestock
2024cites this paper
Rlm-tracking: online multi-pedestrian tracking supported by relative location mapping
2024cites this paper
Dynamic obstacle avoidance model of autonomous driving with attention mechanism and temporal residual block
2024cites this paper
PSMOT: Online Occlusion-Aware Multi-Object Tracking Exploiting Position Sensitivity
2024cites this paper
MAML MOT: Multiple Object Tracking Based on Meta-Learning
2024cites this paper
One-Shot Multiple Object Tracking With Robust ID Preservation
2024cites this paper
Deep Triply Attention Network for RGBT Tracking
2023cites this paper
Rethinking Attentive Object Detection via Neural Attention Learning
2023cites this paper
Multi-Camera Multi-Object Tracking: A Review of Current Trends and Future Advances
2023cites this paper
UTM: A Unified Multiple Object Tracking Model with Identity-Aware Feature Enhancement
2023cites this paper
DeepSORT Pedestrian Tracking Algorithm based on Azimuth Estimation
2023cites this paper
SiamMaskAttn: inverted residual attention block fusing multi-scale feature information for multitask visual object tracking networks
2023cites this paper
Dual-focus transfer network for zero-shot learning
2023cites this paper
Research on water extraction from high resolution remote sensing images based on deep learning
2023cites this paper
Smart Telescope System with Automatic Tracking
2023cites this paper
Research on land cover classification of multi-source remote sensing data based on improved U-net network
2023cites this paper
Deep MDP: A Modular Framework for Multi-Object Tracking
2023cites this paper
Multi-object Tracking with Spatial-Temporal Tracklet Association
2023cites this paper
STRAN: Student expression recognition based on spatio-temporal residual attention network in classroom teaching videos
2023cites this paper
Decode-MOT: How Can We Hurdle Frames to Go Beyond Tracking-by-Detection?
2023cites this paper
Autoregressive Visual Tracking
2023cites this paper
Benchmarking the Complementary-View Multi-human Association and Tracking
2023cites this paper
Multi-object tracking via deep feature fusion and association analysis
2023cites this paper
DC-MOT: Motion Deblurring and Compensation for Multi-Object Tracking in UAV Videos
2023cites this paper
Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation
2023cites this paper
Deep Neural Network-based Multi-Object Tracker in Complex Events
2023cites this paper
PANet: An End-to-end Network Based on Relative Motion for Online Multi-object Tracking
2023cites this paper
An optimized patch-point based approach for seismic fault interpretation using CNN
2023cites this paper
EPT-Net: Edge Perception Transformer for 3D Medical Image Segmentation
2023cites this paper
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
2023cites this paper
Open-Ended Online Learning for Autonomous Visual Perception
2023cites this paper
Bidirectional Multiple Object Tracking Based on Trajectory Criteria in Satellite Videos
2023cites this paper
Object tracking algorithm of Siamese network based on feature fusion and attention mechanism
2023cites this paper
DGM-VINS: Visual–Inertial SLAM for Complex Dynamic Environments With Joint Geometry Feature Extraction and Multiple Object Tracking
2023cites this paper
Automated recognition of individual performers from de-identified video sequences
2023cites this paper
Multi-Channel Weight-Sharing Autoencoder Based on Cascade Multi-Head Attention for Multimodal Emotion Recognition
2023cites this paper
Robust Multi-Ship Tracker in SAR Imagery by Fusing Feature Matching and Modified KCF
2023cites this paper
Multi-Object Multi-Camera Tracking Based on Deep Learning for Intelligent Transportation: A Review
2023cites this paper
Effect of Intermittent Exercise on Performance in 3D Multiple Objects Tracking in Children, Young and Older Adults-A Pilot Study.
2022cites this paper
Bio-Inspired Vision and Gesture-Based Robot-Robot Interaction for Human-Cooperative Package Delivery
2022cites this paper
A motion model based on recurrent neural networks for visual object tracking
2022cites this paper
Online multiple object tracking using joint detection and embedding network
2022cites this paper
Similarity based person re-identification for multi-object tracking using deep Siamese network
2022cites this paper
Real-Time Online Multi-Object Tracking in Compressed Domain
2022cites this paper
A Review of Deep Learning Techniques for Crowd Behavior Analysis
2022cites this paper
Robust S-Y-biLSTM object tracking method for on-road objects shoot from an unmanned aerial vehicle
2022cites this paper
Learning attention modules for visual tracking
2022cites this paper
Online Pedestrian Multiple-Object Tracking with Prediction Refinement and Track Classification
2022cites this paper
Multi-Camera Multiple 3D Object Tracking on the Move for Autonomous Vehicles
2022cites this paper
Unsupervised Multiple-Object Tracking with a Dynamical Variational Autoencoder
2022cites this paper
Fast Multi-shadow Tracking for Video-SAR Using Triplet Attention Mechanism
2022cites this paper
DSRRTracker: Dynamic Search Region Refinement for Attention-based Siamese Multi-Object Tracking
2022cites this paper
Multi-layer features template update object tracking algorithm based on SiamFC++
2022cites this paper
SOTVerse: A User-Defined Task Space of Single Object Tracking
2022cites this paper
Consistent Cell Tracking in Multi-frames with Spatio-Temporal Context by Object-Level Warping Loss
2022cites this paper
An end-to-end identity association network based on geometry refinement for multi-object tracking
2022cites this paper
Leveraging temporal-aware fine-grained features for robust multiple object tracking
2022cites this paper
Multi-cue multi-hypothesis tracking with re-identification for multi-object tracking
2022cites this paper
MobileNet-JDE: a lightweight multi-object tracking model for embedded systems
2022cites this paper
Transformer-based two-source motion model for multi-object tracking
2022cites this paper
Segmentation is Tracking: Spatial-Temporal Map Vehicle Trajectory Reconstruction and Validation
2022cites this paper