Fusion of Video and Inertial Sensing for Deep Learning–Based Human Action Recognition

Published 2019 in Italian National Conference on Sensors

ABSTRACT

This paper presents the simultaneous utilization of video images and inertial signals that are captured at the same time via a video camera and a wearable inertial sensor within a fusion framework in order to achieve a more robust human action recognition compared to the situations when each sensing modality is used individually. The data captured by these sensors are turned into 3D video images and 2D inertial images that are then fed as inputs into a 3D convolutional neural network and a 2D convolutional neural network, respectively, for recognizing actions. Two types of fusion are considered—Decision-level fusion and feature-level fusion. Experiments are conducted using the publicly available dataset UTD-MHAD in which simultaneous video images and inertial signals are captured for a total of 27 actions. The results obtained indicate that both the decision-level and feature-level fusion approaches generate higher recognition accuracies compared to the approaches when each sensing modality is used individually. The highest accuracy of 95.6% is obtained for the decision-level fusion approach.

PUBLICATION RECORD

Publication year
2019
Venue
Italian National Conference on Sensors
Publication date
2019-08-24
Fields of study
Medicine, Computer Science, Engineering
Identifiers
DOI 10.3390/s19173680 PMID 31450609 PMCID 6749419
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Robust High Dimensional Stream Classification with Novel Class Detection
2019cited by this paper
Wearable Embedded Intelligence for Detection of Falls Independently of on-Body Location
2019cited by this paper
Coarse-Fine Convolutional Deep-Learning Strategy for Human Activity Recognition
2019cited by this paper
A Differential Evolution Approach to Optimize Weights of Dynamic Time Warping for Multi-Sensor Based Gesture Recognition
2019cited by this paper
Orientation Independent Activity/Gesture Recognition Using Wearable Motion Sensors
2019cited by this paper
Semi-Supervised Faster RCNN-Based Person Detection and Load Classification for Far Field Video Surveillance
2019cited by this paper
A Convolutional Neural Network-Based Sensor Fusion System for Monitoring Transition Movements in Healthcare Applications
2018cited by this paper
Determining Number of Speakers from Single Microphone Speech Signals by Multi-Label Convolutional Neural Network
2018cited by this paper
Action Detection and Recognition in Continuous Action Streams by Deep Learning-Based Sensing Fusion
2018cited by this paper
Aligning Audiovisual Features for Audiovisual Speech Recognition
2018cited by this paper
Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network
2018cited by this paper
A Survey on Smart Homes for Aging in Place: Toward Solutions to the Specific Needs of the Elderly
2018cited by this paper
Deep Learning-Based Person Detection and Classification for Far Field Video Surveillance
2018cited by this paper
Real-Time Continuous Detection and Recognition of Subject-Specific Smart TV Gestures via Fusion of Depth and Inertial Sensing
2018cited by this paper
TORNADO: A Spatio-Temporal Convolutional Regression Network for Video Action Proposal
2017cited by this paper
Continuous detection and recognition of actions of interest among actions of non-interest using a depth camera
2017cited by this paper
A Real-Time Human Action Recognition System Using Depth and Inertial Sensor Fusion
2016cited by this paper
3D skeleton-based human action classification: A survey
2016cited by this paper
Structured Feature Learning for Pose Estimation
2016cited by this paper
Continuous Human Action Recognition Using Depth-MHI-HOG and a Spotter Model
2015cited by this paper
UTD-MHAD: A multimodal dataset for human action recognition utilizing a depth camera and a wearable inertial sensor
2015cited by this paper
Deep Learning
2015cited by this paper
Improving Human Action Recognition Using Fusion of Depth Camera and Inertial Sensors
2015cited by this paper
APT: Action localization proposals from dense trajectories
2015cited by this paper
A survey of depth and inertial sensor fusion for human action recognition
2015cited by this paper
Semantic human activity recognition: A literature review
2015cited by this paper
A medication adherence monitoring system for pill bottles based on a wearable inertial sensor
2014cited by this paper
Home-based Senior Fitness Test measurement system using collaborative inertial and depth sensors
2014cited by this paper
Two-Stream Convolutional Networks for Action Recognition in Videos
2014cited by this paper
A Vision-Based System for Intelligent Monitoring: Human Behaviour Analysis and Privacy by Context
2014cited by this paper
Learning Spatiotemporal Features with 3D Convolutional Networks
2014cited by this paper
G3D: A gaming action dataset and real time action recognition evaluation framework
2012cited by this paper
uWave: Accelerometer-based personalized gesture recognition and its applications
2009cited by this paper
Distributed recognition of human actions using wearable motion sensor networks
2009cited by this paper
Using human body gestures as inputs for gaming via depth analysis
2008cited by this paper
Author manuscript, published in "International Conference on Computer Vision (2013)" Action Recognition with Improved Trajectories
year unknowncited by this paper

CITED BY

Human Daily Indoor Action (HDIA) Dataset: Privacy-Preserving Human Action Recognition Using Infrared Camera and Wearable Armband Sensors
2025cites this paper
Skeletal joint image-based multi-channel fusion network for human activity recognition
2025cites this paper
Automated recognition of construction worker activities using multimodal decision-level fusion
2025cites this paper
Integrating Soft Computing and Multi-Agent for Action Recognition: Basics, Challenging and Future Directions
2025cites this paper
TriGait: Hybrid Fusion Strategy for Multimodal Alignment and Integration in Gait Recognition
2025cites this paper
RGB video and inertial sensing fusion method for human action recognition in human-robot collaborative manufacturing
2025cites this paper
Mamba-MHAR: An efficient multimodal framework for human action recognition
2025cites this paper
InfraTag: Customizable Infrastructure Interaction Activity Recognition on Resource-Constrained Hardware
2024influential citation
C3T: Cross-modal Transfer Through Time for Sensor-based Human Activity Recognition
2024cites this paper
Spatio-temporal attention modules in orientation-magnitude-response guided multi-stream CNNs for human action recognition
2024cites this paper
A Methodological and Structural Review of Hand Gesture Recognition Across Diverse Data Modalities
2024cites this paper
A Survey on Multimodal Wearable Sensor-based Human Action Recognition
2024cites this paper
Cross-Modality Gesture Recognition With Complete Representation Projection
2024influential citation
Deep Multimodal Habit Tracking System: A User-adaptive Approach for Low-power Embedded Systems
2023cites this paper
DSFNet: A Distributed Sensors Fusion Network for Action Recognition
2023cites this paper
Multimodal data-based deep learning model for sitting posture recognition toward office workers’ health promotion
2023cites this paper
Aggregation of Tennis Multivariate Time-Series Using the Choquet Integral and Its Generalizations
2023cites this paper
Intelligent ADL Recognition via IoT-Based Multimodal Deep Learning Framework
2023cites this paper
Towards Pervasive Sensing: A multimodal approach via CSI and RGB image modalities fusion
2023cites this paper
Physical-aware Cross-modal Adversarial Network for Wearable Sensor-based Human Action Recognition
2023cites this paper
A Novel Two Stream Decision Level Fusion of Vision and Inertial Sensors Data for Automatic Multimodal Human Activity Recognition System
2023cites this paper
Multi-stream CNNs with Orientation-Magnitude Response Maps and Weighted Inception Module for Human Action Recognition
2023influential citation
Optimization Simulation of Match between Technical Actions and Music of National Dance Based on Deep Learning
2023cites this paper
WEAR: A Multimodal Dataset for Wearable and Egocentric Video Activity Recognition
2023cites this paper
Identifying the “Dangshan” Physiological Disease of Pear Woolliness Response via Feature-Level Fusion of Near-Infrared Spectroscopy and Visual RGB Image
2023cites this paper
A Survey on Human Action Recognition
2022cites this paper
Crowd Abnormality Detection Using Optical Flow and GLCM-Based Texture Features
2022cites this paper
A Dance Somersault Pose Recognition Model Using Multifeature Fusion Algorithm
2022cites this paper
Sports Action Recognition Based on Deep Learning and Clustering Extraction Algorithm
2022cites this paper
A Comparative Analysis of Decision-Level Fusion for Multimodal Driver Behaviour Understanding
2022cites this paper
AMB-Wnet: Embedding attention model in multi-bridge Wnet for exploring the mechanics of disease.
2022cites this paper
Action recognition through fusion of sEMG and skeletal data in feature level
2022cites this paper
Inertial Hallucinations - When Wearable Inertial Devices Start Seeing Things
2022cites this paper
Progressive Cross-modal Knowledge Distillation for Human Action Recognition
2022cites this paper
ModSelect: Automatic Modality Selection for Synthetic-to-Real Domain Generalization
2022cites this paper
Application of Artificial Intelligence and Big Data Technology in Basketball Sports Training
2022cites this paper
Lite-3DCNN Combined with Attention Mechanism for Complex Human Movement Recognition
2022cites this paper
MMTSA: Multimodal Temporal Segment Attention Network for Efficient Human Activity Recognition
2022cites this paper
MMTSA
2022cites this paper
Robust Human Activity Recognition by Integrating Image and Accelerometer Sensor Data Using Deep Fusion Network
2021cites this paper
Novel Machine Learning for Human Actions Classification Using Histogram of Oriented Gradients and Sparse Representation
2021cites this paper
Continuous Human Action Detection Based on Wearable Inertial Data
2021cites this paper
Cross-Modal Knowledge Distillation For Vision-To-Sensor Action Recognition
2021cites this paper
Chronological Poor and Rich Tunicate Swarm Algorithm integrated Deep Maxout Network for human action and abnormality detection
2021cites this paper
The Variegated Applications of Deep Learning Techniques in Human Activity Recognition
2021cites this paper
Malicious Network Behavior Detection Using Fusion of Packet Captures Files and Business Feature Data
2021cites this paper
ChMusic: A Traditional Chinese Music Dataset for Evaluation of Instrument Recognition
2021cites this paper
Development and Validation of a 3-Dimensional Convolutional Neural Network for Automatic Surgical Skill Assessment Based on Spatiotemporal Video Analysis
2021cites this paper
Goaling recognition based on intelligent analysis of real-time basketball image of Internet of Things
2021cites this paper
A review of multimodal human activity recognition with special emphasis on classification, applications, challenges and future directions
2021cites this paper
Activity Recognition for Ambient Assisted Living with Videos, Inertial Units and Ambient Sensors
2021influential citation
C-MHAD: Continuous Multimodal Human Action Dataset of Simultaneous Video and Inertial Sensing
2020cites this paper
Human Action Recognition From Various Data Modalities: A Review
2020cites this paper
CNN-Based Multistage Gated Average Fusion (MGAF) for Human Action Recognition Using Depth and Inertial Sensors
2020cites this paper
A Hierarchical Learning Approach for Human Action Recognition
2020influential citation
Real-Time Abnormal Event Detection for Enhanced Security in Autonomous Shuttles Mobility Infrastructures
2020cites this paper
Semantics-Aware Adaptive Knowledge Distillation for Sensor-to-Vision Action Recognition
2020cites this paper
Deep Learning-Based Real-Time Multiple-Person Action Recognition System
2020cites this paper
Vision and Inertial Sensing Fusion for Human Action Recognition: A Review
2020cites this paper
Human Action Recognition Using Laban Movement Analysis and Dynamic Time Warping
2020cites this paper
Simultaneous Utilization of Inertial and Video Sensing for Action Detection and Recognition in Continuous Action Streams
2020influential citation
Spatiotemporal Interaction Residual Networks with Pseudo3D for Video Action Recognition
2020cites this paper
Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network
2018cites this paper