MMDP: A Mobile-IoT Based Multi-Modal Reinforcement Learning Service Framework

Puming Wang,L. Yang,Jintao Li,Xue Li,Xiaokang Zhou

Published 2020 in IEEE Transactions on Services Computing

ABSTRACT

With the development of GPS technology, a new Mobile Internet of Things (M-IoT) is emerging, which perceives the city's rhythm and pulse day and night to collect a large scale of city data. It is urgent to innovate M-IoT service system for these large-scale and heterogeneous data. To cope with the problem, this article proposes a Mobile-IoT based multi-modal reinforcement learning service framework from data perspective, which has three highlights, i) Developing Action-aware High-order Transition Tensor (<inline-formula><tex-math notation="LaTeX">$AHTT$</tex-math><alternatives><mml:math><mml:mrow><mml:mi>A</mml:mi><mml:mi>H</mml:mi><mml:mi>T</mml:mi><mml:mi>T</mml:mi></mml:mrow></mml:math><inline-graphic xlink:href="yang-ieq1-2964663.gif"/></alternatives></inline-formula>) to fuse the heterogeneous data from M-IoTs in a unified form. ii) Developing Multi-modal Markov Decision Process (<inline-formula><tex-math notation="LaTeX">$MMDP$</tex-math><alternatives><mml:math><mml:mrow><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:mi>D</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:math><inline-graphic xlink:href="yang-ieq2-2964663.gif"/></alternatives></inline-formula>) to model the multi-modal reinforcement learning for M-IoT service framework. iii) Developing Tensor Policy Iteration algorithm (<inline-formula><tex-math notation="LaTeX">$TPIA$</tex-math><alternatives><mml:math><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mi>I</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:math><inline-graphic xlink:href="yang-ieq3-2964663.gif"/></alternatives></inline-formula>) to solve the optimal tensor policy. Due to using tensor keeps the multi-modal relations of the context information in the process of solving the optimal policy. The proposed M-IoT service system provides more personalized service for taxi drivers. The experiment results shows that most taxi drivers earn more revenue according to the tensor policy.

PUBLICATION RECORD

Publication year
2020
Venue
IEEE Transactions on Services Computing
Publication date
2020-07-01
Fields of study
Computer Science, Engineering
Identifiers
DOI 10.1109/tsc.2020.2964663
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

HO-OTSVD: A Novel Tensor Decomposition and Its Incremental Decomposition for Cyber–Physical–Social Networks (CPSN)
2020cited by this paper
Data fusion in cyber-physical-social systems: State-of-the-art and perspectives
2019cited by this paper
Deep Convolutional Computation Model for Feature Learning on Big Data in Internet of Things
2018cited by this paper
High-order possibilistic c-means algorithms based on tensor decompositions for big data in IoT
2018cited by this paper
Privacy-Preserving Double-Projection Deep Computation Model With Crowdsourcing on Cloud for Big Data Feature Learning
2018cited by this paper
In Situ Mutation for Active Things in the IoT Context
2018cited by this paper
Towards fog driven IoT healthcare: challenges and framework of fog computing in healthcare
2018cited by this paper
An Edge Cloud-Assisted CPSS Framework for Smart City
2018cited by this paper
IoT-Fog Optimal Workload via Fog Offloading
2018cited by this paper
Measurement and Classification of Smart Systems Data Traffic Over 5G Mobile Networks
2018cited by this paper
Comparison Data Traffic Scheduling Techniques for Classifying QoS over 5G Mobile Networks
2017cited by this paper
A System-Level Modeling and Design for Cyber-Physical-Social Systems
2016cited by this paper
iGeoRec: A Personalized and Efficient Geographical Location Recommendation Framework
2015cited by this paper
A cost-effective recommender system for taxi drivers
2014cited by this paper
Big Data Real-Time Processing Based on Storm
2013cited by this paper
Solving Multilinear Systems via Tensor Inversion
2013cited by this paper
T-Finder: A Recommender System for Finding Passengers and Vacant Taxis
2013cited by this paper
T-drive: driving directions based on taxi trajectories
2010cited by this paper
Optimal Control and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations
1997cited by this paper

CITED BY

MAID: Mobility-aware information dissemination in mobile IoT using temporal point processes
2025cites this paper
The Supply Chain Transportation and Route Planning Under Deep Reinforcement Learning
2025cites this paper
Lightweight Tensor-Enabled GRU for Trustworthy and Communication Efficient Federated Learning in Industrial IoT
2025cites this paper
A multi-objective deep reinforcement learning algorithm for spatio-temporal latency optimization in mobile IoT-enabled edge computing networks
2025cites this paper
Truncated Lanczos-TSVD: An Effective Dimensionality Reduction Algorithm for Detecting DDoS Attacks in Large-Scale Networks
2024cites this paper
Failure Analysis in Next-Generation Critical Cellular Communication Infrastructures
2024cites this paper
Reinforcement learning architecture for cyber–physical–social AI: state-of-the-art and perspectives
2023cites this paper
Edge-Enabled Two-Stage Scheduling Based on Deep Reinforcement Learning for Internet of Everything
2023cites this paper
Agile Services Provisioning for Learning-Based Applications in Fog Computing Networks
2023cites this paper
Task Recommendation via Heterogeneous Multi-modal Features and Decision Fusion in Mobile Crowdsensing
2023cites this paper
Automatic Discovery of Multi-perspective Process Model using Reinforcement Learning
2022cites this paper
Privacy-Preserving Tucker Train Decomposition Over Blockchain-Based Encrypted Industrial IoT Data
2021cites this paper
The Applicability of Reinforcement Learning Methods in the Development of Industry 4.0 Applications
2021cites this paper
DL Multi-sensor information fusion service Selective Information Scheme for Improving the Internet of Things based User Responses
2021cites this paper
Automatic Hierarchical Reinforcement Learning for Reusing Service Process Fragments
2021cites this paper
Image Colorization: A Survey and Dataset
2020cites this paper