Bandits Under the Waves: A Fully-Distributed Multi-Armed Bandit Framework for Modulation Adaptation in the Internet of Underwater Things

F. Busacca,L. Galluccio,S. Palazzo,Andrea Panebianco,R. Raftopoulos

Published 2026 in IEEE Transactions on Network and Service Management

ABSTRACT

Acoustic communications are the most exploited technology in the so-called Internet of Underwater Things (IoUT). UnderWater (UW) environments are often characterized by harsh propagation features, limited bandwidth, fast-varying channel conditions, and long propagation delay. On the other hand, IoUT nodes are usually battery-powered devices with limited processing capabilities. Accordingly, it is necessary to design optimization algorithms to address the challenging propagation features while balancing them with the limited device capabilities. To address the constraints of the nodes in energy and processing resources, it is crucial to adjust the transmission parameters based on the channel conditions while also developing communication procedures that are both lightweight and energy-efficient. In this work, we introduce a novel Multi-Player Multi-Armed Bandit (MP-MAB) framework for modulation adaptation in Multi-Hop IoUT Acoustic Networks. As opposed to widely used, computation-demanding Deep Reinforcement Learning (DRL) techniques, MP-MAB algorithms are simple and lightweight and allow to iteratively make decisions by selecting one among multiple choices, or arms. The framework is fully-distributed and is able to dynamically select the best modulation technique at each IoUT node by leveraging on high-level statistics (e.g., network throughput), without the need to exploit hard-to-extract channel features (e.g., channel state). We evaluate the performance of the proposed framework using the DESERT UW simulator and compare it with state-of-the-art centralized solutions based on Deep Reinforcement Learning (DRL) for cognitive and heterogeneous networks, namely DRL-MCS, DRL-AM, PPO, SAC, as well as with a multiple-agent, distributed version of the PPO. The results highlight that, despite its simplicity and fully-distributed nature, the proposed framework achieves superior performance in UW networks in terms of throughput, convergence speed, and energy efficiency. Compared to DRL-MCS and DRL-AM, our approach improves network throughput by up to 33% and 20%, respectively, and reduces energy consumption by up to 18% and 16%. When compared to PPO, SAC, and Multi-PPO, the proposed solution achieves up to 11%, 34%, and 38% higher throughput, and up to 7%, 17%, and 33% lower energy consumption, respectively.

PUBLICATION RECORD

Publication year
2026
Venue
IEEE Transactions on Network and Service Management
Publication date
Unknown publication date
Fields of study
Computer Science, Engineering, Environmental Science
Identifiers
DOI 10.1109/TNSM.2025.3629240
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Adaptive Modulation in Underwater Acoustic Networks (AMUSE): A Multi-Armed Bandit Approach
2024cited by this paper
A comparative analysis of predictive channel models for real shallow water environments
2024cited by this paper
Adaptive Relay Selection Strategy in Underwater Acoustic Cooperative Networks: A Hierarchical Adversarial Bandit Learning Approach
2023cited by this paper
Underwater Acoustic Channel Models for SNR Prediction in a Real Shallow Water Environment
2023cited by this paper
Modeling Acoustic Channel Variability in Underwater Network Simulators from Real Field Experiment Data
2022cited by this paper
Internet of underwater things communication: Architecture, technologies, research challenges and future opportunities
2022cited by this paper
An integrated acoustic/LoRa system for transmission of multimedia sensor data over an Internet of Underwater Things
2022cited by this paper
Channel-Based Trust Model for Security in Underwater Acoustic Networks
2022cited by this paper
Environment-aware communication channel quality prediction for underwater acoustic transmissions: A machine learning method
2021cited by this paper
Underwater Internet of Things in Smart Ocean: System Architecture and Open Issues
2020cited by this paper
Decentralized Multi-player Multi-armed Bandits with No Collision Information
2020cited by this paper
An experimental testbed of an Internet of Underwater Things
2020cited by this paper
Adaptive Modulation for Long-Range Underwater Acoustic Communication
2020cited by this paper
Internet of Underwater Things and Big Marine Data Analytics—A Comprehensive Survey
2020cited by this paper
State-of-the-Art Underwater Acoustic Communication Modems: Classifications, Analyses and Design Challenges
2020cited by this paper
Soft Actor-Critic for Discrete Action Settings
2019cited by this paper
On Adaptive Modulation for low SNR Underwater Acoustic Communications
2018cited by this paper
Relay Selection for Underwater Acoustic Sensor Networks: A Multi-User Multi-Armed Bandit Formulation
2018cited by this paper
Joint resource allocation in underwater acoustic communication networks: A game-based hierarchical adversarial multiplayer multiarmed bandit algorithm
2018cited by this paper
Distributed Multi-Player Bandits - a Game of Thrones Approach
2018cited by this paper
Deep Reinforcement Learning-Based Modulation and Coding Scheme Selection in Cognitive Heterogeneous Networks
2018cited by this paper
Adaptive modulation switching strategy based on Q-learning for underwater acoustic communication channel
2018cited by this paper
Proximal Policy Optimization Algorithms
2017cited by this paper
Predicting underwater acoustic network variability using machine learning techniques
2017cited by this paper
Decision tree-based adaptive modulation for underwater acoustic communications
2016cited by this paper
The DESERT underwater framework v2: Improved capabilities and extension tools
2016cited by this paper
Shallow Water Acoustic Channel Modeling Based on Analytical Second Order Statistics for Moving Transmitter/Receiver
2015cited by this paper
Algorithms for multi-armed bandit problems
2014cited by this paper
The LOON in 2014: Test bed description
2014cited by this paper
Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
2012cited by this paper
Distributed Algorithms for Learning and Cognitive Medium Access with Logarithmic Regret
2010cited by this paper
On the relationship between capacity and distance in an underwater acoustic communication channel
2006cited by this paper

CITED BY

No citing papers are available for this paper.