Uncertainty in Deep Learning for EEG under Dataset Shifts

Objective As artificial intelligence (AI) is increasingly integrated into medical diagnostics, it is essential that predictive models provide not only accurate outputs but also reliable estimates of uncertainty. In clinical applications, where decisions have significant consequences, understanding the confidence behind each prediction is as critical as the prediction itself. Uncertainty modelling plays a key role in improving trust, guiding decision-making, and identifying unreliable outputs, particularly under dataset shift or in out-of-distribution settings. The primary aim of uncertainty metrics is to align model confidence closely with actual predictive performance, ensuring confidence estimates dynamically adjust to reflect increasing errors or decreasing reliability of predictions. This study investigates how different ensemble learning strategies affect both performance and uncertainty estimation in a clinically relevant task: classifying Normal, Mild Cognitive Impairment, and Dementia from electroencephalography (EEG) data. Approach We evaluated the performance and uncertainty of ensemble methods and Monte Carlo dropout on a large EEG dataset. The models were assessed in three settings: (1) in-distribution performance on a held-out test set, (2) generalisation to three out-of-distribution datasets, and (3) performance under gradual, EEG-specific dataset shifts simulating noise, drift, and frequency perturbation. Main results Ensembles consisting of multiple independently trained models, such as deep ensembles, consistently achieved higher performance in both the in-distribution test set and the out-of-distribution datasets. These models also produced more informative and responsive uncertainty estimates under various types of EEG dataset shifts. Significance These results highlight the benefits of ensemble diversity and independent training to build robust and uncertainty-aware EEG classification models. The findings are particularly relevant for clinical applications, where reliability under distribution shift and transparent uncertainty are essential for safe deployment.

Assessing the robustness of deep learning based brain age prediction models across multiple EEG datasets
2025cited by this paper
The prognosis of mild cognitive impairment: A systematic review and meta‐analysis
2025influential reference
Advancing EEG prediction with deep learning and uncertainty estimation
2024cited by this paper
AI and mental health: evaluating supervised machine learning models trained on diagnostic classifications
2024cited by this paper
Introducing Region Based Pooling for handling a varied number of EEG channels for deep learning models
2024cited by this paper
Deep learning techniques for automated Alzheimer's and mild cognitive impairment disease using EEG signals: A comprehensive review of the last decade (2013 - 2024)
2024cited by this paper
Data leakage in deep learning studies of translational EEG
2024cited by this paper
UNCER: A framework for uncertainty estimation and reduction in neural decoding of EEG signals
2023cited by this paper
Uncertainty-Aware Denoising Network for Artifact Removal in EEG Signals
2023cited by this paper
Estimating Patient-Level Uncertainty in Seizure Detection Using Group-Specific Out-of-Distribution Detection Technique
2023cited by this paper
Risk of data leakage in estimating the diagnostic performance of a deep-learning-based computer-aided system for psychiatric disorders
2023cited by this paper
Application of uncertainty quantification to artificial intelligence in healthcare: A review of last decade (2013-2023)
2023cited by this paper
Automated Interpretation of Clinical Electroencephalograms Using Artificial Intelligence
2023cited by this paper
Deep learning-based EEG analysis to classify normal, mild cognitive impairment, and dementia: Algorithms and dataset
2023cited by this paper
Modern views of machine learning for precision psychiatry
2022cited by this paper
Automated Detection of Major Depressive Disorder With EEG Signals: A Time Series Classification Using Deep Learning
2022cited by this paper
Data augmentation for learning predictive models on EEG: a systematic comparison
2022cited by this paper
The two decades brainclinics research archive for insights in neurophysiology (TDBRAIN) database
2022cited by this paper
AI in health and medicine
2022cited by this paper
SleepTransformer: Automatic Sleep Staging With Interpretability and Uncertainty Quantification
2021cited by this paper
Monte Carlo Dropout for Uncertainty Estimation and Motor Imagery Classification
2021cited by this paper
DeepSleepNet-Lite: A Simplified Automatic Sleep Stage Scoring Model With Uncertainty Estimates
2021cited by this paper
Impact of the Choice of Cross-Validation Techniques on the Results of Machine Learning-Based Diagnostic Applications
2021cited by this paper
Explainable, trustworthy, and ethical machine learning for healthcare: A survey
2021cited by this paper
A survey of uncertainty in deep neural networks
2021cited by this paper
Bridging the Gap Between Explainable AI and Uncertainty Quantification to Enhance Trustability
2021cited by this paper
Measures of resting state EEG rhythms for clinical trials in Alzheimer's disease: Recommendations of an expert panel
2021cited by this paper
Early diagnosis of Alzheimer's disease: the role of biomarkers including advanced EEG signal analysis. Report from the IFCN-sponsored panel of experts.
2020cited by this paper
Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning
2020cited by this paper
Greedy Policy Search: A Simple Baseline for Learnable Test-Time Augmentation
2020cited by this paper
Electroencephalography.
2019cited by this paper
InceptionTime: Finding AlexNet for time series classification
2019cited by this paper
Optuna: A Next-generation Hyperparameter Optimization Framework
2019cited by this paper
Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods
2019cited by this paper
Can You Trust Your Model's Uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift
2019cited by this paper
Evaluating model calibration in classification
2019cited by this paper
A Simple Baseline for Bayesian Uncertainty in Deep Learning
2019cited by this paper
Deep learning-based electroencephalography analysis: a systematic review
2019cited by this paper
A mind-brain-body dataset of MRI, EEG, cognition, emotion, and peripheral physiology in young and old adults
2019cited by this paper
Averaging Weights Leads to Wider Optima and Better Generalization
2018cited by this paper
Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs
2018cited by this paper
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018cited by this paper
Deep Learning-A Technology With the Potential to Transform Health Care.
2018cited by this paper
On Calibration of Modern Neural Networks
2017cited by this paper
A giant with feet of clay: on the validity of the data that feed machine learning in medicine
2017cited by this paper
What do we need to build explainable AI systems for the medical domain?
2017cited by this paper
Snapshot Ensembles: Train 1, get M for free
2017cited by this paper
SGDR: Stochastic Gradient Descent with Restarts
2016cited by this paper
Deep Learning
2016cited by this paper
Autoreject: Automated artifact rejection for MEG and EEG data
2016cited by this paper
Concrete Problems in AI Safety
2016cited by this paper
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
2016cited by this paper
A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks
2016cited by this paper
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
2015cited by this paper
Mild cognitive impairment: a concept in evolution
2014influential reference
The global prevalence of dementia: A systematic review and metaanalysis
2013cited by this paper
Algorithms for Hyper-Parameter Optimization
2011cited by this paper
Aleatory or epistemic? Does it matter?
2009cited by this paper
Interpreting magnetic fields of the brain: minimum norm estimates
2006cited by this paper
Classifications
2005cited by this paper
EEG dynamics in patients with Alzheimer's disease.
2004cited by this paper
The use of the area under the ROC curve in the evaluation of machine learning algorithms
1997cited by this paper
Bagging Predictors
1996cited by this paper
VERIFICATION OF FORECASTS EXPRESSED IN TERMS OF PROBABILITY
1950cited by this paper

Uncertainty in Deep Learning for EEG under Dataset Shifts

ABSTRACT

PUBLICATION RECORD

CITATION MAP

EXTRACTION MAP

CLAIMS

CONCEPTS

REFERENCES

CITED BY