Machine learning-based risk factor analysis and prediction model construction for mortality in chronic heart failure

Background Given the high global mortality burden of chronic heart failure (CHF) and the limitations of traditional risk prediction tools in accuracy and comprehensiveness, along with the potential of machine learning (ML) to improve prediction performance and the ability of a health ecology framework to systematically identify multi-dimensional risk factors, we aimed to develop an ML-based mortality risk prediction model for CHF and analyse its risk factors using a health ecology framework. Methods We enrolled 489 CHF patients from the Jackson Heart Database, with all-cause mortality during a 10-year follow-up period designated as the outcome measure. Guided by a five-layer health ecology framework (individual traits, behavioural characteristics, interpersonal relationships, work/living conditions, and macro policies), we selected 58 variables for analysis. The cohort was split into 7:3 training/validation sets. Random forest (RF) and k-nearest neighbour (KNN) models identified mortality predictors after five oversampling techniques addressed data imbalance before modelling. We trained seven ML algorithms, validated them via 10-fold cross-validation, and compared them using accuracy, the area under the curve (AUC), and other metrics. Results We identified 24 key factors: 19 for individual traits (age, body mass index (BMI), antihypertensive medication, hypoglycaemic medication, antiarrhythmic medication, systolic blood pressure, glycated haemoglobin, glomerular filtration rate, left ventricular ejection fraction, left ventricular diastolic diameter, left ventricular mass, high-density lipoproteins, low-density lipoproteins, triglycerides, total cholesterol, cardiovascular surgical history, mitral annular early diastolic peak velocity of motion); three for individual behavioural characteristics (dark greens intake, egg intake, and night-time sleep duration); and two for living and working conditions (favourite food shop at three-kilometre radius, proportion of poor people in the place of residence). The model constructed using synthetic minority over-sampling technique combined with edited nearest neighbours (SMOTE-ENN) processing and applying extreme gradient boosting (XGBoost) model was optimal, with an accuracy of 81.58%, an AUC value of 0.83, a precision of 0.87, a recall of 0.84, and an F1 value of 0.86 for the prediction of mortality at 10-year follow up. Conclusions We systematically categorised CHF mortality risk factors by integrating health ecology theory and ML. The SMOTE-ENN and XGBoost model demonstrated high accuracy, though further optimisation is needed to enhance clinical utility in CHF risk prediction.

Electrocardiogram-based machine learning for risk stratification of patients with suspected acute coronary syndrome.
2025cited by this paper
A new era in colorectal cancer: Artificial Intelligence at the forefront
2025cited by this paper
Evidentiary Landscape of Heart Failure Therapies, Regulatory Decisions, and Translation Into Guidelines.
2025cited by this paper
Associations of area-level socioeconomic status and individual factors with mortality in China: a nationwide prospective cohort study
2025cited by this paper
Health Policy in an Era of Universal Coverage.
2024cited by this paper
Financial burden and physical and emotional quality of life in COPD, heart failure, and kidney failure
2024cited by this paper
Resampling strategies for imbalanced regression: a survey and empirical analysis
2024cited by this paper
Mortality in patients admitted to hospital with heart failure in China: a nationwide Cardiovascular Association Database-Heart Failure Centre Registry cohort study.
2024cited by this paper
Landslide susceptibility analysis using random forest model with SMOTE-ENN resampling algorithm
2024cited by this paper
The association between dietary dark green vegetable intake and cognitive function in US older adults.
2024cited by this paper
FLEX-SMOTE: Synthetic over-sampling technique that flexibly adjusts to different minority class distributions
2024cited by this paper
fMRI-Based Alzheimer’s Disease Detection Using the SAS Method with Multi-Layer Perceptron Network
2023cited by this paper
Improving the Prognostic Evaluation Precision of Hospital Outcomes for Heart Failure Using Admission Notes and Clinical Tabular Data: Multimodal Deep Learning Model
2023influential reference
How intra-source imbalanced datasets impact the performance of deep learning for COVID-19 diagnosis using chest X-ray images
2023cited by this paper
Hybrid Multi-Label Classification Model for Medical Applications Based on Adaptive Synthetic Data and Ensemble Learning
2023cited by this paper
Insulin resistance is associated with subclinical myocardial dysfunction and reduced functional capacity in heart failure with preserved ejection fraction.
2023cited by this paper
Using machine learning methods to predict 28-day mortality in patients with hepatic encephalopathy
2023cited by this paper
Hypertension in stroke survivors and associations with national premature stroke mortality: data for 2·5 million participants from multinational screening campaigns.
2022cited by this paper
Socio-economic inequality of utilization of cancer testing in Europe: A cross-sectional study
2022cited by this paper
Global burden of heart failure: A comprehensive and updated review of epidemiology.
2022cited by this paper
Depressive Symptoms and Incident Heart Failure in the Jackson Heart Study: Differential Risk Among Black Men and Women
2022cited by this paper
2022 AHA/ACC/HFSA Guideline for the Management of Heart Failure: Executive Summary: A Report of the American College of Cardiology/American Heart Association Joint Committee on Clinical Practice Guidelines.
2022cited by this paper
2022 AHA/ACC/HFSA Guideline for the Management of Heart Failure: Executive Summary: A Report of the American College of Cardiology/American Heart Association Joint Committee on Clinical Practice Guidelines.
2022cited by this paper
Glycemic status, non-traditional risk and left ventricular structure and function in the Jackson Heart Study
2022cited by this paper
Combined lifestyle factors on mortality among the elder population: evidence from a Chinese cohort study
2022cited by this paper
0588 Relationship Between Congestive Heart Failure and Poor Sleep Quality in the US Population
2022influential reference
Diabetes-induced chronic heart failure is due to defects in calcium transporting and regulatory contractile proteins: cellular and molecular evidence
2022cited by this paper
Sleep duration and risk of cardio-cerebrovascular disease: A dose-response meta-analysis of cohort studies comprising 3.8 million participants
2022cited by this paper
Frequency of Egg Intake Associated with Mortality in Chinese Adults: An 8-Year Nationwide Cohort Study
2022cited by this paper
Machine Learning Models for Data-Driven Prediction of Diabetes by Lifestyle Type
2022cited by this paper
Development of a Machine Learning-Based Screening Method for Thyroid Nodules Classification by Solving the Imbalance Challenge in Thyroid Nodules Data
2022cited by this paper
Renal protection in chronic heart failure: focus on sacubitril/valsartan
2021cited by this paper
Hypertension and the Risk of All-Cause and Cause-Specific Mortality: An Outcome-Wide Association Study of 67 Causes of Death in the National Health Interview Survey
2021cited by this paper
The emerging roles of machine learning in cardiovascular diseases: a narrative review
2021cited by this paper
Random subspace and random projection nearest neighbor ensembles for high dimensional data
2021cited by this paper
Prevalence and Incidence of Heart Failure Among Urban Patients in China
2021cited by this paper
DeepSMOTE: Fusing Deep Learning and SMOTE for Imbalanced Data
2021cited by this paper
A novel nomogram to predict all‐cause readmission or death risk in Chinese elderly patients with heart failure
2020influential reference
A practical guide to multiple imputation of missing data in nephrology.
2020cited by this paper
Inequity under equality: research on the benefits equity of Chinese basic medical insurance
2020cited by this paper
Cardiopulmonary Hemodynamic Profile Predicts Mortality After Transcatheter Tricuspid Valve Repair in Chronic Heart Failure.
2020cited by this paper
Egg consumption and risk of cardiovascular disease: three large prospective US cohort studies, systematic review, and updated meta-analysis
2020cited by this paper
[Chinese guideline on the primary prevention of cardiovascular diseases].
2020cited by this paper
Intervention and Public Policy Pathways to Achieve Health Care Equity
2019cited by this paper
Patients' experience with heart failure treatment and self-care-A qualitative study exploring the burden of treatment.
2019cited by this paper
Machine Learning Models for the Prediction of Postpartum Depression: Application and Comparison Based on a Cohort Study
2019cited by this paper
Predictive models for identifying risk of readmission after index hospitalization for heart failure: A systematic review
2018cited by this paper
Pulmonary Hypertension Is Associated With a Higher Risk of Heart Failure Hospitalization and Mortality in Patients With Chronic Kidney Disease: The Jackson Heart Study
2017cited by this paper
Pulmonary Hypertension Is Associated With a Higher Risk of Heart Failure Hospitalization and Mortality in Patients With Chronic Kidney Disease: The Jackson Heart Study
2017cited by this paper
Understanding associations among race, socioeconomic status, and health: Patterns and prospects.
2016cited by this paper
Sleep disordered breathing is an independent risk factor for left atrial enlargement in patients with congestive heart failure.
2013cited by this paper
Aging-associated cardiovascular changes and their relationship to heart failure.
2012cited by this paper
Random KNN feature selection - a fast and stable alternative to Random Forests
2011cited by this paper
Loneliness Matters: A Theoretical and Empirical Review of Consequences and Mechanisms
2010cited by this paper
The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition
2009cited by this paper
Social network, social support, and health.
2009cited by this paper
Long‐term survival in patients older than 80 years hospitalised for heart failure. A 5‐year prospective study
2008cited by this paper
Heart failure
2005cited by this paper
Health ecology and environmental management in Mozambique.
2002cited by this paper
The meaning and use of the area under a receiver operating characteristic (ROC) curve.
1982influential reference
Heart Failure
1937cited by this paper

Machine learning-based risk factor analysis and prediction model construction for mortality in chronic heart failure

ABSTRACT

PUBLICATION RECORD

CITATION MAP

EXTRACTION MAP

CLAIMS

CONCEPTS

REFERENCES

CITED BY