CaliForest: calibrated random forest for health data

Published 2020 in ACM Conference on Health, Inference, and Learning

ABSTRACT

Real-world predictive models in healthcare should be evaluated in terms of discrimination, the ability to differentiate between high and low risk events, and calibration, or the accuracy of the risk estimates. Unfortunately, calibration is often neglected and only discrimination is analyzed. Calibration is crucial for personalized medicine as they play an increasing role in the decision making process. Since random forest is a popular model for many healthcare applications, we propose CaliForest, a new calibrated random forest. Unlike existing calibration methodologies, CaliForest utilizes the out-of-bag samples to avoid the explicit construction of a calibration set. We evaluated CaliForest on two risk prediction tasks obtained from the publicly-available MIMIC-III database. Evaluation on these binary prediction tasks demonstrates that CaliForest can achieve the same discriminative power as random forest while obtaining a better-calibrated model evaluated across six different metrics. CaliForest will be published on the standard Python software repository and the code will be openly available on Github.

PUBLICATION RECORD

Publication year
2020
Venue
ACM Conference on Health, Inference, and Learning
Publication date
2020-04-01
Fields of study
Medicine, Computer Science
Identifiers
DOI 10.1145/3368555.3384461 PMID 34308443 PMCID PMC8299436
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Calibrating Probability Estimation Trees using Venn-Abers Predictors
2019cited by this paper
MIMIC-Extract: a data extraction, preprocessing, and representation pipeline for MIMIC-III
2019cited by this paper
Machine learning for phenotyping opioid overdose events
2019cited by this paper
Feature Robustness in Non-stationary Health Records: Caveats to Deployable Model Performance in Common Clinical Machine Learning Tasks
2019cited by this paper
Benchmarking deep learning models on large healthcare datasets
2018cited by this paper
Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction
2018cited by this paper
Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction
2018cited by this paper
Discrimination and Calibration of Clinical Prediction Models: Users’ Guides to the Medical Literature
2017cited by this paper
Multitask learning and benchmarking with clinical time series data
2017cited by this paper
Beyond discrimination: A comparison of calibration methods and clinical usefulness of predictive models of readmission risk
2017cited by this paper
MIMIC-III, a freely accessible critical care database
2016cited by this paper
Recurrent Neural Networks for Multivariate Time Series with Missing Values
2016cited by this paper
Predicting the Future - Big Data, Machine Learning, and Clinical Medicine.
2016cited by this paper
A Machine Learning-based Framework to Identify Type 2 Diabetes through Electronic Health Records
2016cited by this paper
Online Prediction of Health Care Utilization in the Next Six Months Based on Electronic Health Record Information: A Cohort and Validation Study
2015cited by this paper
Electronic health record phenotyping improves detection and screening of type 2 diabetes in the general United States population: A cross-sectional, unselected, retrospective study
2015cited by this paper
Predicting changes in hypertension control using electronic health records from a chronic disease management program
2014cited by this paper
Big data in health care: using analytics to identify and manage high-risk and high-cost patients.
2014cited by this paper
Clinical Prediction Models
2013cited by this paper
Doubly Optimized Calibrated Support Vector Machine (DOC-SVM): An Algorithm for Joint Optimization of Discrimination and Calibration
2012cited by this paper
Scikit-learn: Machine Learning in Python
2011cited by this paper
Calibrating predictive model estimates to support personalized medicine
2011influential reference
Breast cancer risk estimation with artificial neural networks revisited
2010cited by this paper
Breast Cancer: Risk Assessment and Prevention
2010cited by this paper
Predicting Disease Risks from Highly Unbalanced Data using Random Forest
2010cited by this paper
Use of Brier score to assess binary predictions.
2010cited by this paper
Assessing the Performance of Prediction Models: A Framework for Traditional and Novel Measures
2010cited by this paper
Calibrating Random Forests
2008influential reference
Breast
2002cited by this paper
Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers
2001cited by this paper
Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods
1999cited by this paper
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss
1997cited by this paper
A comparison of goodness-of-fit tests for the logistic regression model.
1997cited by this paper
Probabilistic prediction in patient management and clinical trials.
1986cited by this paper
External correspondence: Decompositions of the mean probability score
1982cited by this paper

CITED BY

Random Forest vs Elastic-Net Penalized Logistic Regression for Patient Discharge Classification in BPJS Primary Care
2026cites this paper
D4Care: A Deep Dynamic Memory-Driven Cross-Modal Feature Representation Network for Clinical Outcome Prediction
2025cites this paper
From Uncertainty to Precision: Enhancing Binary Classifier Performance through Calibration
2024cites this paper
Developing clinical prediction models: a step-by-step guide
2024cites this paper
Multimodal Fusion Artificial Intelligence Model to Predict Risk for MACE and Myocarditis in Cancer Patients Receiving Immune Checkpoint Inhibitor Therapy
2024cites this paper
Predictive Analysis of Internship and Job Placement Success in Computer Science Education
2024cites this paper
M3GNAS: Multi-modal Multi-view Graph Neural Architecture Search for Medical Outcome Predictions
2024cites this paper
Enhancing Clinical Outcome Predictions through Auxiliary Loss and Sentence-Level Self-Attention
2023cites this paper
Density-Aware Personalized Training for Risk Prediction in Imbalanced Medical Data
2022influential citation
PM2F2N: Patient Multi-view Multi-modal Feature Fusion Networks for Clinical Outcome Prediction
2022cites this paper
Machine learning for the life-time risk prediction of Alzheimer’s disease: a systematic review
2021cites this paper
Prediction of central venous catheter-associated deep venous thrombosis in pediatric critical care settings
2021cites this paper
Machine Learning for the life-time risk prediction of 1 Alzheimer ’ s disease : A Systematic Review 2 3
2021cites this paper
An Overview of Sensors, Design and Healthcare Challenges in Smart Homes: Future Design Questions
2021cites this paper
Post-Calibration Techniques: Balancing Calibration and Score Distribution Alignment
year unknowncites this paper