Decision tree methods: applications for classification and prediction

Published 2015 in Shanghai Archives of Psychiatry

ABSTRACT

Summary Decision tree methodology is a commonly used data mining method for establishing classification systems based on multiple covariates or for developing prediction algorithms for a target variable. This method classifies a population into branch-like segments that construct an inverted tree with a root node, internal nodes, and leaf nodes. The algorithm is non-parametric and can efficiently deal with large, complicated datasets without imposing a complicated parametric structure. When the sample size is large enough, study data can be divided into training and validation datasets. Using the training dataset to build a decision tree model and a validation dataset to decide on the appropriate tree size needed to achieve the optimal final model. This paper introduces frequently used algorithms used to develop decision trees (including CART, C4.5, CHAID, and QUEST) and describes the SPSS and SAS programs that can be used to visualize tree structure.

PUBLICATION RECORD

Publication year
2015
Venue
Shanghai Archives of Psychiatry
Publication date
2015-04-25
Fields of study
Mathematics, Computer Science, Medicine
Identifiers
DOI 10.11919/j.issn.1002-0829.215044 PMID 26120265 PMCID 4466856
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Fifty Years of Classification and Regression Trees
2014cited by this paper
Comments on Fifty Years of Classification and Regression Trees.
2014cited by this paper
Shanghai Archives of Psychiatry
2014cited by this paper
Opportunities for prevention and intervention with young children: lessons from the Canadian incidence study of reported child abuse and neglect
2013cited by this paper
Classification and regression trees
2012cited by this paper
Tree-structured subgroup analysis of receiver operating characteristic curves for diagnostic tests.
2012cited by this paper
Study of Various Decision Tree Pruning Methods with their Empirical Comparison in WEKA
2012cited by this paper
Decision Tree Induction: An Approach for Data Classification Using AVL-Tree
2010cited by this paper
Modifiable risk factors predicting major depressive disorder at four year follow-up: a decision tree approach
2009cited by this paper
Discussions
2009cited by this paper
The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition by Trevor Hastie, Robert Tibshirani, Jerome Friedman
2009cited by this paper
A Procedure for Determining Whether a Simple Combination of Diagnostic Tests May Be Noninferior to the Theoretical Optimum Combination
2008cited by this paper
CHI-Squared Test of Independence
2007cited by this paper
Tree-Based Methods and Their Applications
2006cited by this paper
The elements of statistical learning: data mining, inference and prediction
2005cited by this paper
Classification Algorithms for Hip Fracture Prediction Based on Recursive Partitioning Methods
2004cited by this paper
Alternative Tree-Structured Survival Analysis Based on Variance of Survival Time
2004cited by this paper
Residual‐based tree‐structured survival analysis
2002cited by this paper
The Elements of Statistical Learning: Data Mining, Inference, and Prediction
2001cited by this paper
Classification and Regression Trees
2000cited by this paper
Mastering Data Mining: The Art and Science of Customer Relationship Management
1999influential reference
Mastering Data Mining: The Art and Science of Customer Relationship Management
1999influential reference
SPLIT SELECTION METHODS FOR CLASSIFICATION TREES
1997cited by this paper
Features of Tree‐Structured Survival Analysis
1997cited by this paper
C4.5: Programs for Machine Learning (書評)
1995cited by this paper
Splitting Criteria in Survival Trees
1995cited by this paper
Programs for Machine Learning
1994cited by this paper
Relative risk trees for censored survival data.
1992cited by this paper
Martingale-based residuals for survival models
1990cited by this paper
A comparison of estimated proportional hazards models and regression trees.
1989cited by this paper
Regression Trees for Censored Data
1988cited by this paper
An Exploratory Technique for Investigating Large Quantities of Categorical Data
1980cited by this paper

CITED BY

On the Reruns of GitHub Actions Workflows
2026cites this paper
Non-Invasive Detection of Lead Connection Pipe with Machine Learning Model.
2026cites this paper
Predicting the settling velocity of solid particles using machine learning
2026cites this paper
Real-time and explainable non-destructive nut classification using spike-triggered acoustic sensing
2026cites this paper
Security Solutions for the Internet of Things Using Machine Learning and Deep Learning: Current Trends and Future Directions
2026cites this paper
Early detection of air leakage in IoT-connected compressors using enhanced data sampling with deep learning
2026cites this paper
Traditional sex estimation with parameters obtained from the distal end of the humerus: an example of using machine learning algorithms in morphometric studies
2026cites this paper
MALTree: Maliciously Secure Decision Tree Inference Using MPC With a Helper
2026cites this paper
Predicting Surgical Outcomes in Breast Reconstruction With Machine Learning: A Systematic Review.
2026cites this paper
Computational design and screening of novel dipeptide analogues as potent antileprotic agents against alanine racemase and ligase
2026cites this paper
Bridging Expert Reasoning and LLM Detection: A Knowledge-Driven Framework for Malicious Packages
2026cites this paper
A Comparative Study of Traditional Machine Learning, Deep Learning, and Large Language Models for Mental Health Forecasting using Smartphone Sensing Data
2026cites this paper
Contrastive Learning-Based Deep Embedded Clustering and the TCN-DMAttention Model for Traffic Congestion Prediction
2026cites this paper
Learning-based robotic machining error prediction for high precision manufacturing
2026cites this paper
Prediction of local scour beneath vibrating submarine pipelines subjected to waves and currents based on CFD and machine learning approach
2026cites this paper
Innovative statistical method for longitudinal and hierarchical data modeling: the GMEXGBoost method
2026cites this paper
RADSCL: Representation augmentation integrated with dual supervised contrastive learning for low-resource text classification
2026cites this paper
Responsible AI Question Bank for Risk Assessment
2026cites this paper
Hazard-based seismic fragility functions for steel moment-resisting frame buildings through data-driven damage state identification
2026cites this paper
Early Remaining Useful Life Prediction of Lithium-Ion Batteries Based on a Hybrid Machine Learning Method with Time Series Augmentation.
2026cites this paper
Hybrid deep feature integration model for robust deepfake detection using transfer-learned neural networks
2026cites this paper
A Feature Fusion Framework for Improved Autism Spectrum Disorder Prediction Using sMRI and Phenotype Information
2026cites this paper
Who benefits most? Intervention-induced changes in the social networks of people living with dementia
2026cites this paper
Assessment of Human–Bear Conflict Through Time and Space: A Case Study from Ilgaz District, Türkiye
2026cites this paper
Predicting and optimising aluminium alloy compositions for target mechanical properties using machine learning
2026cites this paper
Non-destructive Classification of Harvested Terung Asam (Solanum lasiocarpum Dunal.) Utilising Thermal Imaging and Machine Learning Across Different Storage Days
2026cites this paper
Mild traumatic brain injury detection: uncovering neural interaction patterns through dynamic Hilbert warping features with EEG data
2026cites this paper
Machine learning methods for cold-formed stainless steel roof panel temperature prediction
2026cites this paper
Reinforced Dual-Flow Neural Network for Tabular Data Classification With Dynamical Transformer and Fuzzy Clustering
2026cites this paper
An artificial immune classification method with deep feature enhancement and dynamic memory cells optimization
2026cites this paper
GFFusion: Towards automated assessment of movement disorders from gait videos
2026cites this paper
An Investigation on Residual Stress Distribution in Cold-Formed Steel Channel Sections
2026cites this paper
Quantitative Morphological Profiling and Isolate-Specific Insensitivity of Cacao Pathogens to Novel Bio-Based Phenolic Amides
2026influential citation
Prediction of monthly precipitation and maximum 24 h precipitation using Random Forest, Decision Tree and XGBoost models
2026cites this paper
Data-driven discovery of copper alloys: resolving the strength-conductivity paradox
2026cites this paper
DrowsyDG-Phys: Generalizable driver drowsiness estimation in conditional automated vehicles using physiological signals.
2026cites this paper
Machine learning-based prediction of diabetic retinopathy from pupillary abnormalities in a South Indian population
2026cites this paper
Decoding environmental regimes and spring phytoplankton bloom occurrence in the central Yellow Sea
2026cites this paper
Socioeconomic and Engagement Barriers to Cardiovascular Risk Awareness Among Cancer Survivors
2026cites this paper
Gift recommendation with multilabel clustering
2026cites this paper
An Artificial Intelligence-Based Data-Driven Method for Predicting Soil Shear Strength
2026cites this paper
Early clinical indicators for predicting discharge destination from the acute stroke ward: A retrospective observational study
2026cites this paper
Quality Control of Herbal Medicine Based on Analytical Techniques and Machine Learning: Current Advances and Future Perspectives.
2026cites this paper
Causal Structure-Enhanced Branch Neural Networks for Interpretable and Robust Regression
2026cites this paper
The power of machine learning models in predicting gestational diabetes mellitus.
2026cites this paper
Comparison of Zn recovery prediction from carbonate ores with machine-learning methods
2026cites this paper
DeepFlaky: Deep hybrid representation learning for flaky test prediction
2026cites this paper
A Machine Learning-based Approach for Classifying Waveform Distortion Due to Misalignment in SHPB Experiments
2026cites this paper
Time2Graph: Dual Embedding and Nested-Graph Transformation for Performance Enhancement in Time Series Classification
2026cites this paper
Analysis of the CTS-6 Questionnaire and Development of a Carpal Tunnel Syndrome Decision Tree
2026cites this paper
From Damage to Dispute: A Decision Tree Analysis of Post-Disaster Government–Victim Conflict
2026cites this paper
A Comparative Study of Forensic File Type Identification Methods for Tool Type Identification
2026cites this paper
VESTA: A Secure and Efficient FHE-based Three-Party Vectorized Evaluation System for Tree Aggregation Models
2025cites this paper
Predicting Postgraduate Student Engagement Using Artificial Intelligence (AI)
2025cites this paper
An Improved Bagging of Machine Learning Algorithms to Predict Motif Structures From Protein-Protein Interaction Networks
2025cites this paper
Experimental assessment and hybrid machine learning-based feature importance analysis with the optimization of compressive strength of waste glass powder-modified concrete
2025cites this paper
Predictive model for daily risk alerts in sepsis patients in the ICU: visualization and clinical analysis of risk indicators
2025cites this paper
Radiomics of lung ventilation/perfusion tomographic imaging in pulmonary embolism diagnosis
2025cites this paper
Soft-Computing for 3D Failure Envelope of Rectangular Skirt Foundations in Heterogeneous Clays
2025cites this paper
Predicting Italian students’ mathematics outcomes: a decision tree regression analysis
2025cites this paper
Development of Intelligent Marine Logistics Models Using Machine Learning
2025cites this paper
Applying machine learning to predict bowel preparation adequacy in elderly patients for colonoscopy: development and validation of a web-based prediction tool
2025cites this paper
Development and Feasibility Study of HOPE Model for Prediction of Depression Among Older Adults Using Wi-Fi-based Motion Sensor Data: Machine Learning Study
2025cites this paper
An IoT-Enabled Wearable Device for Fetal Movement Detection Using Accelerometer and Gyroscope Sensors
2025cites this paper
Tabby: A Language Model Architecture for Tabular and Structured Data Synthesis
2025cites this paper
Environmental Effects on NDIR-Based CH4 Monitoring: Characterization and Correction.
2025cites this paper
Compositional modeling of solution gas–oil ratio (Rs): a comparative study of tree-based models, neural networks, and equations of state
2025cites this paper
Deep Learning in Heart Murmur Detection: Analyzing the Potential of FCNN vs. Traditional Machine Learning Models
2025cites this paper
Black-box and white-box machine learning tools to estimate the frost formation condition during cryogenic CO2 capture from natural gas blends
2025cites this paper
Interpretable wood chip moisture content prediction through texture analysis
2025cites this paper
Prediction of minimum miscibility pressure of CO2-oil systems using grey-box modeling for carbon dioxide capture, utilization, storage, and enhanced oil recovery
2025cites this paper
Machine Learning-Based Prediction of Algerian University Student Participation in Sports Activities
2025cites this paper
Unsupervised Autoencoders Combined with Multi-Model Machine Learning Fusion for Improving the Applicability of Aircraft Sensor and Engine Performance Prediction
2025cites this paper
Evaluating Discharge Coefficient of Rectangular Sharp Crested Weirs Using Machine Learning Models
2025cites this paper
Machine learning-based feature selection and classification for cerebral infarction screening: an experimental study
2025cites this paper
MS-NET-v2: modular selective network optimized by systematic generation of expert modules
2025cites this paper
AnxietyFaceTrack: A Smartphone-Based Non-Intrusive Approach for Detecting Social Anxiety Using Facial Features
2025cites this paper
Early Detection of Lung Cancer Using Predictive Modeling Incorporating CTGAN Features and Tree-Based Learning
2025cites this paper
Machine learning models for reinjury risk prediction using cardiopulmonary exercise testing (CPET) data: optimizing athlete recovery
2025cites this paper
Multiple Machine Learning Algorithms-based Credit Card Fraud Detection
2025cites this paper
Preliminary study: Data analytics for predicting medication adherence in Malaysian arthritis patients
2025cites this paper
Integrative analysis of seed morphology, geographic origin, and genetic structure in Medicago with implications for breeding and conservation
2025influential citation
Atomic Force Microscopy for Revealing Oncological Nanomechanobiology and Thermodynamics.
2025cites this paper
A critical analysis of compressive strength prediction of glass fiber and carbon fiber reinforced concrete over machine learning models
2025cites this paper
Machine Learning in Polymer Research
2025cites this paper
Numerical simulation and data-driven study on the axial compression bearing capacity of steel reactive powder concrete columns
2025cites this paper
Customer segmentation in the digital marketing using a Q-learning based differential evolution algorithm integrated with K-means clustering
2025cites this paper
Day-ahead statistical forecasting of algal bloom risk to support reservoir release decisions in a highly engineered watershed.
2025cites this paper
Nutritional intake of micronutrient and macronutrient and type 2 diabetes: machine learning schemes
2025cites this paper
Associations between consistency of current and preferred living arrangements and loneliness in older adults with multimorbidity: A nationwide cross-sectional study.
2025cites this paper
Deep learning and hyperspectral features for seedling stage identification of barnyard grass in paddy field
2025cites this paper
A parallel and distributed C4.5 algorithm in cloud computing environments
2025cites this paper
Predicting pregnancy loss and its determinants among reproductive-aged women using supervised machine learning algorithms in Sub-Saharan Africa
2025cites this paper
Comparison of Trivariate Copula-Based Conditional Quantile Regression Versus Machine Learning Methods for Estimating Copper Recovery
2025cites this paper
iDRKAN: Interpretable miRNA-Disease Association Prediction Based on Dual-Graph Representation Learning and Kolmogorov–Arnold Network
2025cites this paper
An Innovative Next Activity Prediction Approach Using Process Entropy and DAW-Transformer
2025cites this paper
Background subtraction in inelastic scattering measurements using machine learning
2025cites this paper
Artificial intelligence in stroke risk assessment and management via retinal imaging
2025cites this paper
Analyzing machine learning algorithms in predicting Ranikot swelling at different compaction pressures in presence of carbon supported TiO2 water based mud
2025cites this paper
163. Post-injury Local Controlled Cooling Significantly Improves Functional Recovery After Volumetric Muscle Loss in Rats
2025cites this paper