Interpretable Low-Dimensional Regression via Data-Adaptive Smoothing

Wesley Tansey,Jesse Thomason,James G. Scott

Published 2017 in arXiv: Machine Learning

ABSTRACT

We consider the problem of estimating a regression function in the common situation where the number of features is small, where interpretability of the model is a high priority, and where simple linear or additive models fail to provide adequate performance. To address this problem, we present Maximum Variance Total Variation denoising (MVTV), an approach that is conceptually related both to CART and to the more recent CRISP algorithm, a state-of-the-art alternative method for interpretable nonlinear regression. MVTV divides the feature space into blocks of constant value and fits the value of all blocks jointly via a convex optimization routine. Our method is fully data-adaptive, in that it incorporates highly robust routines for tuning all hyperparameters automatically. We compare our approach against CART and CRISP via both a complexity-accuracy tradeoff metric and a human study, demonstrating that that MVTV is a more powerful and interpretable method.

PUBLICATION RECORD

Publication year
2017
Venue
arXiv: Machine Learning
Publication date
2017-06-16
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 1708.01947
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Convex Regression with Interpretable Sharp Partitions
2016influential reference
Monotonic Calibrated Interpolated Look-Up Tables
2015cited by this paper
Multiscale Spatial Density Smoothing: An Application to Large-Scale Radiological Survey and Anomaly Detection
2015cited by this paper
Modular Proximal Optimization for Multidimensional Total-Variation Regularization
2014cited by this paper
Trend Filtering on Graphs
2014cited by this paper
A Dynamic Programming Algorithm for the Fused Lasso and L0-Segmentation
2013cited by this paper
Fast Newton-type Methods for Total Variation Regularization
2011cited by this paper
The solution path of the generalized lasso
2010cited by this paper
On Total Variation Minimization and Surface Evolution Using Parametric Maximum Flows
2009cited by this paper
A context-sensitive approach to anonymizing spatial surveillance data: impact on outbreak detection.
2006cited by this paper
Sparsity and smoothness via the fused lasso
2005cited by this paper
Estimating the number of clusters in a dataset via the gap statistic
2000cited by this paper
Hand-book on statistical distributions for experimentalists
1996cited by this paper
Nonlinear total variation based noise removal algorithms
1992cited by this paper

CITED BY

Fibers of Failure: Classifying Errors in Predictive Processes
2020cites this paper
A Categorisation of Post-hoc Explanations for Predictive Models
2019cites this paper
Fibres of Failure: Classifying errors in predictive processes
2018cites this paper
Evaluation of ride-sourcing search frictions and driver productivity: A spatial denoising approach
2018cites this paper