SuperTML: Two-Dimensional Word Embedding and Transfer Learning Using ImageNet Pretrained CNN Models for the Classifications on Tabular Data

Baohua Sun,Lin Yang,Wenhan Zhang,Michael Lin,Patrick Dong,Charles Young,Jason Dong

Published 2019 in Unknown venue

ABSTRACT

Tabular data is the most commonly used form of data in industry. Gradient Boosting Trees, Support Vector Machine, Random Forest, and Logistic Regression are typically used for classification tasks on tabular data. DNN models using categorical embeddings are also applied in this task, but all attempts thus far have used one-dimensional embeddings. The recent work of Super Characters method using two-dimensional word embeddings achieved the state of art result in text classification tasks, showcasing the promise of this new approach. In this paper, we propose the SuperTML method, which borrows the idea of Super Characters method and two-dimensional embeddings to address the problem of classification on tabular data. For each input of tabular data, the features are first projected into two-dimensional embeddings like an image, and then this image is fed into fine-tuned two-dimensional CNN models for classification. Experimental results have shown that the proposed SuperTML method had achieved state-of-the-art results on both large and small datasets.

PUBLICATION RECORD

Publication year
2019
Venue
Unknown venue
Publication date
2019-02-26
Fields of study
Computer Science
Identifiers
arXiv 1903.06246
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
2019cited by this paper
Squared English Word: A Method of Generating Glyph to Use Super Characters for Sentiment Analysis
2019cited by this paper
Super Characters: A Conversion from Sentiment Classification to Image Classification
2018cited by this paper
Neural Feature Learning From Relational Database
2018cited by this paper
Improving Language Understanding by Generative Pre-Training
2018cited by this paper
Ultra Power-Efficient CNN Domain Specific Accelerator with 9.3TOPS/Watt for Mobile and Embedded Applications
2018cited by this paper
Accelerated gradient boosting
2018cited by this paper
Deep Contextualized Word Representations
2018cited by this paper
CatBoost: unbiased boosting with categorical features
2017cited by this paper
Progressive Neural Architecture Search
2017influential reference
LightGBM: A Highly Efficient Gradient Boosting Decision Tree
2017cited by this paper
Squeeze-and-Excitation Networks
2017influential reference
Learning Transferable Architectures for Scalable Image Recognition
2017influential reference
Attention is All you Need
2017cited by this paper
Entity Embedding-Based Anomaly Detection for Heterogeneous Categorical Events
2016cited by this paper
XGBoost: A Scalable Tree Boosting System
2016influential reference
Entity Embeddings of Categorical Variables
2016cited by this paper
PolyNet: A Pursuit of Structural Diversity in Very Deep Networks
2016cited by this paper
Bag of Tricks for Efficient Text Classification
2016cited by this paper
Deep Residual Learning for Image Recognition
2015cited by this paper
Learning to discover: the Higgs boson machine learning challenge
2014cited by this paper
How transferable are features in deep neural networks?
2014cited by this paper
GloVe: Global Vectors for Word Representation
2014cited by this paper
CNN Features Off-the-Shelf: An Astounding Baseline for Recognition
2014cited by this paper
Higgs Boson Discovery with Boosted Trees
2014cited by this paper
The Higgs boson machine learning challenge
2014influential reference
Neural Machine Translation by Jointly Learning to Align and Translate
2014cited by this paper
Distributed Representations of Words and Phrases and their Compositionality
2013cited by this paper
DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition
2013cited by this paper
The MNIST Database of Handwritten Digit Images for Machine Learning Research [Best of the Web]
2012cited by this paper
A Survey on Transfer Learning
2010cited by this paper

CITED BY

The Impactful Analysis of DL Algorithms Isolated Observing Support Models Design: A Systematic Review
2024cites this paper
From Online Behaviours to Images: A Novel Approach to Social Bot Detection
2023cites this paper
Prediction of Cardio Vascular Disease by Deep Learning and Machine Learning-A Combined Data Science Approach
2022cites this paper
A Deep-Learned Embedding Technique for Categorical Features Encoding
2021cites this paper
On Anomaly Detection in Tabular Data
2021cites this paper
On the impact of selected modern deep-learning techniques to the performance and celerity of classification models in an experimental high-energy physics use case
2020cites this paper
SuperCaptioning: Image Captioning Using Two-dimensional Word Embedding
2019cites this paper
System Demo for Transfer Learning from Vision to Language using Domain Specific CNN Accelerator for On-Device NLP Applications
2019cites this paper
SuperChat: dialogue generation by transfer learning from vision to language using two-dimensional word embedding
2019cites this paper