Learning to Compose Domain-Specific Transformations for Data Augmentation

Alexander J. Ratner,Henry R. Ehrenberg,Zeshan Hussain,Jared A. Dunnmon,C. Ré

Published 2017 in Neural Information Processing Systems

ABSTRACT

Data augmentation is a ubiquitous technique for increasing the size of labeled training sets by leveraging task-specific data transformations that preserve class labels. While it is often easy for domain experts to specify individual transformations, constructing and tuning the more sophisticated compositions typically needed to achieve state-of-the-art results is a time-consuming manual task in practice. We propose a method for automating this process by learning a generative sequence model over user-specified transformation functions using a generative adversarial approach. Our method can make use of arbitrary, non-deterministic transformation functions, is robust to misspecified user input, and is trained on unlabeled data. The learned transformation model can then be used to perform data augmentation for any end discriminative model. In our experiments, we show the efficacy of our approach on both image and text datasets, achieving improvements of 4.0 accuracy points on CIFAR-10, 1.4 F1 points on the ACE relation extraction task, and 3.4 accuracy points when using domain-specific transformation operations on a medical imaging dataset as compared to standard heuristic augmentation approaches.

PUBLICATION RECORD

Publication year
2017
Venue
Neural Information Processing Systems
Publication date
2017-09-06
Fields of study
Medicine, Computer Science, Mathematics
Identifiers
arXiv 1709.01643 PMID 29375240 PMCID PMC5786274
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

GENERATIVE ADVERSARIAL NETS
2018influential reference
Adversarial Transformation Networks: Learning to Generate Adversarial Examples
2017cited by this paper
Improving music source separation based on deep neural networks through data augmentation and network blending
2017cited by this paper
Dataset Augmentation in Feature Space
2017cited by this paper
Enriching Word Vectors with Subword Information
2016cited by this paper
Improved Techniques for Training GANs
2016cited by this paper
RenderGAN: Generating Realistic Labeled Data
2016cited by this paper
Adaptive data augmentation for image classification
2016cited by this paper
Regularization With Stochastic Transformations and Perturbations for Deep Semi-Supervised Learning
2016cited by this paper
Densely Connected Convolutional Networks
2016cited by this paper
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks
2015cited by this paper
Distributional Smoothing with Virtual Adversarial Training
2015cited by this paper
Gradient Estimation Using Stochastic Computation Graphs
2015cited by this paper
Unsupervised and Semi-supervised Learning with Categorical Generative Adversarial Networks
2015cited by this paper
Dreaming More Data: Class-dependent Distributions over Diffeomorphisms for Learned Data Augmentation
2015cited by this paper
Inverting Visual Representations with Convolutional Networks
2015cited by this paper
Deep Residual Learning for Image Recognition
2015influential reference
Fractional Max-Pooling
2014influential reference
Explaining and Harnessing Adversarial Examples
2014cited by this paper
Conditional Generative Adversarial Nets
2014influential reference
The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository
2013cited by this paper
Recurrent policy gradients
2010cited by this paper
Deep, Big, Simple Neural Nets for Handwritten Digit Recognition
2010cited by this paper
Learning Multiple Layers of Features from Tiny Images
2009cited by this paper
Convex Learning with Invariances
2007cited by this paper
THE DIGITAL DATABASE FOR SCREENING MAMMOGRAPHY
2007cited by this paper
Research Paper: Enhancing Text Categorization with Semantic-enriched Representation and Training Data Augmentation
2006cited by this paper
RCV1: A New Benchmark Collection for Text Categorization Research
2004cited by this paper
The Automatic Content Extraction (ACE) Program – Tasks, Data, and Evaluation
2004cited by this paper
Poisson image editing
2003cited by this paper
SMOTE: Synthetic Minority Over-sampling Technique
2002cited by this paper
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
2001cited by this paper
Gradient-based learning applied to document recognition
1998influential reference
Ieee Transactions on Pattern Analysis and Machine Intelligence 1 Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks
year unknowninfluential reference

CITED BY

Rich and cross-fused feature embedding for few-shot point cloud semantic segmentation
2026cites this paper
Spatio-temporal Decoupled Knowledge Compensator for Few-Shot Action Recognition.
2026cites this paper
Augment One With Others: Generalizing to Unforeseen Variations for Visual Tracking
2025cites this paper
Few-Shot 3D Point Cloud Segmentation via Relation Consistency-Guided Heterogeneous Prototypes
2025cites this paper
AugEvo: evolving augmentations to close the sim-to-real gap for AI
2025cites this paper
GSLTA-CDFSAR: Global Sequences and Local Tuples Alignment for Cross-Domain Few-Shot Action Recognition
2025cites this paper
OpenMAE: Efficient Masked Autoencoder for Vibration Sensing with Open-domain Data Enrichment
2025cites this paper
Cross-Domain Semantic Transfer for Domain Generalization
2025cites this paper
UFOS-Net leverages small-scale feature fusion for diabetic foot ulcer segmentation
2025cites this paper
Enhancing out-of-distribution learning in computer vision through dominant feature masking
2025cites this paper
Augmenting atmospheric turbulence effects on thermal-adapted deep object detection models
2025cites this paper
Sample-Aware RandAugment: Search-Free Automatic Data Augmentation for Effective Image Recognition
2025cites this paper
Reinforcement Learning Platform for Adversarial Black-box Attacks with Custom Distortion Filters
2025cites this paper
Boosting few-shot action recognition via time-enhanced multimodal adaptation learning
2025cites this paper
Steganographic Embeddings as an Effective Data Augmentation
2025cites this paper
Towards automated self-supervised learning for truly unsupervised graph anomaly detection
2025cites this paper
Brain-inspired semantic data augmentation for multi-style images
2024cites this paper
Unsupervised learning based object detection using Contrastive Learning
2024cites this paper
A systematic review of deep learning data augmentation in medical imaging: Recent advances and future research directions
2024cites this paper
Multi-view Distillation based on Multi-modal Fusion for Few-shot Action Recognition(CLIP-M2DF)
2024cites this paper
How to Augment for Atmospheric Turbulence Effects on Thermal Adapted Object Detection Models?
2024cites this paper
Few-Shot Action Recognition via Multi-View Representation Learning
2024cites this paper
Clean-image Backdoor Attacks
2024cites this paper
Augmented drug combination dataset to improve the performance of machine learning models predicting synergistic anticancer effects
2024cites this paper
Towards Safer Roads: A Deep Learning Based Object Detection Technique for Vehicle Safety
2024cites this paper
Machine Learning for Automated Sand Transport Monitoring in a Pipeline Using Distributed Acoustic Sensor Data
2024cites this paper
SELF-EXPERTISE: Knowledge-based Instruction Dataset Augmentation for a Legal Expert Language Model
2024cites this paper
A Survey of Synthetic Data Augmentation Methods in Machine Vision
2024cites this paper
Consistency Prototype Module and Motion Compensation for few-shot action recognition (CLIP-CPM2C)
2024cites this paper
Game Theory Meets Data Augmentation
2024cites this paper
Understanding the Role of Invariance in Transfer Learning
2024cites this paper
A Metric-Based Few-Shot Learning Method for Fish Species Identification with Limited Samples
2024cites this paper
Policy-driven Auto-Augmentation with Distillment Rewards for Scene Text Recognition
2024cites this paper
Few-shot SAR target classification via meta-learning with hybrid models
2024cites this paper
Multi-view distillation based on multi-modal fusion for few-shot action recognition (CLIP-MDMF)
2024cites this paper
Deep Adversarial Network Based Dental Inlay Restoration Using Point Cloud Segmenntation
2024cites this paper
Meta generative image and text data augmentation optimization
2024cites this paper
DMSD-CDFSAR: Distillation from Mixed-Source Domain for Cross-Domain Few-shot Action Recognition
2024cites this paper
Data transformation review in deep learning
2024cites this paper
A Systematic Framework for Data Augmentation for Tropical Cyclone Intensity Estimation Using Deep Learning
2024cites this paper
CoViews: Adaptive Augmentation Using Cooperative Views for Enhanced Contrastive Learning
2024cites this paper
Learning Tree-Structured Composition of Data Augmentation
2024influential citation
Development of a cerebellar ataxia diagnosis model using conditional GAN-based synthetic data generation for visuomotor adaptation task
2024cites this paper
Explanatory Debiasing: Involving Domain Experts in the Data Generation Process to Mitigate Representation Bias in AI Systems
2024cites this paper
Meta Generative Data Augmentation Optimization
2023cites this paper
Task-Specific Alignment and Multiple Level Transformer for Few-Shot Action Recognition
2023cites this paper
SudokuSens: Enhancing Deep Learning Robustness for IoT Sensing Applications using a Generative Approach
2023cites this paper
AREA: Adaptive Reweighting via Effective Area for Long-Tailed Classification
2023cites this paper
Comparison of Transfer Style Using a CycleGAN Model with Data Augmentation
2023cites this paper
Differentiable Image Data Augmentation and Its Applications: A Survey
2023cites this paper
Collinear datasets augmentation using Procrustes validation sets
2023cites this paper
Understanding the Detrimental Class-level Effects of Data Augmentation
2023cites this paper
A Bi-Prototype BDC Metric Network With Lightweight Adaptive Task Attention for Few-Shot Fine-Grained Ship Classification in Remote Sensing Images
2023cites this paper
Consistency Prototype Module and Motion Compensation for Few-Shot Action Recognition (CLIP-CPM2C)
2023cites this paper
SMACK: Semantically Meaningful Adversarial Audio Attack
2023cites this paper
RaViTT: Random Vision Transformer Tokens
2023cites this paper
Data augmentation for recommender system: A semi-supervised approach using maximum margin matrix factorization
2023cites this paper
Something for (almost) nothing: Improving deep ensemble calibration using unlabeled data
2023cites this paper
Distribution-balanced augmentation for rough data driven object detection
2023cites this paper
Neural Transformation Network to Generate Diverse Views for Contrastive Learning
2023cites this paper
Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning
2023cites this paper
Automatic Data Augmentation Learning using Bilevel Optimization for Histopathological Images
2023cites this paper
Semantic-aware Video Representation for Few-shot Action Recognition
2023cites this paper
Few-shot and meta-learning methods for image understanding: a survey
2023cites this paper
Automatic Aug-Aware Contrastive Proposal Encoding for Few-Shot Object Detection of Remote Sensing Images
2023cites this paper
A Survey of Automated Data Augmentation for Image Classification: Learning to Compose, Mix, and Generate
2023influential citation
LatentAugment: Data Augmentation via Guided Manipulation of GAN’s Latent Space
2023cites this paper
Data augmentation and refinement for recommender system: A semi-supervised approach using maximum margin matrix factorization
2023cites this paper
Augmented drug combination dataset to improve the performance of machine learning models predicting synergistic anticancer effects
2023cites this paper
Improving severity classification of Hebrew PET-CT pathology reports using test-time augmentation
2023cites this paper
Regularization for Unsupervised Learning of Optical Flow
2023cites this paper
Artificial intelligence applications in pediatric oncology diagnosis
2023cites this paper
Bi-Level Implicit Semantic Data Augmentation for Vehicle Re-Identification
2023cites this paper
GDA: Generative Data Augmentation Techniques for Relation Extraction Tasks
2023cites this paper
Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning
2023influential citation
HyRSM++: Hybrid Relation Guided Temporal Set Matching for Few-shot Action Recognition
2023cites this paper
On Domain-Specific Pre- Training for Effective Semantic Perception in Agricultural Robotics
2023cites this paper
Evaluating semi-supervision methods for medical image segmentation: applications in cardiac magnetic resonance imaging
2023cites this paper
Progressive Target-Styled Feature Augmentation for Unsupervised Domain Adaptation on Point Clouds
2023cites this paper
Adapting Across Domains via Target-Oriented Transferable Semantic Augmentation Under Prototype Constraint
2023cites this paper
Evaluating Machine Learning Models with NERO: Non-Equivariance Revealed on Orbits
2023cites this paper
Equivariant Disentangled Transformation for Domain Generalization under Combination Shift
2022cites this paper
ColdGAN: an effective cold-start recommendation system for new users based on generative adversarial networks
2022cites this paper
Research Trends and Applications of Data Augmentation Algorithms
2022cites this paper
AADG: Automatic Augmentation for Domain Generalization on Retinal Image Segmentation
2022cites this paper
Data Augmentations for Cell-Type Generalizable Instance Segmentation
2022cites this paper
Data Augmentation vs. Equivariant Networks: A Theory of Generalization on Dynamics Forecasting
2022cites this paper
A survey of automated data augmentation algorithms for deep learning-based image classification tasks
2022cites this paper
Improving Diversity with Adversarially Learned Transformations for Domain Generalization
2022cites this paper
A Comprehensive Survey of Image Augmentation Techniques for Deep Learning
2022cites this paper
ReSmooth: Detecting and Utilizing OOD Samples When Training With Data Augmentation
2022cites this paper
Mitigating Data Heterogeneity in Federated Learning with Data Augmentation
2022cites this paper
Semi-Automatic Prostate Segmentation From Ultrasound Images Using Machine Learning and Principal Curve Based on Interpretable Mathematical Model Expression
2022cites this paper
AdaAug: Learning Class- and Instance-adaptive Data Augmentation Policies
2022cites this paper
Binary segmentation based on visual attention consistency under background-change
2022cites this paper
A Deep Learning Method for Pavement Crack Identification Based on Limited Field Images
2022cites this paper
H-ProSeg: Hybrid ultrasound prostate segmentation based on explainability-guided mathematical model
2022cites this paper
Recent advances on loss functions in deep learning for computer vision
2022cites this paper
Automated Data Augmentations for Graph Classification
2022influential citation
Scale-Aware Automatic Augmentations for Object Detection With Dynamic Training
2022cites this paper