Semi-Supervised Learning for Neural Machine Translation

Yong Cheng,W. Xu,Zhongjun He,W. He,Hua Wu,Maosong Sun,Yang Liu

Published 2016 in Annual Meeting of the Association for Computational Linguistics

ABSTRACT

While end-to-end neural machine translation (NMT) has made remarkable progress recently, NMT systems only rely on parallel corpora for parameter estimation. Since parallel corpora are usually limited in quantity, quality, and coverage, especially for low-resource languages, it is appealing to exploit monolingual corpora to improve NMT. We propose a semi-supervised approach for training NMT models on the concatenation of labeled (parallel corpora) and unlabeled (monolingual corpora) data. The central idea is to reconstruct the monolingual corpora using an autoencoder, in which the source-to-target and target-to-source translation models serve as the encoder and decoder, respectively. Our approach can not only exploit the monolingual corpora of the target language, but also of the source language. Experiments on the Chinese-English dataset show that our approach achieves significant improvements over state-of-the-art SMT and NMT systems.

PUBLICATION RECORD

Publication year
2016
Venue
Annual Meeting of the Association for Computational Linguistics
Publication date
2016-06-15
Fields of study
Linguistics, Computer Science
Identifiers
DOI 10.18653/v1/P16-1185 arXiv 1606.04596
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Domain Adaptation for Statistical Machine Translation
2018influential reference
Improving Neural Machine Translation Models with Monolingual Data
2015influential reference
On Using Monolingual Corpora in Neural Machine Translation
2015influential reference
Semi-supervised Sequence Learning
2015cited by this paper
Semi-supervised Learning with Deep Generative Models
2014cited by this paper
Addressing the Rare Word Problem in Neural Machine Translation
2014cited by this paper
On Using Very Large Target Vocabulary for Neural Machine Translation
2014cited by this paper
Conditional Random Field Autoencoders for Unsupervised Structured Prediction
2014cited by this paper
Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation
2014cited by this paper
Sequence to Sequence Learning with Neural Networks
2014cited by this paper
Neural Machine Translation by Jointly Learning to Align and Translate
2014influential reference
On the Properties of Neural Machine Translation: Encoder–Decoder Approaches
2014cited by this paper
Beyond Parallel Data: Joint Word Alignment and Decipherment Improves Machine Translation
2014cited by this paper
Recurrent Continuous Translation Models
2013influential reference
Learning a Phrase-based Translation Model from Monolingual Data with Application to Domain Adaptation
2013cited by this paper
Toward Statistical Machine Translation without Parallel Corpora
2012cited by this paper
Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions
2011influential reference
Deciphering Foreign Language
2011cited by this paper
Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion
2010influential reference
Domain Adaptation for Statistical Machine Translation with Monolingual Resources
2009cited by this paper
Moses: Open Source Toolkit for Statistical Machine Translation
2007cited by this paper
Transductive learning for statistical machine translation
2007cited by this paper
Factored Translation Models
2007cited by this paper
A Hierarchical Phrase-Based Model for Statistical Machine Translation
2005influential reference
Minimum Error Rate Training in Statistical Machine Translation
2003cited by this paper
Statistical Phrase-Based Translation
2003cited by this paper
SRILM - an extensible language modeling toolkit
2002cited by this paper
Bleu: a Method for Automatic Evaluation of Machine Translation
2002cited by this paper
Long Short-Term Memory
1997cited by this paper
The Mathematics of Statistical Machine Translation: Parameter Estimation
1993cited by this paper

CITED BY

Transformer-Based Bidirectional Attention Network for Segmentation-Free Word-Level Text Recognition with Overlapping Characters
2025cites this paper
Applying convolutional attention mechanisms and Human Memory Search for effective English-Urdu translation
2025cites this paper
Seismic Porosity Prediction via Semi-Supervised Learning: Integrating a Low-Frequency Model and a Closed-Loop Network Structure
2025cites this paper
Fine-grained Video Dubbing Duration Alignment with Segment Supervised Preference Optimization
2025cites this paper
WORD LEVEL TEXT RECOGNITION FOR OVERLAPPING CHARACTERS USING BIDIRECTIONAL TEMPORAL CONVOLUTIONAL NETWORK AND WAVELET TRANSFORM
2025cites this paper
Distilling BERT knowledge into Seq2Seq with regularized Mixup for low-resource neural machine translation
2025cites this paper
Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through Text Reconstruction
2024cites this paper
Semisupervised Neural Proto-Language Reconstruction
2024cites this paper
A Reinforcement Learning Approach to Improve Low-Resource Machine Translation Leveraging Domain Monolingual Data
2024cites this paper
SemiEvol: Semi-supervised Fine-tuning for LLM Adaptation
2024cites this paper
Deterministic Reversible Data Augmentation for Neural Machine Translation
2024cites this paper
Initial exploration into sarcasm and irony through machine translation
2024cites this paper
Using AI-Based Virtual Companions to Assist Adolescents with Autism in Recognizing and Addressing Cyberbullying
2024cites this paper
Toward Robust Self-Training Paradigm for Molecular Prediction Tasks
2024cites this paper
Semi-supervised Fine-tuning for Large Language Models
2024cites this paper
The Impact of Artificial Intelligence on Language Translation: A Review
2024cites this paper
Semi-Supervised Spoken Language Glossification
2024cites this paper
The Impact of Syntactic and Semantic Proximity on Machine Translation with Back-Translation
2024cites this paper
SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation
2024cites this paper
NITS-CNLP Low-Resource Neural Machine Translation Systems of English-Manipuri Language Pair
2023cites this paper
Semi-supervised Neural Machine Translation with Consistency Regularization for Low-Resource Languages
2023cites this paper
Scalable Learning of Latent Language Structure With Logical Offline Cycle Consistency
2023cites this paper
Learning from Multiple Sources for Data-to-Text and Text-to-Data
2023cites this paper
Learning from Mistakes: Towards Robust Neural Machine Translation for Disfluent L2 Sentences
2023cites this paper
Neural Machine Translation: A Survey of Methods used for Low Resource Languages
2023cites this paper
SWaCo: Safe Wafer Bin Map Classification With Self-Supervised Contrastive Learning
2023cites this paper
A Smaller and Better Word Embedding for Neural Machine Translation
2023cites this paper
Text Style Transfer Back-Translation
2023cites this paper
Integrating Reconstructor and Post-Editor into Neural Machine Translation
2023cites this paper
Unsupervised Neural Machine Translation between the Portuguese language and the Chinese and Korean languages
2023cites this paper
A deep autoencoder based approach for the inverse design of an acoustic-absorber
2023cites this paper
Transformer: A General Framework from Machine Translation to Others
2023influential citation
An Enhanced Method for Neural Machine Translation via Data Augmentation Based on the Self-Constructed English-Chinese Corpus, WCC-EC
2023cites this paper
Low resource machine translation of english-manipuri: A semi-supervised approach
2022cites this paper
Learning with Limited Text Data
2022cites this paper
Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation
2022cites this paper
Transformers for Low-resource Neural Machine Translation
2022cites this paper
Flow-Adapter Architecture for Unsupervised Machine Translation
2022cites this paper
Low-resource Neural Machine Translation: Methods and Trends
2022cites this paper
An empirical study of low-resource neural machine translation of manipuri in multilingual settings
2022cites this paper
Self-Training Vision Language BERTs With a Unified Conditional Model
2022cites this paper
MMTAfrica: Multilingual Machine Translation for African Languages
2022cites this paper
Morphologically Motivated Input Variations and Data Augmentation in Turkish-English Neural Machine Translation
2022cites this paper
Retracted March 6, 2026: Study on Machine Translation Teaching Model Based on Translation Parallel Corpus and Exploitation for Multimedia Asian Information Processing
2022cites this paper
Deep Transfer Network of Heterogeneous Domain Feature in Machine Translation
2022cites this paper
Robust self-training strategy for various molecular biology prediction tasks
2022cites this paper
Applying attention-based BiLSTM and technical indicators in the design and performance analysis of stock trading strategies
2022cites this paper
Competency-Aware Neural Machine Translation: Can Machine Translation Know its Own Translation Quality?
2022cites this paper
Adapting Multilingual Models for Code-Mixed Translation
2022cites this paper
Multimodality information fusion for automated machine translation
2022cites this paper
Building Dialogue Understanding Models for Low-resource Language Indonesian from Scratch
2022cites this paper
End-to-End Training of Both Translation Models in the Back-Translation Framework
2022cites this paper
Study of Encoder-Decoder Architectures for Code-Mix Search Query Translation
2022cites this paper
CHIA: CHoosing Instances to Annotate for Machine Translation
2022cites this paper
Iterative Constrained Back-Translation for Unsupervised Domain Adaptation of Machine Translation
2022cites this paper
Meta Back-translation
2021cites this paper
Strengthening Low-resource Neural Machine Translation through Joint Learning: The Case of Farsi-Spanish
2021cites this paper
A2R2: Robust Unsupervised Neural Machine Translation With Adversarial Attack and Regularization on Representations
2021cites this paper
Robust Adaptive Semi-supervised Classification Method based on Dynamic Graph and Self-paced Learning
2021cites this paper
Token-wise Curriculum Learning for Neural Machine Translation
2021cites this paper
Hyperspherical Variational Co-embedding for Attributed Networks
2021cites this paper
Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning
2021cites this paper
An Evaluation Dataset Construction Approach for Task-Oriented Dialogue
2021cites this paper
Recent advances of low-resource neural machine translation
2021cites this paper
Comparison of 2D and 3D attention mechanisms for human (collective) activity recognition
2021cites this paper
Semi-Supervised Wide-Angle Portraits Correction by Multi-Scale Transformer
2021cites this paper
Survey of Low-Resource Machine Translation
2021cites this paper
Improving Data Augmentation for Low-Resource NMT Guided by POS-Tagging and Paraphrase Embedding
2021cites this paper
Deep Neural Transformer Model for Mono and Multi Lingual Machine Translation
2021cites this paper
Medical Imaging with Deep Learning for COVID- 19 Diagnosis: A Comprehensive Review
2021cites this paper
Coronavirus Pandemic Analysis Using Deep Learning Techniques A Study
2021cites this paper
A text-based multi-span network for reading comprehension
2021cites this paper
Self-supervised and Supervised Joint Training for Resource-rich Machine Translation
2021cites this paper
Aspect-Level Sentiment-Controllable Review Generation with Mutual Learning Framework
2021cites this paper
Self-Training Sampling with Monolingual Data Uncertainty for Neural Machine Translation
2021cites this paper
An Empirical Survey of Data Augmentation for Limited Data Learning in NLP
2021cites this paper
GX@DravidianLangTech-EACL2021: Multilingual Neural Machine Translation and Back-translation
2021cites this paper
Domain Adaptation for NMT via Filtered Iterative Back-Translation
2021cites this paper
Duplex Sequence-to-Sequence Learning for Reversible Machine Translation
2021cites this paper
Efficient English Translation Method and Analysis Based on the Hybrid Neural Network
2021cites this paper
Addressing the Vulnerability of NMT in Input Perturbations
2021cites this paper
Neural Machine Translation for Low-resource Languages: A Survey
2021cites this paper
Alternated Training with Synthetic and Authentic Data for Neural Machine Translation
2021cites this paper
Attentive fine-tuning of Transformers for Translation of low-resourced languages @LoResMT 2021
2021cites this paper
Pre-Training on Mixed Data for Low-Resource Neural Machine Translation
2021cites this paper
Extended Parallel Corpus for Amharic-English Machine Translation
2021cites this paper
Zero-Shot Language Transfer vs Iterative Back Translation for Unsupervised Machine Translation
2021cites this paper
Research on the Application of BERT in Mongolian-Chinese Neural Machine Translation
2021cites this paper
THUMT: An Open-Source Toolkit for Neural Machine Translation
2020cites this paper
Dual Learning: Theoretical Study and an Algorithmic Extension
2020cites this paper
Text Recognition in the Wild
2020cites this paper
Machine Learning:A Review
2020cites this paper
Generating Wikipedia Article Sections from Diverse Data Sources
2020cites this paper
A Diverse Data Augmentation Strategy for Low-Resource Neural Machine Translation
2020cites this paper
Neural machine translation: Challenges, progress and future
2020influential citation
Keeping Models Consistent between Pretraining and Translation for Low-Resource Neural Machine Translation
2020cites this paper
Neural Machine Translation: A Review of Methods, Resources, and Tools
2020cites this paper
Bidirectional Boost: On Improving Tibetan-Chinese Neural Machine Translation With Back-Translation and Self-Learning
2020cites this paper
Unsupervised Neural Machine Translation for English and Manipuri
2020cites this paper
Dictionary-based Data Augmentation for Cross-Domain Neural Machine Translation
2020cites this paper