Neural Machine Translation Transfer Model Based on Mutual Domain Guidance

Published 2022 in IEEE Access

ABSTRACT

The neural machine translation (NMT) model is a data hungry and domain-sensitive model but it is almost impossible to obtain a large number of labeled data for training it. This requires the use of domain transfer strategy. In order to solve the problem of domain data mismatch, this paper proposes a neural machine translation transfer model based on domain mutual guidance and establishes the continuous impact through the framework of mutual guidance. At the same time, self-ensemble and self-knowledge-distillation are used in these independent domains so that the model will not deviate from the domain too much. Furthermore, the model can better train the models from the batching way of domain data. It mainly uses the pretraining model out of domain, distillation of existing models in domain and data selection in the training process to guide the in-domain model. These are unified in the training framework, so that model training can be continuously and effectively guided in and out of domain. In this study, three typical experiment scenarios were comprehensive tested and our model was compared with many conventional classic methods. The experiment results showed that our proposed “inter-domain transfer training” and “curriculum scheduling agent” was effective and robust. The most important results and findings are that this comprehensive guided training framework (intra-domain and inter-domain) is suitable for the domain transfer in different scenarios, and this framework doesn’t increase the decoding cost.

PUBLICATION RECORD

Publication year
2022
Venue
IEEE Access
Publication date
Unknown publication date
Fields of study
Computer Science
Identifiers
DOI 10.1109/ACCESS.2022.3208951
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Latent Group Dropout for Multilingual and Multidomain Machine Translation
2022cited by this paper
Efficient Machine Translation Domain Adaptation
2022cited by this paper
Neural Machine Translation with Monolingual Translation Memory
2021cited by this paper
Continual Learning for Neural Machine Translation
2021cited by this paper
Improving the Quality Trade-Off for Neural Machine Translation Multi-Domain Adaptation
2021cited by this paper
Translation Transformers Rediscover Inherent Data Domains
2021cited by this paper
Multilingual Domain Adaptation for NMT: Decoupling Language and Domain Information with Adapters
2021cited by this paper
Uncertainty-Aware Balancing for Multilingual and Multi-Domain Neural Machine Translation Training
2021cited by this paper
Machine Translation Customization via Automatic Training Data Selection from the Web
2021cited by this paper
Uncertainty-Aware Curriculum Learning for Neural Machine Translation
2020cited by this paper
Reinforced Curriculum Learning on Pre-trained Neural Machine Translation Models
2020cited by this paper
Self-Paced Learning for Neural Machine Translation
2020cited by this paper
Dynamic Curriculum Learning for Low-Resource Neural Machine Translation
2020cited by this paper
Norm-Based Curriculum Learning for Neural Machine Translation
2020cited by this paper
and s
2019cited by this paper
Learning a Multi-Domain Curriculum for Neural Machine Translation
2019cited by this paper
MetaMT, a MetaLearning Method Leveraging Multiple Domain Data for Low Resource Machine Translation
2019cited by this paper
Domain Differential Adaptation for Neural Machine Translation
2019cited by this paper
Iterative Dual Domain Adaptation for Neural Machine Translation
2019cited by this paper
Revisiting Low-Resource Neural Machine Translation: A Case Study
2019cited by this paper
Language Models with Transformers
2019cited by this paper
Competence-based Curriculum Learning for Neural Machine Translation
2019cited by this paper
Multilingual Neural Machine Translation with Knowledge Distillation
2019cited by this paper
Reinforcement Learning based Curriculum Optimization for Neural Machine Translation
2019cited by this paper
The Evolved Transformer
2019cited by this paper
Learning Hidden Unit Contribution for Adapting Neural Machine Translation Models
2018cited by this paper
An Empirical Exploration of Curriculum Learning for Neural Machine Translation
2018influential reference
Multi-Domain Neural Machine Translation with Word-Level Domain Context Discrimination
2018cited by this paper
Analyzing Knowledge Distillation in Neural Machine Translation
2018cited by this paper
Dual Transfer Learning for Neural Machine Translation with Marginal Distribution Regularization
2018cited by this paper
The ADAPT System Description for the IWSLT 2018 Basque to English Translation Task
2018cited by this paper
Co-teaching: Robust training of deep neural networks with extremely noisy labels
2018cited by this paper
Dynamic Sentence Sampling for Efficient Training of Neural Machine Translation
2018cited by this paper
Curriculum Learning and Minibatch Bucketing in Neural Machine Translation
2017influential reference
Instance Weighting for Neural Machine Translation Domain Adaptation
2017cited by this paper
Convolutional Sequence to Sequence Learning
2017cited by this paper
A comprehensive study of batch construction strategies for recurrent neural networks in MXNet
2017cited by this paper
Sentence Embedding for Neural Machine Translation Domain Adaptation
2017cited by this paper
Attention is All you Need
2017cited by this paper
Using Target-side Monolingual Data for Neural Machine Translation through Multi-task Learning
2017cited by this paper
An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation
2017cited by this paper
Cost Weighting for Neural Machine Translation Domain Adaptation
2017cited by this paper
OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles
2016cited by this paper
Multi-Source Neural Translation
2016cited by this paper
Sequence-Level Knowledge Distillation
2016cited by this paper
Domain Control for Neural Machine Translation
2016cited by this paper
Exploiting Source-side Monolingual Data in Neural Machine Translation
2016cited by this paper
Fast Domain Adaptation for Neural Machine Translation
2016cited by this paper
Semi-supervised Convolutional Networks for Translation Adaptation with Tiny Amount of In-domain Data
2016cited by this paper
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
2016cited by this paper
Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism
2016cited by this paper
Fully Character-Level Neural Machine Translation without Explicit Segmentation
2016cited by this paper
Neural Machine Translation of Rare Words with Subword Units
2015cited by this paper
Improving Neural Machine Translation Models with Monolingual Data
2015cited by this paper
On Using Monolingual Corpora in Neural Machine Translation
2015cited by this paper
How to Avoid Unwanted Pregnancies: Domain Adaptation using Neural Network Models
2015cited by this paper
Multi-Task Learning for Multiple Language Translation
2015cited by this paper
Neural Machine Translation by Jointly Learning to Align and Translate
2014cited by this paper
UM-Corpus: A Large English-Chinese Parallel Corpus for Statistical Machine Translation
2014cited by this paper
Deterministic Policy Gradient Algorithms
2014cited by this paper
I and J
2012cited by this paper
KenLM: Faster and Smaller Language Model Queries
2011cited by this paper
A Survey on Transfer Learning
2010cited by this paper
Introduction to Semi-Supervised Learning
2009cited by this paper
Frustratingly Easy Domain Adaptation
2007cited by this paper
A. and Q
2006cited by this paper
Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning
2004cited by this paper
Bleu: a Method for Automatic Evaluation of Machine Translation
2002cited by this paper
Reinforcement Learning: An Introduction
1998cited by this paper
Multitask Learning: A Knowledge-Based Source of Inductive Bias
1993cited by this paper
A and V
1962cited by this paper
and as an in
year unknowncited by this paper

CITED BY

KD4MT: A Survey of Knowledge Distillation for Machine Translation
2026cites this paper