Learning Kernel-Smoothed Machine Translation with Retrieved Examples

Qingnan Jiang,Mingxuan Wang,Jun Cao,Shanbo Cheng,Shujian Huang,Lei Li

Published 2021 in Conference on Empirical Methods in Natural Language Processing

ABSTRACT

How to effectively adapt neural machine translation (NMT) models according to emerging cases without retraining? Despite the great success of neural machine translation, updating the deployed models online remains a challenge. Existing non-parametric approaches that retrieve similar examples from a database to guide the translation process are promising but are prone to overfit the retrieved examples. However, non-parametric methods are prone to overfit the retrieved examples. In this work, we propose to learn Kernel-Smoothed Translation with Example Retrieval (KSTER), an effective approach to adapt neural machine translation models online. Experiments on domain adaptation and multi-domain machine translation datasets show that even without expensive retraining, KSTER is able to achieve improvement of 1.1 to 1.5 BLEU scores over the best existing online adaptation methods. The code and trained models are released at https://github.com/jiangqn/KSTER.

PUBLICATION RECORD

Publication year
2021
Venue
Conference on Empirical Methods in Natural Language Processing
Publication date
2021-09-21
Fields of study
Linguistics, Computer Science
Identifiers
DOI 10.18653/v1/2021.emnlp-main.579 arXiv 2109.09991
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Learning Language Specific Sub-network for Multilingual Machine Translation
2021cited by this paper
Serial or Parallel? Plug-able Adapter for multilingual machine translation
2021cited by this paper
WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach
2021cited by this paper
Revisiting Multi-Domain Machine Translation
2021cited by this paper
Fast and Accurate Neural Machine Translation with Translation Memory
2021cited by this paper
Finding Sparse Structures for Domain Specific Neural Machine Translation
2021cited by this paper
Counter-Interference Adapter for Multilingual Machine Translation
2021cited by this paper
Unsupervised Domain Clusters in Pretrained Language Models
2020cited by this paper
Parameter-Efficient Transfer Learning with Diff Pruning
2020cited by this paper
On the Sentence Embeddings from BERT for Semantic Textual Similarity
2020cited by this paper
Nearest Neighbor Machine Translation
2020influential reference
Boosting Neural Machine Translation with Similar Translations
2020cited by this paper
Non-Parametric Adaptation for Neural Machine Translation
2019influential reference
Joey NMT: A Minimalist NMT Toolkit for Novices
2019cited by this paper
Graph Based Translation Memory for Neural Machine Translation
2019cited by this paper
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
2019cited by this paper
Simple, Scalable Adaptation for Neural Machine Translation
2019cited by this paper
EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks
2019cited by this paper
Neural Fuzzy Repair: Integrating Fuzzy Matches into Neural Machine Translation
2019cited by this paper
Compact Personalized Models for Neural Machine Translation
2018cited by this paper
Guiding Neural Machine Translation with Retrieved Translation Pieces
2018cited by this paper
A Call for Clarity in Reporting BLEU Scores
2018cited by this paper
Search Engine Guided Neural Machine Translation
2018cited by this paper
A Survey of Domain Adaptation for Neural Machine Translation
2018cited by this paper
Encoding Gated Translation Memory into Neural Machine Translation
2018cited by this paper
Billion-Scale Similarity Search with GPUs
2017cited by this paper
Improving Word Sense Disambiguation in Neural Machine Translation with Sense Embeddings
2017influential reference
Multi-Domain Neural Machine Translation through Unsupervised Adaptation
2017cited by this paper
Attention is All you Need
2017cited by this paper
Six Challenges for Neural Machine Translation
2017cited by this paper
One-shot Learning with Memory-Augmented Neural Networks
2016cited by this paper
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
2016cited by this paper
Neural Machine Translation of Rare Words with Subword Units
2015cited by this paper
Adam: A Method for Stochastic Optimization
2014cited by this paper
Findings of the 2014 Workshop on Statistical Machine Translation
2014cited by this paper
Neural Machine Translation by Jointly Learning to Align and Translate
2014cited by this paper
Parallel Data, Tools and Interfaces in OPUS
2012cited by this paper
Statistical Significance Tests for Machine Translation Evaluation
2004cited by this paper
Bleu: a Method for Automatic Evaluation of Machine Translation
2002cited by this paper
Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem
1989cited by this paper

CITED BY

RASST: Fast Cross-modal Retrieval-Augmented Simultaneous Speech Translation
2026cites this paper
Long-Tail Crisis in Nearest Neighbor Language Models
2025cites this paper
Adaptive k-Nearest-Neighbor Machine Translation with Domain Relevance for Chinese-Mongolian Husbandry Domain
2025cites this paper
LA-RAG:Enhancing LLM-based ASR Accuracy with Retrieval-Augmented Generation
2024cites this paper
NNOSE: Nearest Neighbor Occupational Skill Extraction
2024cites this paper
Efficient Domain Adaptation for Non-Autoregressive Machine Translation
2024cites this paper
Simply Trainable Nearest Neighbour Machine Translation with GPU Inference
2024cites this paper
Example-based Emotion Recognition in Conversations
2024cites this paper
Speaker-Smoothed kNN Speaker Adaptation for End-to-End ASR
2024cites this paper
Datastore Distillation for Nearest Neighbor Machine Translation
2024cites this paper
ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation
2024cites this paper
Generating Diverse Translation with Perturbed kNN-MT
2024cites this paper
Inverse Entropic Optimal Transport Solves Semi-supervised Learning via Data Likelihood Maximization
2024cites this paper
Nearest Neighbor Machine Translation is Meta-Optimizer on Output Projection Layer
2023influential citation
N-Gram Nearest Neighbor Machine Translation
2023cites this paper
Simple and Scalable Nearest Neighbor Machine Translation
2023cites this paper
kNN-BOX: A Unified Framework for Nearest Neighbor Generation
2023cites this paper
Semiparametric Language Models Are Scalable Continual Learners
2023cites this paper
Tram: A Token-level Retrieval-augmented Mechanism for Source Code Summarization
2023cites this paper
Bridging the Domain Gaps in Context Representations for k-Nearest Neighbor Neural Machine Translation
2023cites this paper
kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation
2023cites this paper
Empirical Assessment of kNN-MT for Real-World Translation Scenarios
2023cites this paper
Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented Models
2023cites this paper
Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval
2022cites this paper
Better Datastore, Better Translation: Generating Datastores from Pre-Trained Models for Nearest Neural Machine Translation
2022influential citation
Analogical Math Word Problems Solving with Enhanced Problem-Solution Association
2022cites this paper
What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation
2022influential citation
Towards Robust k-Nearest-Neighbor Machine Translation
2022influential citation
Learning Decoupled Retrieval Representation for Nearest Neighbour Neural Machine Translation
2022cites this paper
An Empirical Study of Retrieval-Enhanced Graph Neural Networks
2022cites this paper
Augmenting Message Passing by Retrieving Similar Graphs
2022cites this paper
DIDST: Disentangling Datastore and Translation Model for Nearest Neighbor Machine Translation
2022cites this paper
Chunk-based Nearest Neighbor Machine Translation
2022cites this paper
Efficient Cluster-Based k-Nearest-Neighbor Machine Translation
2022cites this paper
Efficient Machine Translation Domain Adaptation
2022cites this paper