Recurrent Continuous Translation Models

Published 2013 in Conference on Empirical Methods in Natural Language Processing

ABSTRACT

We introduce a class of probabilistic continuous translation models called Recurrent Continuous Translation Models that are purely based on continuous representations for words, phrases and sentences and do not rely on alignments or phrasal translation units. The models have a generation and a conditioning aspect. The generation of the translation is modelled with a target Recurrent Language Model, whereas the conditioning on the source sentence is modelled with a Convolutional Sentence Model. Through various experiments, we show first that our models obtain a perplexity with respect to gold translations that is > 43% lower than that of stateof-the-art alignment-based translation models. Secondly, we show that they are remarkably sensitive to the word order, syntax, and meaning of the source sentence despite lacking alignments. Finally we show that they match a state-of-the-art system when rescoring n-best lists of translations.

PUBLICATION RECORD

Publication year
2013
Venue
Conference on Empirical Methods in Natural Language Processing
Publication date
2013-10-01
Fields of study
Computer Science
Identifiers
DOI 10.18653/v1/d13-1176
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

The Role of Syntax in Vector Space Models of Compositional Semantics
2013cited by this paper
Recurrent Convolutional Neural Networks for Discourse Compositionality
2013influential reference
A Simple, Fast, and Effective Reparameterization of IBM Model 2
2013cited by this paper
Semantic Compositionality through Recursive Matrix-Vector Spaces
2012cited by this paper
Continuous Space Translation Models with Neural Networks
2012cited by this paper
Continuous Space Translation Models for Phrase-Based Statistical Machine Translation
2012cited by this paper
Context dependent recurrent neural network language model
2012cited by this paper
Generating Text with Recurrent Neural Networks
2011cited by this paper
Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection
2011cited by this paper
Extensions of recurrent neural network language model
2011cited by this paper
Adaptive Subgradient Methods for Online Learning and Stochastic Optimization
2011cited by this paper
Recurrent neural network based language model
2010influential reference
cdec: A Decoder, Alignment, and Learning Framework for Finite- State and Context-Free Translation Models
2010cited by this paper
Concrete Sentence Spaces for Compositional Distributional Models of Meaning
2010influential reference
A unified architecture for natural language processing: deep neural networks with multitask learning
2008cited by this paper
Continuous Space Language Models for Statistical Machine Translation
2006cited by this paper
A Neural Probabilistic Language Model
2003cited by this paper
The Mathematics of Statistical Machine Translation: Parameter Estimation
1993cited by this paper

CITED BY

Optimizing irrigation decisions with Seq2Seq modeling and deep reinforcement learning
2026cites this paper
Leveraging Sentence-oriented Augmentation and Transformer-Based Architecture for Vietnamese-Bahnaric Translation
2026cites this paper
Advanced computational models for urban traffic flow prediction: A comprehensive review and future directions
2026cites this paper
Integrating Code Metrics into Automated Documentation Generation for Computational Notebooks
2026cites this paper
A Survey on Tibetan-Chinese Machine Translation
2025cites this paper
Improving Low-Resource Kazakh-English and Turkish-English Neural Machine Translation Using Transfer Learning and Part of Speech Tags
2025cites this paper
Intelligent Bilingual Reading Translation System Based on Natural Language Processing
2025cites this paper
Research on Dynamic Curriculum Learning in Mongolian-Chinese Neural Machine Translation
2025cites this paper
Smruti: Grammatical Error Correction for Gujarati using LLMs with Non-Parametric Memory
2025cites this paper
Convergence analysis of recurrent neural networks based on sparse mechanism and its application to time series and multiple classification problems
2025cites this paper
Low-Resource Noisy Transliteration Normalization Using Large-Scale Language Model
2025cites this paper
Research on machine translation of ancient books in the era of large language model
2025cites this paper
Science Across Languages: Assessing LLM Multilingual Translation of Scientific Papers
2025cites this paper
A comparison of translation performance between DeepL and Supertext
2025cites this paper
A data-guided curriculum towards low-resource neural machine translation
2025cites this paper
Evaluating Free Legal Translation Tools between Arabic and English: A Comparative Study of Google Translate, ChatGPT, and Gemini
2025cites this paper
Cross-dimensional global interactive transformer for traffic forecasting
2025cites this paper
Machine learning methods for isolating indigenous language catalog descriptions
2025cites this paper
Two-stage effective attentional generative adversarial network
2025cites this paper
How Large Language Models Enhance Low-Resource Mongolian-Chinese Machine Translation?
2025cites this paper
Adversarial Robustness of Vision in Open Foundation Models
2025cites this paper
Enhancing online dispute resolution through natural language processing: a case study of kleros
2025cites this paper
JIBON++: AI Enabled Intelligent Voice Assistant for Blind People Understanding Negative Sentiments
2025influential citation
Adaptive English Translation Parameter Tuning via Particle Swarm Optimization and Attention Mechanism
2025cites this paper
A New NMT Model for Translating Clinical Texts from English to Spanish
2025cites this paper
Translative Neural Team Recommendation: From Multilabel Classification to Sequence Prediction
2025cites this paper
Hierarchical Local-Global Transformer With Dynamic Positional Encoding for Document-Level Machine Translation
2025cites this paper
Use of Transfer Learning for Affordable In-Context Fake Review Generation
2025cites this paper
InsurTech innovation using natural language processing
2025cites this paper
Improving Neural Machine Translation in the Field of Electrical Engineering by Using Sentence Backbone Information
2025cites this paper
Mathematical Engineering of Deep Learning
2024cites this paper
Research on Intelligent Translation System Based on Artificial Intelligence Algorithm
2024cites this paper
Modularized Multilingual NMT with Fine-grained Interlingua
2024cites this paper
Context-Aware Non-Autoregressive Document-Level Translation with Sentence-Aligned Connectionist Temporal Classification
2024cites this paper
Neural Machine Translation for the Arabic-English Language Pair
2024cites this paper
Contextualized dynamic meta embeddings based on Gated CNNs and self-attention for Arabic machine translation
2024cites this paper
Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings
2024influential citation
The metamorphosis of machine translation: The rise of neural machine translation and its challenges
2024cites this paper
A Study for Enhancing Low-resource Thai-Myanmar-English Neural Machine Translation
2024cites this paper
Improving translation between English, Assamese bilingual pair with monolingual data, length penalty and model averaging
2024cites this paper
Attention-Based Models for Multivariate Time Series Forecasting: Multi-step Solar Irradiation Prediction
2024cites this paper
Birdie: Advancing State Space Models with Reward-Driven Objectives and Curricula
2024cites this paper
Transformer Machine Translation Model Incorporating Word Alignment Structure
2024cites this paper
CNNs, RNNs and Transformers in human action recognition: a survey and a hybrid model
2024cites this paper
Automated Medical Image Captioning with Soft Attention-Based LSTM Model Utilizing YOLOv4 Algorithm
2024cites this paper
English as a lingua franca in academic publishing: using round-trip translation to estimate linguistic revision difficulty
2024cites this paper
A Deep Learning Based Translation Performance Analysis of International Languages
2024cites this paper
RePair My Queries: Personalized Query Reformulation via Conditional Transformers
2024cites this paper
TRINet: Team Role Interaction Network for automatic radiology report generation
2024cites this paper
MedBot: A Novel Sequential Pipeline for Context Recognition Based on SciSpacy and Med7 Entity Recognizer
2024cites this paper
Exploring Automated Assertion Generation via Large Language Models
2024cites this paper
Analysis of Errors at the Lexical Level in Post-editing for Medical Texts
2024cites this paper
Hybrid Feature Based Global Variational Transformer for Diverse Image Captioning
2024cites this paper
Consensus-Based Machine Translation for Code-Mixed Texts
2024cites this paper
STORM: A Spatio-Temporal Context-Aware Model for Predicting Event-Triggered Abnormal Crowd Traffic
2024cites this paper
Xiwu: A Basis Flexible and Learnable LLM for High Energy Physics
2024cites this paper
Sparse Domain Transfer via Elastic Net Regularization
2024cites this paper
Gradient Consistency-based Parameter Allocation for Multilingual Neural Machine Translation
2024cites this paper
Review of Hierarchical Transfer Learning Architecture in Low-Resource Machine Translation
2024cites this paper
First Tragedy, then Parse: History Repeats Itself in the New Era of Large Language Models
2024cites this paper
Prediction of remaining life of bearings based on integral correction and global attention mechanism
2024cites this paper
From Efficient Multimodal Models to World Models: A Survey
2024cites this paper
Neural Representation Learning in Linguistic Structured Prediction
2024cites this paper
LLMs-in-the-loop Part-1: Expert Small AI Models for Bio-Medical Text Translation
2024cites this paper
Recent Advances in Generative AI and Large Language Models: Current Status, Challenges, and Perspectives
2024cites this paper
Generative-Adversarial Networks for Low-Resource Language Data Augmentation in Machine Translation
2024cites this paper
A Hierarchical Korean-Chinese Machine Translation Model Based on Sentence Structure Segmentation
2024cites this paper
SITD-NMT: Synchronous Inference NMT with Turing Re-Translation Detection
2024cites this paper
SubMerge: Merging Equivalent Subword Tokenizations for Subword Regularized Models in Neural Machine Translation
2024cites this paper
A Survey of Research and Application of NLP-based Machine Translation
2024cites this paper
Generation of Indian Sign Language Letters, Numbers, and Words
2024cites this paper
Think Carefully and Check Again! Meta-Generation Unlocking LLMs for Low-Resource Cross-Lingual Summarization
2024cites this paper
Uji Nilai Akurasi pada Neural Machine Translation (NMT) Bahasa Indonesia ke Bahasa Tiochiu Pontianak dengan Mekanisme Attention Bahdanau
2023cites this paper
Refining History for Future-Aware Neural Machine Translation
2023cites this paper
The Effect of Clozapine and Novel Glutamate Modulator JNJ-46356479 on Nitrosative Stress in a Postnatal Murine Ketamine Model of Schizophrenia
2023cites this paper
An Improved LA-Transformer Machine Translation Model
2023cites this paper
Mention Attention for Pronoun Translation
2023cites this paper
Photovoltaic power forecasting using quantum machine learning
2023cites this paper
Dialogue Agents with Literary Character Personality Traits
2023cites this paper
Local Information fused Transformer model for Korean-Chinese Machine Translation
2023cites this paper
Biomedical Parallel Sentence Retrieval Using Large Language Models
2023influential citation
Transfer and Triangulation Pivot Translation Approaches for Burmese Dialects
2023cites this paper
Classification of Tabular Data by Text Processing
2023cites this paper
Heterogeneous Encoders Scaling in the Transformer for Neural Machine Translation
2023cites this paper
Data Augmentation for SentRev using Back-Translation of Lexical Bundles
2023cites this paper
Joint Sign Language Recognition and Translation Using Keypoint Estimation
2023cites this paper
Neural machine translation for limited resources English-Nyishi pair
2023cites this paper
Review of Machine Translation
2023cites this paper
Enhancing Neural Machine Translation with Semantic Units
2023cites this paper
ST-MoE: Spatio-Temporal Mixture-of-Experts for Debiasing in Traffic Prediction
2023cites this paper
Enabling Efficient Assertion Inference
2023cites this paper
Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models
2023cites this paper
A Post-training Framework for Improving the Performance of Deep Learning Models via Model Transformation
2023cites this paper
A Text Classification Method of Network Public Opinion Based on Information Fusion
2023cites this paper
Chinese Text De-Colloquialization Technique Based on Back-Translation Strategy and End-to-End Learning
2023cites this paper
Zero-Shot Hate to Non-Hate Text Conversion Using Lexical Constraints
2023cites this paper
Low-Resource Neural Machine Translation: A Systematic Literature Review
2023cites this paper
A fuzzy model for NMT word alignment using quasi-perfect matching
2023cites this paper
Long-Term Prediction Model for NOx Emission Based on LSTM–Transformer
2023cites this paper
An empirical analysis on statistical and neural machine translation system for English to Mizo language
2023cites this paper