A Recursive Recurrent Neural Network for Statistical Machine Translation

Published 2014 in Annual Meeting of the Association for Computational Linguistics

ABSTRACT

In this paper, we propose a novel recursive recurrent neural network (R 2 NN) to model the end-to-end decoding process for statistical machine translation. R 2 NN is a combination of recursive neural network and recurrent neural network, and in turn integrates their respective capabilities: (1) new information can be used to generate the next hidden state, like recurrent neural networks, so that language model and translation model can be integrated naturally; (2) a tree structure can be built, as recursive neural networks, so as to generate the translation candidates in a bottom up manner. A semi-supervised training approach is proposed to train the parameters, and the phrase pair embedding is explored to model translation confidence directly. Experiments on a Chinese to English translation task show that our proposed R 2 NN can outperform the stateof-the-art baseline by about 1.5 points in BLEU.

PUBLICATION RECORD

Publication year
2014
Venue
Annual Meeting of the Association for Computational Linguistics
Publication date
2014-06-01
Fields of study
Computer Science
Identifiers
DOI 10.3115/v1/P14-1140
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Word Alignment Modeling with Context Dependent Deep Neural Network
2013influential reference
Additive Neural Networks for Statistical Machine Translation
2013cited by this paper
Max-Violation Perceptron and Forced Decoding for Scalable MT Training
2013cited by this paper
Joint Language and Translation Modeling with Recurrent Neural Networks
2013influential reference
Recursive Autoencoders for ITG-Based Translation
2013cited by this paper
Parsing with Compositional Vector Grammars
2013cited by this paper
Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition
2012cited by this paper
ImageNet classification with deep convolutional neural networks
2012cited by this paper
Natural Language Processing (Almost) from Scratch
2011influential reference
Parsing Natural Scenes and Natural Language with Recursive Neural Networks
2011influential reference
Recurrent neural network based language model
2010cited by this paper
Training Phrase Translation Models with Leaving-One-Out
2010cited by this paper
Learning Convolutional Feature Hierarchies for Visual Recognition
2010cited by this paper
An End-to-End Discriminative Approach to Machine Translation
2006cited by this paper
A Fast Learning Algorithm for Deep Belief Nets
2006cited by this paper
Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation
2006cited by this paper
Neural Probabilistic Language Models
2006cited by this paper
Statistical Significance Tests for Machine Translation Evaluation
2004cited by this paper
The Alignment Template Approach to Statistical Machine Translation
2004cited by this paper
Bleu: a Method for Automatic Evaluation of Machine Translation
2002cited by this paper
Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora
1997cited by this paper

CITED BY

Transformer-based market trend prediction and enterprise digital development
2026cites this paper
Reducing Traffic Accident Impact: A Vision Transformer-Based Driver Distraction Detection Approach
2025cites this paper
Machine Learning in Prognostics and Health Management of Cyber-Physical Systems: A Review
2025cites this paper
Quantum Mixed-State Self-Attention Network
2024cites this paper
A Survey of Mamba
2024cites this paper
Improving Intra-Urban Prediction of Regional Pedestrian Flow Using a Hybrid Deep Learning Approach
2024cites this paper
Graph Neural Networks for Contextual ASR With the Tree-Constrained Pointer Generator
2023cites this paper
An empirical analysis on statistical and neural machine translation system for English to Mizo language
2023cites this paper
Uncertainty and sensitivity analysis of deep learning models for diurnal temperature range (DTR) forecasting over five Indian cities
2023cites this paper
Bidirectional Neural Network Model for Glaucoma Progression Prediction
2023cites this paper
Visual field prediction using a deep bidirectional gated recurrent unit network model
2023cites this paper
Non-Autoregressive Unsupervised Summarization with Length-Control Algorithms
2022cites this paper
Automatic Classification of Cancer Pathology Reports: A Systematic Review
2022cites this paper
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
2022cites this paper
A Two-phase Recommendation Framework for Consistent Java Method Names
2022cites this paper
Self-supervised Graph-based Point-of-interest Recommendation
2022cites this paper
Charging-Related State Prediction for Electric Vehicles Using the Deep Learning Model
2022cites this paper
Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition
2022cites this paper
Language Modeling on Location-Based Social Networks
2022cites this paper
Sliced Recursive Transformer
2021influential citation
Forecasting of COVID-19 Cases, Using an Evolutionary Neural Architecture Search Approach
2021cites this paper
A comprehensive E-commerce customer behavior analysis using convolutional methods
2021cites this paper
IELTS translation education corpus construction based on bilingual non-parallel data model
2021cites this paper
Visual Analytics of RNN for Thermal Power Control System Identification
2021cites this paper
Deep learning models' comparable assessment and uncertainty analysis for diurnal temperature range (DTR) predictions over indian urban cities
2021cites this paper
MS-TR: A Morphologically enriched sentiment Treebank and recursive deep models for compositional sem
2021cites this paper
Transformer Fault Prognosis Using Deep Recurrent Neural Network Over Vibration Signals
2021cites this paper
ALF: a fitness-based artificial life form for evolving large-scale neural networks
2021cites this paper
Understanding Deep Learning: Case Study Based Approach
2021cites this paper
MS-TR: A Morphologically enriched sentiment Treebank and recursive deep models for compositional semantics in Turkish
2021cites this paper
Bloom’s Learning Outcomes’ Automatic Classification Using LSTM and Pretrained Word Embeddings
2021cites this paper
Layer Flexible Adaptive Computation Time
2021cites this paper
Self-Instantiated Recurrent Units with Dynamic Soft Recursion
2021cites this paper
Graph neural networks in node classification: survey and evaluation
2021cites this paper
Interpretable Recurrent Neural Networks in Continuous-time Control Environments
2020cites this paper
Bayesian neural networks for flight trajectory prediction and safety assessment
2020cites this paper
Semantically Smooth Bilingual Phrase Embeddings Based on Recursive Autoencoders
2020cites this paper
Japanese translation teaching corpus based on bilingual non parallel data model
2020cites this paper
Using Recurrent Neural Networks for Part-of-Speech Tagging and Subject and Predicate Classification in a Sentence
2020cites this paper
Forecasting Rainfall with Recurrent Neural Network for irrigation equipment
2020cites this paper
The Dimension Effect on Adversarial Learning Phenomena
2020cites this paper
An automated snoring sound classification method based on local dual octal pattern and iterative hybrid feature selector
2020cites this paper
Single Space Object Image Super Resolution Reconstructing Using Convolutional Networks in Wavelet Transform Domain
2020cites this paper
Auditory implicit learning in machines versus humans
2020cites this paper
Sentiment analysis with deep neural networks: comparative study and performance assessment
2020cites this paper
The models and structure of neural networks
2020cites this paper
An integrated model for textual social media data with spatio-temporal dimensions
2020cites this paper
BITCOIN PRICE PREDICTION USING MACHINE LEARNING MODELS
2020cites this paper
How Well Do Change Sequences Predict Defects? Sequence Learning from Software Changes
2020cites this paper
JUMT at WMT2019 News Translation Task: A Hybrid Approach to Machine Translation for Lithuanian to English
2019cites this paper
Long Short-Term Memory Model for Classification of English-PtBR Cross-Lingual Hate Speech
2019cites this paper
Visual Field Prediction using Recurrent Neural Network
2019cites this paper
Dynamic modeling of NOX emission in a 660 MW coal-fired boiler with long short-term memory
2019cites this paper
A Study on an Effect of Using Deep Learning in Thai-English Machine Translation Processes
2019influential citation
Hyperparameter Optimization for Machine Learning Models Based on Bayesian Optimization
2019cites this paper
Sentiment Analysis of Twitter Data in Online Social Network
2019cites this paper
Layer Flexible Adaptive Computation Time for Recurrent Neural Networks
2019cites this paper
A novel Enhanced Collaborative Autoencoder with knowledge distillation for top-N recommender systems
2019cites this paper
MTIL2017: Machine Translation Using Recurrent Neural Network on Statistical Machine Translation
2019cites this paper
Deep recurrent neural network‐based residual control chart for autocorrelated processes
2019cites this paper
Key Process Protection of High Dimensional Process Data in Complex Production
2019cites this paper
Urban Land Use and Land Cover Change Prediction via Self-Adaptive Cellular Based Deep Learning With Multisourced Data
2019cites this paper
Named Entity Recognition With Deep Learning
2019cites this paper
MetaMT, a MetaLearning Method Leveraging Multiple Domain Data for Low Resource Machine Translation
2019cites this paper
How meaningful are similarities in deep trajectory representations?
2019cites this paper
Deep Learning Based Weighted Feature Fusion Approach for Sentiment Analysis
2019cites this paper
Demand-Prediction Architecture for Distribution Businesses Based on Multiple RNNs with Alternative Weight Update
2019cites this paper
Deep6mA: A deep learning framework for exploring similar patterns in DNA N6-methyladenine sites across different species
2019cites this paper
Automatic classification of speech overlaps: Feature representation and algorithms
2019cites this paper
Named Entity Recognition in Traditional Chinese Medicine Clinical Cases Combining BiLSTM-CRF with Knowledge Graph
2019cites this paper
Calibrating GloVe model on the principle of Zipf's law
2019cites this paper
JUCBNMT at WMT2018 News Translation Task: Character Based Neural Machine Translation of Finnish to English
2019cites this paper
Towards Automating Big Texts Security Classification
2018cites this paper
Improving Word Embedding Compositionality using Lexicographic Definitions
2018cites this paper
Toward meaningful notions of similarity in NLP embedding models
2018cites this paper
Data-driven forecasting of solar irradiance
2018cites this paper
Deep Learning for Text Classification in Azure Infrastructure April 15
2018cites this paper
Final Report for Final Year Project Deep Learning for Text Classification in Azure Infrastructure
2018cites this paper
Interim Report for Final Year Project Deep Learning for Text Classification in Azure Infrastructure
2018cites this paper
RUN: Residual U-Net for Computer-Aided Detection of Pulmonary Nodules without Candidate Selection
2018cites this paper
Alignment-consistent recursive neural networks for bilingual phrase embeddings
2018cites this paper
Research on optimization method of convolutional nerual network
2018cites this paper
Neural Networks Regularization Through Representation Learning
2018cites this paper
Multi-view Fusion with Deep Learning for 3D Shape Classification
2018cites this paper
Attention Enhanced Chinese Word Embeddings
2018cites this paper
Investigating the use of recurrent motion modelling for speech gesture generation
2018cites this paper
MGANet: A Robust Model for Quality Enhancement of Compressed Video
2018cites this paper
Layer Flexible Adaptive Computational Time for Recurrent Neural Networks
2018cites this paper
SMT vs NMT: A Comparison over Hindi and Bengali Simple Sentences
2018cites this paper
ALSTM: Adaptive LSTM for Durative Sequential Data
2018cites this paper
Stanza: Layer Separation for Distributed Training in Deep Learning
2018cites this paper
A recurrent neural network architecture for biomedical event trigger classification
2018cites this paper
Deep Learning MT and Logos Model
2018cites this paper
Influence analysis of emotional behaviors and user relationships based on Twitter data
2018cites this paper
Language and Complexity: Neurolinguistic Perspectives
2018cites this paper
The parallel corpus for information extraction based on natural language processing and machine translation
2018cites this paper
Application of Convolutional Neural Network in Natural Language Processing
2018cites this paper
Dynamic Content based Behavior Analysis Model for Users on Social Media
2018cites this paper
Unstructured text comprehension and question answering
2018cites this paper
Distributed Representation of Words in Vector Space for Kannada Language
2018cites this paper