Encoding Source Language with Convolutional Neural Network for Machine Translation

Fandong Meng,Zhengdong Lu,Mingxuan Wang,Hang Li,Wenbin Jiang,Qun Liu

Published 2015 in Annual Meeting of the Association for Computational Linguistics

ABSTRACT

The recently proposed neural network joint model (NNJM) (Devlin et al., 2014) augments the n-gram target language model with a heuristically chosen source context window, achieving state-of-the-art performance in SMT. In this paper, we give a more systematic treatment by summarizing the relevant source information through a convolutional architecture guided by the target information. With different guiding signals during decoding, our specifically designed convolution+gating architectures can pinpoint the parts of a source sentence that are relevant to predicting a target word, and fuse them with the context of entire source sentence to form a unified representation. This representation, together with target language words, are fed to a deep neural network (DNN) to form a stronger NNJM. Experiments on two NIST Chinese-English translation tasks show that the proposed model can achieve significant improvements over the previous NNJM by up to +1.08 BLEU points on average

PUBLICATION RECORD

Publication year
2015
Venue
Annual Meeting of the Association for Computational Linguistics
Publication date
2015-03-05
Fields of study
Linguistics, Computer Science
Identifiers
DOI 10.3115/v1/P15-1003 arXiv 1503.01838
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Sequence to Sequence Learning with Neural Networks
2014cited by this paper
Convolutional Neural Network Architectures for Matching Natural Language Sentences
2014cited by this paper
Fast and Robust Neural Network Joint Models for Statistical Machine Translation
2014influential reference
Neural Machine Translation by Jointly Learning to Align and Translate
2014cited by this paper
A Convolutional Neural Network for Modelling Sentences
2014cited by this paper
Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation
2014cited by this paper
Translation with Source Constituency and Dependency Trees
2013cited by this paper
Recurrent Continuous Translation Models
2013cited by this paper
Joint Language and Translation Modeling with Recurrent Neural Networks
2013cited by this paper
Efficient BackProp
2012cited by this paper
A novel dependency-to-string model for statistical machine translation
2011cited by this paper
A New String-to-Dependency Machine Translation Algorithm with a Target Dependency Language Model
2008cited by this paper
Hierarchical Phrase-Based Translation
2007influential reference
Moses: Open Source Toolkit for Statistical Machine Translation
2007cited by this paper
Forest Rescoring: Faster Decoding with Integrated Language Models
2007cited by this paper
Clause Restructuring for Statistical Machine Translation
2005cited by this paper
What’s in a translation rule?
2004cited by this paper
A Neural Probabilistic Language Model
2003cited by this paper
A Systematic Comparison of Various Statistical Alignment Models
2003cited by this paper
Minimum Error Rate Training in Statistical Machine Translation
2003cited by this paper
Statistical Phrase-Based Translation
2003cited by this paper
SRILM - an extensible language modeling toolkit
2002cited by this paper
Discriminative Training and Maximum Entropy Models for Statistical Machine Translation
2002cited by this paper
Fast Exact Inference with a Factored Model for Natural Language Parsing
2002cited by this paper

CITED BY

A joint learning classification for intent detection and slot filling from classical to deep learning: a review
2025cites this paper
DETERMINING ROCK FRAGMENT SIZE DISTRIBUTION USING A CONVOLUTIONAL NEURAL NETWORK
2024cites this paper
A Survey of Research and Application of NLP-based Machine Translation
2024cites this paper
Human-Machine Interaction Translation under Artificial Intelligence and Big Data: Analysis from the Perspective of Text Stratification and Corpus Construction
2024cites this paper
Domain Adaptation for Arabic Machine Translation: The Case of Financial Texts
2023cites this paper
Experimenting with UD Adaptation of an Unsupervised Rule-based Approach for Sentiment Analysis of Mexican Tourist Texts
2023cites this paper
Deep Learning for Natural Language Processing: A Survey
2023cites this paper
Music genre classification based on res-gated CNN and attention mechanism
2023cites this paper
An Improved LA-Transformer Machine Translation Model
2023cites this paper
Refining History for Future-Aware Neural Machine Translation
2023cites this paper
Unified Model Learning for Various Neural Machine Translation
2023cites this paper
Retrieval Augmented Convolutional Encoder-decoder Networks for Video Captioning
2022cites this paper
Genetic Algorithm-based Transformer Architecture Design for Neural Machine Translation
2022influential citation
Representation Learning
2022cites this paper
Task-Oriented Multi-User Semantic Communications
2021cites this paper
THINK: A Novel Conversation Model for Generating Grammatically Correct and Coherent Responses
2021cites this paper
Feature Extraction Method of Machine Translation Equivalent Pairs in Chinese-English Comparable Corpus based OCR Recognition
2021cites this paper
Learning Syllables Using Conv-LSTM Model for Swahili Word Representation and Part-of-speech Tagging
2021influential citation
Neural Machine Translation
2020cites this paper
PC-SAN: Pretraining-Based Contextual Self-Attention Model for Topic Essay Generation
2020cites this paper
Machine Translation Systems for Indian Languages: Review of Modelling Techniques, Challenges, Open Issues and Future Research Directions
2020cites this paper
Non-goal oriented dialogue agents: state of the art, dataset, and evaluation
2020cites this paper
Gated Attentive Convolutional Network Dialogue State Tracker
2020cites this paper
Vertical intent prediction approach based on Doc2vec and convolutional neural networks for improving vertical selection in aggregated search
2020cites this paper
A Survey of Deep Learning Techniques for Neural Machine Translation
2020cites this paper
A Deep Learning Approach to Predict Abdominal Aortic Aneurysm Expansion Using Longitudinal Data
2020cites this paper
Automatic Classification and Reporting of Multiple Common Thorax Diseases Using Chest Radiographs
2019cites this paper
Memory, attention and prediction: a deep learning architecture for car-following
2019cites this paper
Convolutional Neural Networks
2019cites this paper
Deep Learning and Convolutional Neural Networks for Medical Imaging and Clinical Informatics
2019cites this paper
CHAMELEON: A Deep Learning Meta-Architecture for News Recommender Systems [Phd. Thesis]
2019cites this paper
Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning
2019cites this paper
A Sustainable Multi-Modal Multi-Layer Emotion-Aware Service at the Edge
2019cites this paper
Exploiting Sentential Context for Neural Machine Translation
2019cites this paper
Automatic syntactic analysis of learner English
2019cites this paper
Text Generation From Tables
2019cites this paper
Layer-Wise De-Training and Re-Training for ConvS2S Machine Translation
2019cites this paper
Text Classification based on Multiple Block Convolutional Highways
2018cites this paper
Tensor2Tensor for Neural Machine Translation
2018cites this paper
Fast Decoding in Sequence Models using Discrete Latent Variables
2018cites this paper
TieNet: Text-Image Embedding Network for Common Thorax Disease Classification and Reporting in Chest X-Rays
2018influential citation
Table-to-Text: Describing Table Region With Natural Language
2018cites this paper
cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
2018cites this paper
A Neural Approach to Source Dependence Based Context Model for Statistical Machine Translation
2018influential citation
Pervasive Attention: 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction
2018cites this paper
A Hybrid RNN-CNN Encoder for Neural Conversation Model
2018cites this paper
Question Generation With Doubly Adversarial Nets
2018cites this paper
Incorporating Statistical Machine Translation Word Knowledge Into Neural Machine Translation
2018cites this paper
Deep learning methods for knowledge base population
2018cites this paper
Interpretative Topic Categorization Via Deep Multiple Instance Learning
2018cites this paper
Question Rewrite Based Dialogue Response Generation
2018cites this paper
Predicting Stances in Twitter Conversations for Detecting Veracity of Rumors: A Neural Approach
2018cites this paper
Syntax-Based Context Representation for Statistical Machine Translation
2018influential citation
A fuzzy convolutional neural network for text sentiment analysis
2018cites this paper
From Feature to Paradigm: Deep Learning in Machine Translation (Extended Abstract)
2018cites this paper
Deep Learning MT and Logos Model
2018cites this paper
Deep Learning in Machine Translation
2018cites this paper
Automatic Extraction of Cognitive Features from Gaze Data
2018cites this paper
The parallel corpus for information extraction based on natural language processing and machine translation
2018cites this paper
Cognitively Inspired Natural Language Processing
2018cites this paper
One Model To Learn Them All
2017cites this paper
Deconvolutional Paragraph Representation Learning
2017cites this paper
Encoding syntactic representations with a neural network for sentiment collocation extraction
2017cites this paper
Attentive Convolutional Neural Network Based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech
2017cites this paper
Improving Word Embeddings with Convolutional Feature Learning and Subword Information
2017cites this paper
Grammatical error correction in non-native English
2017cites this paper
Character-level deep conflation for business data analytics
2017cites this paper
Combine multi-features with deep learning for answer selection
2017cites this paper
Convolutional Neural Network with Word Embeddings for Chinese Word Segmentation
2017cites this paper
Deep Learning applied to NLP
2017cites this paper
Opinion Expression Detection via Deep Bidirectional C-GRUs
2017cites this paper
Translation Prediction with Source Dependency-Based Context Representation
2017cites this paper
Spacecraft power system fault diagnosis based on DNN
2017cites this paper
A Convolution-LSTM-Based Deep Neural Network for Cross-Domain MOOC Forum Post Classification
2017cites this paper
Question Answering and Question Generation as Dual Tasks
2017cites this paper
Deep convRNN for sentiment parsing of Chinese microblogging texts
2017cites this paper
Low-Resource Cross-Domain Product Review Sentiment Classification Based on a CNN with an Auxiliary Large-Scale Corpus
2017cites this paper
CNN-Based Sequence Labeling for Fine-Grained Opinion Mining of Microblogs
2017cites this paper
Convolutional over Recurrent Encoder for Neural Machine Translation
2017cites this paper
Feedforward sequential memory networks based encoder-decoder model for machine translation
2017cites this paper
Deep Learning Methods on Recommender System: A Survey of State-of-the-art
2017cites this paper
Exploring Different Granularity in Mongolian-Chinese Machine Translation Based on CNN
2017cites this paper
Convolutional Sequence to Sequence Learning
2017cites this paper
Detecting Multiple Coexisting Emotions in Microblogs with Convolutional Neural Networks
2017cites this paper
Depthwise Separable Convolutions for Neural Machine Translation
2017cites this paper
Semantic computation in geography question answering
2016cites this paper
Can Active Memory Replace Attention?
2016cites this paper
Mutual Information and Diverse Decoding Improve Neural Machine Translation
2016cites this paper
Multi-label Chinese Microblog Emotion Classification via Convolutional Neural Network
2016cites this paper
Neural Machine Translation with External Phrase Memory
2016cites this paper
Survey on the attention based RNN model and its applications in computer vision
2016cites this paper
Hashtag Recommendation Using Attention-Based Convolutional Neural Network
2016cites this paper
Encoding Dependency Representation with Convolutional Neural Network for Target-Polarity Word Collocation Extraction
2016cites this paper
Unsupervised Learning of Sentence Representations using Convolutional Neural Networks
2016cites this paper
Learning Generic Sentence Representations Using Convolutional Neural Networks
2016cites this paper
A Continuous Space Rule Selection Model for Syntax-based Statistical Machine Translation
2016cites this paper
Exploring Different Dimensions of Attention for Uncertainty Detection
2016influential citation
Attention-Based Convolutional Neural Networks for Sentence Classification
2016cites this paper
Relation Classification via Multi-Level Attention CNNs
2016cites this paper
A Survey and Critique of Deep Learning on Recommender Systems
2016cites this paper