A Hierarchical Neural Autoencoder for Paragraphs and Documents

Published 2015 in Annual Meeting of the Association for Computational Linguistics

ABSTRACT

Natural language generation of coherent long texts like paragraphs or longer documents is a challenging problem for recurrent networks models. In this paper, we explore an important step toward this generation task: training an LSTM (Longshort term memory) auto-encoder to preserve and reconstruct multi-sentence paragraphs. We introduce an LSTM model that hierarchically builds an embedding for a paragraph from embeddings for sentences and words, then decodes this embedding to reconstruct the original paragraph. We evaluate the reconstructed paragraph using standard metrics like ROUGE and Entity Grid, showing that neural models are able to encode texts in a way that preserve syntactic, semantic, and discourse coherence. While only a first step toward generating coherent text units from neural models, our work has the potential to significantly impact natural language generation and summarization1.

PUBLICATION RECORD

Publication year
2015
Venue
Annual Meeting of the Association for Computational Linguistics
Publication date
2015-06-02
Fields of study
Computer Science
Identifiers
DOI 10.3115/v1/P15-1107 arXiv 1506.01057
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
2015cited by this paper
The NLP Engine: A Universal Turing Machine for NLP
2015cited by this paper
Grammar as a Foreign Language
2014cited by this paper
Addressing the Rare Word Problem in Neural Machine Translation
2014cited by this paper
A Model of Coherence Based on Distributed Sentence Representation
2014cited by this paper
Overcoming the Curse of Sentence Length for Neural Machine Translation using Automatic Segmentation
2014cited by this paper
Representation Learning for Text-level Discourse Parsing
2014cited by this paper
Recursive Deep Models for Discourse Parsing
2014cited by this paper
Sequence to Sequence Learning with Neural Networks
2014influential reference
Neural Machine Translation by Jointly Learning to Align and Translate
2014cited by this paper
Generating Sequences With Recurrent Neural Networks
2013cited by this paper
Text-level Discourse Parsing with Rich Linguistic Features
2012cited by this paper
Evolutionary timeline summarization: a balanced optimization framework via iterative substitution
2011cited by this paper
Automatically Evaluating Text Coherence Using Discourse Relations
2011cited by this paper
Timeline Generation through Evolutionary Trans-Temporal Summarization
2011cited by this paper
Discovery of Topically Coherent Sentences for Extractive Summarization
2011cited by this paper
HILDA: A Discourse Parser Using Support Vector Machine Classification
2010cited by this paper
Coreference-inspired Coherence Modeling
2008cited by this paper
Representing Discourse Coherence: A Corpus-Based Study
2005cited by this paper
Modeling Local Coherence: An Entity-Based Approach
2005cited by this paper
Automatic Evaluation of Text Coherence: Models and Representations
2005cited by this paper
ROUGE: A Package for Automatic Evaluation of Summaries
2004influential reference
Catching the Drift: Probabilistic Content Models, with Applications to Generation and Summarization
2004cited by this paper
Generating Discourse Structures for Written Text
2004cited by this paper
Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics
2003cited by this paper
Bleu: a Method for Automatic Evaluation of Machine Translation
2002influential reference
The rhetorical parsing of unrestricted texts: a surface-based approach
2000cited by this paper
Long Short-Term Memory
1997cited by this paper
Rhetorical Structure Theory: Toward a functional theory of text organization
1988cited by this paper
Conference
1969cited by this paper
Edinburgh Research Explorer Discourse relations and defeasible knowledge
year unknowncited by this paper

CITED BY

A survey on ordering of text at different granular levels
2026cites this paper
Synthesizing Time-Series Gene Expression Data to Enhance Network Inference Performance Using Autoencoder
2025cites this paper
Harnessing Structured Knowledge: A Concept Map-Based Approach for High-Quality Multiple Choice Question Generation with Effective Distractors
2025cites this paper
Transformer-enhanced hierarchical encoding with multi-decoder for diversified MCQ distractor generation
2025cites this paper
A Cascaded Architecture for Extractive Summarization of Multimedia Content via Audio-to-Text Alignment
2025cites this paper
Good Things Come in Pairs: Paired Autoencoders for Inverse Problems
2025cites this paper
Semantic-assisted report generation with memory enhanced transformer using context-aware visual extractor
2025cites this paper
A Study on Generative Text Summarization Methods Based on Improved Pointer Generation Networks
2025cites this paper
REN-GAN: Generative adversarial network-driven rebar clutter elimination network in GPR image for tunnel defect identification
2024cites this paper
Prefix tuning with prompt augmentation for efficient financial news summarization
2024cites this paper
Distractor Generation in Multiple-Choice Tasks: A Survey of Methods, Datasets, and Evaluation
2024cites this paper
Distractor Generation for Multiple-Choice Questions: A Survey of Methods, Datasets, and Evaluation
2024cites this paper
Intent Recognition in Dialogue Systems
2024cites this paper
Propositional Extraction from Natural Speech in Small Group Collaborative Tasks
2024cites this paper
ESMDNN-PPI: a new protein–protein interaction prediction model developed with protein language model of ESM2 and deep neural network
2024cites this paper
Dissecting learning and forgetting in language model finetuning
2024cites this paper
Linguistically Informed Language Generation: A Multi-faceted Approach
2024cites this paper
Recursively Autoregressive Autoencoder for Pyramidal Text Representation
2024cites this paper
An Effective Machine Learning-based Segmentation and Feature Extraction Technique for Muscular-Disorder
2023cites this paper
Yoga Pose Detection Using Long-Term Recurrent Convolutional Network
2023cites this paper
Open-ended Long Text Generation via Masked Language Modeling
2023cites this paper
Channel and temporal-frequency attention UNet for monaural speech enhancement
2023cites this paper
ProTIP: Progressive Tool Retrieval Improves Planning
2023cites this paper
Multi‐task learning with contextual hierarchical attention for Korean coreference resolution
2023cites this paper
Workload-Aware Query Recommendation Using Deep Learning
2023influential citation
A Systematic survey on automated text generation tools and techniques: application, evaluation, and challenges
2023cites this paper
Unsupervised Candidate Answer Extraction through Differentiable Masker-Reconstructor Model
2023cites this paper
Enhancing Generation through Summarization Duality and Explicit Outline Control
2023cites this paper
Text-based emotion recognition using contextual phrase embedding model
2023cites this paper
A Multilevel Center Embedding approach for Sentence Similarity having Complex structures
2023cites this paper
A Probabilistic Data-Driven Method For Response-Based Load Shedding Against Fault-Induced Delayed Voltage Recovery in Power Systems
2023cites this paper
Abstractive Financial News Summarization via Transformer-BiLSTM Encoder and Graph Attention-Based Decoder
2023cites this paper
Controllable Conversation Generation with Conversation Structures via Diffusion Models
2023cites this paper
Exploring Natural Language Processing Methods for Interactive Behaviour Modelling
2023cites this paper
Adversarial Conversational Shaping for Intelligent Agents
2023cites this paper
Time-feature attention-based convolutional auto-encoder for flight feature extraction
2023cites this paper
MCASP: Multi-Modal Cross Attention Network for Stock Market Prediction
2023cites this paper
Control Channel Isolation in SDN Virtualization: A Machine Learning Approach
2023cites this paper
An Interpretable LSTM Network for Solar Flare Prediction
2023cites this paper
Attention Based BiGRU-2DCNN with Hunger Game Search Technique for Low-Resource Document-Level Sentiment Classification
2023influential citation
Neural Natural Language Processing for Long Texts: A Survey of the State-of-the-Art
2023cites this paper
Representation Learning
2022influential citation
Dew-Cloud-Based Hierarchical Federated Learning for Intrusion Detection in IoMT
2022cites this paper
Generating Coherent Narratives by Learning Dynamic and Discrete Entity States with a Contrastive Framework
2022cites this paper
Beyond word embeddings: A survey
2022cites this paper
Dense Text Retrieval Based on Pretrained Language Models: A Survey
2022cites this paper
StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing
2022cites this paper
Coherent Long Text Generation by Contrastive Soft Prompt
2022cites this paper
Robust hierarchical model for joint span detection and aspect-based sentiment analysis in Vietnamese
2022cites this paper
Context Bucketed Text Responses using Generative Adversarial Neural Network in Android Application with Tens or Flow-Lite Framework
2022cites this paper
Unsupervised Inference of Data-Driven Discourse Structures using a Tree Auto-Encoder
2022cites this paper
Text Feature Adversarial Learning for Text Generation With Knowledge Transfer From GPT2
2022cites this paper
Document-aware Positional Encoding and Linguistic-guided Encoding for Abstractive Multi-document Summarization
2022cites this paper
A Comprehensive Survey of Abstractive Text Summarization Based on Deep Learning
2022cites this paper
Context-aware ranking refinement with attentive semi-supervised autoencoders
2022cites this paper
Enhanced Story Comprehension for Large Language Models through Dynamic Document-Based Knowledge Graphs
2022cites this paper
QDG: A unified model for automatic question-distractor pairs generation
2022cites this paper
Autoencoders reloaded
2022cites this paper
Enhancing Text Generation via Parse Tree Embedding
2022cites this paper
A Proposal for a Method of Determining Contextual Semantic Frames by Understanding the Mutual Objectives and Situations Between Speech Recognition and Interlocutors
2022cites this paper
Forecasting Covid-19 Transmission with ARIMA and LSTM Techniques in Morocco
2022cites this paper
Implicit n-grams Induced by Recurrence
2022cites this paper
RCE-GAN: A Rebar Clutter Elimination Network to Improve Tunnel Lining Void Detection from GPR Images
2022cites this paper
Keywords and Instances: A Hierarchical Contrastive Learning Framework Unifying Hybrid Granularities for Text Generation
2022cites this paper
Dual Scene Graph Convolutional Network for Motivation Prediction
2022cites this paper
Classical Planning in Deep Latent Space
2021cites this paper
Interventional Assays for the Latent Space of Autoencoders
2021cites this paper
Modeling and augmenting of fMRI data using deep recurrent variational auto-encoder
2021cites this paper
Does Structure Matter? Encoding Documents for Machine Reading Comprehension
2021cites this paper
Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence
2021cites this paper
Hierarchical, Feature-Based Text Generation
2021cites this paper
Power Grid Stability Prediction Model Based on BiLSTM with Attention
2021cites this paper
SACNN: Self-attentive Convolutional Neural Network Model for Natural Language Inference
2021cites this paper
Generating Instructive Questions from Multiple Articles to Guide Reading in E-Bibliotherapy
2021cites this paper
Is There a Place for Responsible Artificial Intelligence in Pandemics? A Tale of Two Countries
2021cites this paper
BookSum: A Collection of Datasets for Long-form Narrative Summarization
2021cites this paper
Dominant motion identification of multi-particle system using deep learning from video
2021cites this paper
Prediction of CRISPR/Cas9 single guide RNA cleavage efficiency and specificity by attention-based convolutional neural networks
2021cites this paper
Knowledge transfer for adapting pre-trained deep neural models to predict different greenhouse environments based on a low quantity of data
2021cites this paper
Getting Your Conversation on Track: Estimation of Residual Life for Conversations
2021cites this paper
Scene Graphs: A Survey of Generations and Applications
2021cites this paper
HITS-based attentional neural model for abstractive summarization
2021cites this paper
Few-Shot Learning of an Interleaved Text Summarization Model by Pretraining with Synthetic Data
2021cites this paper
Improving Information Extraction from Visually Rich Documents using Visual Span Representations
2021cites this paper
Health Monitoring of Air Compressors Using Reconstruction-Based Deep Learning for Anomaly Detection with Increased Transparency †
2021cites this paper
A citation recommendation method based on context correlation
2021cites this paper
A Novel Deep-learning Pipeline for Light Field Image Based Material Recognition
2021cites this paper
Medical Abridgement for Enhancing Physicians’ Accuracy
2021cites this paper
Scene Graphs: A Review of Generations and Applications
2021cites this paper
A Hierarchical Long Short-Term Memory Encoder-Decoder Model for Abstractive Summarization
2021cites this paper
SPUCL (Scientific Publication Classifier): A Human-Readable Labelling System for Scientific Publications
2021cites this paper
Exploring the Latent Space of Autoencoders with Interventional Assays
2021cites this paper
Entailment Method Based on Template Selection for Chinese Text Few-shot Learning
2021cites this paper
A Scheme for Efficient Question Answering with Low Dimension Reconstructed Embeddings
2021cites this paper
A Comprehensive Survey of Scene Graphs: Generation and Application
2021cites this paper
KI-HABS: Key Information Guided Hierarchical Abstractive Summarization
2021influential citation
Towards Learning Language Agnostic Features for NLP in Low-resource Languages
2021cites this paper
Review Summary Generation in Online Systems: Frameworks for Supervised and Unsupervised Scenarios
2021cites this paper
Unsupervised Syntactic Structure Induction in Natural Language Processing
2021cites this paper
Skeleton-based human activity recognition using ConvLSTM and guided feature learning
2021cites this paper