Optimal Multi-Paragraph Text Segmentation by Dynamic Programming

Published 1998 in Annual Meeting of the Association for Computational Linguistics

ABSTRACT

There exist several methods of calculating a similarity curve, or a sequence of similarity values, representing the lexical cohesion of successive text constituents, e.g., paragraphs. Methods for deciding the locations of fragment boundaries are, however, scarce. We propose a fragmentation method based on dynamic programming. The method is theoretically sound and guaranteed to provide an optimal splitting on the basis of a similarity curve, a preferred fragment length, and a cost function defined. The method is especially useful when control on fragment size is of importance.

PUBLICATION RECORD

Publication year
1998
Venue
Annual Meeting of the Association for Computational Linguistics
Publication date
1998-08-10
Fields of study
Computer Science
Identifiers
DOI 10.3115/980691.980814 arXiv cs/9812005
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages
1997cited by this paper
Segmentation of Expository Texts by Hierarchical Agglomerative Clustering
1997cited by this paper
Introduction to algorithms
1996cited by this paper
Passage-level evidence in document retrieval
1994cited by this paper
Multi-Paragraph Segmentation Expository Text
1994cited by this paper
Comparison of Fragmentation Schemes for Document Retrieval
1994cited by this paper
Text Segmentation Based on Similarity between Words
1993cited by this paper
A New Tool for Discourse Analysis: The Vocabulary-Management Profile.
1991cited by this paper
Lexical Cohesion Computed by Thesaural relations as an indicator of the structure of text
1991cited by this paper
Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer
1989cited by this paper

CITED BY

LSPNet: A Local Semantic Perception Network for Topic Segmentation
2025cites this paper
Broadcast news story segmentation using sticky hierarchical dirichlet process
2022cites this paper
Citation Data of Czech Apex Courts
2020cites this paper
Combining Information Extraction and Text Segmentation methods in Greek Texts
2018cites this paper
Learning distributed sentence representations for story segmentation
2018cites this paper
Texts Segmentation and Semantic Comparison: Method and Results of its Application
2018cites this paper
A Domain-Independent Text Segmentation Method for Educational Course Content
2018cites this paper
Topic embedding of sentences for story segmentation
2017cites this paper
A hybrid neural network hidden Markov model approach for automatic story segmentation
2017cites this paper
ePubWU Institutional Repository
2017cites this paper
A Semi-automatic Approach to Identify Business Process Elements in Natural Language Texts
2017cites this paper
Applying named entity recognition and co-reference resolution for segmenting English texts
2017cites this paper
Recognition of Business Process Elements in Natural Language Texts
2017cites this paper
An end-to-end neural network approach to story segmentation
2017cites this paper
A DNN-HMM Approach to Story Segmentation
2016cites this paper
Text Segmentation using Named Entity Recognition and Co-reference Resolution
2016influential citation
Use of named entity recognition and co-reference resolution tools for segmenting english texts
2015cites this paper
Supporting Process Model Validation through Natural Language Generation
2014cites this paper
A hybrid linear text segmentation algorithm using hierarchical agglomerative clustering and discrete particle swarm optimization
2014cites this paper
Topic segmentation on spoken documents using self-validated acoustic cuts
2014cites this paper
Natural Language in Business Process Models
2013cites this paper
Improving Text Segmentation with Clustering Cohesion
2013cites this paper
Laplacian Eigenmaps for Automatic Story Segmentation of Broadcast News
2012cites this paper
Advances on information processing and management
2011influential citation
New approach for collecting high quality parallel corpora from multilingual websites
2011cites this paper
ClustSeg : A Method for Text Segmentation
2010cites this paper
An Incremental Text Segmentation by Clustering Cohesion
2010cites this paper
Text Segmentation by Clustering Cohesion
2010cites this paper
Hierarchical Text Segmentation from Multi-Scale Lexical Cohesion
2009cites this paper
Efficient linear text segmentation based on information retrieval techniques
2009cites this paper
Segmentation of Greek Texts by Dynamic Programming
2008influential citation
Blogging activity among cancer patients and their companions: Uses, gratifications, and predictors of outcomes
2008cites this paper
A Dynamic Programming Algorithm for the Segmentation of
2008cites this paper
A Dynamic Programming Model for Text Segmentation Based on Min-Max Similarity
2008cites this paper
Question-driven segmentation of lecture speech text: Towards intelligent e-learning systems
2008cites this paper
Text Segmentation Model Based on Multiple Discriminant Analysis
2007cites this paper
An Improved Model of Dotplotting for Text Segmentation
2007influential citation
Word Distribution Based Methods for Minimizing Segment Overlaps
2007cites this paper
Segmentation of Greek Text by Dynamic Programming
2007influential citation
Informations morpho-syntaxiques et adaptation thématique pour améliorer la reconnaissance de la parole
2007cites this paper
Segmentation of Greek Text by Dynamic Programming
2007influential citation
TextLec: A Novel Method of Segmentation by Topic Using Lower Windows and Lexical Cohesion
2007cites this paper
Squibs and Discussions: Improving Text Segmentation Using Latent Semantic Analysis: A Reanalysis of Choi, Wiemer-Hastings, and Moore (2001)
2006cites this paper
Auto-Segmentation Based Partitioning and Clustering Approach to Robust Endpointing
2006cites this paper
Density Estimation via Optimal Segmentation
2005cites this paper
Using Multiple Discriminant Analysis Approach for Linear Text Segmentation
2005cites this paper
Amélioration de la segmentation automatique des textes grâce aux connaissances acquises par l’analyse sémantique latente
2005cites this paper
Improvement of the dotplotting method for linear text segmentation
2005cites this paper
Automated Video Segmentation for Lecture Videos: A Linguistics-Based Approach
2005cites this paper
Segmentation of lecture videos based on text: a method combining multiple linguistic features
2004cites this paper
A Dynamic Programming Algorithm for Linear Text Segmentation
2004influential citation
Domain-independent text segmentation using anisotropic diffusion and dynamic programming
2003influential citation
Text Segmentation by Product Partition Models and Dynamic Programming
2003cites this paper
Segmentation by Product Partition Models and Dynamic Programming
2003cites this paper
An algorithm for optimal partitioning of data on an interval
2003cites this paper
Linear Text Segmentation using a Dynamic Programming Algorithm
2003influential citation
A Dynamic Programming Algorithm for the Segmentation of Greek Texts
2003cites this paper
A Critique and Improvement of an Evaluation Metric for Text Segmentation
2002cites this paper
What have you done for me lately? The fickle alignment of NLP and CALL
2002cites this paper
A Statistical Model for Domain-Independent Text Segmentation
2001cites this paper
Title Segmentation of lecture videos based on text : A methodcombining multiple linguistic features
2001cites this paper
Lexical Segments in Text 1
2001cites this paper
Text Segmentation into Paragraphs Based on Local Text Cohesion
2001cites this paper
Advances in domain independent linear text segmentation
2000cites this paper
Generalization of Document Structures and Document Assembly
2000cites this paper
Úò Blockin Blockin× Ò Óñññò Òòòôòòòòø Ððòòòö Øøüø ×××ññòøøøøóò
2000cites this paper
Knowledge Discovery in Documents by Extracting Frequent Word Sequences
1999cites this paper
Rakenteisten dokumenttien koostamismalli ja koostamisjärjestelmä SAW
1998cites this paper