Shortest triplet clustering: reconstructing large phylogenies using representative sets

Published 2005 in BMC Bioinformatics

ABSTRACT

BackgroundUnderstanding the evolutionary relationships among species based on their genetic information is one of the primary objectives in phylogenetic analysis. Reconstructing phylogenies for large data sets is still a challenging task in Bioinformatics.ResultsWe propose a new distance-based clustering method, the shortest triplet clustering algorithm (STC), to reconstruct phylogenies. The main idea is the introduction of a natural definition of so-called k-representative sets. Based on k-representative sets, shortest triplets are reconstructed and serve as building blocks for the STC algorithm to agglomerate sequences for tree reconstruction in O(n2) time for n sequences.Simulations show that STC gives better topological accuracy than other tested methods that also build a first starting tree. STC appears as a very good method to start the tree reconstruction. However, all tested methods give similar results if balanced nearest neighbor interchange (BNNI) is applied as a post-processing step. BNNI leads to an improvement in all instances. The program is available at http://www.bi.uni-duesseldorf.de/software/stc/.ConclusionThe results demonstrate that the new approach efficiently reconstructs phylogenies for large data sets. We found that BNNI boosts the topological accuracy of all methods including STC, therefore, one should use BNNI as a post-processing step to get better topological accuracy.

PUBLICATION RECORD

Publication year
2005
Venue
BMC Bioinformatics
Publication date
2005-04-08
Fields of study
Biology, Medicine, Computer Science
Identifiers
DOI 10.1186/1471-2105-6-92 PMID 15819989 PMCID 1097715
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Maximum‐Likelihood Analysis Using TREE‐PUZZLE
2007cited by this paper
RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees
2005cited by this paper
PhyNav: A Novel Approach to Reconstruct Large Phylogenies
2004cited by this paper
IQPNNI: moving fast through tree space and stopping in time.
2004influential reference
Inferring Phylogenies.—Joseph Felsenstein. 2003. Sinauer Associates, Sunderland, Massachusetts.
2004cited by this paper
A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood.
2003cited by this paper
TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing
2002cited by this paper
Current Protocols in Bioinformatics
2002cited by this paper
Genetic algorithms and parallel processing in maximum-likelihood phylogeny inference.
2002cited by this paper
Fast and Accurate Phylogeny Reconstruction Algorithms Based on the Minimum-Evolution Principle
2002influential reference
Hitch-Hiking: A Parallel Heuristic Search Strategy, Applied to the Phylogeny Problem
2001cited by this paper
Fast Recovery of Evolutionary Trees with Thousands of Nodes
2001influential reference
Weighted neighbor joining: a likelihood-based approach to distance-based phylogeny reconstruction.
2000influential reference
Disk-Covering, a Fast-Converging Method for Phylogenetic Tree Reconstruction
1999cited by this paper
Seq-Gen: an application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic trees
1997cited by this paper
PSeq-Gen: an application for the Monte Carlo simulation of protein sequence evolution along phylogenetic trees
1997cited by this paper
BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data.
1997cited by this paper
Quartet Puzzling: A Quartet Maximum-Likelihood Method for Reconstructing Tree Topologies
1996cited by this paper
The neighbor-joining method: a new method for reconstructing phylogenetic trees.
1987influential reference
Simple method for constructing phylogenetic trees from distance matrices.
1981cited by this paper
Comparison of phylogenetic trees
1981cited by this paper
A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences
1980influential reference
Calculation of evolutionary trees from sequence data.
1979cited by this paper
On the Phenetic Approach to Vertebrate Classification
1977cited by this paper
Clustering Algorithms
1975cited by this paper
The Design and Analysis of Computer Algorithms
1974cited by this paper
Mathematics in the Archaeological and Historical Sciences
1971cited by this paper
The probabilities of rooted tree-shapes generated by random bifurcation
1971influential reference
The Recovery of Trees from Measures of Dissimilarity
1971cited by this paper
PHYLOGENETIC ANALYSIS: MODELS AND ESTIMATION PROCEDURES
1967cited by this paper
Construction of phylogenetic trees.
1967cited by this paper
Phylogenetic analysis. Models and estimation procedures.
1967cited by this paper

CITED BY

Phyloformer: Fast, Accurate, and Versatile Phylogenetic Reconstruction with Deep Neural Networks
2024cites this paper
Rooting Phylogenetic Trees from Protein Alignments
2023cites this paper
A tutorial on the balanced minimum evolution problem
2021cites this paper
A protein alignment partitioning method for protein phylogenetic inference
2020cites this paper
Phylogenetic and Phylogenomic Analyses for Large Datasets
2019cites this paper
CLASSIFICATION TECHNIQUE FOR DRUG DISCOVERY IN MEDICAL IMAGE PROCESSING
2017cites this paper
Technique for Drug Discovery in Medical Image Processing
2017cites this paper
Computational Phylogenetics: An Introduction to Designing Methods for Phylogeny Estimation
2017cites this paper
Dealing with propositions, not with the characters: the ability of three-taxon statement analysis to recognise groups based solely on ‘reversals’, under the maximum-likelihood criteria
2016cites this paper
Distance-Based Phylogenetic Inference
2016cites this paper
ChemTreeMap: an interactive map of biochemical similarity in molecular datasets
2016cites this paper
Distance-based methods in phylogenetics
2015cites this paper
FastME 2.0: A Comprehensive, Accurate, and Fast Distance-Based Phylogeny Inference Program
2015influential citation
A NEW TEST TO BUILD CONFIDENCE REGIONS USING BALANCED MINIMUM EVOLUTION
2013cites this paper
A NEW TEST TO BUILD CONFIDENCE REGIONS USING BALANCED MINIMUM EVOLUTION
2013cites this paper
Combinatorics of distance-based tree inference
2012cites this paper
Évolution du VIH : méthodes, modèles et algorithmes
2012cites this paper
EM-Coffee: An Improvement of M-Coffee
2010cites this paper
Robustness of Phylogenetic Inference Based on Minimum Evolution
2010cites this paper
Consistency of Topological Moves Based on the Balanced Minimum Evolution Principle of Phylogenetic Inference
2009cites this paper
A phylogenetic potpourri: computational methods for analysing genome scale data
2009cites this paper
Phylogenetic Inference with Weighted Codon Evolutionary Distances
2009cites this paper
Maximum Similarity: A New Formulation of Phylogenetic Reconstruction
2009influential citation
Distance-based Phylogeny Reconstruction ( Optimal Radius ) , * * 1999 ,
2008cites this paper
Distance-Based Phylogeny Reconstruction (Optimal Radius)
2008cites this paper
Fast NJ-like algorithms to deal with incomplete distance matrices
2008cites this paper
SimulFold: Simultaneously Inferring RNA Structures Including Pseudoknots, Alignments, and Trees Using a Bayesian MCMC Framework
2007cites this paper
Inferring Phylogenies from LCA-Distances
2006cites this paper
Genome BLAST distance phylogenies inferred from whole plastid and whole mitochondrion genome sequences
2006influential citation
Neighbor-joining revealed.
2006cites this paper
Phylogenetic supermatrix analysis of GenBank sequences from 2228 papilionoid legumes.
2006cites this paper
Parallel Reconstruction of Large Maximum Likelihood Phylogenies
2005cites this paper
Getting a tree fast: Neighbor Joining, FastME, and distance-based methods.
2003influential citation
Neighbor-Joining Revealed
year unknowncites this paper
: A
year unknowninfluential citation
Robustness of Phylogenetic Inference Based on Minimum Evolution
year unknowncites this paper
Neighbor-Joining Revealed
year unknowncites this paper
ROBUSTNESS OF PHYLOGENETIC INFERENCE BASED ON MINIMUM EVOLUTION
year unknowncites this paper
Neighbor-Joining Revealed
year unknowncites this paper
Robustness of Phylogenetic Inference Based on Minimum Evolution
year unknowncites this paper
Neighbor-Joining Revealed
year unknowncites this paper
: A
year unknowninfluential citation
Neighbor-Joining Revealed
year unknowncites this paper
Robustness of Phylogenetic Inference Based on Minimum Evolution
year unknowncites this paper
Robustness of Phylogenetic Inference Based on Minimum Evolution
year unknowncites this paper
Robustness of Phylogenetic Inference Based on Minimum Evolution
year unknowncites this paper
Neighbor-Joining Revealed
year unknowncites this paper
Robustness of Phylogenetic Inference Based on Minimum Evolution
year unknowncites this paper
Simulfold: Simultaneously Inferring Rna Structures including Pseudoknots, Alignments, and Trees Using a Bayesian Mcmc Framework
year unknowncites this paper