Using coevolution to improve protein subfamily classification

Franco L. Simonetti,M. Banchero,A. Berenstein,A. Chernomoretz,Cristina Marino Buslje

Published 2015 in BMC Bioinformatics

ABSTRACT

Background The common approach for protein subfamily classification relies on grouping protein sequences according to their degree of similarity. However, there is no single sequence similarity threshold for accurately grouping sequences into isofunctional groups. Current subfamily classification methods use bottom-up clustering to construct a cluster hierarchy, then cut the hierarchy at the most appropriate locations to obtain a single partitioning. These methods usually integrate data such as protein sequence similarity, residue conservation within groups and HMM profiles. Despite this straightforward approach, results usually predict a great number of subfamilies with few members and limited biological meaning. The goal of this study is to identify subsets of functionally related sequences within a given superfamily. Since all proteins within a superfamily share a common ancestor, we hypothesize that functional diversity within superfamilies has arisen through a series of concerted changes that must have left an identifiable coevolutionary signal.

PUBLICATION RECORD

Publication year
2015
Venue
BMC Bioinformatics
Publication date
2015-04-30
Fields of study
Biology, Computer Science
Identifiers
DOI 10.1186/1471-2105-16-S8-A6 PMCID 4423731
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

No references are available for this paper.

CITED BY

Attractor Stability in Finite Asynchronous Biological System Models
2019cites this paper
Griffin: A Tool for Symbolic Inference of Synchronous Boolean Molecular Networks
2018cites this paper
Imaging of DNA and Protein by SFM and Combined SFM-TIRF Microscopy.
2018cites this paper
Curve computation by geodesics and graph modelling for polymer analysis
2017cites this paper
A Network Model to Explore the Effect of the Micro-environment on Endothelial Cell Behavior during Angiogenesis
2017cites this paper
Experimental Investigation of Frequency Chaos Game Representation for in Silico and Accurate Classification of Viral Pathogens from Genomic Sequences
2017cites this paper
Curve Extraction by Geodesics Fusion: Application to Polymer Reptation Analysis
2016cites this paper
Organogenesis of the C. elegans Vulva and Control of Cell Fusion
2016cites this paper
Highlights from the 1st ISCB Latin American Student Council Symposium 2014
2015cites this paper