Assessing Coverage of Protein Interaction Data Using Capture–Recapture Models

Published 2012 in Bulletin of Mathematical Biology

ABSTRACT

Protein interaction networks comprise thousands of individual binary links between distinct proteins. Whilst these data have attracted considerable attention and been the focus of many different studies, the networks, their structure, function, and how they change over time are still not fully known. More importantly, there is still considerable uncertainty regarding their size, and the quality of the available data continues to be questioned. Here, we employ statistical models of the experimental sampling process, in particular capture–recapture methods, in order to assess the false discovery rate and size of protein interaction networks. We uses these methods to gauge the ability of different experimental systems to find the true binary interactome. Our model allows us to obtain estimates for the size and false-discovery rate from simple considerations regarding the number of repeatedly interactions, and provides suggestions as to how we can exploit this information in order to reduce the effects of noise in such data. In particular our approach does not require a reference dataset. We estimate that approximately more than half of the true physical interactome has now been sampled in yeast.

PUBLICATION RECORD

Publication year
2012
Venue
Bulletin of Mathematical Biology
Publication date
2012-02-01
Fields of study
Biology, Medicine, Computer Science
Identifiers
DOI 10.1007/s11538-011-9680-2 PMID 21870201
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Biological Networks
2013cited by this paper
Prediction of putative protein interactions through evolutionary analysis of osmotic stress response in the model yeast Saccharomyces cerevisae.
2011cited by this paper
Topology of protein interaction network shapes protein abundances and strengths of their functional and nonspecific interactions
2011cited by this paper
Sub-Modular Resolution Analysis by Network Mixture Models
2010cited by this paper
Statistical inference of the time-varying structure of gene-regulation networks
2010cited by this paper
Trees on networks: resolving statistical patterns of phylogenetic similarities among interacting proteins
2010cited by this paper
Protein-protein interactions: from global to local analyses.
2008cited by this paper
Deducing topology of protein-protein interaction networks from experimentally measured sub-networks
2008cited by this paper
Estimating the size of the human interactome
2008influential reference
Estimating collection size with logistic regression
2007cited by this paper
Coverage and error models of protein-protein interaction data by directed graph analysis
2007influential reference
Where Have All the Interactions Gone? Estimating the Coverage of Two-Hybrid Protein Interaction Maps
2007influential reference
Making the most of high-throughput protein-interaction data
2007influential reference
The effects of incomplete protein interaction data on structural and evolutionary inferences
2006cited by this paper
How complete are current yeast and human protein-interaction networks?
2006cited by this paper
Capturing collection size for distributed non-cooperative retrieval
2006cited by this paper
Subnets of scale-free networks are not scale-free: sampling properties of networks.
2005cited by this paper
Modelling gene networks at different organisational levels
2005cited by this paper
Complex networks and simple models in biology
2005cited by this paper
Derivation of genetic interaction networks from quantitative phenotype data
2005cited by this paper
Genome Snapshot: a new resource at the Saccharomyces Genome Database (SGD) presenting an overview of the Saccharomyces cerevisiae genome
2005cited by this paper
Estimating and improving protein interaction error rates
2004cited by this paper
Gaining confidence in high-throughput protein interaction networks
2004cited by this paper
On the number of protein-protein interactions in the yeast proteome.
2003cited by this paper
Functional classification of proteins for the prediction of cellular function from a protein-protein interaction network
2003cited by this paper
Biological networks.
2003cited by this paper
Comparative assessment of large-scale data sets of protein–protein interactions
2002cited by this paper
An overview of closed capture-recapture models
2001cited by this paper
Estimating the Number of Species: A Review
1993cited by this paper
Estimation of the size of a closed population when capture probabilities vary among animals
1978cited by this paper

CITED BY

Position Matters: Network Centrality Considerably Impacts Rates of Protein Evolution in the Human Protein–Protein Interaction Network
2017cites this paper
Development of network-based analysis methods with application to the genetic component of asthma
2017cites this paper
Quantifying noise in mass spectrometry and yeast two-hybrid protein interaction detection experiments
2015cites this paper
Inferring Functional Divergence in Protein Sequences
2014cites this paper
Protein stickiness, rather than number of functional protein-protein interactions, predicts expression noise and plasticity in yeast
2012cites this paper