SCUMBLE: a method for systematic and accurate detection of codon usage bias by maximum likelihood estimation

Published 2008 in Nucleic Acids Research

ABSTRACT

The genetic code is degenerate—most amino acids can be encoded by from two to as many as six different codons. The synonymous codons are not used with equal frequency: not only are some codons favored over others, but also their usage can vary significantly from species to species and between different genes in the same organism. Known causes of codon bias include differences in mutation rates as well as selection pressure related to the expression level of a gene, but the standard analysis methods can account for only a fraction of the observed codon usage variation. We here introduce an explicit model of codon usage bias, inspired by statistical physics. Combining this model with a maximum likelihood approach, we are able to clearly identify different sources of bias in various genomes. We have applied the algorithm to Saccharomyces cerevisiae as well as 325 prokaryote genomes, and in most cases our model explains essentially all observed variance.

PUBLICATION RECORD

Publication year
2008
Venue
Nucleic Acids Research
Publication date
2008-05-21
Fields of study
Biology, Medicine, Computer Science
Identifiers
DOI 10.1093/nar/gkn288 PMID 18495752 PMCID 2441815
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

From evidence to understanding: a commentary on Fisher (1922) ‘On the mathematical foundations of theoretical statistics’
2015cited by this paper
Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation
2007cited by this paper
Combining models of protein translation and population genetics to predict protein production rates from codon usage patterns.
2007cited by this paper
The factors shaping synonymous codon usage in the genome of Burkholderia mallei.
2007influential reference
Bio::NEXUS: a Perl API for the NEXUS format for comparative biological data
2006cited by this paper
Single-cell proteomic analysis of S. cerevisiae reveals the architecture of biological noise
2006cited by this paper
The Correlation between Recombination Rate and Codon Bias in Yeast Mainly Results from Mutational Bias Associated with Recombination Rather than Hill-Robertson Interference
2005cited by this paper
A problem in multivariate analysis of codon usage data and a possible solution
2005influential reference
Subproteomes of soluble and structure‐bound Helicobacter pylori proteins analyzed by two‐dimensional gel electrophoresis and mass spectrometry
2005cited by this paper
Online synonymous codon usage analyses with the ade4 and seqinR packages
2005cited by this paper
Bioinformatic analysis of the link between gene composition and expressivity in Saccharomyces cerevisiae and Schizosaccharomyces pombe
2004cited by this paper
Intragenic Spatial Patterns of Codon Usage Bias in Prokaryotic and Eukaryotic Genomes
2004cited by this paper
Global analysis of protein expression in yeast
2003cited by this paper
Use and misuse of correspondence analysis in codon usage studies.
2002cited by this paper
Precision and functional specificity in mRNA decay
2002cited by this paper
Codon‐usage based regulation of colicin K synthesis by the stress alarmone ppGpp
2001cited by this paper
Gene expressivity is the main factor in dictating the codon usage variation among the genes in Pseudomonas aeruginosa.
2001cited by this paper
Gradients in nucleotide and codon usage along Escherichia coli genes.
2000cited by this paper
Lateral gene transfer and the nature of bacterial innovation
2000cited by this paper
Relationship of codon bias to mRNA concentration and protein length in Saccharomyces cerevisiae
2000cited by this paper
Absence of translationally selected synonymous codon usage bias in Helicobacter pylori.
2000cited by this paper
A Sampling of the Yeast Proteome
1999cited by this paper
G+C content variation along and among Saccharomyces cerevisiae chromosomes.
1999cited by this paper
Universal replication biases in bacteria
1999cited by this paper
Molecular archaeology of the Escherichia coli genome.
1998cited by this paper
Dissecting the regulatory circuitry of a eukaryotic genome.
1998cited by this paper
Replicational and transcriptional selection on codon usage in Borrelia burgdorferi.
1998cited by this paper
Characterization of the yeast transcriptome.
1997cited by this paper
Asymmetric substitution patterns in the two DNA strands of bacteria.
1996cited by this paper
Regional base composition variation along yeast chromosome III: evolution of chromosome primary structure.
1993cited by this paper
The selection-mutation-drift theory of synonymous codon usage.
1991cited by this paper
The 'effective number of codons' used in a gene.
1990cited by this paper
The effect of context on synonymous codon usage in genes with low codon usage bias.
1990cited by this paper
The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications.
1987cited by this paper
An evolutionary perspective on synonymous codon usage in unicellular organisms
1986cited by this paper
The relationship between base composition and codon usage in bacterial genes and its use for the simple and reliable identification of protein-coding sequences.
1984cited by this paper
Analyse de l'inertie intraclasse par l'analyse d'un tableau de correspondance
1983cited by this paper
Correlation between the abundance of yeast transfer RNAs and the occurrence of the respective codons in protein genes. Differences in synonymous codon choice patterns of yeast and Escherichia coli with reference to the abundance of isoaccepting transfer RNAs.
1982cited by this paper
Codon selection in yeast.
1982cited by this paper
Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a synonymous codon choice that is optimal for the E. coli translational system.
1981cited by this paper
Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes.
1981cited by this paper
On the Mathematical Foundations of Theoretical Statistics
year unknowninfluential reference

CITED BY

Codon-based indices for modeling gene expression and transcript evolution
2021cites this paper
Functional expression, purification, and antimicrobial activity of a novel antimicrobial peptide MLH in Escherichia coli
2018cites this paper
Recombinant expression, purification and antimicrobial activity of a novel antimicrobial peptide PaDef in Pichia pastoris.
2017cites this paper
A Constraint Logic Programming Approach to Predicting the Three-Dimensional Yeast Genome
2016cites this paper
Construction of Recombinant Pichia pastoris Carrying a Constitutive AvBD9 Gene and Analysis of Its Activity.
2015cites this paper
Codon usage bias: causative factors, quantification methods and genome‐wide patterns: with emphasis on insect genomes
2013cites this paper
Genome-Wide Patterns of Codon Bias Are Shaped by Natural Selection in the Purple Sea Urchin, Strongylocentrotus purpuratus
2013influential citation
Measuring codon usage bias
2012cites this paper
Variations in Helicobacter pylori Cytotoxin-Associated Genes and Their Influence in Progression to Gastric Cancer: Implications for Prevention
2012cites this paper
Contributions of Speed and Accuracy to Translational Selection in Bacteria
2012cites this paper
Genomic signals of selection within and phylogenetic relationships among Strongylocentrotid sea urchins
2012influential citation
Characterizing the Native Codon Usages of a Genome: An Axis Projection Approach
2010influential citation
Genes optimized by evolution for accurate and fast translation encode in Archaea and Bacteria a broad and characteristic spectrum of protein functions
2010cites this paper
Forces that influence the evolution of codon bias
2010cites this paper
Relative Codon Adaptation Index, a Sensitive Measure of Codon Usage Bias
2010cites this paper
Functional Biogeography as Evidence of Gene Transfer in Hypersaline Microbial Communities
2010cites this paper