corseq: fast and efficient identification of favoured codons from next generation sequencing reads

Published 2018 in PeerJ

ABSTRACT

Background Optimization of transgene expression can be achieved by designing coding sequences with the synonymous codon usage of genes which are highly expressed in the host organism. The identification of the so-called “favoured codons” generally requires the access to either the genome or the coding sequences and the availability of expression data. Results Here we describe corseq, a fast and reliable software for detecting the favoured codons directly from RNAseq data without prior knowledge of genomic sequence or gene annotation. The presented tool allows the inference of codons that are preferentially used in highly expressed genes while estimating the transcripts abundance by a new kmer based approach. corseq is implemented in Python and runs under any operating system. The software requires the Biopython 1.65 library (or later versions) and is available under the ‘GNU General Public License version 3’ at the project webpage https://sourceforge.net/projects/corseq/files. Conclusion corseq represents a faster and easy-to-use alternative for the detection of favoured codons in non model organisms.

PUBLICATION RECORD

Publication year
2018
Venue
PeerJ
Publication date
2018-07-04
Fields of study
Biology, Medicine, Computer Science
Identifiers
DOI 10.7717/peerj.5099 PMID 30013827 PMCID 6035725
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

The Evolutionary Basis of Translational Accuracy in Plants
2017cited by this paper
Ancient DNA and the rewriting of human history: be sparing with Occam’s razor
2016cited by this paper
A survey of best practices for RNA-seq data analysis
2016cited by this paper
Codon and Amino Acid Usage Are Shaped by Selection Across Divergent Model Organisms of the Pancrustacea
2015cited by this paper
Estimating Gene Expression and Codon-Specific Translational Efficiencies, Mutation Biases, and Selection Coefficients from Genomic Data Alone‡
2015cited by this paper
GtRNAdb 2.0: an expanded database of transfer RNA genes identified in complete and draft genomes
2015cited by this paper
Codon Bias as a Means to Fine-Tune Gene Expression.
2015cited by this paper
A critical analysis of codon optimization in human therapeutics.
2014cited by this paper
Codon-by-Codon Modulation of Translational Speed and Accuracy Via mRNA Folding
2014cited by this paper
Seforta, an integrated tool for detecting the signature of selection in coding sequences
2014cited by this paper
The Signatures of Selection for Translational Accuracy in Plant Genes
2013cited by this paper
Streaming fragment assignment for real-time analysis of sequencing experiments
2012cited by this paper
Optimal Codon Identities in Bacteria: Implications from the Conflicting Results of Two Different Methods
2011cited by this paper
Explaining complex codon usage patterns with selection for translational efficiency, mutation bias, and genetic drift
2011cited by this paper
Synonymous but not the same: the causes and consequences of codon bias
2011cited by this paper
Fast and accurate long-read alignment with Burrows–Wheeler transform
2010cited by this paper
Biopython: freely available Python tools for computational molecular biology and bioinformatics
2009cited by this paper
Translationally optimal codons associate with structurally sensitive sites in proteins.
2009cited by this paper
Divergence times in Caenorhabditis and Drosophila inferred from direct estimates of the neutral mutation rate.
2008cited by this paper
Molecular evolution of synonymous codon usage in Populus
2008cited by this paper
Codon optimization reveals critical factors for high level expression of two rare codon genes in Escherichia coli: RNA stability and secondary structure but not tRNA abundance.
2004cited by this paper
Analysis of codon usage.
2000cited by this paper
Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, and Arabidopsis.
1999cited by this paper
Characterization of the promoter region and genomic organization of GLI, a member of the Sonic hedgehog-Patched signaling pathway.
1998cited by this paper
Codon usage and gene function are related in sequences of Arabidopsis thaliana.
1998cited by this paper
Evolution of codon usage patterns: the extent and nature of divergence between Candida albicans and Saccharomyces cerevisiae.
1992cited by this paper
The selection-mutation-drift theory of synonymous codon usage.
1991cited by this paper
"Silent" sites in Drosophila genes are not neutral: evidence of selection among synonymous codons.
1988cited by this paper

CITED BY

Visualizing Codon Usage Within and Across Genomes: Concepts and Tools
2020cites this paper