NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins

Published 2006 in Nucleic Acids Research

ABSTRACT

NCBI's reference sequence (RefSeq) database () is a curated non-redundant collection of sequences representing genomes, transcripts and proteins. The database includes 3774 organisms spanning prokaryotes, eukaryotes and viruses, and has records for 2 879 860 proteins (RefSeq release 19). RefSeq records integrate information from multiple sources, when additional data are available from those sources and therefore represent a current description of the sequence and its features. Annotations include coding regions, conserved domains, tRNAs, sequence tagged sites (STS), variation, references, gene and protein product names, and database cross-references. Sequence is reviewed and features are added using a combined approach of collaboration and other input from the scientific community, prediction, propagation from GenBank and curation by NCBI staff. The format of all RefSeq records is validated, and an increasing number of tests are being applied to evaluate the quality of sequence and annotation, especially in the context of complete genomic sequence.

PUBLICATION RECORD

Publication year
2006
Venue
Nucleic Acids Research
Publication date
2006-11-27
Fields of study
Not labeled
Identifiers
DOI 10.1093/nar/gkl842 PMCID 1716718
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

No references are available for this paper.

CITED BY

High‐Resolution Metabarcoding Reveals the Microbiome Dynamics of Mangrove Oysters (Crassostrea gasar) and Their Habitat
2026cites this paper
5′ UTR length regulates alternative N-terminal protein isoform production in health and disease
2026cites this paper
Integrative genome-wide association study and transcriptomic analyses unveil key candidate genes regulating fruit shape diversity in mango
2026cites this paper
Molecular clock evidence for an Archean diversification of heme-copper oxygen reductase enzymes
2026cites this paper
Telomere to telomere level genome assembly of the Yarkand hare (Lepus yarkandensis).
2026cites this paper
Host whole genome sequence data represent an untapped resource for characterising affiliated parasite diversity.
2026cites this paper
Diversity and ecological roles of hidden viral players in groundwater microbiomes
2026cites this paper
Complete mitochondrial genome of the stingless bee Geniotrigona thoracica (Hymenoptera, Apidae, Meliponini): presence of genome duplication, heteroplasmy and inverted repeats
2026cites this paper
Diversity, transfer potential, and transcriptional activity of virus‐carried antibiotic resistance genes in global estuaries
2026cites this paper
Graph Alignment Methods for the Development of Interoperable Gene Regulation Knowledge Graphs
2026cites this paper
Mapping antibiotic resistance determinants in oral streptococci
2026cites this paper
Structure, evolution, phylogeny, and analysis of domain-deficient genes in the IQD gene family of Brassica juncea.
2026cites this paper
Emergence and Tandem Repeat-Mediated Elongation of a Translated De Novo Open Reading Frame in Human Oncogenic RNA Gene VPS9D1-AS1 (MYU)
2026cites this paper
Rejuvenation-Responsive and Senolytic-Sensitive Muscle Stem Cells Unveiled by CD200 and CD63 in Geriatric Muscle
2026influential citation
Predicting beef diet nutritional composition and intake from rumen metagenomic profiles
2026cites this paper
Insight into cryopreservation mechanism of natural deep eutectic solvents (NADESs) from the transcriptomics perspective
2026cites this paper
The role of quorum sensing in rhizosphere community regulation during bacterial wilt pathogen invasion.
2026cites this paper
NP-CAM: Efficient and Scalable DNA Classification using a NoC-Partitioned CAM Architecture
2026cites this paper
Molecular Phylogeny of “Chemosensory Proteins” in Bacteria and Arthropods: CSP as an Extremely Ancient Gene
2026cites this paper
Bacteria sense the antibiotic rifampicin through a widespread dual-promoter based alarm system
2026cites this paper
Elucidating Scent and Color Variation in White and Pink-Flowered Hydrangea arborescens ‘Annabelle’ Through Multi-Omics Profiling
2026cites this paper
Bioinformatic identification of putative intergenic small open-reading frames in the grapevine genome and their roles in responding to biotic and abiotic stress
2026cites this paper
Host range and antibiotic resistance dissemination are shaped by distinct survival strategies of conjugative plasmids
2026cites this paper
Transmission strategy modulates parasite biogeography in an island-colonising bird
2026cites this paper
Chromosome-level genome assembly of the medicinal plant Ophiorrhiza japonica Blume.
2026cites this paper
Global Metagenomics Reveals Hidden Protist Diversity
2026cites this paper
AntiPan: a genome-informed in silico pipeline for advancing subunit vaccine discovery against Staphylococcus aureus
2026cites this paper
Izzy: a high-throughput metagenomic read simulator
2025cites this paper
Classification models distinguish functional and trafficking effects of KCNQ1 variants to enhance variant interpretation
2025cites this paper
Chromosome-Level Genome Assembly and Annotation of the Japanese Cutlassfish (Trichiurus japonicus): A High-Quality Genomic Resource Featuring Nuclear and Mitochondrial Completeness for Future Studies
2025influential citation
Barbell Resolves Demultiplexing and Trimming Issues in Nanopore Data
2025cites this paper
A chromosome-level genome assembly of Verticillium albo-atrum, an dangerous quarantine pathogen known for causing verticillium wilt
2025influential citation
Comprehensive multi omics explore the microbial function in metabolic pathway flow during altered diet
2025cites this paper
Sewers to Seas: exploring pathogens and antimicrobial resistance on microplastics from hospital wastewater to marine environments.
2025cites this paper
Telomere-to-telomere gapless genome assembly of Acorus tatarinowii
2025cites this paper
Genome assembly and insights into globally invasive Red-vented Bulbul (Pycnonotus cafer)
2025cites this paper
PAHG: the database of human multi-gene families
2025cites this paper
Cataloging the potential functional diversity of Cacna1e splice variants using long-read sequencing
2025cites this paper
Chromosome-level genome assembly of the caddisfly Stenopsyche angustata (Insecta: Trichoptera)
2025cites this paper
Genomic and metabonomic insights into the lignin-degrading potential of a novel halophilic bacterial strain Salinicoccus sp. HZC-1
2025cites this paper
Pathways and Risks of Antibiotic Resistance Genes Along a Rural to Urban River Gradient
2025cites this paper
Blini: lightweight nucleotide sequence search and dereplication
2025cites this paper
The dynamics of the gut microbiota in prediabetes during a four-year follow-up among European patients—an IMI-DIRECT prospective study
2025cites this paper
Amoxicillin Resistance: An In Vivo Study on the Effects of an Approved Formulation on Antibiotic Resistance in Broiler Chickens
2025influential citation
Chromosome-level genome assembly of starry flounder (Platichthys stellatus)
2025cites this paper
Efficient selection of pyruvate decarboxylase sequences from database for high ethanol productivity in Synechocystis sp. PCC 6803.
2025cites this paper
Leveraging the enrichment analysis from a genome-wide association study against epilepsy—focusing on the role of tryptophan catabolites pathway in patients with drug-resistant epilepsy
2025cites this paper
Transcriptome sequencing reveals the evolutionary histories and gene expression evolution in two related Pagurus species
2025cites this paper
BSA-Seq-Based Discovery of Functional InDel Markers for Seed Size Selection in Litchi (Litchi chinensis Sonn.)
2025cites this paper
Pangenomic Framework and Comparative Genomics of Carbapenem-Resistant Acinetobacter baumannii Clinical Isolates Across Asia: Unraveling the Molecular Trajectories of Antimicrobial Resistance and Virulence
2025cites this paper
Genetic signatures of exceptional longevity: a comprehensive analysis of coding region single nucleotide polymorphisms (SNPs) in centenarians and supercentenarians
2025cites this paper
Human gut microbiome study through metagenomics: Recent advances and challenges for clinical implementation.
2025cites this paper
De novo transcriptome assembly and gene expression analysis of Cnidium officinale under high-temperature conditions
2025influential citation
Multi-tissue transcriptome of Brycon amazonicus (Spix & Agassiz, 1829): insights into lipid metabolism in an Amazonian fish.
2025cites this paper
Gap-free telomere-to-telomere genome assembly of marbled flounder (Pseudopleuronectes yokohamae)
2025cites this paper
Structure-guided drug repurposing and dynamics simulation reveal anti-viral candidates for Bourbon virus
2025cites this paper
Epigenetic characterization of pseudogenes across human tissues
2025cites this paper
Exploring the evolutionary divergence of cyclic di-nucleotide signaling in diverse mycobacterial species
2025cites this paper
Parallel Evolution of Bacteroidota as Long-Term Endosymbionts of Insects
2025cites this paper
Peptide Mass Fingerprinting of South American Xenarthrans: A New Resource for Zooarcheology and Palaeontology
2025cites this paper
Horizontal transfer of matrix metalloproteinase genes links early animal and microbial evolution
2025cites this paper
Multi-omics profiling of Curcuma Wenyujin under salt-alkali stress reveals functional genes and associated metabolites
2025cites this paper
Crucial roles of intracellular cyclic di-GMP in impacting the genes important for extracellular electron transfer by Geobacter metallireducens
2025cites this paper
Genetic regulation of lncRNA expression in whole human brain and their contribution to CNS disorders
2025cites this paper
Subtype reclassification and viral mRNA expression profiling of Red Seabream Iridovirus (RSIV) via comparative genomic and transcriptomic analysis
2025cites this paper
Sociobiome signals by high income for increased mobile genetic elements in the gut microbiome of Chinese individuals
2025cites this paper
Haplotype-resolved chromosome-level genome sequence of Elsholtzia splendens (Nakai ex F.Maek.)
2025influential citation
High-quality reference genome and population analysis of allotetraploid Elymus sibiricus provide insight into genome origin and environmental adaptations to the Qinghai-Tibetan Plateau
2025cites this paper
HPD-Kit: a comprehensive toolkit for pathogen detection and analysis
2025cites this paper
Bakta Web – rapid and standardized genome annotation on scalable infrastructures
2025cites this paper
Comparative Global Metabolome Profile and Transcriptome Sequence Analysis of the Rough and Smooth Peel of the Orah Mandarin (Citrus reticulata)
2025cites this paper
Accelerated Pseudogenization in the Ancient Endosymbionts of Giant Scale Insects
2025cites this paper
Ancestral Sequence Reconstruction of the Ethylene-Forming Enzyme
2025cites this paper
Common non-antibiotic drugs enhance selection for antimicrobial resistance in mixture with ciprofloxacin
2025cites this paper
A chromosomal-level genome assembly of the American shad: insights into phylogenetic relationships
2025cites this paper
Ayu: a machine intelligence tool for identification of extracellular proteins in the marine secretome
2025cites this paper
Targeting HIV-1 conserved regions: An immunoinformatic pathway to vaccine innovation for the Asia
2025cites this paper
Pyridoxine dehydrogenase SePdx regulates photosynthesis via an association with the phycobilisome in a cyanobacterium
2025cites this paper
Chromosome-level genome assembly and annotation of Chinese herring (Ilisha elongata)
2025cites this paper
Community‐Level Metabolic Shifts Following Land Use Change in the Amazon Rainforest Identified by a Supervised Machine Leaning Approach
2025influential citation
Progressing microbial genomics: Artificial intelligence and deep learning driven advances in genome analysis and therapeutics
2025cites this paper
Screening and transcriptomic analysis of anti-Sporothrix globosa targeting AbaA
2025cites this paper
In Silico Characterization of Resistance and Virulence Genes in Aeromonas jandaei Strains Isolated from Oreochromis niloticus in Brazil
2025cites this paper
Proteome trade-off between primary and secondary metabolism shapes acid stress induced bacterial exopolysaccharide production.
2025cites this paper
Scalable and Maintainable Distributed Sequence Alignment Using Spark
2025cites this paper
Host range and ARG dissemination are shaped by distinct survival strategies of conjugative plasmids
2025cites this paper
Hidden viral players: Diversity and ecological roles of viruses in groundwater microbiomes
2025influential citation
The high-quality telomere-to-telomere genome assembly of the earthworm (Amynthas aspergillum)
2025cites this paper
Acidification alters anxiety-like behaviour and brain gene expression in zebrafish.
2025cites this paper
Genomic evidence for flies as carriers of zoonotic pathogens on dairy farms
2025cites this paper
Genome-wide and transcriptome analysis of PdWRKY transcription factors in date palm (Phoenix dactylifera) revealing insights into heat and drought stress tolerance
2025cites this paper
Identification of candidate genes associated with bipolar disorder by whole-exome sequencing of a Chinese multi-affected pedigree
2025cites this paper
In Silico and In Vitro development of novel small interfering RNAs (siRNAs) to inhibit SARS-CoV-2
2025cites this paper
Effects of Gallic Acid on In Vitro Ruminal Fermentation, Methane Emission, Microbial Composition, and Metabolic Functions
2025influential citation
Estudio del microbioma intestinal humano mediante metagenómica: avances recientes y desafíos para su implementación clínica
2025cites this paper
Bag-of-words is competitive with sum-of-embeddings language-inspired representations on protein inference
2025cites this paper
Genome annotation, comparative genomics and transcriptomic analysis of Eucalyptus cloeziana reveal insights into genome evolution and wood formation in Eucalyptus
2025cites this paper
Analysis wheat wild relatives Thinopyrum intermedium and Roegneria kamoji genomes reveal different polyploid evolution paths
2025cites this paper
Non-coding RNA profiling in BRAFV600E-mutant cutaneous melanoma before and after Spry1 depletion
2025cites this paper
Haptophyte-infecting viruses change the genome condensing proteins of dinoflagellates
2025cites this paper