GSA: Genome Sequence Archive*

Yanqing Wang,Fuhai Song,Junwei Zhu,Sisi Zhang,Yadong Yang,Tingting Chen,Bixia Tang,Lili Dong,N. Ding,Qian Zhang,Z. Bai,Xunong Dong,Huanxin Chen,Mingyuan Sun,S. Zhai,Yubin Sun,Lei Yu,Li Lan,Jingfa Xiao,Xiangdong Fang,Hongxing Lei

Published 2017 in Genomics, Proteomics & Bioinformatics

ABSTRACT

With the rapid development of sequencing technologies towards higher throughput and lower cost, sequence data are generated at an unprecedentedly explosive rate. To provide an efficient and easy-to-use platform for managing huge sequence data, here we present Genome Sequence Archive (GSA; http://bigd.big.ac.cn/gsa or http://gsa.big.ac.cn), a data repository for archiving raw sequence data. In compliance with data standards and structures of the International Nucleotide Sequence Database Collaboration (INSDC), GSA adopts four data objects (BioProject, BioSample, Experiment, and Run) for data organization, accepts raw sequence reads produced by a variety of sequencing platforms, stores both sequence reads and metadata submitted from all over the world, and makes all these data publicly available to worldwide scientific communities. In the era of big data, GSA is not only an important complement to existing INSDC members by alleviating the increasing burdens of handling sequence data deluge, but also takes the significant responsibility for global big data archive and provides free unrestricted access to all publicly available data in support of research activities throughout the world.

PUBLICATION RECORD

Publication year
2017
Venue
Genomics, Proteomics & Bioinformatics
Publication date
2017-02-01
Fields of study
Biology, Medicine, Computer Science, Environmental Science
Identifiers
DOI 10.1016/j.gpb.2017.01.001 PMID 28387199 PMCID 5339404
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Database resources of the National Center for Biotechnology Information
2017cited by this paper
The BIG Data Center: from deposition to integration to translation
2016cited by this paper
Precision Medicine: What Challenges Are We Facing?
2016cited by this paper
A new initiative on precision medicine.
2015cited by this paper
The International Nucleotide Sequence Database Collaboration
2015cited by this paper
The European Bioinformatics Institute in 2016: Data growth and integration
2015cited by this paper
Large-scale whole-genome sequencing of the Icelandic population
2015cited by this paper
Whole-genome sequence-based analysis of thyroid function
2015cited by this paper
DNA data bank of Japan (DDBJ) progress report
2015cited by this paper
DoGSD: the dog and wolf genome SNP database
2014cited by this paper
Database resources of the National Center for Biotechnology Information
2014cited by this paper
The International Nucleotide Sequence Database Collaboration
2011cited by this paper
Data Integration in Bioinformatics: Current Efforts and Challenges
2011cited by this paper
The International Nucleotide Sequence Database Collaboration
2010cited by this paper

CITED BY

The Role of Boredom Proneness in Moderation of The Relationship between FoMO and Phubbing Behavior of Yogyakarta's Gen-Z
2026cites this paper
Characteristics of microbial communities in water, sediment, and fish intestines: A comparison between lotus-fish co-culture and intensive pond culture systems and their potential ecological impacts
2026cites this paper
CD19 exon 2 skipping is a potential prognostic correlate of anti-CD19 CAR-T therapy relapse.
2026cites this paper
Genome Variation Map: a platform for the analysis and integration of genomic variation
2025cites this paper
The GSA Family in 2025: A Broadened Sharing Platform for Multi-omics and Multimodal Data
2025cites this paper
A biGWAS strategy reveals the genetic architecture of the interaction between wheat and Blumeria graminis f. sp. tritici
2025cites this paper
BAP31-ELAVL1-SPINK6 axis induces loss of cell polarity and promotes metastasis in hepatocellular carcinoma
2025cites this paper
Single‐Cell Analysis Reveals that Vitamin C Inhibits Bone Metastasis of Renal Cancer via Cell Cycle Arrest and Microenvironment Remodeling
2025cites this paper
Database resources of the National Genomics Data Center, China National Center for Bioinformation in 2026
2025cites this paper
Evaluation of Raw Cell-Free DNA Sequences for Gastric Cancer Detection.
2025cites this paper
Loss of med14 causes developmental malformations characteristic of VACTERL association by disrupting the Mediator complex
2025cites this paper
Multi-dimensional annotation of porcine variants using genomic and epigenomic features in pigs
2025cites this paper
SEdb 3.0: a comprehensive super-enhancer database across multiple species
2025cites this paper
A Systematic Method to Detect Next-Generation Sequencing-Based Microsatellite Instability in Plasma Cell-Free DNA: plasmaMSI.
2025cites this paper
Data-driven identification of core tumor-secreted factors associated with cachexia prevalence
2025cites this paper
Single-cell transcriptomes reveal cell-type-specific and sample-specific gene function in human cancer
2025cites this paper
Systematically Revealing Quantitative Multi‐Target Integrative Effects of Plants With Artificial Intelligence Method
2025cites this paper
A comprehensive omics resource and genetic tools for functional genomics research and genetic improvement of sorghum.
2025cites this paper
High-quality chromosome-level genome assembly of the snake Pseudoxenodon stejnegeri (Squamata: Colubridae)
2025cites this paper
CIRCpedia v3: an interactive database for circular RNA characterization and functional exploration
2025cites this paper
Bacterial population dynamics and biosafety assessment in a closed Bioregenerative Life Support System during the “Lunar Palace 365” experiment
2025cites this paper
Artificial intelligence in gut microbiome research: Toward predictive diagnostics for neurodegenerative disorders.
2025cites this paper
Genomic and phenotypic characterization of antimicrobial resistance in clinical Nocardia species isolates
2025cites this paper
Targeting ferroptosis in prostate cancer management: molecular mechanisms, multidisciplinary strategies and translational perspectives
2025cites this paper
Legacy effects of preceding crops improve flue-cured tobacco productivity in southwest China by optimizing soil structure, nutrients, and microbial interactions
2024cites this paper
Reprogramming of 3D genome structure underlying HSPC development in zebrafish
2024cites this paper
Genome-wide association study, population structure, and genetic diversity of the tea plant in Guizhou Plateau
2024cites this paper
An ultra-dense linkage map identified quantitative trait loci corresponding to fruit quality- and size-related traits in red goji berry
2024cites this paper
Dog10K: an integrated Dog10K database summarizing canine multi-omics
2024cites this paper
Genomic and transcriptomic analysis of breast cancer identifies novel signatures associated with response to neoadjuvant chemotherapy
2024cites this paper
Metabolomic and Transcriptomic Analyses of Flavonoid Biosynthesis in Different Colors of Soybean Seed Coats
2024cites this paper
Deep learning can predict subgenome dominance in ancient but not in neo/synthetic polyploidized genomes.
2024cites this paper
Unveiling axolotl transcriptome for tissue regeneration with high-resolution annotation via long-read sequencing
2024cites this paper
scTML: a pan-cancer single-cell landscape of multiple mutation types
2024cites this paper
Systematic identification and functional analysis of root meristem growth factors (RGFs) reveals role of PgRGF1 in modulation of root development and ginsenoside production in Panax ginseng.
2024cites this paper
Tomato root specialized metabolites evolved through gene duplication and regulatory divergence within a biosynthetic gene cluster
2024cites this paper
Methylisothiazolinone pollution inhibited root stem cells and regeneration through auxin transport modification in Arabidopsis thaliana.
2024cites this paper
Disruption of Super‐Enhancers in Activated Pancreatic Stellate Cells Facilitates Chemotherapy and Immunotherapy in Pancreatic Cancer
2024cites this paper
Core Bacterial Taxa Determine Formation of Forage Yield in Fertilized Soil
2024cites this paper
GEPREP: A comprehensive data atlas of RNA-seq-based gene expression profiles of exercise responses
2024cites this paper
BIG to CNCB: An Exploratory Journey from Genomics to Bioinformation
2024influential citation
iDog: a multi-omics resource for canids study
2024cites this paper
Screening and validation of reference genes in Dracaena cochinchinensis using quantitative real-time PCR
2024cites this paper
High-quality reference genome decoding and population evolution analysis of prickly Sechium edule
2024cites this paper
Effect of myristic acid supplementation on triglyceride synthesis and related genes in the pectoral muscles of broiler chickens
2024cites this paper
Bioinformatics software development: Principles and future directions
2024cites this paper
Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2025
2024cites this paper
Ruminant microbiome data are skewed and unFAIR, undermining their usefulness for sustainable production improvement
2024cites this paper
Integrated multi-omics analysis reveals molecular changes associated with chronic lipid accumulation following contusive spinal cord injury.
2024cites this paper
Metabolite of Clostridium perfringens type A, palmitic acid, enhances porcine enteric coronavirus porcine epidemic diarrhea virus infection
2024cites this paper
The Molecular Characteristics and Therapeutic Implications of O-Glycan Synthesis in Pancreatic Cancer by Integrating Transcriptome and Single-Cell Data.
2024cites this paper
Methylation entropy landscape of Chinese long‐lived individuals reveals lower epigenetic noise related to human healthy aging
2024cites this paper
Integration of ATAC-Seq and RNA-Seq reveals FOSL2 drives human liver progenitor-like cell aging by regulating inflammatory factors
2023cites this paper
Detection of multiple types of cancer driver mutations using targeted RNA sequencing in non–small cell lung cancer
2023cites this paper
Population structure analysis to explore genetic diversity and geographical distribution characteristics of wild tea plant in Guizhou Plateau
2023cites this paper
Population structure and genome-wide evolutionary signatures reveal putative climate-driven habitat change and local adaptation in the large yellow croaker
2023influential citation
The Snapdragon Genomes Reveal the Evolutionary Dynamics of the S-Locus Supergene
2023cites this paper
Restoring carboxypeptidase E rescues BDNF maturation and neurogenesis in aged brains
2023cites this paper
Characteristics of bacterial community and ARG profiles in the surface and air environments in a spacecraft assembly cleanroom.
2023cites this paper
Gut Microbiome Variation Along A Lifestyle Gradient Reveals Threats Faced by Asian Elephants
2023cites this paper
Evidence of the predominance of passive symplastic phloem loading and sugar transport with leaf ageing in Camellia oleifera
2023cites this paper
Multilineage contribution of CD34+ cells in cardiac remodeling after ischemia/reperfusion injury
2023cites this paper
Human genetic history on the Tibetan Plateau in the past 5100 years
2023cites this paper
Enhanced insulin‐regulated phagocytic activities support extreme health span and longevity in multiple populations
2023cites this paper
BarleyExpDB: an integrative gene expression database for barley
2023cites this paper
Full-length circular RNA profiling by nanopore sequencing with CIRI-long
2023cites this paper
SEanalysis 2.0: a comprehensive super-enhancer regulatory network analysis tool for human and mouse
2023cites this paper
Metagenome sequencing to unveil the occurrence and distribution of antibiotic resistome and in a wastewater treatment plant
2023cites this paper
Phenotypes and genetic etiology of spontaneous polycystic kidney and liver disease in cynomolgus monkey
2023cites this paper
Microbial Virulence Factors, Antimicrobial Resistance Genes, Metabolites, and Synthetic Chemicals in Cabins of Commercial Aircraft
2023cites this paper
Resetting histone modifications during human prenatal germline development
2023cites this paper
Polymorphism analysis of the chloroplast and mitochondrial genomes in soybean
2023cites this paper
Effect of the Different Fertilization Treatments Application on Paddy Soil Enzyme Activities and Bacterial Community Composition
2023cites this paper
G9a Inhibition Promotes Neuroprotection through GMFB Regulation in Alzheimer’s Disease
2023cites this paper
Associations between environmental characteristics, high-resolution indoor microbiome, metabolome and allergic and non-allergic rhinitis symptoms for junior high school students.
2023cites this paper
Integrated multi-omics profiling to dissect the spatiotemporal evolution of metastatic hepatocellular carcinoma.
2023cites this paper
Multi-Omics Analysis of Genes Encoding Proteins Involved in Alpha-Linolenic Acid Metabolism in Chicken
2023cites this paper
Toward A New Paradigm of Genomics Research
2023cites this paper
Seq2science: an end-to-end workflow for functional genomics analysis
2023cites this paper
AGIDB: a versatile database for genotype imputation and variant decoding across species
2023cites this paper
Metabolome and transcriptome analyses identify the underground rhizome growth through the regulation of rhizome apices in Panax ginseng
2023cites this paper
The Armillaria response to Gastrodia elata is partially mediated by strigolactone-induced changes in reactive oxygen species.
2023cites this paper
Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2024
2023cites this paper
The high-quality sequencing of the Brassica rapa ‘XiangQingCai’ genome and exploration of genome evolution and genes related to volatile aroma
2023cites this paper
Comparative analysis of organellar genomes between diploid and tetraploid Chrysanthemum indicum with its relatives
2023cites this paper
OBIA: An Open Biomedical Imaging Archive
2023cites this paper
Efficient Management of Database Resources in Large-Scale Data Centers
2023cites this paper
NETosis promotes chronic inflammation and fibrosis in systemic lupus erythematosus and COVID-19.
2023cites this paper
Divergence and convergence of gut microbiomes of wild insect pollinators
2023cites this paper
The Role of Indoor Microbiome and Metabolites in Shaping Children’s Nasal and Oral Microbiota: A Pilot Multi-Omic Analysis
2023cites this paper
Spatiotemporal transcriptome atlas reveals the regional specification of the developing human brain.
2023cites this paper
Genomic signature of MTOR could be an immunogenicity marker in human colorectal cancer
2022cites this paper
Towards A Data Repository for Educational Factories
2022cites this paper
Distance-dependent inhibition of translation initiation by downstream out-of-frame AUGs is consistent with a Brownian ratchet process of ribosome scanning
2022cites this paper
Application of earthworm and silicon can alleviate antibiotic resistance in soil-Chinese cabbage system with ARGs contamination.
2022cites this paper
Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2023
2022cites this paper
Clinical and Genetic Characterization of EBV-associated T/NK lymphoproliferative diseases.
2022cites this paper
Frequent exacerbators of chronic obstructive pulmonary disease have distinguishable sputum microbiome signatures during clinical stability
2022cites this paper
Fine Mapping and Identification of a Candidate Gene of Downy Mildew Resistance, RPF2, in Spinach (Spinacia oleracea L.)
2022cites this paper
The genome of a hadal sea cucumber reveals novel adaptive strategies to deep-sea environments
2022cites this paper