Multiset sparse redundancy analysis for high‐dimensional omics data

Published 2018 in Biometrical journal. Biometrische Zeitschrift

ABSTRACT

Redundancy Analysis (RDA) is a well‐known method used to describe the directional relationship between related data sets. Recently, we proposed sparse Redundancy Analysis (sRDA) for high‐dimensional genomic data analysis to find explanatory variables that explain the most variance of the response variables. As more and more biomolecular data become available from different biological levels, such as genotypic and phenotypic data from different omics domains, a natural research direction is to apply an integrated analysis approach in order to explore the underlying biological mechanism of certain phenotypes of the given organism. We show that the multiset sparse Redundancy Analysis (multi‐sRDA) framework is a prominent candidate for high‐dimensional omics data analysis since it accounts for the directional information transfer between omics sets, and, through its sparse solutions, the interpretability of the result is improved. In this paper, we also describe a software implementation for multi‐sRDA, based on the Partial Least Squares Path Modeling algorithm. We test our method through simulation and real omics data analysis with data sets of 364,134 methylation markers, 18,424 gene expression markers, and 47 cytokine markers measured on 37 patients with Marfan syndrome.

PUBLICATION RECORD

Publication year
2018
Venue
Biometrical journal. Biometrische Zeitschrift
Publication date
2018-12-03
Fields of study
Biology, Medicine, Computer Science
Identifiers
DOI 10.1002/bimj.201700248 PMID 30506971 PMCID 6587877
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Multiset sparse redundancy analysis for high‐dimensional omics data
2018cited by this paper
More Is Better: Recent Progress in Multi-Omics Data Integration Methods
2017cited by this paper
Sparse redundancy analysis of high‐dimensional genetic and genomic data
2017influential reference
Supervised multiblock sparse multivariable analysis with application to multimodal brain imaging genetics
2017influential reference
Integration of omics: more than the sum of its parts
2016cited by this paper
Sparse canonical correlation analysis from a predictive point of view
2015cited by this paper
Sparse multi-block PLSR for biomarker discovery when integrating data from LC–MS and NMR metabolomics
2015cited by this paper
Losartan reduces aortic dilatation rate in adults with Marfan syndrome: a randomized controlled trial.
2013cited by this paper
Partial least squares algorithms and methods
2013cited by this paper
Role of Position 627 of PB2 and the Multibasic Cleavage Site of the Hemagglutinin in the Virulence of H5N1 Avian Influenza Virus in Chickens and Ducks
2012cited by this paper
Inflammation Aggravates Disease Severity in Marfan Syndrome Patients
2012cited by this paper
PLS Path Modeling: From Foundations to Recent Developments and Open Issues for Model Assessment and Improvement
2010cited by this paper
Sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks
2009cited by this paper
[Central dogma of molecular biology].
2003cited by this paper
A Model and Simple Iterative Algorithm For Redundancy Analysis.
1988cited by this paper
Redundancy analysis for qualitative variables
1984cited by this paper
An extension of Wollenberg's redundancy analysis
1981cited by this paper
Redundancy analysis an alternative for canonical correlation analysis
1977cited by this paper
Path Models with Latent Variables: The NIPALS Approach
1975cited by this paper
Central dogma of molecular biology.
1970cited by this paper

CITED BY

Ginsenoside Rg5 Activates the LKB1/AMPK/mTOR Signaling Pathway and Modifies the Gut Microbiota to Alleviate Nonalcoholic Fatty Liver Disease Induced by a High-Fat Diet
2024cites this paper
Data integration through canonical correlation analysis and its application to OMICs research
2023cites this paper
Redundancy Analysis to Reduce the High-Dimensional Near-Infrared Spectral Information to Improve the Authentication of Olive Oil
2022cites this paper
Transcriptome Profiling and Metagenomic Analysis Help to Elucidate Interactions in an Inflammation-Associated Cancer Mouse Model
2021cites this paper
Analysis of Megavariate Data in Functional Omics q
2020cites this paper
Analysis of Megavariate Data in Functional Omics
2020cites this paper
Considerations for mass spectrometry-based multi-omic analysis of clinical samples
2020cites this paper
Multiset sparse partial least squares path modeling for high dimensional omics data analysis
2020cites this paper
Multivariate Statistical Methods for High-Dimensional Multiset Omics Data Analysis
2019cites this paper
Multiset sparse redundancy analysis for high‐dimensional omics data
2018cites this paper