Testing dependence/correlation of two variables is one of the fundamental tasks in statistics. In this work, we proposed a new way of testing nonlinear dependence between two continuous variables (X and Y). We addressed this research question by using CANOVA (continuous analysis of variance, software available at https://sourceforge.net/projects/canova/). In the CANOVA framework, we first defined a neighborhood for each data point related to its X value, and then calculated the variance of the Y value within the neighborhood. Finally, we performed permutations to evaluate the significance of the observed values within the neighborhood variance. To evaluate the strength of CANOVA compared to six other methods, we performed extensive simulations to explore the relationship between methods and compared the false positive rates and statistical power using both simulated and real datasets (kidney cancer RNA-seq dataset). We concluded that CANOVA is an efficient method for testing nonlinear correlation with several advantages in real data applications.
Efficient test for nonlinear dependence of two continuous variables
Yi Wang,Yi Li,Hongbao Cao,M. Xiong,Y. Shugart,Li Jin
Published 2015 in BMC Bioinformatics
ABSTRACT
PUBLICATION RECORD
- Publication year
2015
- Venue
BMC Bioinformatics
- Publication date
2015-08-19
- Fields of study
Mathematics, Medicine, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar, PubMed
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-42 of 42 references · Page 1 of 1
CITED BY
Showing 1-51 of 51 citing papers · Page 1 of 1