Back to Search Start Over

MODMatcher: Multi-Omics Data Matcher for Integrative Genomic Analysis.

Authors :
Yoo, Seungyeul
Huang, Tao
Campbell, Joshua D.
Lee, Eunjee
Tu, Zhidong
Geraci, Mark W.
Powell, Charles A.
Schadt, Eric E.
Spira, Avrum
Zhu, Jun
Source :
PLoS Computational Biology; Aug2014, Vol. 10 Issue 8, p1-14, 14p, 2 Diagrams, 1 Chart, 8 Graphs
Publication Year :
2014

Abstract

Errors in sample annotation or labeling often occur in large-scale genetic or genomic studies and are difficult to avoid completely during data generation and management. For integrative genomic studies, it is critical to identify and correct these errors. Different types of genetic and genomic data are inter-connected by cis-regulations. On that basis, we developed a computational approach, Multi-Omics Data Matcher (MODMatcher), to identify and correct sample labeling errors in multiple types of molecular data, which can be used in further integrative analysis. Our results indicate that inspection of sample annotation and labeling error is an indispensable data quality assurance step. Applied to a large lung genomic study, MODMatcher increased statistically significant genetic associations and genomic correlations by more than two-fold. In a simulation study, MODMatcher provided more robust results by using three types of omics data than two types of omics data. We further demonstrate that MODMatcher can be broadly applied to large genomic data sets containing multiple types of omics data, such as The Cancer Genome Atlas (TCGA) data sets. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
1553734X
Volume :
10
Issue :
8
Database :
Complementary Index
Journal :
PLoS Computational Biology
Publication Type :
Academic Journal
Accession number :
97810572
Full Text :
https://doi.org/10.1371/journal.pcbi.1003790