1. Incorporating epistasis interaction of genetic susceptibility single nucleotide polymorphisms in a lung cancer risk prediction model
- Author
-
Stephen W. Duffy, Raewyn J. Hopkins, John K. Field, Olaide Y. Raji, Michael W. Marcus, and Robert P. Young
- Subjects
Adult ,Male ,epistasis ,0301 basic medicine ,Oncology ,Cancer Research ,medicine.medical_specialty ,Lung Neoplasms ,multifactor dimensionality reduction ,Single-nucleotide polymorphism ,Biology ,Logistic regression ,Polymorphism, Single Nucleotide ,single nucleotide polymorphisms ,03 medical and health sciences ,0302 clinical medicine ,Risk Factors ,Internal medicine ,medicine ,Genetic predisposition ,Humans ,Genetic Predisposition to Disease ,Lung cancer ,Genetics ,Multifactor dimensionality reduction ,Receptors, Dopamine D2 ,Interleukin-18 ,Area under the curve ,Case-control study ,Epistasis, Genetic ,risk models ,Articles ,Middle Aged ,medicine.disease ,lung cancer ,Logistic Models ,030104 developmental biology ,Area Under Curve ,Case-Control Studies ,030220 oncology & carcinogenesis ,Epistasis ,Female ,Integrin alpha Chains ,random forest - Abstract
Incorporation of genetic variants such as single nucleotide polymorphisms (SNPs) into risk prediction models may account for a substantial fraction of attributable disease risk. Genetic data, from 2385 subjects recruited into the Liverpool Lung Project (LLP) between 2000 and 2008, consisting of 20 SNPs independently validated in a candidate-gene discovery study was used. Multifactor dimensionality reduction (MDR) and random forest (RF) were used to explore evidence of epistasis among 20 replicated SNPs. Multivariable logistic regression was used to identify similar risk predictors for lung cancer in the LLP risk model for the epidemiological model and extended model with SNPs. Both models were internally validated using the bootstrap method and model performance was assessed using area under the curve (AUC) and net reclassification improvement (NRI). Using MDR and RF, the overall best classifier of lung cancer status were SNPs rs1799732 (DRD2), rs5744256 (IL-18), rs2306022 (ITGA11) with training accuracy of 0.6592 and a testing accuracy of 0.6572 and a cross-validation consistency of 10/10 with permutation testing P
- Published
- 2016
- Full Text
- View/download PDF