Back to Search
Start Over
Accounting for measurement error in biomarker data and misclassification of subtypes in the analysis of tumor data
- Source :
- Statistics in Medicine. 35:5686-5700
- Publication Year :
- 2016
- Publisher :
- Wiley, 2016.
-
Abstract
- A common paradigm in dealing with heterogeneity across tumors in cancer analysis is to cluster the tumors into subtypes using marker data on the tumor, and then to analyze each of the clusters separately. A more specific target is to investigate the association between risk factors and specific subtypes and to use the results for personalized preventive treatment. This task is usually carried out in two steps–clustering and risk factor assessment. However, two sources of measurement error arise in these problems. The first is the measurement error in the biomarker values. The second is the misclassification error when assigning observations to clusters. We consider the case with a specified set of relevant markers and propose a unified single-likelihood approach for normally distributed biomarkers. As an alternative, we consider a two-step procedure with the tumor type misclassification error taken into account in the second-step risk factor analysis. We describe our method for binary data and also for survival analysis data using a modified version of the Cox model. We present asymptotic theory for the proposed estimators. Simulation results indicate that our methods significantly lower the bias with a small price being paid in terms of variance. We present an analysis of breast cancer data from the Nurses' Health Study to demonstrate the utility of our method. Copyright © 2016 John Wiley & Sons, Ltd.
- Subjects :
- Statistics and Probability
Observational error
Epidemiology
Computer science
Proportional hazards model
Estimator
Risk factor (finance)
Variance (accounting)
Asymptotic theory (statistics)
computer.software_genre
01 natural sciences
010104 statistics & probability
03 medical and health sciences
0302 clinical medicine
030220 oncology & carcinogenesis
Statistics
Binary data
Data mining
0101 mathematics
Cluster analysis
computer
Subjects
Details
- ISSN :
- 02776715
- Volume :
- 35
- Database :
- OpenAIRE
- Journal :
- Statistics in Medicine
- Accession number :
- edsair.doi...........6763d66922cf42d8bb3b40766b85d836
- Full Text :
- https://doi.org/10.1002/sim.7083