1. Disease progression based feature screening for ultrahigh‐dimensional survival‐associated biomarkers
- Author
-
Mengjiao Peng, Liming Xiang, and School of Physical and Mathematical Sciences
- Subjects
Mathematics [Science] ,Statistics and Probability ,Epidemiology ,Conditional Survival Function ,Correlation Rank - Abstract
The increased availability of ultrahigh-dimensional biomarker data and the high demand of identifying biomarkers importantly related to survival outcomes made feature screening methods commonplace in the analysis of cancer genome data. When survival outcomes include endpoints of overall survival (OS) and time-to-progression (TTP), a high concordance is typically found in both endpoints in cancer studies, namely, patients' OS would most likely be extended when tumour progression is delayed. Existing screening procedures are often performed on a single survival endpoint only and may result in biased selection of features for OS in ignorance of disease progression. We propose a novel feature screening method by incorporating information of TTP into the selection of important biomarker predictors for more accurate inference of OS subsequent to disease progression. The proposal is based on the rank of correlation between individual features and the conditional distribution of OS given observations of TTP. It is advantageous for its flexible model nature, which requires no marginal model assumption for each endpoint, and its minimal computational cost for implementation. Theoretical results show its ranking consistency, sure screening and false rate control properties. Simulation results demonstrate that the proposed screener leads to more accurate feature selection than the method without considering the prior observations of disease progression. An application to breast cancer genome data illustrates its practical utility and facilitates disease classification using selected biomarker predictors. Ministry of Education (MOE) Xiang’s research was partially supported by the Singapore Ministry of Education Academic Research Fund Tier 1 Grant (RG98/20) and Tier 2 Grant (MOE-T2EP20121-0004), and Peng’s research was partially supported by the National Key R&D Program of China (Nos. 2021YFA1000100 and 2021YFA1000101) and the National Natural Science Foundation of China (NSFC grant no. 92046005).
- Published
- 2023