Back to Search
Start Over
How are missing data in covariates handled in observational time-to-event studies in oncology? A systematic review
- Source :
- BMC Medical Research Methodology, Vol 20, Iss 1, Pp 1-15 (2020), BMC Medical Research Methodology
- Publication Year :
- 2020
- Publisher :
- Springer Science and Business Media LLC, 2020.
-
Abstract
- Background Missing data in covariates can result in biased estimates and loss of power to detect associations. It can also lead to other challenges in time-to-event analyses including the handling of time-varying effects of covariates, selection of covariates and their flexible modelling. This review aims to describe how researchers approach time-to-event analyses with missing data. Methods Medline and Embase were searched for observational time-to-event studies in oncology published from January 2012 to January 2018. The review focused on proportional hazards models or extended Cox models. We investigated the extent and reporting of missing data and how it was addressed in the analysis. Covariate modelling and selection, and assessment of the proportional hazards assumption were also investigated, alongside the treatment of missing data in these procedures. Results 148 studies were included. The mean proportion of individuals with missingness in any covariate was 32%. 53% of studies used complete-case analysis, and 22% used multiple imputation. In total, 14% of studies stated an assumption concerning missing data and only 34% stated missingness as a limitation. The proportional hazards assumption was checked in 28% of studies, of which, 17% did not state the assessment method. 58% of 144 multivariable models stated their covariate selection procedure with use of a pre-selected set of covariates being the most popular followed by stepwise methods and univariable analyses. Of 69 studies that included continuous covariates, 81% did not assess the appropriateness of the functional form. Conclusion While guidelines for handling missing data in epidemiological studies are in place, this review indicates that few report implementing recommendations in practice. Although missing data are present in many studies, we found that few state clearly how they handled it or the assumptions they have made. Easy-to-implement but potentially biased approaches such as complete-case analysis are most commonly used despite these relying on strong assumptions and where often more appropriate methods should be employed. Authors should be encouraged to follow existing guidelines to address missing data, and increased levels of expectation from journals and editors could be used to improve practice.
- Subjects :
- Oncology
medicine.medical_specialty
Survival
Epidemiology
Computer science
Missing data
MEDLINE
Health Informatics
Medical Oncology
01 natural sciences
010104 statistics & probability
03 medical and health sciences
0302 clinical medicine
Internal medicine
Covariate
medicine
Humans
Time-to-event
030212 general & internal medicine
0101 mathematics
Observational studies
Set (psychology)
Selection (genetic algorithm)
Proportional Hazards Models
lcsh:R5-920
Proportional hazards model
Research
Multivariable calculus
Data Interpretation, Statistical
Multiple imputation
Observational study
lcsh:Medicine (General)
Research Article
Subjects
Details
- ISSN :
- 14712288
- Volume :
- 20
- Database :
- OpenAIRE
- Journal :
- BMC Medical Research Methodology
- Accession number :
- edsair.doi.dedup.....8584bf57360c6cf681fa6dfe670395bc
- Full Text :
- https://doi.org/10.1186/s12874-020-01018-7