1. Determine the therapeutic role of radiotherapy in administrative data: a data mining approach.
- Author
-
Zhang-Salomons J and Salomons G
- Subjects
- Data Mining classification, Data Mining methods, Decision Trees, Hospital Records classification, Hospital Records standards, Humans, Medical Records classification, Medical Records standards, Ontario, Outcome Assessment, Health Care methods, Outcome Assessment, Health Care statistics & numerical data, Radiation Oncology methods, Radiation Oncology statistics & numerical data, Radiotherapy methods, Reproducibility of Results, Retrospective Studies, Data Mining statistics & numerical data, Hospital Records statistics & numerical data, Medical Records statistics & numerical data, Neoplasms radiotherapy, Radiotherapy statistics & numerical data
- Abstract
Background: Clinical data gathered for administrative purposes often lack sufficient information to separate the records of radiotherapy given for palliation from those given for cure. An absence, incompleteness, or inaccuracy of such information could hinder or bias the study of the utilization and outcome of radiotherapy. This study has three specific purposes: 1) develop a method to determine the therapeutic role of radiotherapy (TRR); 2) assess the accuracy of the method; 3) report the quality of the information on treatment "intent" recorded in the clinical data in Ontario, Canada. A general purpose is to use this study as a prototype to demonstrate and test a method to assess the quality of administrative data., Methods: This is a population based retrospective study. A random sample was drawn from the treatment records with "intent" assigned in treating hospitals. A decision tree is grown using treatment parameters as predictors and "intent" as outcome variable to classify the treatments into curative or palliative. The tree classifier was applied to the entire dataset, and the classification results were compared with those identified by "intent". A manual audit was conducted to assess the accuracy of the classification., Results: The following parameters predicted the TRR, from the strongest to the weakest: radiation dose per fraction, treated body-region, disease site, and time of treatment. When applied to the records of treatments given between 1990 and 2008 in Ontario, Canada, the classification rules correctly classified 96.1% of the records. The quality of the "intent" variable was as follows: 77.5% correctly classified, 3.7% misclassified, and 18.8% did not have an "intent" assigned., Conclusions: The classification rules derived in this study can be used to determine the TRR when such information is unavailable, incomplete, or inaccurate in administrative data. The study demonstrates that data mining approach can be used to effectively assess and improve the quality of large administrative datasets. more...
- Published
- 2015
- Full Text
- View/download PDF