1. SMURF: Systematic Methodology for Unveiling Relevant Factors in Retrospective Data on Chronic Disease Treatments
- Author
-
Marina Ruiz Pinero, Franklin Parrales Bravo, Angel Guerrero Peral, Saso Dzeroski, Ana Beatriz Gago Veiga, Alberto Antonio Del Barrio Garcia, José L. Ayala, and Maria Mercedes Gallego De La Sacristana
- Subjects
General Computer Science ,Computer science ,02 engineering and technology ,Machine learning ,computer.software_genre ,Quality of life ,Encoding (memory) ,0202 electrical engineering, electronic engineering, information engineering ,medicine ,General Materials Science ,Migraine treatment ,Multi-target prediction ,business.industry ,Medical record ,020208 electrical & electronic engineering ,General Engineering ,020206 networking & telecommunications ,data mining ,medicine.disease ,Chronic disease ,Migraine ,classification algorithms ,Simulated annealing ,Artificial intelligence ,simulated annealing ,lcsh:Electrical engineering. Electronics. Nuclear engineering ,business ,computer ,lcsh:TK1-9971 ,Medical literature - Abstract
Deciding on the continuous treatment of chronic diseases is vital in terms of economy, quality of life, and time. We present a holistic data mining approach that addresses the prediction of the therapeutic response in a panoramic and feedback way while unveiling relevant medical factors. Panoramic prediction makes it possible to decide whether the treatment will be beneficial without using previous knowledge and without involving unnecessary treatments. Feedback prediction can be more accurate prediction since it considers the results of previous stages of the treatment. A novel label encoding called simulated annealing and rounding (SAR) encoding is also proposed to help improve the accuracy of prediction in both approaches. To unveil the medical factors that make the treatment effective for patients, various techniques are applied to the prediction models found through the proposed approaches. Finally, this methodology is applied in the realistic scenario of analyzing electronic medical records of migraineurs under BoNT-A treatment. The results show a significant improvement in accuracy due to the use of SAR encoding, from close to 60% (baseline) to 75% with panoramic prediction, and up to around 90% when using feedback prediction. Furthermore, the following factors have been found to be relevant when predicting the migraine treatment responses: migraine time evolution, unilateral pain, analgesic abuse, headache days, and the retroocular component. According to doctors, these factors are also medically relevant and in alignment with the medical literature.
- Published
- 2019