Back to Search Start Over

Privacy-preserving federated machine learning on FAIR health data: A real-world application

Authors :
European Commission
Instituto de Salud Carlos III
Parra-Calderón, Carlos Luis [0000-0003-2609-575X]
Consejo Superior de Investigaciones Científicas [https://ror.org/02gfc7t72]
Sinaci, A. Anil
Gencturk, Mert
Álvarez-Romero, Celia
Laleci Erturkmen, Gokce Banu
Martínez-García, Alicia
Escalona-Cuaresma, María José
Parra-Calderón, Carlos Luis
European Commission
Instituto de Salud Carlos III
Parra-Calderón, Carlos Luis [0000-0003-2609-575X]
Consejo Superior de Investigaciones Científicas [https://ror.org/02gfc7t72]
Sinaci, A. Anil
Gencturk, Mert
Álvarez-Romero, Celia
Laleci Erturkmen, Gokce Banu
Martínez-García, Alicia
Escalona-Cuaresma, María José
Parra-Calderón, Carlos Luis
Publication Year :
2024

Abstract

[Objective] This paper introduces a privacy-preserving federated machine learning (ML) architecture built upon Findable, Accessible, Interoperable, and Reusable (FAIR) health data. It aims to devise an architecture for executing classification algorithms in a federated manner, enabling collaborative model-building among health data owners without sharing their datasets.<br />[Materials and methods] Utilizing an agent-based architecture, a privacy-preserving federated ML algorithm was developed to create a global predictive model from various local models. This involved formally defining the algorithm in two steps: data preparation and federated model training on FAIR health data and constructing the architecture with multiple components facilitating algorithm execution. The solution was validated by five healthcare organizations using their specific health datasets.<br />[Results] Five organizations transformed their datasets into Health Level 7 Fast Healthcare Interoperability Resources via a common FAIRification workflow and software set, thereby generating FAIR datasets. Each organization deployed a Federated ML Agent within its secure network, connected to a cloud-based Federated ML Manager. System testing was conducted on a use case aiming to predict 30-day readmission risk for chronic obstructive pulmonary disease patients and the federated model achieved an accuracy rate of 87%.<br />[Discussion] The paper demonstrated a practical application of privacy-preserving federated ML among five distinct healthcare entities, highlighting the value of FAIR health data in machine learning when utilized in a federated manner that ensures privacy protection without sharing data.<br />[Conclusion] This solution effectively leverages FAIR datasets from multiple healthcare organizations for federated ML while safeguarding sensitive health datasets, meeting legislative privacy and security requirements.

Details

Database :
OAIster
Notes :
English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1442728465
Document Type :
Electronic Resource