TY - GEN
T1 - Neural gradient boosting in federated learning for hemodynamic instability prediction
T2 - towards a distributed and scalable deep learning-based solution
AU - Manni, Francesca
AU - Bukharev, Aleksandr
AU - Jain, Anshul
AU - Moorthy, Shiva
AU - Rahman, Asif
AU - Bucur, Anca
PY - 2022
Y1 - 2022
N2 - Federated learning (FL) is a privacy preserving approach to learning that overcome issues related to data access, privacy, and security, which represent key challenges in the healthcare sector. FL enables hospitals to collaboratively learn a shared prediction model without moving the data outside their secure infrastructure. To do so, after having sent model updates to a central server, an update aggregation is performed, and the model is sent back to the sites for further training. Although widely applied on neural networks, the deployment of FL architectures is lacking scalability and support for machine learning techniques such as decision tree-based models. The latter, when embedded in FL, suffer from costly encryption techniques applied for sharing sensitive information such as the splitting decisions within the trees. In this work, we focus on predicting hemodynamic instability on ICU patients by enabling distributed gradient boosting in FL. We employ a clinical dataset from 25 hospitals generated based on the Philips eICU database and we design a FL pipeline that supports neural-based boosting models as well as conventional neural networks. This enhancement enables decision tree models in FL, which represent the state-of-the-art approach for classification tasks involving tabular clinical data. Comparable performances in terms of accuracy, precision, recall and F1 score have been reached when detecting hemodynamic instability in FL, and in a centralized setup. In summary, we demonstrate the feasibility of a scalable FL for detecting hemodynamic instability in ICU data, which preserves privacy and holds the deployment benefits of a neural-based architecture.
AB - Federated learning (FL) is a privacy preserving approach to learning that overcome issues related to data access, privacy, and security, which represent key challenges in the healthcare sector. FL enables hospitals to collaboratively learn a shared prediction model without moving the data outside their secure infrastructure. To do so, after having sent model updates to a central server, an update aggregation is performed, and the model is sent back to the sites for further training. Although widely applied on neural networks, the deployment of FL architectures is lacking scalability and support for machine learning techniques such as decision tree-based models. The latter, when embedded in FL, suffer from costly encryption techniques applied for sharing sensitive information such as the splitting decisions within the trees. In this work, we focus on predicting hemodynamic instability on ICU patients by enabling distributed gradient boosting in FL. We employ a clinical dataset from 25 hospitals generated based on the Philips eICU database and we design a FL pipeline that supports neural-based boosting models as well as conventional neural networks. This enhancement enables decision tree models in FL, which represent the state-of-the-art approach for classification tasks involving tabular clinical data. Comparable performances in terms of accuracy, precision, recall and F1 score have been reached when detecting hemodynamic instability in FL, and in a centralized setup. In summary, we demonstrate the feasibility of a scalable FL for detecting hemodynamic instability in ICU data, which preserves privacy and holds the deployment benefits of a neural-based architecture.
M3 - Conference contribution
SP - 729
EP - 738
BT - AMIA Annual Symposium Proceedings
PB - American Medical Informatics Association (AMIA)
ER -