A multi-modal data fusion and real-time monitoring on stroke risk prediction using federated learning

Scritto il 06/04/2026

da Ramya Sree K

PLoS One. 2026 Apr 6;21(4):e0330244. doi: 10.1371/journal.pone.0330244. eCollection 2026.

ABSTRACT

Predicting the risk of stroke is one of the critical problems in healthcare, which necessitates efficient solutions for providing accurate and prompt risk assessments while preserving data confidentiality. This work proposes a new framework using Federated Learning (FL) to combine Multi-Layer Perceptron (MLP) and Gated Recurrent Unit (GRU) models that are essential in analyzing multimodal data. Implemented in Python, the approach incorporates two datasets: Dataset 1, which consists of Demographic data medical history, and lifestyle data, and the second dataset, which includes the normal condition and the affected stroke condition CT scan images. Imputation of missing values, feature normalization by Min-Max scaling, and handling of imbalanced classes with SMOTE make the data pre-processing procedures exhaustive. In FL architecture three clients -Client A, Client B, and Client C - process a split multimodal dataset containing static and sequential information. Each client independently trains an MLP-GRU model. Each is applied with MLP handling static features from Dataset 1 and GRU handling sequential features from Dataset 2. To update models, Federated Averaging is used on a central server, to create a global model that is then returned to the clients for further refinement. The accuracy of the proposed method averages 99.00% and surpasses other models by 2.5% including CNN, LSTM, Random Forest, and SVM. By enhancing MLP with GRU and applying them to a privacy-preserving FL framework,The study addresses the fragmented use of multimodal medical data, where clinical records and imaging are generally evaluated separately, resulting in inadequate diagnostic support. The strategy integrates complementary modalities to create a more comprehensive perspective of patient health, enhancing healthcare predictive accuracy and decision-making. This incentive is essential for improving computational methods and linking technical advancement with medical objectives like fast diagnosis and therapy planning. The introduction emphasises the therapeutic necessity of harmonising organized and unstructured data to reduce diagnostic ambiguity. A translational approach is used to discuss how multimodal integration might improve clinical workflows, develop collaborative healthcare systems, and support sustainable medical practices. This repeated emphasis links methodological advances to real-world healthcare issues, boosting the study's academic relevance sets.

PMID:41941431 | DOI:10.1371/journal.pone.0330244