Development of a multimodal obstructive sleep apnea diagnostic prediction model using two-dimensional facial images and clinical data

Scritto il 30/03/2026
da Hakje Yoo

IEEE J Biomed Health Inform. 2026 Mar 30;PP. doi: 10.1109/JBHI.2026.3678664. Online ahead of print.

ABSTRACT

Obstructive sleep apnea (OSA) is a prevalent disorder in middle-aged and obese men with increased risk of cardiovascular disease, metabolic dysfunction, and neurocognitive impairments. Delays in diagnosis and treatment of patients with OSA increase long-term all-cause mortality. In this study, we developed a multimodal artificial intelligence (AI) model that utilizes two-dimensional facial photograph, cephalometric radiograph, and clinical data to enhance OSA screening. Clinical parameters of sex, age, body mass index, neck circumference, and abdominal circumference and sleep questionnaires were assessed. Data from 710 patients who underwent polysomnography were used to train and validate a deep learning model combining ShuffleNet-V2 for image feature extraction and a deep neural network for clinical data analysis. The model was evaluated using five-fold cross-validation and a holdout test set. Performance metrics were area under the curve (AUC), sensitivity, specificity, accuracy, and F1 score. The trimodal AI model significantly outperformed uni- and bimodal approaches, achieving an AUC of 0.859 in distinguishing moderate-severe OSA from normal-mild OSA. Subgroup analyses of data from obese, elderly, and male patients showed higher classification accuracy, and data from patients with smaller abdominal circumferences had the lowest sensitivity compared to other subgroups. The Grad-CAM analysis demonstrated that the model focused on airway structures and low part of face, aligning with clinical expectations. This study presents a novel AI-driven OSA screening approach that integrates data from facial images and clinical data. This AI-based OSA screening model may facilitate early diagnosis of OSA and improve patient outcomes.

PMID:41911139 | DOI:10.1109/JBHI.2026.3678664