Can J Cardiol. 2025 Dec 18:S0828-282X(25)01589-2. doi: 10.1016/j.cjca.2025.12.024. Online ahead of print.
ABSTRACT
BACKGROUND: Treatment decision-making for patients with coronary artery disease (CAD) can benefit from accurate patient outcome prediction. While previous studies have employed machine learning (ML) to develop prediction models, they were mostly based on small patient cohorts with strict inclusion and exclusion criteria, limited features, and only internal validation. We aimed to develop and externally validate ML models to predict short- and long-term outcomes for patients with obstructive CAD using large-scale multi-center patient data.
METHODS: We used a comprehensive data set from patients with obstructive CAD who underwent coronary angiography at three hospitals in Alberta, Canada between 2009 and 2019. To predict all-cause mortality and major adverse cardiovascular events at 90 days, 1 year, 3 years, and 5 years, over 12,000 features were considered in an extensive ML framework. In addition to traditional ML models, we employed a generative transformer-based tabular foundation model, TabPFN. To study real-time feasibility, secondary analyses limited feature sets to commonly available pre-angiography data.
RESULTS: A total of 44,462 catheterizations from 38,767 patients were included. The median areas under the receiver operating characteristic curves of the best models, mostly TabPFNs, in external validation ranged 0.796-0.845 and 0.694-0.755 for mortality and MACE, respectively. The minimum deployable pre-angiography feature set led to slightly lower but still reasonable performance.
CONCLUSIONS: The large sample size, extensive feature set, external validation, and transformer architecture led to personalized models with robust prediction performance. Our models have the potential to improve CAD treatment decision-making via accurate prognosis.
PMID:41421637 | DOI:10.1016/j.cjca.2025.12.024