Circ Genom Precis Med. 2026 Jan 28:e004853. doi: 10.1161/CIRCGEN.124.004853. Online ahead of print.
ABSTRACT
BACKGROUND: Atrial fibrillation (AF) is a common and clinically heterogeneous arrhythmia. Machine learning algorithms can define data-driven disease subtypes in an unbiased fashion, but whether these AF subgroups align with underlying mechanisms, such as polygenic liability to AF or inflammation, and associate with clinical outcomes is unclear.
METHODS: We identified individuals with AF in a large biobank linked to electronic health records and genome-wide genotyping. We applied an unsupervised coclustering machine learning algorithm to 35 curated and uncorrelated clinical features to identify distinct phenotypic AF clusters. The clinical inflammatory status of the clusters was defined using measured biomarkers (CRP, ESR, WBC, Neutrophil %, Platelet count, RDW) within 6 months of first AF mention. Polygenic risk scores for AF and for cytokine levels were used to assess the genetic liability of clusters to AF and inflammation, respectively. Clinical outcomes were collected from electronic health records up to the last medical contact.
RESULTS: The analysis included 23 271 subjects with AF, of which 6023 had available genome-wide genotyping. The machine learning algorithm identified 3 phenotypic clusters that were distinguished by increasing prevalence of comorbidities, particularly renal disease and coronary artery disease. Polygenic liability to AF across clusters was highest in the low comorbidity cluster. Clinically measured inflammatory biomarkers were highest in the high comorbidity cluster. There was no difference between groups in genetically predicted levels of inflammatory biomarkers. Cluster assignment was associated with mortality, stroke, bleeding, and use of cardiac implantable electronic devices after AF diagnosis.
CONCLUSIONS: Patients with AF subgroups identified by clustering were distinguished by comorbidity burden and associated with risk of clinically important outcomes, polygenic liability to AF, and clinical inflammation, but not with genetically predicted inflammatory cytokine levels. Our study empirically demonstrates the complementary roles of comorbidity and genetic liability as major drivers of AF phenotypic variability using hypothesis-free methods.
PMID:41603049 | DOI:10.1161/CIRCGEN.124.004853