J Genet Genomics. 2026 Mar 14:S1673-8527(26)00096-2. doi: 10.1016/j.jgg.2026.03.010. Online ahead of print.
ABSTRACT
Protein kinases are pivotal regulators of cellular signaling, and their genetic variations are frequently implicated in diseases. Although numerous kinase mutations have been identified as drivers of altered activity, with a few successfully targeted therapeutically, the functional impact of most variants remains uncharacterized. To bridge this gap, we curate a comprehensive dataset that contains 2553 experimentally validated kinase activity-related key alterations (KAKAs) from the literature. While many mutations outside canonical functional regions are known to affect kinase activity, systematic methods to predict their functional consequences are lacking. Consequently, we develop a computational method to predict potential KAKAs, leveraging transfer learning on the pre-trained protein language model ProtBert. Our model, termed pKAKA, achieves an impressive AUC score of 0.9593 and outperforms the AlphaMissense benchmark in comparative testing. Systematic analysis of kinase missense mutations underscores the critical role of KAKAs in pathogenesis, with highlights including JAK2 V617F in atherosclerotic cardiovascular disease, LRRK2 G2385R in Parkinson's disease, EGFR L858R in lung adenocarcinoma, and EGFR G598V in glioma. Overall, this study significantly advances our understanding of how mutations that influence kinase activity contribute to disease mechanisms.
PMID:41839313 | DOI:10.1016/j.jgg.2026.03.010

