Machine learning-assisted GWAS integrates classical genome-wide association testing with machine learning models to improve the detection of genetic variants associated with complex traits. Where traditional GWAS tests each single nucleotide polymorphism (SNP) independently using linear or logistic regression, ML-GWAS captures non-linear interactions and epistasis, ranks candidate loci more accurately, and reduces the false discovery burden in large biobank datasets. The approach has become increasingly prominent as sample sizes and genomic complexity outpace the assumptions of conventional single-SNP tests.
Beam, A. L., & Kohane, I. S. (2018). Big data and machine learning in health care. JAMA, 319(13), 1317-1318. link ↗
Szymanski, M., Holland-Letz, T., & Kneib, T. (2022). Machine learning approaches to GWAS: methods, pitfalls, and applications. Briefings in Bioinformatics, 23(3), bbac068. link ↗