Process / pipelineBioinformatics / omics

Machine Learning-Assisted Variant Calling — ML-Based Genomic Variant Detection

Machine learning-assisted variant calling uses statistical learning models — most notably convolutional neural networks — to distinguish genuine genomic variants (SNPs, indels) from sequencing artifacts in aligned short- or long-read data. Unlike heuristic callers that rely on hand-crafted filters, ML-based approaches learn directly from large labeled datasets of validated variants, improving sensitivity and specificity across diverse sequencing platforms and coverage depths. Google's DeepVariant (2018) is the landmark implementation that brought deep learning into mainstream variant calling.

Open in MethodMindSoonVideoSoon

Read the full method

Members only

Sign in with a free account to read this section.

Sign in

Sources

  1. Poplin, R., Chang, P. C., Alexander, D., Schwartz, S., Colthurst, T., Ku, A., Newburger, D., Dijamco, J., Nguyen, N., Afshar, P. T., Gross, S. S., Dorfman, L., McLean, C. Y., & DePristo, M. A. (2018). A universal SNP and small-indel variant caller using deep neural networks. Nature Biotechnology, 36(10), 983–987. DOI: 10.1038/nbt.4235
  2. Krusche, P., Trigg, L., Boutros, P. C., Mason, C. E., De La Vega, F. M., Moore, B. L., Gonzalez-Porta, M., Eberle, M. A., Tezak, Z., Lababidi, S., Truty, R., Asimenos, G., Funke, B., Fleharty, M., Salit, M., Goldfeder, R. L., & Zook, J. M. (2019). Best practices for benchmarking germline small-variant calls in human genomes. Nature Biotechnology, 37(5), 555–560. DOI: 10.1038/s41587-019-0054-x

Referenced by

ScholarGateMachine learning-assisted variant calling (Machine Learning-Assisted Genomic Variant Calling). Retrieved 2026-06-04 from https://scholargate.app/en/bioinformatics/machine-learning-assisted-variant-calling