Process / pipelineBioinformatics / omics
Machine Learning-Assisted Variant Calling — ML-Based Genomic Variant Detection
Machine learning-assisted variant calling uses statistical learning models — most notably convolutional neural networks — to distinguish genuine genomic variants (SNPs, indels) from sequencing artifacts in aligned short- or long-read data. Unlike heuristic callers that rely on hand-crafted filters, ML-based approaches learn directly from large labeled datasets of validated variants, improving sensitivity and specificity across diverse sequencing platforms and coverage depths. Google's DeepVariant (2018) is the landmark implementation that brought deep learning into mainstream variant calling.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- Poplin, R., Chang, P. C., Alexander, D., Schwartz, S., Colthurst, T., Ku, A., Newburger, D., Dijamco, J., Nguyen, N., Afshar, P. T., Gross, S. S., Dorfman, L., McLean, C. Y., & DePristo, M. A. (2018). A universal SNP and small-indel variant caller using deep neural networks. Nature Biotechnology, 36(10), 983–987. DOI: 10.1038/nbt.4235 ↗
- Krusche, P., Trigg, L., Boutros, P. C., Mason, C. E., De La Vega, F. M., Moore, B. L., Gonzalez-Porta, M., Eberle, M. A., Tezak, Z., Lababidi, S., Truty, R., Asimenos, G., Funke, B., Fleharty, M., Salit, M., Goldfeder, R. L., & Zook, J. M. (2019). Best practices for benchmarking germline small-variant calls in human genomes. Nature Biotechnology, 37(5), 555–560. DOI: 10.1038/s41587-019-0054-x ↗