Whole-Genome Sequencing
Whole-genome sequencing (WGS) determines the near-complete nucleotide sequence of an organism's genome in a single assay, rather than targeting selected genes or regions. By reading both coding and non-coding DNA, it provides the most comprehensive primary genomic dataset and serves as the input for assembly, variant calling, and downstream genomic analysis.
Definition
Whole-genome sequencing is the laboratory and computational process of determining the order of essentially all nucleotides across an organism's genome, typically by fragmenting the DNA, reading the fragments at redundant coverage, and reconstructing or aligning them to recover the full sequence.
Scope
The entry covers what WGS measures, the shotgun strategy of fragmenting and reading the genome at high coverage, the contrast with targeted approaches such as whole-exome sequencing, and the role of sequencing depth in determining sensitivity. It is a methodological topic and does not provide clinical or testing recommendations.
Core questions
- What does whole-genome sequencing capture that targeted sequencing does not?
- How does the shotgun strategy reconstruct a whole genome from many short fragments?
- How does sequencing depth affect what variants can be detected?
Key concepts
- Shotgun sequencing
- Sequencing depth and coverage
- Whole-genome versus whole-exome sequencing
- Coding and non-coding regions
- Reversible terminator chemistry
- Variant calling input
Mechanisms
In WGS, genomic DNA is fragmented and read many times over so that every position is covered by multiple independent reads; the redundancy (depth) lets base calls be cross-checked and supports detection of variants. The whole-genome shotgun approach, demonstrated at human scale by Venter and colleagues, breaks the genome into random fragments, sequences them, and reassembles them computationally. Reversible-terminator chemistry later enabled accurate massively parallel reading of whole human genomes at far lower cost. Because WGS reads non-coding as well as coding DNA, it captures regulatory and structural variation that targeted assays miss.
Clinical relevance
Whole-genome sequencing is increasingly used in research and clinical genomics to characterise an individual's complete genetic makeup, supporting variant discovery across coding and non-coding regions. This entry describes the method and its data characteristics; it is educational reference material and not a recommendation for any specific test or clinical action.
Evidence & guidelines
The foundational evidence is a set of landmark primary studies: the two human genome sequences published in 2001 (Venter et al.; International Human Genome Sequencing Consortium) and the demonstration of accurate massively parallel whole-genome sequencing by Bentley et al. (2008). Methodological reviews such as Sims et al. (2014) document how depth and coverage shape analytical sensitivity.
History
Whole-genome sequencing at human scale was first achieved in 2001 through two parallel efforts, one using hierarchical clone-based sequencing and one using whole-genome shotgun assembly. The 2008 demonstration of accurate sequencing with reversible-terminator chemistry made population-scale WGS feasible, and depth and coverage became central design parameters as the method matured.
Key figures
- J. Craig Venter
- Eric Lander
- David Bentley
Related topics
Seminal works
- venter-2001
- ihgsc-2001-wgs
- bentley-2008
Frequently asked questions
- How is whole-genome sequencing different from whole-exome sequencing?
- Whole-genome sequencing reads essentially the entire genome, including non-coding and regulatory regions, whereas whole-exome sequencing targets only the protein-coding portion (the exome), which is a small fraction of the genome.
- Why is sequencing depth important in whole-genome sequencing?
- Depth (how many reads cover each position) determines how confidently base calls and variants can be made; higher depth improves sensitivity and accuracy, especially for detecting low-frequency or heterozygous variants.