How does structural variation differ from a single-nucleotide variant?

A single-nucleotide variant changes one base, whereas structural variation alters a whole segment of DNA—deleting, duplicating, inserting, inverting, or relocating it—so it affects many bases at once and is detected by different methods.

Is structural variation always harmful?

No. Most structural variants are common polymorphisms carried by healthy people; only a subset, often involving dosage-sensitive genes or specific genomic regions, is associated with disease.

Structural Genomics and Structural Variation

Structural variation is the class of genetic differences in which segments of DNA—typically larger than a single base change—are deleted, duplicated, inserted, inverted, or translocated, altering the copy number, orientation, or position of stretches of the genome. Together with the chromosomal aberrations from which the field grew, structural variants account for a large share of the base pairs that differ between any two human genomes and contribute substantially to both normal variation and disease.

Definition

Structural variation comprises genomic alterations affecting DNA segments—conventionally those of roughly one kilobase or larger, though detection now extends smaller—including deletions, duplications, copy-number variants, insertions, inversions, and translocations, considered as a complement to single-nucleotide and short-indel variation.

Scope

This area orients the reader to how the genome varies at the scale of segments rather than single nucleotides. It groups the operational essentials of detecting and classifying copy-number variants, the catalogue of structural-variant types and the mutational mechanisms that create them, the repeat architecture (segmental duplications) that predisposes specific genomic regions to recurrent rearrangement, the mobile elements that actively reshape the genome, and the population-level organization of variation into haplotypes. It is a reference orientation, not clinical guidance.

Sub-topics

Core questions

What types of structural variants exist, and by what mechanisms do they arise?
How is copy-number and structural variation detected, sized, and classified from genomic data?
Which features of genome architecture predispose particular regions to recurrent rearrangement?
How does structural variation distribute across populations and relate to haplotype structure?

Key concepts

Structural variant (SV)
Copy-number variant (CNV)
Balanced vs. unbalanced rearrangement
Recurrent vs. non-recurrent variants
Segmental duplication and genomic architecture
Mobile element insertion
Haplotype and linkage disequilibrium

Mechanisms

Structural variants arise through several broad routes. Recombination-based mechanisms—chiefly non-allelic homologous recombination between highly similar repeat copies such as segmental duplications—produce recurrent deletions and duplications with shared breakpoints. Replication-based mechanisms, including fork stalling and template switching and microhomology-mediated break-induced replication, generate non-recurrent and often complex rearrangements. Non-homologous end joining repairs double-strand breaks in ways that can delete or insert sequence, and retrotransposition copies mobile elements into new sites. These processes change copy number, sequence orientation, or genomic location, and their footprints are read back from sequencing and array data to detect and classify the variants.

Clinical relevance

Structural variation underlies a meaningful fraction of genetic disease, from recurrent microdeletion and microduplication syndromes mediated by genomic architecture to somatic rearrangements in cancer. Understanding how such variants are detected and classified is foundational for interpreting genomic findings in the health sciences. This area describes how structural variation is conceptualized and measured and is not a basis for individual diagnostic or treatment decisions.

Epidemiology

Large reference projects have shown that structural variants, though far fewer in count than single-nucleotide variants, affect more total base pairs of difference between genomes. The 1000 Genomes structural-variation map catalogued tens of thousands of structural variants across thousands of individuals from multiple populations, establishing baseline frequencies and the relative abundance of deletions, duplications, and mobile-element insertions across human ancestries.

History

The study of large-scale genomic change began with cytogenetics and the recognition of visible chromosomal aberrations. Array-based and sequencing technologies in the 2000s revealed that submicroscopic copy-number and structural variation is pervasive even among healthy individuals, reframing structural variation as a routine component of the genome rather than a rare pathological event. Reference catalogues from the HapMap and 1000 Genomes efforts and from focused structural-variation studies then placed this variation on a population footing.

Debates

How completely can short-read sequencing capture structural variation?: Short-read data detects deletions and many copy-number changes well but systematically misses insertions, inversions, and variants embedded in repetitive or duplicated regions; the field continues to debate how much of the structural-variant landscape remains hidden until long-read and assembly-based methods are applied universally.

Key figures

Stephen W. Scherer
Evan E. Eichler
James R. Lupski
Charles Lee
Matthew Hurles

Seminal works

feuk-2006
alkan-2011
sudmant-2015
1000genomes-2015

Frequently asked questions

How does structural variation differ from a single-nucleotide variant?: A single-nucleotide variant changes one base, whereas structural variation alters a whole segment of DNA—deleting, duplicating, inserting, inverting, or relocating it—so it affects many bases at once and is detected by different methods.
Is structural variation always harmful?: No. Most structural variants are common polymorphisms carried by healthy people; only a subset, often involving dosage-sensitive genes or specific genomic regions, is associated with disease.