Why is the GWAS significance threshold set near 5 x 10^-8?

It approximates a Bonferroni correction for the roughly one million effectively independent common variants in the human genome, keeping the genome-wide false-positive rate near the conventional 0.05 level.

Why must a GWAS finding be replicated?

A single study can produce spurious associations from subtle quality-control problems, residual confounding, or chance at the edge of significance; independent replication in a separate sample is the standard check that a signal is real.

GWAS Design, Execution, and Statistical Methods

Designing and analysing a genome-wide association study is a disciplined pipeline: assemble well-phenotyped cases and controls (or a quantitative-trait cohort), genotype and impute variants genome-wide, scrub the data through stringent quality control, test each variant for association while adjusting for ancestry, and judge signals against a genome-wide significance threshold before seeking replication. Each step exists to keep the enormous number of statistical tests from producing false discoveries.

Găsește o temă cu PaperMindÎn curândFind papers & topics

Tools & resources

Descarcă prezentarea

Learn & explore

VideoÎn curând

Definition

GWAS design and analysis is the set of study-design choices and statistical procedures by which variant-phenotype associations are tested genome-wide, false positives controlled across millions of comparisons, and credible signals distinguished from artefacts of genotyping, relatedness, or ancestry.

Scope

This topic covers the methodological backbone of a GWAS: sample and phenotype definition, genotyping and imputation, quality-control filters, the single-marker association model, multiple-testing correction and genome-wide significance, diagnostics such as the genomic inflation factor and QQ/Manhattan plots, and replication. It is a methods reference and not a protocol for clinical genetic testing.

Core questions

What sample size and phenotype definition give adequate power to detect small-effect variants?
Which quality-control filters remove unreliable variants and samples before testing?
What regression model is used for a single-marker association test, and how is ancestry adjusted?
What significance threshold controls genome-wide false positives, and why is it near 5 x 10^-8?
How is a genuine signal distinguished from genomic inflation, and why is replication required?

Key concepts

Case-control and quantitative-trait designs
Genotype calling and imputation
Quality control (call rate, MAF, Hardy-Weinberg equilibrium filters)
Single-marker association test (logistic or linear regression)
Additive genetic model and per-allele effect (odds ratio or beta)
Genome-wide significance threshold (~5 x 10^-8)
Genomic inflation factor (lambda) and QQ plots
Manhattan plot and replication

Mechanisms

Each variant is typically tested with a regression model - logistic for binary disease status, linear for quantitative traits - in which the variant is coded under an additive (per-allele) model and principal components of ancestry plus other covariates are included to control confounding. The result per variant is an effect estimate (odds ratio or beta) and a p-value. Because hundreds of thousands to millions of largely independent common variants are tested, significance is judged against a genome-wide threshold of about 5 x 10^-8, derived from Bonferroni-style correction for the effective number of independent tests. Before testing, quality control removes samples and variants with low call rates, extreme deviation from Hardy-Weinberg equilibrium in controls, very low minor-allele frequency, or evidence of relatedness and population outliers. The genomic inflation factor and QQ plots flag residual confounding; Manhattan plots display signals across the genome; and independent replication guards against design-specific artefacts. Software such as PLINK standardised these steps.

Clinical relevance

Understanding GWAS design and analysis is part of appraising the genetic evidence cited in disease research and in the construction of polygenic scores. This topic explains how associations are generated and validated and is descriptive; it is not a procedure for individual genetic diagnosis or for clinical decision-making.

Evidence & guidelines

Analytic conventions were consolidated through consortium experience and methodological reviews rather than formal clinical guidelines. The Wellcome Trust Case Control Consortium (2007) demonstrated shared-control design and rigorous quality control at scale; PLINK (Purcell et al., 2007) became a standard analysis toolkit; and reviews by McCarthy et al. (2008) and Bush and Moore (2012) lay out widely accepted expectations for power, quality control, significance thresholds, and replication.

History

The pipeline crystallised with the first large genome-wide scans in the mid-2000s, when affordable arrays and HapMap-based imputation made whole-genome testing practical. The 2007 Wellcome Trust Case Control Consortium study set influential precedents for shared controls, quality control, and the 5 x 10^-8 threshold, while the release of PLINK gave the community a common analytic toolset. Methodological reviews subsequently codified best practice, and the analytic toolkit later expanded to mixed models, summary-statistic methods, and very large biobank cohorts.

Debates

Is a fixed 5 x 10^-8 threshold appropriate across study designs and ancestries?: The conventional genome-wide threshold was calibrated for common variation in European-ancestry samples; denser sequencing, rarer variants, and other ancestries imply a different effective number of independent tests, so whether the threshold should be design-specific is debated.

Key figures

Shaun Purcell
Mark McCarthy
Jason Moore
William Bush
Peter Visscher

Seminal works

wtccc-2007
purcell-2007
mccarthy-2008

Frequently asked questions

Why is the GWAS significance threshold set near 5 x 10^-8?: It approximates a Bonferroni correction for the roughly one million effectively independent common variants in the human genome, keeping the genome-wide false-positive rate near the conventional 0.05 level.
Why must a GWAS finding be replicated?: A single study can produce spurious associations from subtle quality-control problems, residual confounding, or chance at the edge of significance; independent replication in a separate sample is the standard check that a signal is real.