What is the difference between a systematic review and the statistical methods used in it?

A systematic review is the whole process of identifying, appraising, and summarising studies; the statistical methods in evidence synthesis are the quantitative steps within it that pool results, measure heterogeneity, and test robustness.

Does every systematic review include a meta-analysis?

No. When studies are too clinically or methodologically diverse to combine meaningfully, a review may synthesise findings narratively, and these statistical methods are applied only when pooling is judged appropriate.

Statistical Methods in Evidence Synthesis

Statistical methods in evidence synthesis are the quantitative techniques used to combine results from multiple studies into an overall estimate of effect, to quantify and explain differences among studies, and to test how robust that combined conclusion is. They form the analytic engine of systematic reviews and underpin evidence-based practice and health technology assessment.

用 PaperMind 寻找选题即将推出Find papers & topics

Tools & resources

下载幻灯片

Learn & explore

视频即将推出

Definition

Statistical methods in evidence synthesis comprise the procedures for estimating a pooled effect across studies (typically under fixed-effect or random-effects models), for measuring and investigating between-study heterogeneity, and for examining the robustness of the synthesised estimate to analytic choices and to individual studies.

Scope

This area orients the reader to the family of methods that turn a body of individual studies into a synthesised quantitative answer: meta-analysis (the pooling step), the assessment of heterogeneity (how much studies differ), meta-regression (explaining that difference with study-level covariates), and sensitivity analysis (testing whether conclusions hold under different assumptions). It frames these as methodological reference topics, not as clinical instructions.

Sub-topics

Core questions

How should results from separate studies be weighted and combined into a single estimate?
When is a fixed-effect model appropriate and when is a random-effects model needed?
How much do studies differ beyond chance, and does that difference change what we can conclude?
Can study-level characteristics explain the variation in observed effects?
How sensitive is the pooled conclusion to individual studies, to modelling assumptions, and to risk of bias?

Key concepts

Pooled (summary) effect estimate
Study weighting (inverse-variance weighting)
Fixed-effect vs random-effects models
Between-study heterogeneity
Meta-regression and subgroup analysis
Sensitivity and robustness analysis
Confidence and prediction intervals

Mechanisms

The shared logic is to treat each study's effect estimate as a data point with a known uncertainty, then to combine those points with weights that reflect their precision. A fixed-effect analysis assumes all studies estimate one common true effect and weights by inverse variance alone; a random-effects analysis assumes true effects vary across studies and adds a between-study variance component, so the DerSimonian-Laird method and its successors widen the weights and the interval accordingly. Heterogeneity statistics summarise how much observed variation exceeds sampling error; meta-regression and subgroup analysis attempt to explain that variation with study-level covariates; and sensitivity analyses re-run the synthesis under alternative assumptions to check that the headline result is not an artefact of one study or one modelling choice.

Clinical relevance

These methods generate much of the high-level evidence that guidelines and health technology assessments draw on, so understanding how a pooled estimate, its heterogeneity, and its sensitivity analyses were produced is central to appraising a systematic review. This area describes how synthesised evidence is generated and interpreted; it is not a source of individual diagnostic or treatment advice.

Evidence & guidelines

Reporting standards for the statistical conduct of evidence synthesis are set out in the PRISMA statement (Moher et al., 2009) and in the Cochrane Handbook (Higgins & Green, 2008), which describe expected practice for model choice, heterogeneity assessment, and sensitivity analysis in systematic reviews.

History

Quantitative pooling of study results grew out of agricultural and social-science statistics in the early twentieth century and was named meta-analysis by Gene Glass in 1976. Its adaptation to clinical trials was crystallised by DerSimonian and Laird's 1986 random-effects method, and the subsequent development of heterogeneity statistics, meta-regression, and standardised reporting (PRISMA, the Cochrane Handbook) turned evidence synthesis into a structured statistical discipline supporting evidence-based practice and health technology assessment.

Debates

Fixed-effect versus random-effects as the default model: Whether a synthesis should assume a single common effect or allow true effects to vary changes both the estimate and its uncertainty; commentators argue the choice should reflect the clinical and methodological diversity of the included studies rather than a fixed convention.

Key figures

Rebecca DerSimonian
Nan Laird
Julian Higgins
Simon Thompson
Michael Borenstein
Larry Hedges

Seminal works

dersimonian-laird-1986
higgins-thompson-2002
higgins-handbook-2008

Frequently asked questions

What is the difference between a systematic review and the statistical methods used in it?: A systematic review is the whole process of identifying, appraising, and summarising studies; the statistical methods in evidence synthesis are the quantitative steps within it that pool results, measure heterogeneity, and test robustness.
Does every systematic review include a meta-analysis?: No. When studies are too clinically or methodologically diverse to combine meaningfully, a review may synthesise findings narratively, and these statistical methods are applied only when pooling is judged appropriate.