Statistical Methods in Evidence Synthesis
Statistical methods in evidence synthesis are the quantitative techniques used to combine results from multiple studies into an overall estimate of effect, to quantify and explain differences among studies, and to test how robust that combined conclusion is. They form the analytic engine of systematic reviews and underpin evidence-based practice and health technology assessment.
Definition
Statistical methods in evidence synthesis comprise the procedures for estimating a pooled effect across studies (typically under fixed-effect or random-effects models), for measuring and investigating between-study heterogeneity, and for examining the robustness of the synthesised estimate to analytic choices and to individual studies.
Scope
This area orients the reader to the family of methods that turn a body of individual studies into a synthesised quantitative answer: meta-analysis (the pooling step), the assessment of heterogeneity (how much studies differ), meta-regression (explaining that difference with study-level covariates), and sensitivity analysis (testing whether conclusions hold under different assumptions). It frames these as methodological reference topics, not as clinical instructions.
Sub-topics
Core questions
- How should results from separate studies be weighted and combined into a single estimate?
- When is a fixed-effect model appropriate and when is a random-effects model needed?
- How much do studies differ beyond chance, and does that difference change what we can conclude?
- Can study-level characteristics explain the variation in observed effects?
- How sensitive is the pooled conclusion to individual studies, to modelling assumptions, and to risk of bias?
Key concepts
- Pooled (summary) effect estimate
- Study weighting (inverse-variance weighting)
- Fixed-effect vs random-effects models
- Between-study heterogeneity
- Meta-regression and subgroup analysis
- Sensitivity and robustness analysis
- Confidence and prediction intervals
Mechanisms
The shared logic is to treat each study's effect estimate as a data point with a known uncertainty, then to combine those points with weights that reflect their precision. A fixed-effect analysis assumes all studies estimate one common true effect and weights by inverse variance alone; a random-effects analysis assumes true effects vary across studies and adds a between-study variance component, so the DerSimonian-Laird method and its successors widen the weights and the interval accordingly. Heterogeneity statistics summarise how much observed variation exceeds sampling error; meta-regression and subgroup analysis attempt to explain that variation with study-level covariates; and sensitivity analyses re-run the synthesis under alternative assumptions to check that the headline result is not an artefact of one study or one modelling choice.
Clinical relevance
These methods generate much of the high-level evidence that guidelines and health technology assessments draw on, so understanding how a pooled estimate, its heterogeneity, and its sensitivity analyses were produced is central to appraising a systematic review. This area describes how synthesised evidence is generated and interpreted; it is not a source of individual diagnostic or treatment advice.
Evidence & guidelines
Reporting standards for the statistical conduct of evidence synthesis are set out in the PRISMA statement (Moher et al., 2009) and in the Cochrane Handbook (Higgins & Green, 2008), which describe expected practice for model choice, heterogeneity assessment, and sensitivity analysis in systematic reviews.
History
Quantitative pooling of study results grew out of agricultural and social-science statistics in the early twentieth century and was named meta-analysis by Gene Glass in 1976. Its adaptation to clinical trials was crystallised by DerSimonian and Laird's 1986 random-effects method, and the subsequent development of heterogeneity statistics, meta-regression, and standardised reporting (PRISMA, the Cochrane Handbook) turned evidence synthesis into a structured statistical discipline supporting evidence-based practice and health technology assessment.
Debates
- Fixed-effect versus random-effects as the default model
- Whether a synthesis should assume a single common effect or allow true effects to vary changes both the estimate and its uncertainty; commentators argue the choice should reflect the clinical and methodological diversity of the included studies rather than a fixed convention.
Key figures
- Rebecca DerSimonian
- Nan Laird
- Julian Higgins
- Simon Thompson
- Michael Borenstein
- Larry Hedges
Related topics
Seminal works
- dersimonian-laird-1986
- higgins-thompson-2002
- higgins-handbook-2008
Frequently asked questions
- What is the difference between a systematic review and the statistical methods used in it?
- A systematic review is the whole process of identifying, appraising, and summarising studies; the statistical methods in evidence synthesis are the quantitative steps within it that pool results, measure heterogeneity, and test robustness.
- Does every systematic review include a meta-analysis?
- No. When studies are too clinically or methodologically diverse to combine meaningfully, a review may synthesise findings narratively, and these statistical methods are applied only when pooling is judged appropriate.