Bayesian vs Frequentist Inference
Two philosophies of statistics
Frequentist and Bayesian inference are the two dominant philosophies of statistics, differing in how they define probability and treat parameters. The frequentist approach treats parameters as fixed and probability as long-run frequency, using p-values and confidence intervals as its main tools. The Bayesian approach treats parameters as uncertain quantities, combining prior beliefs with observed data via likelihood to produce a posterior distribution. The two frameworks answer subtly different questions and are increasingly used together in modern research practice.
Core Idea: What Is Probability?
Frequentist statistics defines probability as the long-run relative frequency of outcomes across many hypothetical repetitions of an experiment. Under this view, a population parameter (such as a mean or effect size) is fixed but unknown; uncertainty arises only from random sampling. Bayesian statistics interprets probability as a degree of belief, or state of knowledge, about the world. A parameter is not random per se, but we express our uncertainty about its value through a probability distribution. These differing interpretations profoundly shape how each approach formulates problems and how its results are interpreted.
How It Works: Computation and Key Formulas
In frequentist inference, a test statistic is compared to a sampling distribution to obtain a p-value, which expresses the probability of observing data at least as extreme as the sample if the null hypothesis were true. A 95% confidence interval does not mean the parameter lies in it with 95% probability; it means 95% of such intervals constructed over repeated sampling would contain the true parameter. Bayesian inference rests on Bayes' Theorem: Posterior ∝ Prior × Likelihood. By multiplying prior beliefs with the likelihood supplied by the data, we obtain the posterior distribution. A Bayesian 95% credible interval can be directly interpreted as a 95% probability that the parameter lies within that range.
Common Misconceptions and Misuses
The most prevalent frequentist misconception is interpreting the p-value as the probability that the null hypothesis is true, or as the probability that a finding is a false positive; the p-value is neither of these. Confidence intervals are similarly misread as giving a 95% probability that the parameter lies within the interval. On the Bayesian side, a common problem is choosing a prior arbitrarily or without transparency, which can unduly influence the posterior, especially with small samples. Another error is interpreting Bayesian outputs as if they were frequentist quantities, or vice versa. Clearly stating which framework is used and reporting results accordingly is essential to honest scientific communication.
Why It Matters in Research Practice
The choice of framework is not merely philosophical; it has practical consequences for how studies are designed, analyzed, and reported. The frequentist approach remains dominant across social, medical, and biological sciences due to its standardized procedures and broad software support. The Bayesian approach can offer more reliable estimates with small samples, integrates naturally with hierarchical and complex models, and conveys uncertainty as a full posterior distribution rather than a single interval. Today, many researchers treat the two as complementary: frequentist tools for routine decisions and Bayesian models for situations requiring richer uncertainty quantification. Understanding the tension between them is a core component of statistical literacy.
Sources
- Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., & Rubin, D. B. (2013). Bayesian Data Analysis (3rd ed.). CRC Press. ISBN: 978-1-4398-4095-5