Internal Validity
The soundness of a causal claim
Internal validity refers to the degree to which an observed effect in a study can be attributed to the independent variable rather than to extraneous factors. Ruling out alternative explanations — that is, confounds — is essential for a credible causal claim. Random assignment and the use of a control group are the strongest safeguards. Without internal validity, no causal conclusion is warranted, regardless of sample size.
Defining the Concept
Internal validity is an assessment of whether a study can genuinely attribute the observed change to the independent variable. Shadish, Cook, and Campbell (2002) define it as the capacity to draw valid causal inferences. To claim 'X caused Y,' a researcher must demonstrate that X preceded Y, that a relationship exists between them, and that alternative explanations have been ruled out. The true experimental design most robustly satisfies these three conditions, yet the concern for internal validity extends across all research types.
Key Threats and Control Mechanisms
The main threats that undermine internal validity include: history effects (external events occurring during the study), maturation (natural participant change over time), selection bias (non-equivalent groups at baseline), attrition (systematic dropout), testing effects (pre-test exposure influencing later performance), and instrumentation (inconsistent application of measures). The most powerful remedy is random assignment, which distributes both known and unknown confounds equally across conditions. Matching, blinding, and the use of control groups serve as additional safeguards.
A Concrete Example
In a teaching method study, a researcher examines whether a new curriculum raises test scores. If classes are not randomly assigned to experimental and control conditions, baseline score differences, teacher quality, or socioeconomic variation could explain any observed improvement. By randomly assigning classes and keeping the control group on the standard curriculum, the researcher substantially controls for selection bias and history effects. The resulting difference in learning outcomes can then be more safely attributed to the instructional method itself.
Common Misconceptions and Best Practices
A common misconception is that a large sample guarantees internal validity. However, a systematic confound can undermine a study with thousands of participants. Another error is treating statistical significance as proof of causation; the existence of an association and causal inference are fundamentally different questions. Best practices include proactively listing threats at the design stage, explicitly reporting which threats have been controlled, and honestly acknowledging in the limitations section any threats that could not be ruled out.
Key terms
- Confounding Variable
- A third variable related to both the independent and dependent variables, threatening causal inference.
- Random Assignment
- Allocating participants to conditions by chance; the strongest method for balancing confounds.
- Selection Bias
- Systematic error arising from non-equivalent groups at the outset of a study.
- History Effect
- External events occurring during the study period that affect the dependent variable.
- Attrition
- Systematic dropout of participants from a study; can distort results if non-random.
Further reading
- Shadish, W. R., Cook, T. D., & Campbell, D. T. (2002). Experimental and Quasi-Experimental Designs for Generalized Causal Inference. Houghton Mifflin. ISBN: 978-0-395-61556-1