Measures of Central Tendency
Mean, median and mode
Measures of central tendency summarise the centre of a distribution with a single value. The arithmetic mean uses all observations; the median is the middle value of an ordered dataset; the mode is the most frequently occurring value. Which measure to choose depends on the level of measurement and the shape of the distribution. When outliers are present, the median provides a more robust estimate of the centre than the mean.
Core Definitions
The arithmetic mean is obtained by dividing the sum of all values by the number of observations: x̄ = (Σxᵢ) / n. The median is the value that splits an ordered dataset exactly in half; for an even number of observations, it is the average of the two middle values. The mode is the value or set of values that appears most frequently in the distribution; a distribution may have more than one mode. These three measures converge in symmetric distributions and diverge noticeably in skewed ones.
Computation and Level of Measurement
The choice of central tendency measure is largely determined by the level of measurement. At the nominal level, only the mode is meaningful; an average cannot be computed for categories that carry no ordering. At the ordinal level, the median and mode are appropriate because intervals are not equal. At interval and ratio levels, the arithmetic mean is preferred, as all algebraic operations are meaningful. Even at the ratio level, if the distribution is skewed, the median may be more representative; income distribution is a classic example.
Common Misconceptions and Misuses
The most common error is treating the mean as the default measure in every situation. In skewed distributions or datasets containing outliers, the mean does not represent the typical observation; it is pulled toward extreme values. For instance, one exceptionally high grade in a class can raise the group mean substantially. Similarly, applying the mode to continuous variables is misleading because exact repetitions are rare in continuous data. Finally, measures of central tendency convey no information about the spread of a distribution; they should always be reported alongside variability measures such as the standard deviation or interquartile range.
Importance in Research Practice
Measures of central tendency form the foundation of descriptive statistics reporting in every quantitative study. Choosing the correct measure directly affects how findings are interpreted and how groups are compared. Whether evaluating treatment effects in clinical research, comparing group achievement in educational science, or describing demographic patterns in social sciences, explaining which measure was chosen and why is a requirement of methodological transparency. APA publication standards mandate that central tendency values be reported together with measures of variability.
Sources
- Field, A. (2018). Discovering Statistics Using IBM SPSS Statistics (5th ed.). SAGE. ISBN: 978-1-5264-1951-4