Hypothesis test

Fleiss' Kappa for Multiple Rater Agreement

Fleiss' Kappa is a non-parametric statistic for measuring the degree of agreement among three or more raters who classify items into mutually exclusive nominal categories. Introduced by Joseph L. Fleiss in 1971 as a generalization of Cohen's Kappa beyond two raters, it corrects observed agreement for the level of agreement expected by chance alone, making it the standard reliability index in medical diagnosis studies, content analysis, and multi-coder research.

Apply with StatMindSoonVideoSoon

Read the full method

Members only

Sign in with a free account to read this section.

Sign in

Sources

  1. Fleiss, J.L. (1971). Measuring Nominal Scale Agreement Among Many Raters. Psychological Bulletin, 76(5), 378–382. DOI: 10.1037/h0031619

Related methods

Referenced by

ScholarGateFleiss' Kappa (Fleiss' Kappa for Multiple Rater Agreement). Retrieved 2026-06-04 from https://scholargate.app/en/statistics/fleiss-kappa