方法对比

并排查看您选择的方法；存在差异的行会高亮显示。

	Cohen's Kappa Coefficient ×	卡方独立性检验 ×	Fleiss Kappa多评分者一致性系数 ×	麦克尼马尔检验 ×
领域	统计学	统计学	统计学	统计学
方法族	Hypothesis test	Hypothesis test	Hypothesis test	Hypothesis test
起源年份≠	1960	1900	1971	1947
提出者≠	Jacob Cohen	Karl Pearson	Joseph L. Fleiss	Quinn McNemar
类型≠	Inter-rater reliability coefficient	Nonparametric test of association	Non-parametric agreement measure	Nonparametric test for paired binary data
开创性文献≠	Cohen, J. (1960). A Coefficient of Agreement for Nominal Scales. Educational and Psychological Measurement, 20(1), 37–46. DOI ↗	Pearson, K. (1900). On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. Philosophical Magazine, 50(302), 157–175. DOI ↗	Fleiss, J.L. (1971). Measuring Nominal Scale Agreement Among Many Raters. Psychological Bulletin, 76(5), 378–382. DOI ↗	McNemar, Q. (1947). Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika, 12(2), 153–157. DOI ↗
别名≠	kappa coefficient, kappa statistic, Cohen's Kappa (Değerlendiriciler Arası Uyum)	chi-squared test, Pearson's chi-square test, test of independence, ki-kare bağımsızlık testi	multi-rater kappa, Fleiss kappa, Fleiss' Kappa (Çoklu Değerlendirici Uyumu)	McNemar chi-square test, test for correlated proportions, paired binary test, McNemar Testi
相关≠	3	2	2	5
摘要≠	Cohen's kappa (κ) is a statistical measure of inter-rater reliability for categorical classifications, introduced by Jacob Cohen in 1960. Unlike simple percent agreement, kappa corrects for the level of agreement that would be expected purely by chance, making it the standard metric when two raters independently assign observations to the same set of mutually exclusive categories.	The chi-square test of independence is a nonparametric hypothesis test that examines whether two categorical variables are associated by comparing observed and expected frequencies in a cross-tabulation. It rests on the chi-square criterion introduced by Karl Pearson in 1900.	Fleiss' Kappa is a non-parametric statistic for measuring the degree of agreement among three or more raters who classify items into mutually exclusive nominal categories. Introduced by Joseph L. Fleiss in 1971 as a generalization of Cohen's Kappa beyond two raters, it corrects observed agreement for the level of agreement expected by chance alone, making it the standard reliability index in medical diagnosis studies, content analysis, and multi-coder research.	McNemar's test is a nonparametric hypothesis test that compares two paired (correlated) binary proportions, such as a yes/no measurement taken on the same subjects before and after an intervention. It was introduced by Quinn McNemar in 1947 and works on the 2×2 table of matched outcomes.
ScholarGate数据集 ↗	v1 2 来源 PUBLISHED	v1 2 来源 PUBLISHED	v1 1 来源 PUBLISHED	v1 1 来源 PUBLISHED

前往搜索 → 下载幻灯片