方法对比
并排查看您选择的方法;存在差异的行会高亮显示。
| 评分者间信度(Cohen's κ 和 ICC)× | Cohen's Kappa Coefficient× | Fleiss Kappa多评分者一致性系数× | |
|---|---|---|---|
| 领域≠ | 心理测量学 | 统计学 | 统计学 |
| 方法族≠ | Latent structure | Hypothesis test | Hypothesis test |
| 起源年份≠ | 1960 (kappa); 1979 (ICC) | 1960 | 1971 |
| 提出者≠ | Cohen (kappa, 1960); Shrout & Fleiss (ICC, 1979) | Jacob Cohen | Joseph L. Fleiss |
| 类型≠ | Reliability / agreement analysis | Inter-rater reliability coefficient | Non-parametric agreement measure |
| 开创性文献≠ | Cohen, J. (1960). A Coefficient of Agreement for Nominal Scales. Educational and Psychological Measurement, 20(1), 37–46. DOI ↗ | Cohen, J. (1960). A Coefficient of Agreement for Nominal Scales. Educational and Psychological Measurement, 20(1), 37–46. DOI ↗ | Fleiss, J.L. (1971). Measuring Nominal Scale Agreement Among Many Raters. Psychological Bulletin, 76(5), 378–382. DOI ↗ |
| 别名≠ | inter-rater reliability, interrater agreement, rater agreement, Değerlendiriciler Arası Güvenilirlik (Cohen's κ, ICC) | kappa coefficient, kappa statistic, Cohen's Kappa (Değerlendiriciler Arası Uyum) | multi-rater kappa, Fleiss kappa, Fleiss' Kappa (Çoklu Değerlendirici Uyumu) |
| 相关≠ | 6 | 3 | 2 |
| 摘要≠ | Interrater reliability quantifies the degree to which two or more independent raters produce consistent scores when evaluating the same individuals or products. The family encompasses Cohen's kappa, introduced in 1960 for categorical judgments, and the Intraclass Correlation Coefficient (ICC) for continuous ratings, together spanning most measurement scenarios encountered in behavioral, health, and educational research. | Cohen's kappa (κ) is a statistical measure of inter-rater reliability for categorical classifications, introduced by Jacob Cohen in 1960. Unlike simple percent agreement, kappa corrects for the level of agreement that would be expected purely by chance, making it the standard metric when two raters independently assign observations to the same set of mutually exclusive categories. | Fleiss' Kappa is a non-parametric statistic for measuring the degree of agreement among three or more raters who classify items into mutually exclusive nominal categories. Introduced by Joseph L. Fleiss in 1971 as a generalization of Cohen's Kappa beyond two raters, it corrects observed agreement for the level of agreement expected by chance alone, making it the standard reliability index in medical diagnosis studies, content analysis, and multi-coder research. |
| ScholarGate数据集 ↗ |
|
|
|