مقایسهٔ روشها
روشهای انتخابی خود را کنار هم مرور کنید؛ ردیفهای متفاوت برجسته شدهاند.
| ضریب کاپای کوهن× | آزمون استقلال کی-دو× | کاپای فلیس برای توافق بین چند ارزیاب× | |
|---|---|---|---|
| حوزه | آمار | آمار | آمار |
| خانواده | Hypothesis test | Hypothesis test | Hypothesis test |
| سال پیدایش≠ | 1960 | 1900 | 1971 |
| پدیدآور≠ | Jacob Cohen | Karl Pearson | Joseph L. Fleiss |
| نوع≠ | Inter-rater reliability coefficient | Nonparametric test of association | Non-parametric agreement measure |
| منبع بنیادین≠ | Cohen, J. (1960). A Coefficient of Agreement for Nominal Scales. Educational and Psychological Measurement, 20(1), 37–46. DOI ↗ | Pearson, K. (1900). On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. Philosophical Magazine, 50(302), 157–175. DOI ↗ | Fleiss, J.L. (1971). Measuring Nominal Scale Agreement Among Many Raters. Psychological Bulletin, 76(5), 378–382. DOI ↗ |
| نامهای دیگر≠ | kappa coefficient, kappa statistic, Cohen's Kappa (Değerlendiriciler Arası Uyum) | chi-squared test, Pearson's chi-square test, test of independence, ki-kare bağımsızlık testi | multi-rater kappa, Fleiss kappa, Fleiss' Kappa (Çoklu Değerlendirici Uyumu) |
| مرتبط≠ | 3 | 2 | 2 |
| خلاصه≠ | Cohen's kappa (κ) is a statistical measure of inter-rater reliability for categorical classifications, introduced by Jacob Cohen in 1960. Unlike simple percent agreement, kappa corrects for the level of agreement that would be expected purely by chance, making it the standard metric when two raters independently assign observations to the same set of mutually exclusive categories. | The chi-square test of independence is a nonparametric hypothesis test that examines whether two categorical variables are associated by comparing observed and expected frequencies in a cross-tabulation. It rests on the chi-square criterion introduced by Karl Pearson in 1900. | Fleiss' Kappa is a non-parametric statistic for measuring the degree of agreement among three or more raters who classify items into mutually exclusive nominal categories. Introduced by Joseph L. Fleiss in 1971 as a generalization of Cohen's Kappa beyond two raters, it corrects observed agreement for the level of agreement expected by chance alone, making it the standard reliability index in medical diagnosis studies, content analysis, and multi-coder research. |
| ScholarGateمجموعهداده ↗ |
|
|
|