Sammenlign metoder
Gjennomgå de valgte metodene side om side; rader som avviker, er uthevet.
| Mellomvurderer-reliabilitet (Cohens κ og ICC)× | Bland-Altman Metodekomparativ Analyse× | Cohens Kappa-koeffisient× | Cronbachs Alpha (Reliabilitetsanalyse)× | Fleiss' Kappa for samsvar mellom flere bedømmere× | |
|---|---|---|---|---|---|
| Fagfelt≠ | Psykometri | Statistikk | Statistikk | Statistikk | Statistikk |
| Familie≠ | Latent structure | Hypothesis test | Hypothesis test | Latent structure | Hypothesis test |
| Opprinnelsesår≠ | 1960 (kappa); 1979 (ICC) | 1986 | 1960 | 1951 | 1971 |
| Opphavsperson≠ | Cohen (kappa, 1960); Shrout & Fleiss (ICC, 1979) | J. Martin Bland & Douglas G. Altman | Jacob Cohen | Lee J. Cronbach | Joseph L. Fleiss |
| Type≠ | Reliability / agreement analysis | Graphical and statistical method comparison | Inter-rater reliability coefficient | Reliability / internal consistency coefficient | Non-parametric agreement measure |
| Opprinnelig kilde≠ | Cohen, J. (1960). A Coefficient of Agreement for Nominal Scales. Educational and Psychological Measurement, 20(1), 37–46. DOI ↗ | Bland, J.M. & Altman, D.G. (1986). Statistical Methods for Assessing Agreement Between Two Methods of Clinical Measurement. Lancet, 327(8476), 307–310. DOI ↗ | Cohen, J. (1960). A Coefficient of Agreement for Nominal Scales. Educational and Psychological Measurement, 20(1), 37–46. DOI ↗ | Cronbach, L. J. (1951). Coefficient alpha and the internal structure of tests. Psychometrika, 16(3), 297–334. DOI ↗ | Fleiss, J.L. (1971). Measuring Nominal Scale Agreement Among Many Raters. Psychological Bulletin, 76(5), 378–382. DOI ↗ |
| Alias≠ | inter-rater reliability, interrater agreement, rater agreement, Değerlendiriciler Arası Güvenilirlik (Cohen's κ, ICC) | Bland-Altman plot, limits of agreement analysis, method agreement analysis, Bland-Altman Uyum Analizi | kappa coefficient, kappa statistic, Cohen's Kappa (Değerlendiriciler Arası Uyum) | coefficient alpha, alpha reliability, internal consistency reliability, Güvenilirlik Analizi (Cronbach Alpha) | multi-rater kappa, Fleiss kappa, Fleiss' Kappa (Çoklu Değerlendirici Uyumu) |
| Relaterte≠ | 6 | 5 | 3 | 4 | 2 |
| Sammendrag≠ | Interrater reliability quantifies the degree to which two or more independent raters produce consistent scores when evaluating the same individuals or products. The family encompasses Cohen's kappa, introduced in 1960 for categorical judgments, and the Intraclass Correlation Coefficient (ICC) for continuous ratings, together spanning most measurement scenarios encountered in behavioral, health, and educational research. | The Bland-Altman analysis is a graphical and statistical technique for assessing agreement between two measurement methods applied to the same subjects. Introduced by J. Martin Bland and Douglas G. Altman in their landmark 1986 Lancet paper, it plots the difference between the two methods against their mean for each subject, and derives the bias (mean difference) along with limits of agreement (LoA) that capture 95% of differences in the population. | Cohen's kappa (κ) is a statistical measure of inter-rater reliability for categorical classifications, introduced by Jacob Cohen in 1960. Unlike simple percent agreement, kappa corrects for the level of agreement that would be expected purely by chance, making it the standard metric when two raters independently assign observations to the same set of mutually exclusive categories. | Cronbach's alpha is a coefficient of internal consistency that quantifies the degree to which a set of items on a scale measures the same underlying construct. Introduced by Lee J. Cronbach in 1951, it remains the most widely reported reliability index in social-science, health, and educational research. | Fleiss' Kappa is a non-parametric statistic for measuring the degree of agreement among three or more raters who classify items into mutually exclusive nominal categories. Introduced by Joseph L. Fleiss in 1971 as a generalization of Cohen's Kappa beyond two raters, it corrects observed agreement for the level of agreement expected by chance alone, making it the standard reliability index in medical diagnosis studies, content analysis, and multi-coder research. |
| ScholarGateDatasett ↗ |
|
|
|
|
|