Porównaj metody
Przeglądaj wybrane metody obok siebie; wiersze, które się różnią, są wyróżnione.
| Teoria generalizowalności (G-Theory)× | Dwuparametrowy logistyczny model teorii odpowiedzi na pozycje (2PL)× | Niezawodność międzyoceniająca (kappa Cohena i ICC)× | |
|---|---|---|---|
| Dziedzina | Psychometria | Psychometria | Psychometria |
| Rodzina | Latent structure | Latent structure | Latent structure |
| Rok powstania≠ | 1963 | 1980 | 1960 (kappa); 1979 (ICC) |
| Twórca≠ | Lee J. Cronbach and colleagues | Frederic M. Lord | Cohen (kappa, 1960); Shrout & Fleiss (ICC, 1979) |
| Typ≠ | ANOVA-based variance-component framework | Item response model / latent trait model | Reliability / agreement analysis |
| Źródło pierwotne≠ | Brennan, R. L. (2001). Generalizability Theory. Springer. link ↗ | Lord, F. M. (1980). Applications of Item Response Theory to Practical Testing Problems. Erlbaum. link ↗ | Cohen, J. (1960). A Coefficient of Agreement for Nominal Scales. Educational and Psychological Measurement, 20(1), 37–46. DOI ↗ |
| Inne nazwy≠ | Generalizability Theory, G-Study / D-Study framework, Genellenebilirlik Kuramı (G-Kuramı) | two-parameter logistic model, 2PL model, 2PL IRT — İki Parametreli Madde Tepki Modeli | inter-rater reliability, interrater agreement, rater agreement, Değerlendiriciler Arası Güvenilirlik (Cohen's κ, ICC) |
| Pokrewne | 6 | 6 | 6 |
| Podsumowanie≠ | Generalizability Theory, developed by Lee J. Cronbach and colleagues in the 1960s and formalised by Brennan (2001), is an ANOVA-based framework that extends Classical Test Theory by decomposing observed score variance into multiple, separately identified sources of measurement error — such as raters, tasks, occasions, or items — rather than bundling all error into a single undifferentiated term. | The two-parameter logistic item response model, formalised by Frederic Lord (1980), describes the probability that a respondent answers a binary test item correctly as a smooth S-shaped function of the respondent's latent ability. By estimating a separate discrimination parameter for each item alongside a difficulty parameter, 2PL allows items to differ in how sharply they distinguish high- from low-ability respondents — making it the standard model for large-scale educational and psychological assessments. | Interrater reliability quantifies the degree to which two or more independent raters produce consistent scores when evaluating the same individuals or products. The family encompasses Cohen's kappa, introduced in 1960 for categorical judgments, and the Intraclass Correlation Coefficient (ICC) for continuous ratings, together spanning most measurement scenarios encountered in behavioral, health, and educational research. |
| ScholarGateZbiór danych ↗ |
|
|
|