Sammenlign metoder
Gennemgå dine valgte metoder side om side; rækker, der afviger, er fremhævet.
| Computerized Adaptive Test Test-Retest Reliabilitet× | Item Response Theory (IRT)× | |
|---|---|---|
| Fagområde | Psykometri | Psykometri |
| Familie | Latent structure | Latent structure |
| Oprindelsesår≠ | 1970s–1980s | 1952–1968 |
| Ophavsperson≠ | David J. Weiss and colleagues (adaptive testing reliability literature) | Frederic M. Lord (and Allan Birnbaum for the 2PL/3PL models) |
| Type≠ | Reliability estimation | Probabilistic measurement model |
| Oprindelig kilde≠ | Weiss, D. J. (2004). Computerized adaptive testing for effective and efficient measurement in counseling and education. Measurement and Evaluation in Counseling and Development, 37(2), 70–84. DOI ↗ | Lord, F. M. & Novick, M. R. (1968). Statistical Theories of Mental Test Scores. Addison-Wesley. link ↗ |
| Aliasser | CAT temporal stability, adaptive test retest reliability, CAT score consistency, computerized adaptive testing reliability | IRT, latent trait theory, item characteristic curve theory, modern test theory |
| Relaterede≠ | 4 | 5 |
| Resumé≠ | Computerized adaptive test (CAT) test-retest reliability quantifies the consistency of ability estimates obtained when the same examinees complete a CAT on two separate occasions. Because adaptive algorithms tailor each examinee's item set individually, traditional reliability frameworks must be adapted to account for non-overlapping item exposures across administrations. | Item response theory models the probability that a respondent answers an item correctly (or endorses it) as a function of the respondent's latent trait level and the item's own statistical properties — difficulty, discrimination, and guessing. Unlike classical test theory, IRT places persons and items on the same scale, yielding measurement that is sample-independent for items and test-independent for persons. |
| ScholarGateDatasæt ↗ |
|
|