方法对比
并排查看您选择的方法;存在差异的行会高亮显示。
| 计算机化自适应测验重测信度× | 项目反应理论 (IRT)× | |
|---|---|---|
| 领域 | 心理测量学 | 心理测量学 |
| 方法族 | Latent structure | Latent structure |
| 起源年份≠ | 1970s–1980s | 1952–1968 |
| 提出者≠ | David J. Weiss and colleagues (adaptive testing reliability literature) | Frederic M. Lord (and Allan Birnbaum for the 2PL/3PL models) |
| 类型≠ | Reliability estimation | Probabilistic measurement model |
| 开创性文献≠ | Weiss, D. J. (2004). Computerized adaptive testing for effective and efficient measurement in counseling and education. Measurement and Evaluation in Counseling and Development, 37(2), 70–84. DOI ↗ | Lord, F. M. & Novick, M. R. (1968). Statistical Theories of Mental Test Scores. Addison-Wesley. link ↗ |
| 别名 | CAT temporal stability, adaptive test retest reliability, CAT score consistency, computerized adaptive testing reliability | IRT, latent trait theory, item characteristic curve theory, modern test theory |
| 相关≠ | 4 | 5 |
| 摘要≠ | Computerized adaptive test (CAT) test-retest reliability quantifies the consistency of ability estimates obtained when the same examinees complete a CAT on two separate occasions. Because adaptive algorithms tailor each examinee's item set individually, traditional reliability frameworks must be adapted to account for non-overlapping item exposures across administrations. | Item response theory models the probability that a respondent answers an item correctly (or endorses it) as a function of the respondent's latent trait level and the item's own statistical properties — difficulty, discrimination, and guessing. Unlike classical test theory, IRT places persons and items on the same scale, yielding measurement that is sample-independent for items and test-independent for persons. |
| ScholarGate数据集 ↗ |
|
|