手法を比較
選択した手法を並べて確認できます。異なる行はハイライト表示されます。
| コンピュータ適応型テストの再検査信頼性× | 項目応答理論 (IRT)× | |
|---|---|---|
| 分野 | 心理測定学 | 心理測定学 |
| 系統 | Latent structure | Latent structure |
| 提唱年≠ | 1970s–1980s | 1952–1968 |
| 提唱者≠ | David J. Weiss and colleagues (adaptive testing reliability literature) | Frederic M. Lord (and Allan Birnbaum for the 2PL/3PL models) |
| 種類≠ | Reliability estimation | Probabilistic measurement model |
| 原典≠ | Weiss, D. J. (2004). Computerized adaptive testing for effective and efficient measurement in counseling and education. Measurement and Evaluation in Counseling and Development, 37(2), 70–84. DOI ↗ | Lord, F. M. & Novick, M. R. (1968). Statistical Theories of Mental Test Scores. Addison-Wesley. link ↗ |
| 別名 | CAT temporal stability, adaptive test retest reliability, CAT score consistency, computerized adaptive testing reliability | IRT, latent trait theory, item characteristic curve theory, modern test theory |
| 関連≠ | 4 | 5 |
| 概要≠ | Computerized adaptive test (CAT) test-retest reliability quantifies the consistency of ability estimates obtained when the same examinees complete a CAT on two separate occasions. Because adaptive algorithms tailor each examinee's item set individually, traditional reliability frameworks must be adapted to account for non-overlapping item exposures across administrations. | Item response theory models the probability that a respondent answers an item correctly (or endorses it) as a function of the respondent's latent trait level and the item's own statistical properties — difficulty, discrimination, and guessing. Unlike classical test theory, IRT places persons and items on the same scale, yielding measurement that is sample-independent for items and test-independent for persons. |
| ScholarGateデータセット ↗ |
|
|