So sánh phương pháp
Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.
| Độ tin cậy kiểm định lặp lại của bài kiểm tra thích ứng bằng máy tính× | Lý thuyết Ứng đáp Câu hỏi (IRT)× | |
|---|---|---|
| Lĩnh vực | Trắc lượng tâm lý | Trắc lượng tâm lý |
| Họ | Latent structure | Latent structure |
| Năm ra đời≠ | 1970s–1980s | 1952–1968 |
| Người khởi xướng≠ | David J. Weiss and colleagues (adaptive testing reliability literature) | Frederic M. Lord (and Allan Birnbaum for the 2PL/3PL models) |
| Loại≠ | Reliability estimation | Probabilistic measurement model |
| Công trình gốc≠ | Weiss, D. J. (2004). Computerized adaptive testing for effective and efficient measurement in counseling and education. Measurement and Evaluation in Counseling and Development, 37(2), 70–84. DOI ↗ | Lord, F. M. & Novick, M. R. (1968). Statistical Theories of Mental Test Scores. Addison-Wesley. link ↗ |
| Tên gọi khác | CAT temporal stability, adaptive test retest reliability, CAT score consistency, computerized adaptive testing reliability | IRT, latent trait theory, item characteristic curve theory, modern test theory |
| Liên quan≠ | 4 | 5 |
| Tóm tắt≠ | Computerized adaptive test (CAT) test-retest reliability quantifies the consistency of ability estimates obtained when the same examinees complete a CAT on two separate occasions. Because adaptive algorithms tailor each examinee's item set individually, traditional reliability frameworks must be adapted to account for non-overlapping item exposures across administrations. | Item response theory models the probability that a respondent answers an item correctly (or endorses it) as a function of the respondent's latent trait level and the item's own statistical properties — difficulty, discrimination, and guessing. Unlike classical test theory, IRT places persons and items on the same scale, yielding measurement that is sample-independent for items and test-independent for persons. |
| ScholarGateBộ dữ liệu ↗ |
|
|