So sánh phương pháp
Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.
| Tính hợp lệ về cấu trúc trong Kiểm tra Thích ứng Máy tính (CAT)× | Kiểm tra Thích ứng bằng Máy tính dựa trên Lý thuyết Ứng đáp Mục (CAT-IRT)× | |
|---|---|---|
| Lĩnh vực | Trắc lượng tâm lý | Trắc lượng tâm lý |
| Họ | Latent structure | Latent structure |
| Năm ra đời≠ | 1989–2000s | 1970s–1980s |
| Người khởi xướng≠ | Samuel Messick (unified validity framework); CAT application formalized by Wainer, van der Linden, and colleagues | Lord, F. M.; further developed by Wainer, van der Linden, and others |
| Loại≠ | Validity evaluation / psychometric evidence gathering | Adaptive measurement / sequential testing |
| Công trình gốc≠ | Messick, S. (1989). Validity. In R. L. Linn (Ed.), Educational Measurement (3rd ed., pp. 13–103). American Council on Education / Macmillan. link ↗ | Wainer, H. (Ed.). (2000). Computerized Adaptive Testing: A Primer (2nd ed.). Lawrence Erlbaum Associates. ISBN: 978-0805835113 |
| Tên gọi khác | CAT construct validity, adaptive test construct validation, CAT validity evidence, construct validity evidence in CAT | CAT-IRT, adaptive testing, IRT-based CAT, computerized adaptive testing |
| Liên quan≠ | 6 | 4 |
| Tóm tắt≠ | Construct validity in computerized adaptive testing evaluates whether the latent trait estimates produced by a CAT instrument genuinely measure the intended psychological or educational construct. Because adaptive algorithms select items individually for each examinee, the validity evidence gathered must account for the variable item exposure and the IRT-based scoring that are unique to CAT administrations. | Computerized adaptive testing based on item response theory is a sequential measurement procedure in which a computer algorithm selects successive test items tailored to each examinee's estimated ability level. Drawing on IRT to model item characteristics and ability estimation, CAT delivers precise scores with far fewer items than fixed-length tests, making it efficient for high-stakes assessments, clinical screening, and large-scale surveys. |
| ScholarGateBộ dữ liệu ↗ |
|
|