ScholarGate
Trợ lý

So sánh phương pháp

Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.

Bookmark Standard Setting×Lý thuyết Ứng đáp Câu hỏi (IRT)×Vertical Scaling×
Lĩnh vựcEducationTrắc lượng tâm lýEducation
HọProcess / pipelineLatent structureLatent structure
Năm ra đời20011952–19682014
Người khởi xướngHoward Mitzel, Daniel Lewis, Richard Patz & Donald Ross Green (CTB/McGraw-Hill)Frederic M. Lord (and Allan Birnbaum for the 2PL/3PL models)Educational measurement tradition (Thurstone; Kolen & Brennan synthesis)
LoạiIRT-based standard-setting procedure using ordered item bookletsProbabilistic measurement modelConstruction of a single developmental score scale spanning multiple grades
Công trình gốcCizek, G. J., & Bunch, M. B. (2007). Standard Setting: A Guide to Establishing and Evaluating Performance Standards on Tests. Sage. ISBN: 9781412916820Lord, F. M. & Novick, M. R. (1968). Statistical Theories of Mental Test Scores. Addison-Wesley. link ↗Kolen, M. J., & Brennan, R. L. (2014). Test Equating, Scaling, and Linking: Methods and Practices (3rd ed.). Springer. ISBN: 9781493903160
Tên gọi khácBookmark Method, Bookmark Procedure, Item Mapping Standard Setting, Ordered Item Booklet MethodIRT, latent trait theory, item characteristic curve theory, modern test theoryDevelopmental Scaling, Vertical Linking, Cross-Grade Scaling, Growth Scale Construction
Liên quan354
Tóm tắtThe Bookmark method is an item-response-theory-based standard-setting procedure in which test items are arranged in a booklet ordered from easiest to hardest. Panelists page through this ordered item booklet and place a 'bookmark' at the point separating items a borderline examinee would likely master from those they would not, judged against a fixed response probability (commonly two-thirds). The latent ability at the bookmark defines the cut score. Developed at CTB/McGraw-Hill, it became one of the dominant methods for large-scale K-12 assessments.Item response theory models the probability that a respondent answers an item correctly (or endorses it) as a function of the respondent's latent trait level and the item's own statistical properties — difficulty, discrimination, and guessing. Unlike classical test theory, IRT places persons and items on the same scale, yielding measurement that is sample-independent for items and test-independent for persons.Vertical scaling places tests written for different grade levels onto a single continuous score scale so that growth from one grade to the next can be measured in common units. Unlike horizontal equating, which links alternate forms intended to be interchangeable, vertical scaling deliberately links tests of differing difficulty and content to build a developmental continuum spanning, for example, grades 3 through 8. It is the measurement foundation that lets a fourth-grade and a fifth-grade score be subtracted to express how much a student grew.
ScholarGateBộ dữ liệu
  1. v1
  2. 2 Nguồn tài liệu
  3. PUBLISHED
  1. v1
  2. 2 Nguồn tài liệu
  3. PUBLISHED
  1. v1
  2. 2 Nguồn tài liệu
  3. PUBLISHED

Đến trang tìm kiếm Tải xuống bản trình chiếu

ScholarGateSo sánh phương pháp: Bookmark Standard Setting · Item Response Theory · Vertical Scaling. Truy cập ngày 2026-06-25 từ https://scholargate.app/vi/compare