Usporedite metode
Pregledajte odabrane metode jednu uz drugu; retci koji se razlikuju su istaknuti.
| Differential Item Functioning in Educational Testing× | Standardized Test Analysis× | |
|---|---|---|
| Područje | Education | Education |
| Obitelj | Latent structure | Latent structure |
| Godina nastanka≠ | 1993 | 2014 |
| Tvorac≠ | Educational measurement / test-fairness tradition (Holland, Wainer, Dorans, Thissen) | Educational measurement profession (AERA/APA/NCME Standards; Lord; Cronbach) |
| Vrsta≠ | Test-fairness analysis detecting items that function differently across groups | Psychometric evaluation of items, reliability, validity, and fairness of standardized tests |
| Temeljni izvor≠ | Holland, P. W., & Wainer, H. (Eds.). (1993). Differential Item Functioning. Lawrence Erlbaum Associates. ISBN: 9780805809725 | American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (2014). Standards for Educational and Psychological Testing. AERA. ISBN: 9780935302356 |
| Drugi nazivi | Educational DIF Analysis, Item Bias Detection in Tests, Test Fairness DIF, Mantel-Haenszel DIF | Standardized Testing Analysis, Test Score Analysis, Item and Test Analysis, Educational Test Psychometrics |
| Srodne≠ | 4 | 3 |
| Sažetak≠ | Differential item functioning (DIF) analysis is the central statistical tool for evaluating the fairness of test items in education. An item shows DIF when examinees of equal ability but different group membership — for example by gender, race/ethnicity, or language background — have unequal probabilities of answering it correctly. By conditioning on ability before comparing groups, DIF analysis separates genuine item bias from real group differences in proficiency, and flags items for expert review before they affect high-stakes decisions. | Standardized test analysis is the body of psychometric methods used to evaluate and score standardized educational tests: analyzing how items perform, estimating reliability and the standard error of measurement, scaling scores via classical or item response theory, and assembling validity and fairness evidence. Governed by the professional Standards for Educational and Psychological Testing and rooted in test theory synthesized by Lord and others, it is the disciplined work that turns a set of test questions into defensible scores carrying meaning, precision, and fairness. |
| ScholarGateSkup podataka ↗ |
|
|