Norm-Referenced and Criterion-Referenced Tests : Theory and Practice

Main Article Content

Rapin Posrie
Sukunya Rujimethabhas
Surachet Boonyarug

Abstract

Norm-referenced and criterion-referenced tests are both achievement or proficiency tests, each of which has a different theoretical background. Construction of these tests with incorrect applications of theoretical backgrounds will lead to incorrect interpretation of raw score, resulting in huge negative effects on students and all other stakeholders. Both approaches have long been developing for more than half a century, therefore, in this article they will be named “theories”.


            Norm-referenced test theory explains that in general, the test domain is large and heterogeneous.                                         A test measuring such a domain is only a sample of tasks. A raw score is not a true score and not dependable as a true score.  It cannot be interpreted as a percent correct as a true score. It is, then, interpreted through norm referencing. While the criterion-referenced test theory explains that in some cases, the test domain is small and homogeneous.  A small sample of tasks can represent the whole domain, just like a spoon of soup can represent the whole soup in a pot. Therefore, a raw score is a virtual true score and dependable as a true score. It, then, can be interpreted in terms of a percent correct referencing directly to the test domain.     

Article Details

How to Cite
Posrie, R., Rujimethabhas, S., & Boonyarug, S. (2024). Norm-Referenced and Criterion-Referenced Tests : Theory and Practice. Rajabhat Maha Sarakham University Journal, 18(1), 13–22. retrieved from https://so05.tci-thaijo.org/index.php/rmuj/article/view/272546
Section
Academic Articles

References

Brennan, R. L., & Kane, M. T. (1977). An index of dependability for mastery tests. Journal of Educational Measurement, 14, 277-289.

Brennan, R. L. (1983). Elements of Generalizability Theory. Iowa: ACT Publications.

Cronbach, L. J. (1951). Coefficient alpha and the internal structure of tests. Psychometrika, 16, 292-334.

Ebel, R. L. (1970). Essentials of Educational Measurement. New Jersey: Prentice-Hall.

Hambleton, R. K., Swaminathan, H., Algina, J., & Coulson, D. B. (1978). Criterion-referenced testing and measurement: A review of technical issues and developments. Review of Educational Research, 48(1), 1-47.

IELTS Liz. (2024.). IELTS band score. https://ieltsliz-band-scores/

Glaser, R. (1963). Instructional technology, and the measurement of learning outcomes, Some questions, American Psychologist, 18, 519-521

KAPOOK. (n.d.). สอบใบขับขี่รถยนต์.https://car.kapook.com/view173898.html.Lyman, H. B. (1978). Test scores and what they mean (3rd Ed.). Englewood Cliffs.NJ: Prentice Hall.

Net News. (2559). ใบรายงานผลโอเน็ตมีกี่แบบ.https://www.niets.or.th

PISA Technical Report. (2022). Proficiency Scale Construction for the Core Domains. https://www.oecd.org/pisa/data/pisa2022technicalreport/.

Popham, W. J. (1981). Modern Educational Measurement. Englewood Cliffs. NJ: Prentice Hall.

Popham, W. J. (2014). Criterion-Referenced Measurement: Half a Century Wasted. https://www.ascd.org/el/ articles/criterion-referenced-measurement-half-century-waste.

Spearman, C. (1904). The proof and measurement of association between two things. American Journal of Psychology, 15, 72-101.