Item response theory; maximum likelihood; weighted likelihood; standard error; exact distribution
Abstract :
[en] The maximum likelihood (ML) and the weighted likelihood (WL) estimators are commonly used to obtain proficiency level estimates with pre-calibrated item parameters. Both estimators have the same asymptotic standard error (ASE) that can be easily derived from the expected information function of the test. However, the accuracy of this asymptotic formula is uncertain with short tests when only a few items are administered. The purpose of this paper is to compare the ASE of these estimators to their exact values, evaluated at the proficiency level estimates. The exact SE is computed by generating the full exact sample distribution of the estimators, so its practical feasibility is limited to small tests (except under the Rasch model). A simulation study was conducted to compare the ASE and the exact SE of the ML and WL estimators, to the “true” SE (i.e., computed as the exact SE with the true proficiency levels). It is concluded that with small tests, the exact SEs are less biased and return smaller root mean squared error values than the asymptotic SEs, while as expected the two estimators return similar results with longer tests.
Disciplines :
Education & instruction
Author, co-author :
Magis, David ; Université de Liège - ULiège > Département d'éducation et formation > Psychométrie et édumétrie
Language :
English
Title :
Accuracy of asymptotic standard errors of the maximum and weighted likelihood estimators of proficiency levels with short tests
scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.
Bibliography
Agresti A.Categorical data analysis. New York, NY: Wiley; 1990:.
Baker F. B.,Kim S.-H.Item response theory: Parameter estimation techniques. 2004). Item response theory: Parameter estimation techniques (2nd ed.). New York, NY: Marcel Dekker. doi:10.2307/2532822New York, NY: Marcel Dekker; 2004:.
Birnbaum A.Statistical theories of mental test scores. Lord F. M.Novick M. R., ed. Reading, MA: Addison-Wesley; 1968:17-20.
Bock R. D.,Mislevy R. J.Adaptive EAP estimation of ability in a microcomputer environment.Applied Psychological Measurement. 1982;6:431-444.
Donoghue J. R.,Allen N. L.Thin vs. thick matching in the Mantel-Haenszel procedure for detecting DIF.Journal of Educational Statistics. 1993;18:131-154.
Hambleton R. K.,Swaminathan H.Item response theory: Principles and applications. Boston, MA: Kluwer; 1985:.
Lord F. M.Applications of item response theory to practical testing problems. Hillsdale, NJ: Erlbaum; 1980:.
Lord F. M.Unbiased estimators of ability parameters, of their variance, and of their parallel-forms reliability.Psychometrika. 1983;48:233-245.
Lord F. M.Maximum likelihood and Bayesian parameter estimation in item response theory.Journal of Educational Measurement. 1986;23:157-162.
Lord F. M.,Wingersky M. S.Comparison of IRT true-score and equipercentile observed-score equatings.Applied Psychological Measurement. 1984;8:453-461.
Magis D.,Raîche G.Random generation of response patterns under computerized adaptive testing with the R package catR.Journal of Statistical Software. 2012;48:1-31.
Penfield R. D.Application of the Breslow-Day test of trend in odds ratio heterogeneity to the detection of nonuniform DIF.Alberta Journal of Educational Research. 2003;49:231-243.
Penfield R. D.,Bergeron J. M.Applying a weighted maximum likelihood latent trait estimator to the generalized partial credit model.Applied Psychological Measurement. 2005;29:218-233.
Rasch G.Probabilistic models for some intelligence and attainment tests. Copenhagen, Denmark: Danish Institute for Educational Research; 1960:.
Vienna, Austria: R Foundation for Statistical Computing; 2012:.
van der Linden W. J.,Pashley P. J.Computerized adaptive testing: Theory and practice. van der Linden W. J.Glas C. A. W., ed. Boston, MA: Kluwer; 2009:1-25.
Wainer H.Computerized adaptive testing: A primer. 2000). Computerized adaptive testing: A primer (2nd ed.). Mahwah, NJ: Erlbaum.Mahwah, NJ: Erlbaum; 2000:.
Warm T. A.Weighted likelihood estimation of ability in item response theory.Psychometrika. 1989;54:427-450.
Warm T. A.Warm weighted likelihood estimates (WLE) of Rasch measures.Rasch Measurement Transactions. 2007 1094;21:.
Wu M. L.,Adams R. J.,Wilson M. R.Camberwell, Australia: Australian Council for Educational Research; 1997:.
Zimowski M. F.,Muraki E.,Mislevy R. J.,Bock R. D.Chicago, IL: Scientific Software International; 1996:.
Zwick R.,Donoghue J. R.,Grima A.Assessment of differential item functioning for performance tasks.Journal of Educational Measurement. 1993;30:233-251.
Similar publications
Sorry the service is unavailable at the moment. Please try again later.
This website uses cookies to improve user experience. Read more
Save & Close
Accept all
Decline all
Show detailsHide details
Cookie declaration
About cookies
Strictly necessary
Performance
Strictly necessary cookies allow core website functionality such as user login and account management. The website cannot be used properly without strictly necessary cookies.
This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.
Performance cookies are used to see how visitors use the website, eg. analytics cookies. Those cookies cannot be used to directly identify a certain visitor.
Used to store the attribution information, the referrer initially used to visit the website
Cookies are small text files that are placed on your computer by websites that you visit. Websites use cookies to help users navigate efficiently and perform certain functions. Cookies that are required for the website to operate properly are allowed to be set without your permission. All other cookies need to be approved before they can be set in the browser.
You can change your consent to cookie usage at any time on our Privacy Policy page.