[en] In item response theory, the classical estimators of ability are highly sensitive to response disturbances and can return strongly biased estimates of the true underlying ability level. Robust methods were introduced to lessen the impact of such aberrant responses onto the estimation process. The computation of asymptotic (i.e., large sample) standard errors (ASE) for these robust estimators, however, has not been fully considered yet. This paper focuses on a broad class of robust ability estimators, defined by an appropriate selection of the weight function and the residual measure, for which the ASE is derived from the theory of estimating equations. The maximum likelihood (ML) and the robust estimators, together with their estimated ASE, are then compared through a simulation study. It is concluded that both the estimators and their ASE perform similarly in absence of response disturbances, while the robust estimator and its estimated ASE are less biased and outperform their ML counterparts in presence of response disturbances with large impact on the item response process.
Disciplines :
Mathematics
Author, co-author :
Magis, David ; Université de Liège - ULiège > Département d'éducation et formation > Psychométrie et édumétrie
Language :
English
Title :
On the asymptotic standard error of a class of robust estimators of ability in dichotomous item response models
Publication date :
2014
Journal title :
British Journal of Mathematical and Statistical Psychology
ISSN :
0007-1102
Publisher :
British Psychological Society, London, United Kingdom
scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.
Bibliography
Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee's ability. In F. M. Lord & M. R. Novick (Eds.), Statistical theories of mental test scores (pp. 397-479). Reading, MA: Addison-Wesley.
Carroll, R. J., & Pederson, S. (1993). On robustness in the logistic regression model. Journal of the Royal Statistical Society, Series B, 55, 693-706.
Donoghue, J. R., & Allen, N. L. (1993). Thin vs. thick matching in the Mantel-Haenszel procedure for detecting DIF. Journal of Educational Statistics, 18, 131-154. doi:10.2307/1165084
Hambleton, R. K., & Swaminathan, H. (1985). Item response theory: Principles and applications. Boston: Kluwer.
Huber, P. J. (1981). Robust statistics. New York, NY: Wiley.
Hutton, P. (2006). Understanding student cheating and what educators can do about it. College Teaching, 54, 171-176. doi:10.3200/CTCH.54.1.171-176
Inagaki, N. (1973). Asymptotic relations between the likelihood estimating functions and the maximum likelihood estimators. Annals of the Institute of Statistical Mathematics, 25, 1-26. doi:10.1007/BF02479355
Karabatsos, G. (2003). Comparing the aberrant response detection performance of thirty six person-fit statistics. Applied Measurement in Education, 16, 277-298. doi:10.1207/S15324818AME1604_2
Lord, F. M. (1980). Applications of item response theory to practical testing problems. Hillsdale, NJ: Lawrence Erlbaum.
Lord, F. M. (1983). Unbiased estimators of ability parameters, of their variance, and of their parallel-forms reliability. Psychometrika, 48, 233-245. doi:10.1007/BF02294018
Magis, D., & De Boeck, P. (2011). Identification of differential item functioning in multiple-group settings: A multivariate outlier detection approach. Multivariate Behavioral Research, 46, 733-755. doi:10.1080/00273171.2011.606757
Maronna, R. A., Martin, D., & Yohai, V. J. (2006). Robust statistics: Theory and methods. Chichester, UK: Wiley.
Meijer, R., & Sijtsma, K. (2001). Methodology review: Evaluating person fit. Applied Psychological Measurement, 25, 107-135. doi:10.1177/01466210122031957
Mislevy, R. J., & Bock, R. D. (1982). Biweight estimates of latent ability. Educational and Psychological Measurement, 42, 725-737. doi:10.1177/001316448204200302
Molenaar, I. W., & Hoijtink, H. (1990). The many null distributions of person fit indices. Psychometrika, 55, 75-106. doi:10.1007/BF02294745
Mosteller, F., & Tukey, J. (1977). Exploratory data analysis and regression. Reading, MA: Addison-Wesley.
Penfield, R. D. (2003). Application of the Breslow-Day test of trend in odds ratio heterogeneity to the detection of nonuniform DIF. Alberta Journal of Educational Research, 49, 231-243.
Pregibon, D. (1982). Resistant fits for some commonly used logistic models with medical applications. Biometrics, 38, 485-498. doi:10.2307/2530463
Rao, M. M. (1984). Probability theory with applications. New York: Academic Press.
Rousseeuw, P. J., & Leroy, A. M. (1987). Robust regression and outlier detection. New York, NY: Wiley.
Schuster, C., & Yuan, K.-H. (2011). Robust estimation of latent ability in item response models. Journal of Educational and Behavioral Statistics, 36, 720-735. doi:10.3102/1076998610396890
Snijders, T. A. B. (2001). Asymptotic null distribution of person fit statistics with estimated person parameter. Psychometrika, 66, 331-342. doi:10.1007/BF02294437
Stefanski, L. A., & Boos, D. D. (2002). The calculus of M-estimation. American Statistician, 56, 29-38. doi:10.1198/000313002753631330
St-Onge, C., Valois, P., Abdous, B., & Germain, S. (2011). Accuracy of person-fit statistics: A Monte Carlo study of the influence of aberrance rates. Applied Psychological Measurement, 35, 419-432. doi:10.1177/0146621610391777
Van der Linden, W., & Glas, C. A. W. (2009). Elements of adaptive testing. New York, NY: Springer.
Wainer, H. (2000). Computerized adaptive testing: A primer. (2nd ed.) Mahwah, NJ: Lawrence Erlbaum Associates.
Wainer, H., & Wright, B. D. (1980). Robust estimation of ability in the Rasch model. Psychometrika, 45, 373-391. doi:10.1007/BF02293910
Warm, T. A. (1989). Weighted likelihood estimation of ability in item response models. Psychometrika, 54, 427-450. doi:10.1007/BF02294627
Yuan, K.-H. (1997). A theorem on uniform convergence of stochastic functions with applications. Journal of Multivariate Analysis, 62, 100-109. doi:10.1006/jmva.1997.1674
Yuan, K.-H., & Jennrich, R. I. (1998). Asymptotics of estimating equations under natural conditions. Journal of Multivariate Analysis, 65, 245-260. doi:10.1006/jmva.1997.1731
Zeileis, A. (2006). Object-oriented computation of sandwich estimators. Journal of Statistical Software, 16(9), 1-16.
Zwick, R., Donoghue, J. R., & Grima, A. (1993). Assessment of differential item functioning for performance tasks. Journal of Educational Measurement, 30, 233-251. doi:10.1111/j.1745-3984.1993.tb00425.x
This website uses cookies to improve user experience. Read more
Save & Close
Accept all
Decline all
Show detailsHide details
Cookie declaration
About cookies
Strictly necessary
Performance
Strictly necessary cookies allow core website functionality such as user login and account management. The website cannot be used properly without strictly necessary cookies.
This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.
Performance cookies are used to see how visitors use the website, eg. analytics cookies. Those cookies cannot be used to directly identify a certain visitor.
Used to store the attribution information, the referrer initially used to visit the website
Cookies are small text files that are placed on your computer by websites that you visit. Websites use cookies to help users navigate efficiently and perform certain functions. Cookies that are required for the website to operate properly are allowed to be set without your permission. All other cookies need to be approved before they can be set in the browser.
You can change your consent to cookie usage at any time on our Privacy Policy page.