Adams, R. J., and Wilson, M. R., and Wang, W. (1997). The multidimensional random coefficients multinomial logit model. Applied Psychological Measurement, 21, 1-24
Adams, R. J., and Wu, M. L. (Eds.) (2002) PISA 2000 Technical Report. Paris: OECD.
Adams, R. J., Wilson, M. R., and Wu, M. L. (1997) Multilevel item response modelling: An approach to errors in variables regression. Journal of Educational and Behavioral Statistics, 22, 47-76.
Beaton, A. E. (1987). Implementing the new design: The NAEP 1983-84 Technical Report. (Report No. 15-TR-20). Princeton, NJ: Educational Testing Service.
Bryk, A. S., and Raudenbush, S. W. (1992). Hierarchical linear models: Applications and data analysis methods. Newbury Park, CA: Sage.
Chang, H., and Stout, W. (1993). The asymptotic posterior normality of the latent train in an IRT model. Psychometrika, 58, 37-52.
Goldstein, H. (1987). Multilevel statistical models. London: Edward Arnold.
Guilford, J. P. (1954) Psychometric methods. New York: McGraw-Hill.
Fuller, W. (1987). Measurement error models. New York: Wiley.
Littell, R. C., Milliken, G. A., Stroup, W. W., and Wolfinger, R. D. (1999). SAS system for mixed models. Cary, NC: SAS Institute.
Macaskill, G., Adams, R. J., and Wu, M. L. (1998). Scaling methodology and procedures for the mathematics and science literacy, advanced mathematics and physics scales. In M. Martin and D. L. Kelly (Eds.) Third international mathematics and science study. Technical Report Volume 3: Implementation and Analysis. Chestnut Hill, MA: Boston College.
Mislevy, R. J. (1991). Randomization-based inference about latent variables from complex samples. Psychometrika, 56, 177-196.
Mislevy, R. J., Beaton, A. E., Kaplan, B., and Sheehan, K. M. (1992). Estimating population characteristics from sparse matrix samples of item responses. Journal of Educational Measurement, 29, 133-161.
Mislevy, R. J., and Sheehan, K. M. (1989). Information matrices in latent-variable models. Journal of Educational Statistics, 14(4), 335-350.
Organization for Economic Co-operation and Development. (2001). Knowledge and skills for life: First results from PISA 2000. Paris: OECD Publications.
Organization for Economic Co-operation and Development. (2004). Learning for tomorrow's world: First results from PISA 2003. Paris: OECD Publications.
Organization for Economic Co-operation and Development. (2007). Science competencies for tomorrow's world. Paris: OECD Publications.
Rubin, D. B. (1987) Multiple imputations for non-response in surveys. New York: John Wiley and Sons.
Thomas, N. (2000). Assessing model sensitivity of the imputation methods used in the national assessment of educational progress. Journal of Educational and Behavioral Statistics, 25, 351-371.
Thomas, N., and Gan, N. (1997). Generating multiple imputations for matrix sampling data analyzed with item response models. Journal of Educational and Behavioral Statistics, 22, 425-446.
Warm, T. A. (1985). Weighted maximum likelihood estimation of ability in item response theory with tests of finite length. Technical Report CGI-TR-85-08. Oklahoma City, OK: U.S. Cost Guard Institute.
Wu, M. L., and Adams, R. J., (2002, April). Plausible Values: Why they are important. Paper presented at the International Objective Measurement workshop, New Orleans, LA.
Wu, M. L., Adams, R. J., and Wilson, M. R. (1997). ConQuest: Multi-aspect test software [Computer program]. Camberwell, VIC, Australia: Australian Council for Educational Research.