Free Access
Bibliography
R. Darrell Bock,
Robert D. Gibbons,
R. Darrell Bock
Search for more papers by this authorRobert D. Gibbons
Search for more papers by this author
Book Author(s):R. Darrell Bock,
Robert D. Gibbons,
R. Darrell Bock
Search for more papers by this authorRobert D. Gibbons
Search for more papers by this author
First published: 02 July 2021
References
- Achtyes, E.D., Halstead, S., Smart, L.A. et al. (2015). Validation of computerized adaptive testing in an outpatient nonacademic setting: the VOCATIONS trial. Psychiatric Services 66 (10): 1091–1096.
- Ackerman, T.A. (1994). Using multidimensional item response theory to understand what items and tests are measuring. Applied Measurement in Education 7 (4): 255–278.
10.1207/s15324818ame0704_1 Google Scholar
- Ackerman, T.A. (1996). Graphical representation of multidimensional item response theory analysis. Applied Psychological Measurement 20: 311–329.
- Aitchison, J. and Silvey, S.D. (1958). Maximum-likelihood estimation of parameters subject to restraints. The Annals of Mathematical Statistics 29 (3): 813–828.
10.1214/aoms/1177706538 Google Scholar
- Alegría, M., Alvarez, K., Ishikawa, R.Z. et al. (2016). Removing obstacles to eliminating racial and ethnic disparities in behavioral health care. Health Affairs 35 (6): 991–999.
- Andersen, E.B. (1977). Sufficient statistics and latent trait models. Psychometrika 42 (1): 69–81.
- Andersen, E.B. (1980). Discrete Statistical Models with Social Science Applications. Amsterdam: North Holland.
- Andersen, E. and Madsen, M. (1977). Estimating the parameters of the latent population distribution. Psychometrika 42 (3): 357–374.
- Anderson, T.W. (1984). An Introduction to Multivariate Statistical Analysis, 2e. New York: Wiley.
- Andreasen, NC. (1984). The Scale for the Assessment of Positive Symptoms (SAPS). Iowa City, IA: University of Iowa.
- Andrich, D. (1978). Application of a psychometric rating model to ordered categories which are scored with successive integers. Applied Psychological Measurement 2 (4): 581–594.
10.1177/014662167800200413 Google Scholar
- Andrich, D. (1988). A general form of Rasch's extended logistic model for partial credit scoring. Applied Measurement in Education 1 (4): 363–378.
10.1207/s15324818ame0104_7 Google Scholar
- Anscombe, F.J. (1956). On estimating binomial response relations. Biometrika 43 (3/4): 461.
- Ashford, J. and Sowden, R.R. (1970). Multi-variate probit analysis. Biometrics 26 (3): 535–546.
- Baek, S.-G. (1997). Computerized adaptive testing using the partial credit model for attitude measurement. In: Objective Measurement: Theory Into Practice. (ed. M. Wilson, G. Engelhard and K. Draney), 37–55.
- Baker, F.B. (1992). Item Response Theory: Parameter Estimation Techniques. New York: Marcel Dekker.
- Ban, J.C., Hanson, B.A., Yi, Q., and Harris, D.J. (2002). Data sparseness and on-line pretest item calibration-scaling methods in CAT. Journal of Educational Measurement 39 (3): 207–218.
- Ban, J.C., Hanson, B.A., Wang, T. et al. (2006). A comparative study of online pretest item calibration/scaling methods in computerized adaptive testing. American Educational Research Association 38 (3): 191–212.
- Bartholomew, D.J. and Tzamourani, P. (1999). The goodness of fit of latent trait models in attitude measurement. Sociological Methods and Research 27 (4): 525–546.
- Beiser, D., Vu, M., and Gibbons, R. (2016). Test-retest reliability of a computerized adaptive depression screener. Psychiatric Services 67 (9): 1039–1041.
- Beiser, D.G., Ward, C.E., Vu, M. et al. (2019). Depression in emergency department patients and association with health care utilization. Academic Emergency Medicine 26 (8): 878–888.
- Berkson, J. (1956). Estimation by least squares and by maximum likelihood. Proceedings of the Third Berkeley Symposium 1: 1–11.
- Berndt, E.R., Hall, B.H., Hall, R.E., and Hausman, J.A. (1974). Estimation and inference in nonlinear structural models. Annals of Economic and Social Measurement 3 (4): 653–665.
- Berona, J., Whitton, S., Newcomb, M.E. et al. Prospective risk and protective factors for the transition from suicide ideation to attempt among sexual and gender minority youth. Psychiatric Services, in press.
- Birnbaum, A. (1957). Probability and Statistics in Item Analysis and Classification Problems: Efficient Design and Use of Tests of Mental Ability for Various Decision-making. Technical report, Ser. Rep. No. 15. Randolph Air Force Base, TX: USAF School of Aviation Medicine.
- Birnbaum, A. (1958a). Further Considerations of Efficiency in Tests of a Mental Ability. Technical report, Ser. Rep. No. 17. Randolph Air Force Base, TX: USAF School of Aviation Medicine.
- Birnbaum, A. (1958b). On the Estimation of Mental Ability. Technical report, Ser. Rep. No. 17. Randolph Air Force Base, TX: USAF School of Aviation Medicine.
- Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee's ability. In: Statistical Theories of Mental Test Scores (ed. F.M. Lord and M.R. Novick), 397–479. Reading, MA: Addison-Wesley.
- Bishop, Y.M., Holland, P.W., and Fienberg, S.E. (1975). Discrete Multivariate Analysis Theory and Practice. Cambridge, MA: Massachusetts Institute of Technology Press.
- Black, D.W., Gunter, T., Loveless, P. et al. (2010). Antisocial personality disorder in incarcerated offenders: psychiatric comorbidity and quality of life. Annals of Clinical Psychiatry 22 (2): 113–120.
- Bliss, C.I. (1935). The calculation of the dosage-mortality curve. Annals of Applied Biology 22 (1): 134–167.
- Bock, R. (1972). Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika 37 (1): 29–51.
- Bock, R. (1975). Multivariate Statistical Methods in Behavioral Research. New York: McGraw-Hill.
- Bock, R.D. and Moore, E.G.J. (1986). Advantage and Disadvantage: A Profile of American Youth. Hillsdale, NJ: Erlbaum.
- Bock, R. (1989a). Addendum: measurement of human variation: a two-stage model. R. Darrell Bock. In: Multilevel Analysis of Educational Data, 319–342. Academic Press.
10.1016/B978-0-12-108840-8.50021-4 Google Scholar
- Bock, R. (1989b). Measurement of Human Variation: A Two-Stage Model. Academic Press.
- Bock, R.D. (1997). The nominal categories model. W.J. van der Linden; R.K. Hambleton. In: Handbook of Modern Item Response Theory, 33–49. New York: Springer.
10.1007/978-1-4757-2691-6_2 Google Scholar
- Bock, R.D. and Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: application of an EM algorithm. Psychometrika 46 (4): 443–459.
- Bock, R.D. and Gibbons, R.D. (1996). High-dimensional multivariate probit analysis. Biometrics 52 (4): 1183–1194.
- Bock, R. and Gibbons, R. (2010). Factor analysis of categorical item responses. In: Handbook of Polytomous Item Response Theory Models (ed. M.L. Nering and R. Ostini). Florence, KY: Lawrence Erlbaum. 155–184.
- Bock, R.D. and Jones, L.V. (1968). The Measurement and Prediction of Judgment and Choice. San Francisco, CA: Holden-Day.
- Bock, R.D. and Lieberman, M. (1970). Fitting a response model for n dichotomously scored items. Psychometrika 35 (2): 179–197.
- Bock, R.D. and Mislevy, R.J. (1982). Adaptive EAP estimation of ability in a microcomputer environment. Applied Psychological Measurement 6 (4): 431–444.
- Bock, R.D. and Schilling, S. (1997). High-dimensional full-information item factor analysis. M. Berkane. In: Latent Variable Modeling and Applications to Causality, 163–176. New York: Springer.
- Bock, R.D. and Zimowski, M.F. (1997). Multiple group IRT. In: Handbook of Modern Item Response Theory (ed. W.J. van der Linden and R.K. Hambleton), 433–448. New York: Springer.
- Bock, R.D., Mislevy, R., and Woodson, C. (1982). The next stage in educational assessment. Educational Researcher 11 (3): 4–16.
10.3102/0013189X011003004 Google Scholar
- Bock, R.D., Muraki, E., and Pfeiffenberger, W. (1988). Item pool maintenance in the presence of item parameter drift. Journal of Educational Measurement 25 (4): 275–285.
- Bock, R.D., Thissen, D., and Zimowski, M.F. (1997). IRT estimation of domain scores. Journal of Educational Measurement 34 (3): 197–211.
- Böckenholt, U. (2001). Hierarchical modeling of paired comparison data. Psychological Methods 6 (1): 49–64.
- Bradley, R.A. and Terry, M.E. (1952). Rank analysis of incomplete block designs: I. The method of paired comparisons. Biometrika 39 (3/4): 324.
- Brennan, R. (2001). Generalizability Theory. New York: Springer.
10.1007/978-1-4757-3456-0 Google Scholar
- Brown, J. and Weiss, D. (1977). An Adaptive Testing Strategy for Achievement Test Batteries. Technical report (Research Rep. No. 77-6). Minneapolis, MN: University of Minnesota, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory.
- Browne, M.W. and Cudeck, R. (1993). Alternative ways of assessing model fit. In: Testing Structural Equation Models (ed. K.A. Bollen and J.S. Long), pp. 136–162. Beverly Hills, CA: Sage.
- Cai, L. (2010). A two-tier full-information item factor analysis model with applications. Psychometrika 75: 581–612.
- Cai, L. and Hansen, M. (2013). Limited-information goodness-of-fit testing of hierarchical item factor models. British Journal of Mathematical and Statistical Psychology 66 (2): 245–276.
- Cai, L., Maydeu-Olivares, A., Coffman, D.L., and Thissen, D. (2006). Limited-information goodness-of-fit testing of item response theory models for sparse 2P tables. British Journal of Mathematical and Statistical Psychology 59 (1): 173–194.
- Cai, L., Thissen, D., and du Toit, S.H. (2011). IRTPRO. Lincolnwood, IL: Scientific Software International.
- Camilli, G. and Shepard, L. (1994). Methods for Identifying Biased Test Items. Thousand Oaks, CA: Sage.
- Chang, H.H. (2004). Understanding computerized adaptive testing: from Robbins–Monro to Lord and beyond. In: The Sage Handbook of Quantitative Methodology for the Social Sciences (ed. D. Kaplan), pp. 117–133. Thousand Oaks, CA: Sage.
10.4135/9781412986311.n7 Google Scholar
- Chang, H.-H. and Ying, Z. (1996). A global information approach to computerized adaptive testing. Applied Psychological Measurement 20: 213–229.
- Chang, H.-H. and Ying, Z. (1999). A-stratified multistage computerized adaptive testing. Applied Psychological Measurement 23 (3): 211–222.
- Chang, H.-H. and Ying, Z. (2009). Nonlinear sequential designs for logistic item response theory models with applications to computerized adaptive tests. Annals of Statistics 37 (3): 1466–1488.
- Chang, H.-H., Qian, J., and Ying, Z. (2001). A-stratified multistage computerized adaptive testing with b blocking. Applied Psychological Measurement 25 (4): 333–341.
- Chapman, L. and Bock, R.D. (1958). Components of variance due to acquiescence and content in the F scale measure of authoritarianism. Psychological Bulletin 55 (5): 328–333.
- Chen, S.-Y., Ankenmann, R.D., and Chang, H.-H. (2000). A comparison of item selection rules at the early stages of computerized adaptive testing. Applied Psychological Measurement 24 (3): 241–255.
- Cochran, W. and Cox, G. (1957). Experimental Designs. New York: Wiley.
- Cooper, B.E. (1968). Algorithm AS 2: the normal integral. Applied Statistics 17 (2): 186.
10.2307/2985683 Google Scholar
- Creedon, T.B. and Lê Cook, B. (2016). Datawatch: access to mental health care increased but not for substance use, while disparities remain. Health Affairs 35 (6): 1017–1021.
- Cronbach, L. (1970). Essentials of Psychological Testing. New York: Harper & Row.
- Cronbach, L.J., Gleser, G.C., Nanda, N., and Rajaratnam, N. (1972). The Dependability of Behavioral Measurements: Theory of Generalizability for Scores and Profiles. New York: Wiley.
- Day, N.E. (1969). Estimating the components of a mixture of normal distributions. Biometrika 56 (3): 463.
- Dempster, A.P., Laird, N.M., and Rubin, D.B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological) 39 (1): 1–22.
- Dempster, A.P., Rubin, D.B., and Tsutakawa, R.K. (1981). Estimation in covariance components models. Journal of the American Statistical Association 76 (374): 341–353.
- Divgi, D.R. (1979a). Calculation of the tetrachoric correlation coefficient. Psychometrika 44 (2): 169–172.
- Divgi, D.R. (1979b). Calculation of univariate and bivariate normal probability functions. The Annals of Statistics 7 (4): 903–910.
- Dodd, B.G., de Ayala, R.J., and Koch, W.R. (1995). Computerized adaptive testing with polytomous items. Applied Psychological Measurement 19 (1): 5–22.
- Dorans, N.J., Moses, T.P., and Eignor, D.R. (2010). Principles and practices of test score equating. ETS Research Report Series 2010 (2): i–41.
10.1002/j.2333-8504.2010.tb02214.x Google Scholar
- Dunnett, C. (1964). New tables for multiple comparisons with a control. Biometrics 20 (3): 482–491.
- DuToit, M. (2003), IRT from SSI: Bilog-MG, multilog, parscale, testfact, Scientific Software International, Chicago, IL.
- Edwards, A.L. and Thurstone, L.L. (1952). An internal consistency check for scale values determined by the method of successive intervals. Psychometrika 17 (2): 169–180.
10.1007/BF02288780 Google Scholar
- Elderon, L., Smolderen, K.G., Na, B., and Whooley, M.A. (2011). Accuracy and prognostic value of american heart association-recommended depression screening in patients with coronary heart disease. Circulation: Cardiovascular Quality and Outcomes 4 (5): 533–540.
- Embretson, S. and Reise, S. (2000). Item Response Theory for Psychologists. Mahway, NJ: Lawrence Erlbaum Associates.
10.1037/10519-153 Google Scholar
- Endicott, J. and Spitzer, R.L. (1978). A diagnostic interview: the schedule for affective disorders and schizophrenia. Archives of General Psychiatry 35 (7): 837–844.
- Fechner, G.T. (1966). Elements of Psychophysics (ed. D.H. Howes and E.G. Boring). Leipzig: Breitkopf und Härtel. First published in 1860, translated by Adler, H.E.
- Fechner, G.T. (1860). Elemente der psychophysik. Leipzig: Breitkopf und Härtel.
10.1002/andp.18601871114 Google Scholar
- Fedorov, V.V. and Hackl, P. (1997). Model-Oriented Design of Experiments, Lecture Notes in Statistics . New York: Springer-Verlag.
10.1007/978-1-4612-0703-0 Google Scholar
- de Finetti, B.D. (1972). Probability, Induction and Statistics: The Art of Guessing. New York: Wiley.
- Finney, D.J. (1952). Statistical Method in Biological Assay. New York: Hafner Publishing Co.
- Finney, D.J. (1964). Probit Analysis: A Statistical Treatment of the Sigmoid Response Curve. London: Cambridge University Press.
- First, M., Gibbon, M., Spitzer, R., and Williams, J. B. W. (1996). User's Guide for the Structured Clinical Interview for DSM-IV Axis I Disorders-Research Version. New York: Biometrics Research Department, New York State Psychiatric Institute.
- Fisher, R.A. (1922). On the mathematical foundations of theoretical statistics. Philosophical Transactions of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character 222 (594–604): 309–368.
10.1098/rsta.1922.0009 Google Scholar
- Fisher, R.A. and Yates, F. (1938). Statistical Tables for Biological, Agricultural and Medical Research. London: Oliver and Boyd.
- Fletcher, R. (1987). Practical Methods of Optimization, 2e. Chichester: Wiley.
- Fliege, H., Becker, J., Walter, O.B. et al. (2005). Development of a computer-adaptive test for depression (D-CAT). Quality of Life Research 14 (10): 2277–2291.
- Gardner, W., Shear, K., Kelleher, K.J. et al. (2004). Computerized adaptive measurement of depression: a simulation study. BMC Psychiatry 4.
- Garwood, F. (1941). The application of maximum likelihood to dosage-mortality curves. Biometrika 32 (1): 46.
10.1093/biomet/32.1.46 Google Scholar
- Gauss, C.F. (1809). Theoria Motus Corporum Coelestium in Sectionibus Conicis Solem Ambientium. Perthes et Besser.
- Gibbons, R.D. and Amatya, A. (2015). Statistical Methods for Drug Safety. Boca Raton, FL: Chapman and Hall.
10.1201/b18698 Google Scholar
- Gibbons, R.D. and Cai, L. (2017). Dimensionality Analysis From: Handbook of Item Response Theory: Applications, vol. 3. CRC Press.
- Gibbons, R.D. and Hedeker, D.R. (1992). Full-information item bi-factor analysis. Psychometrika 57 (3): 423–436.
- Gibbons, R.D. and Lavigne, J.V. (1998). Emergence of childhood psychiatric disorders: a multivariate probit analysis. Statistics in Medicine 17 (21): 2487–2499.
10.1002/(SICI)1097-0258(19981115)17:21<2487::AID-SIM937>3.0.CO;2-2 CASPubMedWeb of Science®Google Scholar
- Gibbons, R.D. and Wilcox-Gök, V. (1998). Health service utilization and insurance coverage: a multivariate probit analysis. Journal of the American Statistical Association 93 (441): 63–72.
- Gibbons, R.D., Bock, R.D., Hedeker, D. et al. (2007a). Full-information item bifactor analysis of graded response data. Applied Psychological Measurement 31 (1): 4–19.
- Gibbons, R.R.D., Immekus, J.J.C., and Bock, R.D. (2007b). The added value of multidimensional IRT models. Multidimensional and Hierarchical Modeling Monograph 1 (312): 1–49.
- Gibbons, R.D., Weiss, D.J., Kupfer, D.J. et al. (2008). Using computerized adaptive testing to reduce the burden of mental health assessment. Psychiatric Services 59 (4): 361–368.
- Gibbons, R.D., Weiss, D.J., Pilkonis, P.A. et al. (2012). Development of a computerized adaptive test for depression. Archives of General Psychiatry 69 (11): 1104–1112.
- Gibbons, R.D., Weiss, D.J., Pilkonis, P.A. et al. (2014). Development of the CAT-ANX: a computerized adaptive test for anxiety. American Journal of Psychiatry 171 (2): 187–194.
- Gibbons, R.D., Weiss, D.J., Frank, E., and Kupfer, D. (2016). Computerized adaptive diagnosis and testing of mental health disorders. Annual Review of Clinical Psychology 12 (1): 83–104.
- Gibbons, R.D., Kupfer, D., Frank, E. et al. (2017). Development of a computerized adaptive test suicide scale-The CAT-SS. Journal of Clinical Psychiatry 78 (9): 1376–1382.
- Gibbons, R.D., Alegría, M., Cai, L. et al. (2018). Successful validation of the CAT-MH scales in a sample of Latin American migrants in the United States and Spain. Psychological Assessment 30 (10): 1267–1276.
- Gibbons, R.D., Kupfer, D.J., Frank, E. et al. (2019). Computerized adaptive tests for rapid and accurate assessment of psychopathology dimensions in youth. Journal of the American Academy of Child & Adolescent Psychiatry. 1264–1273.
- Gibbons, R.D., Alegria, M., Markle, S. et al. (2020). Development of a computerized adaptive substance use disorder scale for screening and measurement: the CAT-SUD. Addiction 115 (7): 1382–1394.
- Gilks, W.R., Roberts, G.O., and Sahu, S.K. (1998). Adaptive markov chain monte carlo through regeneration. Journal of the American Statistical Association 93 (443): 1045–1054.
- Gill, P. and Murray, W. (1974). Numerical Methods for Constrained Optimization. New York: Academic Press.
- Glas, C.A. (1998). Detection of differential item functioning using lagrange multiplier tests. Statistica Sinica 8 (3): 647–667.
- Goff, A., Rose, E., Rose, S., and Purves, D. (2007). Does PTSD occur in sentenced prison populations? A systematic literature review. Criminal Behaviour and Mental Health 17 (3): 152–162.
- Goldstein, H. (1983). Measuring changes in educational attainment over time: problems and possibilities. Journal of Educational Measurement 20 (4): 369–377.
- Goodman, L.A. (1968). The analysis of cross-classified data: independence, quasi-independence, and interactions in contingency tables with or without missing entries. Journal of the American Statistical Association 63 (324): 1091–1131.
- Guilford, J. (1954). The constant methods. In: Psychometric Methods, 2e, 597. New York: McGraw-Hill.
- Guinart, D., de Filippis, R., Rosson, S. et al. (2020). Development and validation of a computerized adaptive assessment tool for discrimination and measurement of psychotic symptoms. Schizophrenia Bulletin 9: sbaa168.
- Gulliksen, H. (1950). Theory of Mental Tests. Wiley.
10.1037/13240-000 Google Scholar
- Gumbel, E.J. (1961). Bivariate logistic distributions. Journal of the American Statistical Association 56 (294): 335–349.
- Gupta, S.S. (1963). Probability integrals of multivariate normal and multivariate t. The Annals of Mathematical Statistics 34 (3): 792–828.
- Guttman, L. (1945). A basis for analyzing test-retest reliability. Psychometrika 10 (4): 255–282.
- Haberman, S.J. (1977). Log-linear models and frequency tables with small expected cell counts. Annals of Statistics 5: 1148–1169.
- Haberman, S. (1978). Analysis of Qualitative Data: Introductory Topics, vol. 1. New York: Academic Press, Incorporated.
- Haberman, S. (1979). Analysis of Qualitative Data: New Developments, vol. 2. New York: Academic Press, Incorporated.
- Haffajee, R.L. and Frank, R.G. (2018). Making the opioid public health emergency effective. JAMA Psychiatry 75 (8): 767–768.
- Haggard, E. (1958). Intraclass Correlation and the Analysis of Variance. New York: Dryden Press.
- Haley, D.C. (1952). Estimation of the Dosage Mortality Relationship When the Dose is Subject to Error. Technical report. Technical Report No. 15 (Office of Naval Research Contract No 25140, NR 342-022). Stanford, CA: Stanford University Applied Mathematics and Statistics Labs.
- Hambleton, R.K. and Swaminathan, H. (1985). Item Response Theory: Principles and Applications. Boston, MA: Kluwer-Nijhoff.
10.1007/978-94-017-1988-9 Google Scholar
- Hamilton, M. (1960). A rating scale for depression. Journal of Neurology, Neurosurgery, and Psychiatry 23: 56–62.
- Han, K.T. (2012). SimulCAT: windows software for simulating computerized adaptive test administration. Applied Psychological Measurement 36 (1): 64–66.
- Harman, H. (1967). Modern Factor Analysis. Chicago: University of Chicago Press.
- Hastings, C. (1955). Approximations for Digital Computers. Princeton, NJ: Princeton University Press.
10.1515/9781400875597 Google Scholar
- Hathaway, R.J. (1985). A constrained formulation of maximum-likelihood estimation for normal mixture distributions. The Annals of Statistics 13 (2): 795–800.
- Hausman, J.A. and Wise, D.A. (1978). A conditional probit model for qualitative choice: discrete decisions recognizing interdependence and heterogeneous preferences. Econometrica, Econometric Society 46 (2): 403–426.
- Hawton, K. and Fagg, J. (1992). Trends in deliberate self poisoning and self injury in Oxford, 1976–90. British Medical Journal 304 (6839): 1409–1411.
- Hedeker, D. (1989). Random regression models with autocorrelated errors. Unpublished PhD dissertation. University of Chicago.
- Hedeker, D. and Gibbons, R.D. (1996). MIXOR: a computer program for mixed-effects ordinal probit and logistic regression analysis. Computer Methods and Programs in Biomedicine 49: 157–176.
- Hedeker, D. and Gibbons, R. (2006). Longitudinal Data Analysis. New York: Wiley.
- Hedeker, D., Mermelstein, R.J., and Flay, B.R. (2006). Application of item response theory models for intensive longitudinal data. In: Models for Intensive Longitudinal Data (ed. T.A. Walls and J.L. Schafer), 84–108. Oxford: Oxford University Press.
10.1093/acprof:oso/9780195173444.003.0004 Google Scholar
- Henderson, H.V. and Searle, S.R. (1979). Vec and vech operators for matrices, with some uses in Jacobians and multivariate statistics. Canadian Journal of Statistics 7 (1): 65–81.
10.2307/3315017 Google Scholar
- Hendrickson, A.E. and White, P.O. (1964). Promax: a quick method for rotation to oblique simple structure. British Journal of Statistical Psychology 17 (1): 65–70.
- Holland, P. and Wainer, H. (1993). Differential Item Functioning. Hillsdale, NJ: Lawrence Erlbaum Associates.
- Holland, P.W., Dorans, N.J., and Petersen, N.S. (2006). Equating test scores. C. R. Rao and S. Sinharay. In: Handbook of Statistics, vol. 26, 169–203. Amsterdam, Netherlands: Elsevier.
- Holzinger, K.J. and Swineford, F. (1937). The Bi-factor method. Psychometrika 2 (1): 41–54.
10.1007/BF02287965 Google Scholar
- Householder, A.S. (1953). Principles of Numerical Analysis. New York: McGraw-Hill.
- Householder, A.S. (1964). Theory of Matrices in Numerical Analysis. New York: Blaisdell.
- Jeon, M., Rijmen, F., and Rabe-Hesketh, S. (2013). Modeling differential item functioning using a generalization of the multiple-group bifactor model. Journal of Educational and Behavioral Statistics 38 (1): 32–60.
- Joe, H. and Maydeu-Olivares, A. (2010). A general family of limited information goodness-of-fit statistics for multinomial data. Psychometrika 75 (3): 393–419.
- Joiner, T. (2010). Myths about suicide. Choice Reviews Online 48 (03): 48–1761.
- Jöreskog, K.G. (1969). A general approach to confirmatory maximum likelihood factor analysis. Psychometrika 34 (2): 183–202.
- Jöreskog, K.G. (1979). Basic ideas of factor and component analysis. In: Advances in Factor Analysis and Structural Equation Models, 5–20. Cambridge: Abt Books.
- Jöreskog, K.G. (1994). On the estimation of polychoric correlations and their asymptotic covariance matrix. Psychometrika 59: 381–390.
- Kaiser, H.F. (1958). The varimax criterion for analytic rotation in factor analysis. Psychometrika 23: 187–200.
- Kalb, L.G., Stapp, E.K., Ballard, E.D. et al. (2019). Trends in psychiatric emergency department visits among youth and young adults in the US. Pediatrics 143 (4).
- Kelley, T. (1947). Fundamentals of Statistics. Cambridge: Harvard University Press.
- Kennedy, W.J. and Gentle, E.J. (1980). Statistical Computing. New York: Marcel Dekker.
- Kiefer, J. and Wolfowitz, J. (1956). Consistency of the maximum likelihood estimator in the presence of infinitely many incidental parameters. The Annals of Mathematical Statistics 27 (4): 887–906.
- Kim, J.J., Silver, R.K., Elue, R. et al. (2016). The experience of depression, anxiety, and mania among perinatal women. Archives of Women's Mental Health 19 (5): 883–890.
- King, C.A., Brent, D., Grupp-Phelan, J. et al. (2021). The computerized adaptive screen for suicidal youth (CASSY) development and independent validation. JAMA Psychiatry, published online ahead of print.
- Kingsbury, G.G. and Weiss, D.J. (1980). An Alternate-Forms Reliability and Concurrent Validity Comparison of Bayesian Adaptive and Conventional Ability Tests. Research Report 80-5, Computerized Adaptive Testing Laboratory. Minneapolis, MN: University of Minnesota.
- Kingsbury, G.G. and Weiss, D.J. (1983). A comparison of IRT-based adaptive mastery testing and a sequential mastery testing procedure. D. J. Weiss. In: New Horizons in Testing, 257–283. New York: Academic Press.
10.1016/B978-0-12-742780-5.50024-X Google Scholar
- Kolakowski, D. and Bock, R.D. (1981). A multivariate generalization of probit analysis. Biometrics 37: 541–551.
- Kolen, M.J. (2006). Scaling and norming. In: Educational Measurement (ed. R.L. Brennan), 155–186. Westport, CT: American Council on Education/Prager.
- Kroenke, K., Spitzer, R.L., and Williams, J.B. (2001). The PHQ-9: validity of a brief depression severity measure. Journal of General Internal Medicine 16 (9): 606–613.
- La Porte, L.M., Kim, J.J., Adams, M.G. et al. (2020). Feasibility of perinatal mood screening and text messaging on patients' personal smartphones. Archives of Women's Mental Health 23 (2): 181–188.
- Laird, N. (1978). Nonparametric maximum likelihood estimation of a mixing distribution. Journal of the American Statistical Association 73 (364): 805–811.
- Lawley, D.N. (1943). XXIII.On Problems connected with item selection and test construction. Proceedings of the Royal Society of Edinburgh. Section A. Mathematical and Physical Sciences 61 (3): 273–287.
10.1017/S0080454100006282 Google Scholar
- Lazarsfeld, P. (1958). Evidence and inference in social research. Daedalus 87 (4): 99–130.
- Lazarsfeld, P. (1959). Latent structure analysis. In: Psychology: A Study of Science, S. Koch, 476–543. New York: McGraw-Hill.
- Lehman, A.F. (1988). A quality of life interview for the chronically mentally ill. Evaluation and Program Planning 11 (1): 51–62.
- Lehmann, E. and Casella, G. (1998). Theory of Point Estimation. Springer-Verlag.
- Leung, C.-K., Chang, H.-H., and Hau, K.-T. (2003). Computerized adaptive testing: a comparison of three content balancing methods. The Journal of Technology, Learning and Assessment 2 (5): 1–15.
- Likert, R. (1932). A technique for the measurement of attitudes. Archives of Psychology 22: 5–55.
- Lima Passos, V., Berger, M.P.F., and Tan, F.E. (2007). Test design optimization in CAT early stage with the nominal response model. Applied Psychological Measurement 31 (3): 213–232.
- Lindquist, E. (1953). Design and Analysis of Experiments in Psychology and Education. Boston, MA: Houghton Miffin.
- Linn, R.L., Rock, D.A., and Cleary, T.A. (1969). The development and evaluation of several programmed testing methods. Educational and Psychological Measurement 29 (1): 129–146.
- Linn, R., Levine, M., Hastings, C., and Wardrop, J. (1980). An Investigation of Item Bias in a Test of Reading Comprehension. Technical report (Technical Report No. 163). Urbana, IL: Center for the Study of Reading, University of Illinois.
- Longford, N. (1987). A fast scoring algorithm for maximum likelihood estimation in unbalanced mixed models with nested random effects. Biometrika 74 (4): 817–827.
- Lord, F. (1952). A theory of test scores. Psychometric Monographs 7: 84.
- Lord, F.M. (1953). The relation of test score to the trait underlying the test. Educational and Psychological Measurement 13 (4): 517–549.
- Lord, F.M. (1962). Estimating norms by item-sampling. Educational and Psychological Measurement 22 (2): 259–267.
- Lord, F.M. (1968). Some Test Theory for Tailored Testing. Research Bulletin, 69/4. Princeton, NJ: Educational Testing Service.
10.1002/j.2333-8504.1968.tb00562.x Google Scholar
- Lord, F.M. (1980). Applications of Item Response Theory to Practical Testing Problems. Mahwah, NJ: Lawrence Erlbaum Associates.
- Lord, F.M. (1983). Unbiased estimators of ability parameters, of their variance, and of their parallel-forms reliability. Psychometrika 48: 233–245.
- Lord, F.M. and Novick, M.R. (1968). Statistical Theories of Mental Test Scores. Reading, MA: Addison-Wesley Pub. Co.
- Louis, T.A. (1982). Finding the observed information matrix when using the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological) 44 (2): 226–233.
- Luce, R.D. (1959). On the possible psychophysical laws. Psychological Review 66 (2): 81–95.
- Magnus, J. and Neudecker, H. (1988). Matrix Differential Calculus with Applications in Statistics and Econometrics. New York: Wiley.
10.2307/2531754 Google Scholar
- Martínez-Plumed, F., Prudêncio, R.B., Martínez-Usó, A., and Hernández-Orallo, J. (2016). Making sense of item response theory in machine learning. Frontiers in Artificial Intelligence and Applications 285: 1140–1148.
- Masters, G.N. (1982). A Rasch model for partial credit scoring. Psychometrika 47 (2): 149–174.
- Mathers, C.D. and Loncar, D. (2006). Projections of global mortality and burden of disease from 2002 to 2030. PLoS Medicine 3 (11): 2011–2030.
- Matías-Carrelo, L.E., Chávez, L.M., Negrón, G. et al. (2003). The Spanish translation and cultural adaptation of five mental health outcome measures. Culture, Medicine and Psychiatry 27 (3): 291–313.
- Maydeu-Olivares, A. and Joe, H. (2005). Limited- and full-information estimation and goodness-of-fit testing in 2n contingency tables: a unified framework. Journal of the American Statistical Association 100 (471): 1009–1020.
- McBride, J.R. and Martin, J.T. (1983). Reliability and validity of adaptive ability tests in a military setting. In: New Horizons in Testing (ed. D.J. Weiss), 223–236. New York: Academic Press.
- McFadden, D. (1973). Conditional logit analysis of qualitative choice behavior. In: Frontiers in Econometrics (ed. P. Zarembka), 105–142. New York: Academic Press.
- Meredith, W. (1993). Measurement invariance, factor analysis and factorial invariance. Psychometrika 58 (4): 525–543.
- Mislevy, R.J. (1983). Item response models for grouped data. Journal of Educational Statistics 8 (4): 271–288.
10.3102/10769986008004271 Google Scholar
- Mislevy, R.J. (1984). Estimating latent distributions. Psychometrika 49 (3): 359–381.
- Mislevy, R.J. and Verhelst, N. (1990). Modeling item responses when different subjects employ different solution strategies. Psychometrika 55 (2): 195–215.
- Mitchell, A., Vaze, A., and Rao, S. (2009). Clinical diagnosis of depression in primary care: a meta-analysis. The Lancet 374 (9690): 609–619.
- Mosteller, F. and Tukey, J.W. (1977). Data Analysis and Regression: A Second Course in Statistics. Reading, MA: Addison-Wesley Pub. Co.
- Moustaki, I. (2000). A latent variable model for ordinal variables. Applied Psychological Measurement 24 (3): 211–223.
- Moustaki, I., Jöreskog, K.G., and Mavridis, D. (2004). Factor models for ordinal variables with covariate effects on the manifest and latent variables: a comparison of LISREL and IRT approaches. Structural Equation Modeling 11 (4): 487–513.
- Mulder, J. and van der Linden, W.J. (2009). Multidimensional adaptive testing with optimal design criteria for item selection. Psychometrika 74 (2): 273–296.
- Muraki, E. (1990). Fitting a polytomous item response model to likert-type data. Applied Psychological Measurement 14 (1): 59–71.
- Muraki, E. (1992). A generalized partial credit model: application of an EM algorithm. ETS Research Report Series 1992 (1): i–30.
10.1002/j.2333-8504.1992.tb01436.x Google Scholar
- Mustanski, B. and Espelage, D.L. (2020). Why are we not closing the gap in suicide disparities for sexual minority youth? Pediatrics 145 (3).
- Mustanski, B., Whitton, S., Newcomb, M. et al. Predicting suicidality using a computer adaptive test: Two longitudinal studies of sexual and gender minority youth. Journal of Clinical and Consulting Psychology, in press.
- Muthén, B. (1979). A structural probit model with latent variables. Journal of the American Statistical Association 74 (368): 807–811.
- Muthén, B.O. (1989). Latent variable modeling in heterogeneous populations. Psychometrika 54 (4): 557–585.
- Nandakumar, R. and Stout, W.F. (1993). Refinements of Stout's procedure for assessing latent trait unidimensionality. Journal of Educational Statistics 18: 41–68.
- National Institute of Mental Health (2017). Ask Suicide-Screening Questions (ASQ) Toolkit. https://www.nimh.nih.gov/research/research-conducted-at-nimh/asq-toolkit-materials/index.shtml (accessed 28 October 2020).
- National Leadership Forum on Behavioral Health/Criminal Justice Services (2009). Ending an American Tragedy: Addressing the Needs of Justice-Involved People with Mental Illnesses and Co-Occurring Disorders. Technical report. https://www.usf.edu/cbcs/mhlp/tac/documents/behavioral-healthcare/samh/ending-an-american-tragedy.pdf (accessed 28 October 2020).
- Naylor, J.C. and Smith, A.F.M. (1982). Applications of a method for the efficient computation of posterior distributions. Applied Statistics 31 (3): 214.
- Neyman, J. and Scott, E.L. (1948). Consistent estimates based on partially consistent observations. Econometrica 16 (1): 1.
- Nishisato, S. and Nishisato, I. (1994). Dual Scaling in a Nutshell. Toronto: MicroStats.
- Nunnally, J. (1967). Psychometric Theory. New York: McGraw Hill.
- Overall, J.E. and Gorham, D.R. (1988). The brief psychiatric rating scale (BPRS): recent developments in ascertainment and scaling. Psychopharmacology Bulletin 24 (1): 97–99.
- Pilkonis, P.A., Choi, S.W., Reise, S.P. et al. (2011). Item banks for measuring emotional distress from the patient-reported outcomes measurement information system (PROMIS®): depression, anxiety, and anger. Assessment 18 (3): 263–283.
- Priester, M.A., Browne, T., Iachini, A. et al. (2016). Treatment access barriers and disparities among individuals with co-occurring mental health and substance use disorders: an integrative literature review. Journal of Substance Abuse Treatment 61: 47–59.
- Radloff, L.S. (1977). The CES-D scale: a self-report depression scale for research in the general population. Applied Psychological Measurement 1 (3): 385–401.
10.1177/014662167700100306 Google Scholar
- Rao, C.R. (1973). Linear Statistical Inference and Its Applications, 2e. New York: Wiley.
10.1002/9780470316436 Google Scholar
- Rasch, G. (1960). Studies in Mathematical Psychology: I. Probabilistic Models for Some Intelligence and Attainment Tests. Copenhagen: Nielsen & Lydiche.
- Rasch, G. (1961). On general laws and the meaning of measurement in psychology. Proceedings of the 4th Berkeley Symposium on Mathematical Statistics and Probability, Volume 4: Contributions to Biology and Problems of Medicine, pp. 321–334.
- Reckase, M.D. (2009). Multidimensional Item Response Theory. New York: Springer.
10.1007/978-0-387-89976-3 Google Scholar
- Reckase, M.D. and McKinley, R.L. (1991). The discriminating power of items that measure more than one dimension. Applied Psychological Measurement 15: 361–374.
- Reise, S.P. and Waller, N.G. (1993). Traitedness and the assessment of response pattern scalability. Journal of Personality and Social Psychology 65 (1): 143–151.
- Rijmen, F., Tuerlinckx, F., De Boeck, P., and Kuppens, P. (2003). A nonlinear mixed model framework for item response theory. Psychological Methods 8 (2): 185–205.
- Roy, S.N. (1957). Some Aspects of Multivariate Analysis. Kolkata: Statistical Publishing Society.
- Rubin, D.B. (1976). Inference and missing data. Biometrika 63 (3): 581–592.
- Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores. Psychometrika 35 (1): 139.
10.1007/BF02290599 Google Scholar
- Samejima, F. (1972). A general model for free-response data. Psychometrika Monograph Supplement 37 (1): 68.
- Samejima, F. (1979). A New Family of Models for the Multiple Choice Item. Technical report (Research Report No. 79-4). Tennessee University Knoxville Department of Psychology.
- SAMHSA (2016). National Survey on Drug Use and Health. Technical report. Mental Health Services Administration. https://www.samhsa.gov/data/release/2016-national-survey-drug-use-and-health-nsduh-releases (accessed 28 October 2020).
- Sanathanan, L. and Blumenthal, S. (1978). The logistic model and estimation of latent structure. Journal of the American Statistical Association 73 (364): 794–799.
- Sani, S., Busnello, J., Kochanski, R. et al. (2017). High-frequency measurement of depressive severity in a patient treated for severe treatment-resistant depression with deep-brain stimulation. Translational Psychiatry 7 (8): e1207.
- Savage, L.J. (1954). The Foundations of Statistics. New York: Dover Press.
- Schwarz, Gideon (1978). Estimating the dimension of a model. Annals of Statistics 6 (2): 461–464.
- Schilling, S. and Bock, R.D. (2005). High-dimensional maximum marginal likelihood item factor analysis by adaptive quadrature. Psychometrika 70 (3): 533–555.
- Schott, J.R. (1997). Matrix Analysis for Statistics. Hoboken, NJ: Wiley.
- Schroeder, E. (1945). On Measurement of Motor Skills; An Approach Through a Statistical Analysis of Archery Scores. Oxford: King's Crown Press.
10.7312/schr92192 Google Scholar
- Segall, D.O. (1996). Multidimensional adaptive testing. Psychometrika 61 (2): 331–354.
- Segall, D.O. (2000). Principles of multidimensional adaptive testing. In: Computerized Adaptive Testing: Theory and Practice (ed. W.J. van der Linden and C.A.W. Glas), pp. 53–73. Boston, MA: Kluwer Academic.
10.1007/0-306-47531-6_3 Google Scholar
- Seo, D.G. and Weiss, D.J. (2015). Best design for multidimensional computerized adaptive testing with the bifactor model. Educational and Psychological Measurement 75 (6): 954–978.
- Shenton, L.R. and Bowman, K.O. (1977). Maximum Likelihood Estimation in Small Samples. London: Griffin.
- Silvey, S.D. (1980). Optimal Design. London: Chapman & Hall.
10.1007/978-94-009-5912-5 Google Scholar
- Skrondal, A. and Rabe-Hesketh, S. (2004). Generalized Latent Variable Modeling: Multilevel, Longitudinal, and Structural Equation Models. London: Chapman and Hall/CRC.
10.1201/9780203489437 Google Scholar
- Smith, M.R., Martinez, T., and Giraud-Carrier, C. (2014). An instance level analysis of data complexity. Machine Learning 95 (2): 225–256.
- Spearman, C. (1904). Measurement of association, Part II. Correction of 'systematic deviations'. American Journal of Psychology 15: 88–101.
10.2307/1412159 Google Scholar
- Spearman, C. (1907). Demonstration of formulae for true measurement of correlation. The American Journal of Psychology 18 (2): 161.
10.2307/1412408 Google Scholar
- Spielberger, C.D., Gorsuch, R.L., Lushene, R., et al. (1983). Manual for the State-Trait Anxiety Inventory. Palo Alto, CA: Consulting Psychologists Press.
- Stan, A.D., Tamminga, C.A., Han, K. et al. (2020). Associating psychotic symptoms with altered brain anatomy in psychotic disorders using multidimensional item response theory models. Cerebral Cortex 30 (5): 2939–2947.
- Steadman, H.J., Osher, F.C., Robbins, P.C. et al. (2009). Prevalence of serious mental illness among jail inmates. Psychiatric Services 60 (6): 761–765.
- Stevens, S.S. (1961). To honor Fechner and repeal his law. Science 133 (3446): 80–86.
- Stewart, W.F., Ricci, J.A., Chee, E. et al. (2003). Cost of lost productive work time among US workers with depression. Journal of the American Medical Association 289 (23): 3135–3144.
- Stocking, M.L. (1988). Scale drift in on-line calibration. ETS Research Report Series 1988 (1): i–122.
10.1002/j.2330-8516.1988.tb00313.x Google Scholar
- Stocking, M.L. and Lord, F.M. (1983). Developing a common metric in item response theory. Applied Psychological Measurement 7 (2): 201–210.
- Stout, W. (1987). A nonparametric approach for assessing latent trait unidimensionality. Psychometrika 52 (4): 589–617.
- Stout, W. (1990). A new item response theory modeling approach with applications to unidimensional assessment and ability estimation. Psychometrika 55: 293–326.
- Stroud, A.H. and Secrest, D. (1966). Gaussian Quadrature Formulas. Englewood Cliffs, NJ: Prentice Hall.
- Stuart, A. (1958). Equally correlated variates and the multinormal integral. Journal of the Royal Statistical Society: Series B (Methodological) 20: 373–378.
- Swaminathan, H. and Rogers, H. (1990). Detecting differential item functioning using logistic regression procedures. Journal of Educational Measurement 27 (4): 361–370.
- Tamminga, C.A., Ivleva, E.I., Keshavan, M.S. et al. (2013). Clinical phenotypes of psychosis in the Bipolar-Schizophrenia network on intermediate phenotypes (B-SNIP). American Journal of Psychiatry 170 (11): 1263–1274.
- Thissen, D. (1982). Marginal maximum likelihood estimation for the one-parameter logistic model. Psychometrika 47 (2): 175–186.
- Thissen, D. (1991). MULTILOG User's Guide: Multiple, Categorical Item Analysis and Test Scoring Using Item Response Theory. Lincolnwood, IL: Scientific Software, Inc.
- Thissen, D. and Steinberg, L. (1984). A response model for multiple choice items. Psychometrika 49 (4): 501–519.
- Thissen, D. and Steinberg, L. (1986). A taxonomy of item response models. Psychometrika 51 (4): 567–577.
- Thissen, D., Steinberg, L., and Wainer, H. (1993). Detection of differential item functioning using the parameters of item response models. In: Differential Item Functioning, 67–113. Hillsdale NJ: Lawrence Erlbaum Associates.
- Thissen, D., Cai, L., and Bock, R.D. (2010). The nominal categories item response model. In: Handbook of Polytomous Item Response Theory Models (ed. M.L. Nering and R. Ostini), 43–75. Routledge/Taylor & Francis Group.
- Thurstone, L.L. (1927). Psychophysical analysis. The American Journal of Psychology 38 (3): 368–389.
10.2307/1415006 Google Scholar
- Thurstone, L.L. (1928). Attitudes can be measured. American Journal of Sociology 33 (4): 529–554.
- Thurstone, L.L. (1929). Theory of attitude measurement. Psychological Review 36 (3): 222–241.
10.1037/h0070922 Google Scholar
- Thurstone, L.L. (1947). Multiple Factor Analysis. Chicago, IL: University of Chicago Press.
- Thurstone, L.L. (1959). The Measurement of Values. Chicago, IL: University of Chicago Press.
- Tsai, J.; Gu, X. (1958). Utilization of addiction treatment among U.S. adults with history of incarceration and substance use disorders. Addiction Science & Clinical Practice 14 (9).
- Tucker, L.R. (1958). An inter-battery method of factor analysis. Psychometrika 23 (2): 111–136.
- Vale, C.D. and Weiss, D.J. (1977). A Rapid Item-Search Procedure for Bayesian Adaptive Testing 77-4. Technical report 77-4. Minneapolis, MN: University of Minnesota, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory.
- van der Linden, W.J. (1999). Adaptive Testing with Equated Number-Correct Scoring. OMD research report; No. 99-02. Enschede, Netherlands: University of Twente, Faculty Educational Science and Technology.
- van der Linden, W.J. and Pashley, P.J. (2010). Item selection and ability estimation in adaptive testing. In: Elements of Adaptive Testing (ed. W.J. van der Linden and C.A.W. Glas), pp. 3–30. New York: Springer.
10.1007/978-0-387-85461-8 Google Scholar
- van der Linden, W.J. and Reese, L.M. (1998). A model for optimal constrained adaptive testing. Applied Psychological Measurement 22 (3): 259–270.
- Veerkamp, W.J.J. and Berger, M.P.F. (1997). Some new item selection criteria for adaptive testing. Journal of Educational and Behavioral Statistics 22 (2): 203–226.
- Wainer, H. and Mislevy, R. (1990). Item response theory, item calibration, and proficiency estimation. In: Computerized Adaptive Testing: A Primer (ed. H. Wainer), 81–99. Mahwah, NJ: Lawrence Erlbaum Associates.
- Wainer, H., Sireci, S.G., and Thissen, D. (1991). Differential testlet functioning definitions and detection. ETS Research Report Series 1991 (1): i–42.
10.1002/j.2333-8504.1991.tb01423.x Google Scholar
- Walker, H.M. and Lev, J. (1953). Statistical Inference. New York: Henry Holt & Company.
10.1037/11773-000 Google Scholar
- Warm, T.A. (1989). Weighted likelihood estimation of ability in item response theory. Psychometrika 54 (3): 427–450.
- Weiss, D.J. (1985). Adaptive testing by computer. Journal of Consulting and Clinical Psychology 53 (6): 774–789.
- Weiss, D.J. and Kingsbury, G.G. (1984). Application of computerized adaptive testing to educational problems. Journal of Educational Measurement 21 (4): 361–375.
- Weiss, D.J. and McBride, J.R. (1984). Bias and information of Bayesian adaptive testing. Applied Psychological Measurement 8 (3): 273–285.
- Whooley, M.A. (2012). Diagnosis and treatment of depression in adults with comorbid medical conditions: a 52-year-old man with depression. Journal of the American Medical Association 307 (17): 1848–1857.
- Woods, C.M., Cai, L., and Wang, M. (2013). The Langer-improved Wald test for DIF testing with multiple groups: evaluation and comparison to two-group IRT. Educational and Psychological Measurement 73 (3): 532–547.
- Yao, L. (2012). Multidimensional CAT item selection methods for domain scores and composite scores: theory and applications. Psychometrika 77: 495–523.
- Yao, L. (2013). Comparing the performance of five multidimensional CAT selection procedures with different stopping rules. Applied Psychological Measurement 37: 3–23.
- Yi, Q. and Chang, H. (2003). A-stratified CAT design with content blocking. British Journal of Mathematical and Statistical Psychology 56: 359–378.
- Zhang, J. and Stout, W. (1999). Conditional covariance structure of generalized compensatory multidimensional items. Psychometrika 64 (2): 129–152.
- Zimowski, M.F., Muraki, E., Mislevy, R.J., and Bock, R.D. (1996). BILOG-MG: Multiple-Group IRT Analysis and Test Maintenance for Binary Items. Chicago, IL: Scientific Software International.