Bibliography

R. Darrell Bock,

R. Darrell Bock

Search for more papers by this author

Robert D. Gibbons,

Robert D. Gibbons

Search for more papers by this author

Book Author(s):R. Darrell Bock,

R. Darrell Bock

Search for more papers by this author

Robert D. Gibbons,

Robert D. Gibbons

Search for more papers by this author

First published: 02 July 2021

https://doi.org/10.1002/9781119716723.biblio

References

Achtyes, E.D., Halstead, S., Smart, L.A. et al. (2015). Validation of computerized adaptive testing in an outpatient nonacademic setting: the VOCATIONS trial. Psychiatric Services 66 (10): 1091–1096.
10.1176/appi.ps.201400390
PubMedWeb of Science®Google Scholar
Ackerman, T.A. (1994). Using multidimensional item response theory to understand what items and tests are measuring. Applied Measurement in Education 7 (4): 255–278.
10.1207/s15324818ame0704_1
Google Scholar
Ackerman, T.A. (1996). Graphical representation of multidimensional item response theory analysis. Applied Psychological Measurement 20: 311–329.
10.1177/014662169602000402
Web of Science®Google Scholar
Aitchison, J. and Silvey, S.D. (1958). Maximum-likelihood estimation of parameters subject to restraints. The Annals of Mathematical Statistics 29 (3): 813–828.
10.1214/aoms/1177706538
Google Scholar
Alegría, M., Alvarez, K., Ishikawa, R.Z. et al. (2016). Removing obstacles to eliminating racial and ethnic disparities in behavioral health care. Health Affairs 35 (6): 991–999.
10.1377/hlthaff.2016.0029
PubMedWeb of Science®Google Scholar
Andersen, E.B. (1977). Sufficient statistics and latent trait models. Psychometrika 42 (1): 69–81.
10.1007/BF02293746
Web of Science®Google Scholar
Andersen, E.B. (1980). Discrete Statistical Models with Social Science Applications. Amsterdam: North Holland.

Google Scholar
Andersen, E. and Madsen, M. (1977). Estimating the parameters of the latent population distribution. Psychometrika 42 (3): 357–374.
10.1007/BF02293656
Web of Science®Google Scholar
Anderson, T.W. (1984). An Introduction to Multivariate Statistical Analysis, 2e. New York: Wiley.

Google Scholar
Andreasen, NC. (1984). The Scale for the Assessment of Positive Symptoms (SAPS). Iowa City, IA: University of Iowa.

Google Scholar
Andrich, D. (1978). Application of a psychometric rating model to ordered categories which are scored with successive integers. Applied Psychological Measurement 2 (4): 581–594.
10.1177/014662167800200413
Google Scholar
Andrich, D. (1988). A general form of Rasch's extended logistic model for partial credit scoring. Applied Measurement in Education 1 (4): 363–378.
10.1207/s15324818ame0104_7
Google Scholar
Anscombe, F.J. (1956). On estimating binomial response relations. Biometrika 43 (3/4): 461.
10.2307/2332926
Web of Science®Google Scholar
Ashford, J. and Sowden, R.R. (1970). Multi-variate probit analysis. Biometrics 26 (3): 535–546.
10.2307/2529107
CASPubMedWeb of Science®Google Scholar
Baek, S.-G. (1997). Computerized adaptive testing using the partial credit model for attitude measurement. In: Objective Measurement: Theory Into Practice. (ed. M. Wilson, G. Engelhard and K. Draney), 37–55.

Google Scholar
Baker, F.B. (1992). Item Response Theory: Parameter Estimation Techniques. New York: Marcel Dekker.

Google Scholar
Ban, J.C., Hanson, B.A., Yi, Q., and Harris, D.J. (2002). Data sparseness and on-line pretest item calibration-scaling methods in CAT. Journal of Educational Measurement 39 (3): 207–218.
10.1111/j.1745-3984.2002.tb01174.x
Web of Science®Google Scholar
Ban, J.C., Hanson, B.A., Wang, T. et al. (2006). A comparative study of online pretest item calibration/scaling methods in computerized adaptive testing. American Educational Research Association 38 (3): 191–212.

Google Scholar
Bartholomew, D.J. and Tzamourani, P. (1999). The goodness of fit of latent trait models in attitude measurement. Sociological Methods and Research 27 (4): 525–546.
10.1177/0049124199027004003
Web of Science®Google Scholar
Beiser, D., Vu, M., and Gibbons, R. (2016). Test-retest reliability of a computerized adaptive depression screener. Psychiatric Services 67 (9): 1039–1041.
10.1176/appi.ps.201500304
PubMedWeb of Science®Google Scholar
Beiser, D.G., Ward, C.E., Vu, M. et al. (2019). Depression in emergency department patients and association with health care utilization. Academic Emergency Medicine 26 (8): 878–888.
10.1111/acem.13726
PubMedWeb of Science®Google Scholar
Berkson, J. (1956). Estimation by least squares and by maximum likelihood. Proceedings of the Third Berkeley Symposium 1: 1–11.

Google Scholar
Berndt, E.R., Hall, B.H., Hall, R.E., and Hausman, J.A. (1974). Estimation and inference in nonlinear structural models. Annals of Economic and Social Measurement 3 (4): 653–665.

Web of Science®Google Scholar
Berona, J., Whitton, S., Newcomb, M.E. et al. Prospective risk and protective factors for the transition from suicide ideation to attempt among sexual and gender minority youth. Psychiatric Services, in press.

Google Scholar
Birnbaum, A. (1957). Probability and Statistics in Item Analysis and Classification Problems: Efficient Design and Use of Tests of Mental Ability for Various Decision-making. Technical report, Ser. Rep. No. 15. Randolph Air Force Base, TX: USAF School of Aviation Medicine.

Google Scholar
Birnbaum, A. (1958a). Further Considerations of Efficiency in Tests of a Mental Ability. Technical report, Ser. Rep. No. 17. Randolph Air Force Base, TX: USAF School of Aviation Medicine.

Google Scholar
Birnbaum, A. (1958b). On the Estimation of Mental Ability. Technical report, Ser. Rep. No. 17. Randolph Air Force Base, TX: USAF School of Aviation Medicine.

Google Scholar
Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee's ability. In: Statistical Theories of Mental Test Scores (ed. F.M. Lord and M.R. Novick), 397–479. Reading, MA: Addison-Wesley.

Google Scholar
Bishop, Y.M., Holland, P.W., and Fienberg, S.E. (1975). Discrete Multivariate Analysis Theory and Practice. Cambridge, MA: Massachusetts Institute of Technology Press.

Google Scholar
Black, D.W., Gunter, T., Loveless, P. et al. (2010). Antisocial personality disorder in incarcerated offenders: psychiatric comorbidity and quality of life. Annals of Clinical Psychiatry 22 (2): 113–120.

PubMedWeb of Science®Google Scholar
Bliss, C.I. (1935). The calculation of the dosage-mortality curve. Annals of Applied Biology 22 (1): 134–167.
10.1111/j.1744-7348.1935.tb07713.x
CASGoogle Scholar
Bock, R. (1972). Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika 37 (1): 29–51.
10.1007/BF02291411
Web of Science®Google Scholar
Bock, R. (1975). Multivariate Statistical Methods in Behavioral Research. New York: McGraw-Hill.

Google Scholar
Bock, R.D. and Moore, E.G.J. (1986). Advantage and Disadvantage: A Profile of American Youth. Hillsdale, NJ: Erlbaum.

Google Scholar
Bock, R. (1989a). Addendum: measurement of human variation: a two-stage model. R. Darrell Bock. In: Multilevel Analysis of Educational Data, 319–342. Academic Press.
10.1016/B978-0-12-108840-8.50021-4
Google Scholar
Bock, R. (1989b). Measurement of Human Variation: A Two-Stage Model. Academic Press.

Google Scholar
Bock, R.D. (1997). The nominal categories model. W.J. van der Linden; R.K. Hambleton. In: Handbook of Modern Item Response Theory, 33–49. New York: Springer.
10.1007/978-1-4757-2691-6_2
Google Scholar
Bock, R.D. and Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: application of an EM algorithm. Psychometrika 46 (4): 443–459.
10.1007/BF02293801
Web of Science®Google Scholar
Bock, R.D. and Gibbons, R.D. (1996). High-dimensional multivariate probit analysis. Biometrics 52 (4): 1183–1194.
10.2307/2532834
CASPubMedWeb of Science®Google Scholar
Bock, R. and Gibbons, R. (2010). Factor analysis of categorical item responses. In: Handbook of Polytomous Item Response Theory Models (ed. M.L. Nering and R. Ostini). Florence, KY: Lawrence Erlbaum. 155–184.

Google Scholar
Bock, R.D. and Jones, L.V. (1968). The Measurement and Prediction of Judgment and Choice. San Francisco, CA: Holden-Day.

Google Scholar
Bock, R.D. and Lieberman, M. (1970). Fitting a response model for n dichotomously scored items. Psychometrika 35 (2): 179–197.
10.1007/BF02291262
Web of Science®Google Scholar
Bock, R.D. and Mislevy, R.J. (1982). Adaptive EAP estimation of ability in a microcomputer environment. Applied Psychological Measurement 6 (4): 431–444.
10.1177/014662168200600405
Web of Science®Google Scholar
Bock, R.D. and Schilling, S. (1997). High-dimensional full-information item factor analysis. M. Berkane. In: Latent Variable Modeling and Applications to Causality, 163–176. New York: Springer.
10.1007/978-1-4612-1842-5_8
Web of Science®Google Scholar
Bock, R.D. and Zimowski, M.F. (1997). Multiple group IRT. In: Handbook of Modern Item Response Theory (ed. W.J. van der Linden and R.K. Hambleton), 433–448. New York: Springer.
10.1007/978-1-4757-2691-6_25
Web of Science®Google Scholar
Bock, R.D., Mislevy, R., and Woodson, C. (1982). The next stage in educational assessment. Educational Researcher 11 (3): 4–16.
10.3102/0013189X011003004
Google Scholar
Bock, R.D., Muraki, E., and Pfeiffenberger, W. (1988). Item pool maintenance in the presence of item parameter drift. Journal of Educational Measurement 25 (4): 275–285.
10.1111/j.1745-3984.1988.tb00308.x
Web of Science®Google Scholar
Bock, R.D., Thissen, D., and Zimowski, M.F. (1997). IRT estimation of domain scores. Journal of Educational Measurement 34 (3): 197–211.
10.1111/j.1745-3984.1997.tb00515.x
Web of Science®Google Scholar
Böckenholt, U. (2001). Hierarchical modeling of paired comparison data. Psychological Methods 6 (1): 49–64.
10.1037/1082-989X.6.1.49
CASPubMedWeb of Science®Google Scholar
Bradley, R.A. and Terry, M.E. (1952). Rank analysis of incomplete block designs: I. The method of paired comparisons. Biometrika 39 (3/4): 324.
10.2307/2334029
Web of Science®Google Scholar
Brennan, R. (2001). Generalizability Theory. New York: Springer.
10.1007/978-1-4757-3456-0
Google Scholar
Brown, J. and Weiss, D. (1977). An Adaptive Testing Strategy for Achievement Test Batteries. Technical report (Research Rep. No. 77-6). Minneapolis, MN: University of Minnesota, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory.

Google Scholar
Browne, M.W. and Cudeck, R. (1993). Alternative ways of assessing model fit. In: Testing Structural Equation Models (ed. K.A. Bollen and J.S. Long), pp. 136–162. Beverly Hills, CA: Sage.

Google Scholar
Cai, L. (2010). A two-tier full-information item factor analysis model with applications. Psychometrika 75: 581–612.
10.1007/s11336-010-9178-0
Web of Science®Google Scholar
Cai, L. and Hansen, M. (2013). Limited-information goodness-of-fit testing of hierarchical item factor models. British Journal of Mathematical and Statistical Psychology 66 (2): 245–276.
10.1111/j.2044-8317.2012.02050.x
PubMedWeb of Science®Google Scholar
Cai, L., Maydeu-Olivares, A., Coffman, D.L., and Thissen, D. (2006). Limited-information goodness-of-fit testing of item response theory models for sparse 2P tables. British Journal of Mathematical and Statistical Psychology 59 (1): 173–194.
10.1348/000711005X66419
PubMedWeb of Science®Google Scholar
Cai, L., Thissen, D., and du Toit, S.H. (2011). IRTPRO. Lincolnwood, IL: Scientific Software International.

Google Scholar
Camilli, G. and Shepard, L. (1994). Methods for Identifying Biased Test Items. Thousand Oaks, CA: Sage.

Google Scholar
Chang, H.H. (2004). Understanding computerized adaptive testing: from Robbins–Monro to Lord and beyond. In: The Sage Handbook of Quantitative Methodology for the Social Sciences (ed. D. Kaplan), pp. 117–133. Thousand Oaks, CA: Sage.
10.4135/9781412986311.n7
Google Scholar
Chang, H.-H. and Ying, Z. (1996). A global information approach to computerized adaptive testing. Applied Psychological Measurement 20: 213–229.
10.1177/014662169602000303
Web of Science®Google Scholar
Chang, H.-H. and Ying, Z. (1999). A-stratified multistage computerized adaptive testing. Applied Psychological Measurement 23 (3): 211–222.
10.1177/01466219922031338
Web of Science®Google Scholar
Chang, H.-H. and Ying, Z. (2009). Nonlinear sequential designs for logistic item response theory models with applications to computerized adaptive tests. Annals of Statistics 37 (3): 1466–1488.
10.1214/08-AOS614
Web of Science®Google Scholar
Chang, H.-H., Qian, J., and Ying, Z. (2001). A-stratified multistage computerized adaptive testing with b blocking. Applied Psychological Measurement 25 (4): 333–341.
10.1177/01466210122032181
Web of Science®Google Scholar
Chapman, L. and Bock, R.D. (1958). Components of variance due to acquiescence and content in the F scale measure of authoritarianism. Psychological Bulletin 55 (5): 328–333.
10.1037/h0040659
CASPubMedWeb of Science®Google Scholar
Chen, S.-Y., Ankenmann, R.D., and Chang, H.-H. (2000). A comparison of item selection rules at the early stages of computerized adaptive testing. Applied Psychological Measurement 24 (3): 241–255.
10.1177/01466210022031705
Web of Science®Google Scholar
Cochran, W. and Cox, G. (1957). Experimental Designs. New York: Wiley.

Google Scholar
Cooper, B.E. (1968). Algorithm AS 2: the normal integral. Applied Statistics 17 (2): 186.
10.2307/2985683
Google Scholar
Creedon, T.B. and Lê Cook, B. (2016). Datawatch: access to mental health care increased but not for substance use, while disparities remain. Health Affairs 35 (6): 1017–1021.
10.1377/hlthaff.2016.0098
PubMedWeb of Science®Google Scholar
Cronbach, L. (1970). Essentials of Psychological Testing. New York: Harper & Row.

Google Scholar
Cronbach, L.J., Gleser, G.C., Nanda, N., and Rajaratnam, N. (1972). The Dependability of Behavioral Measurements: Theory of Generalizability for Scores and Profiles. New York: Wiley.
10.1126/science.176.4036.785
Web of Science®Google Scholar
Day, N.E. (1969). Estimating the components of a mixture of normal distributions. Biometrika 56 (3): 463.
10.1093/biomet/56.3.463
Web of Science®Google Scholar
Dempster, A.P., Laird, N.M., and Rubin, D.B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological) 39 (1): 1–22.
10.1111/j.2517-6161.1977.tb01600.x
Web of Science®Google Scholar
Dempster, A.P., Rubin, D.B., and Tsutakawa, R.K. (1981). Estimation in covariance components models. Journal of the American Statistical Association 76 (374): 341–353.
10.1080/01621459.1981.10477653
Web of Science®Google Scholar
Divgi, D.R. (1979a). Calculation of the tetrachoric correlation coefficient. Psychometrika 44 (2): 169–172.
10.1007/BF02293968
Web of Science®Google Scholar
Divgi, D.R. (1979b). Calculation of univariate and bivariate normal probability functions. The Annals of Statistics 7 (4): 903–910.
10.1214/aos/1176344739
Web of Science®Google Scholar
Dodd, B.G., de Ayala, R.J., and Koch, W.R. (1995). Computerized adaptive testing with polytomous items. Applied Psychological Measurement 19 (1): 5–22.
10.1177/014662169501900103
Web of Science®Google Scholar
Dorans, N.J., Moses, T.P., and Eignor, D.R. (2010). Principles and practices of test score equating. ETS Research Report Series 2010 (2): i–41.
10.1002/j.2333-8504.2010.tb02214.x
Google Scholar
Dunnett, C. (1964). New tables for multiple comparisons with a control. Biometrics 20 (3): 482–491.
10.2307/2528490
Web of Science®Google Scholar
DuToit, M. (2003), IRT from SSI: Bilog-MG, multilog, parscale, testfact, Scientific Software International, Chicago, IL.

Google Scholar
Edwards, A.L. and Thurstone, L.L. (1952). An internal consistency check for scale values determined by the method of successive intervals. Psychometrika 17 (2): 169–180.
10.1007/BF02288780
Google Scholar
Elderon, L., Smolderen, K.G., Na, B., and Whooley, M.A. (2011). Accuracy and prognostic value of american heart association-recommended depression screening in patients with coronary heart disease. Circulation: Cardiovascular Quality and Outcomes 4 (5): 533–540.
10.1161/CIRCOUTCOMES.110.960302
PubMedWeb of Science®Google Scholar
Embretson, S. and Reise, S. (2000). Item Response Theory for Psychologists. Mahway, NJ: Lawrence Erlbaum Associates.
10.1037/10519-153
Google Scholar
Endicott, J. and Spitzer, R.L. (1978). A diagnostic interview: the schedule for affective disorders and schizophrenia. Archives of General Psychiatry 35 (7): 837–844.
10.1001/archpsyc.1978.01770310043002
CASPubMedWeb of Science®Google Scholar
Fechner, G.T. (1966). Elements of Psychophysics (ed. D.H. Howes and E.G. Boring). Leipzig: Breitkopf und Härtel. First published in 1860, translated by Adler, H.E.
10.1007/BF01330949
Web of Science®Google Scholar
Fechner, G.T. (1860). Elemente der psychophysik. Leipzig: Breitkopf und Härtel.
10.1002/andp.18601871114
Google Scholar
Fedorov, V.V. and Hackl, P. (1997). Model-Oriented Design of Experiments, Lecture Notes in Statistics . New York: Springer-Verlag.
10.1007/978-1-4612-0703-0
Google Scholar
de Finetti, B.D. (1972). Probability, Induction and Statistics: The Art of Guessing. New York: Wiley.

Google Scholar
Finney, D.J. (1952). Statistical Method in Biological Assay. New York: Hafner Publishing Co.

Google Scholar
Finney, D.J. (1964). Probit Analysis: A Statistical Treatment of the Sigmoid Response Curve. London: Cambridge University Press.

Google Scholar
First, M., Gibbon, M., Spitzer, R., and Williams, J. B. W. (1996). User's Guide for the Structured Clinical Interview for DSM-IV Axis I Disorders-Research Version. New York: Biometrics Research Department, New York State Psychiatric Institute.

Google Scholar
Fisher, R.A. (1922). On the mathematical foundations of theoretical statistics. Philosophical Transactions of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character 222 (594–604): 309–368.
10.1098/rsta.1922.0009
Google Scholar
Fisher, R.A. and Yates, F. (1938). Statistical Tables for Biological, Agricultural and Medical Research. London: Oliver and Boyd.

Google Scholar
Fletcher, R. (1987). Practical Methods of Optimization, 2e. Chichester: Wiley.

Google Scholar
Fliege, H., Becker, J., Walter, O.B. et al. (2005). Development of a computer-adaptive test for depression (D-CAT). Quality of Life Research 14 (10): 2277–2291.
10.1007/s11136-005-6651-9
PubMedWeb of Science®Google Scholar
Gardner, W., Shear, K., Kelleher, K.J. et al. (2004). Computerized adaptive measurement of depression: a simulation study. BMC Psychiatry 4.
10.1186/1471-244X-4-13
PubMedWeb of Science®Google Scholar
Garwood, F. (1941). The application of maximum likelihood to dosage-mortality curves. Biometrika 32 (1): 46.
10.1093/biomet/32.1.46
Google Scholar
Gauss, C.F. (1809). Theoria Motus Corporum Coelestium in Sectionibus Conicis Solem Ambientium. Perthes et Besser.

Google Scholar
Gibbons, R.D. and Amatya, A. (2015). Statistical Methods for Drug Safety. Boca Raton, FL: Chapman and Hall.
10.1201/b18698
Google Scholar
Gibbons, R.D. and Cai, L. (2017). Dimensionality Analysis From: Handbook of Item Response Theory: Applications, vol. 3. CRC Press.

Google Scholar
Gibbons, R.D. and Hedeker, D.R. (1992). Full-information item bi-factor analysis. Psychometrika 57 (3): 423–436.
10.1007/BF02295430
Web of Science®Google Scholar
Gibbons, R.D. and Lavigne, J.V. (1998). Emergence of childhood psychiatric disorders: a multivariate probit analysis. Statistics in Medicine 17 (21): 2487–2499.
10.1002/(SICI)1097-0258(19981115)17:21<2487::AID-SIM937>3.0.CO;2-2
CASPubMedWeb of Science®Google Scholar
Gibbons, R.D. and Wilcox-Gök, V. (1998). Health service utilization and insurance coverage: a multivariate probit analysis. Journal of the American Statistical Association 93 (441): 63–72.
10.1080/01621459.1998.10474088
Web of Science®Google Scholar
Gibbons, R.D., Bock, R.D., Hedeker, D. et al. (2007a). Full-information item bifactor analysis of graded response data. Applied Psychological Measurement 31 (1): 4–19.
10.1177/0146621606289485
Web of Science®Google Scholar
Gibbons, R.R.D., Immekus, J.J.C., and Bock, R.D. (2007b). The added value of multidimensional IRT models. Multidimensional and Hierarchical Modeling Monograph 1 (312): 1–49.

Google Scholar
Gibbons, R.D., Weiss, D.J., Kupfer, D.J. et al. (2008). Using computerized adaptive testing to reduce the burden of mental health assessment. Psychiatric Services 59 (4): 361–368.
10.1176/ps.2008.59.4.361
PubMedWeb of Science®Google Scholar
Gibbons, R.D., Weiss, D.J., Pilkonis, P.A. et al. (2012). Development of a computerized adaptive test for depression. Archives of General Psychiatry 69 (11): 1104–1112.
10.1001/archgenpsychiatry.2012.14
PubMedWeb of Science®Google Scholar
Gibbons, R.D., Weiss, D.J., Pilkonis, P.A. et al. (2014). Development of the CAT-ANX: a computerized adaptive test for anxiety. American Journal of Psychiatry 171 (2): 187–194.
10.1176/appi.ajp.2013.13020178
PubMedWeb of Science®Google Scholar
Gibbons, R.D., Weiss, D.J., Frank, E., and Kupfer, D. (2016). Computerized adaptive diagnosis and testing of mental health disorders. Annual Review of Clinical Psychology 12 (1): 83–104.
10.1146/annurev-clinpsy-021815-093634
PubMedWeb of Science®Google Scholar
Gibbons, R.D., Kupfer, D., Frank, E. et al. (2017). Development of a computerized adaptive test suicide scale-The CAT-SS. Journal of Clinical Psychiatry 78 (9): 1376–1382.
10.4088/JCP.16m10922
PubMedWeb of Science®Google Scholar
Gibbons, R.D., Alegría, M., Cai, L. et al. (2018). Successful validation of the CAT-MH scales in a sample of Latin American migrants in the United States and Spain. Psychological Assessment 30 (10): 1267–1276.
10.1037/pas0000569
PubMedWeb of Science®Google Scholar
Gibbons, R.D., Kupfer, D.J., Frank, E. et al. (2019). Computerized adaptive tests for rapid and accurate assessment of psychopathology dimensions in youth. Journal of the American Academy of Child & Adolescent Psychiatry. 1264–1273.

PubMedWeb of Science®Google Scholar
Gibbons, R.D., Alegria, M., Markle, S. et al. (2020). Development of a computerized adaptive substance use disorder scale for screening and measurement: the CAT-SUD. Addiction 115 (7): 1382–1394.
10.1111/add.14938
PubMedWeb of Science®Google Scholar
Gilks, W.R., Roberts, G.O., and Sahu, S.K. (1998). Adaptive markov chain monte carlo through regeneration. Journal of the American Statistical Association 93 (443): 1045–1054.
10.1080/01621459.1998.10473766
Web of Science®Google Scholar
Gill, P. and Murray, W. (1974). Numerical Methods for Constrained Optimization. New York: Academic Press.

Google Scholar
Glas, C.A. (1998). Detection of differential item functioning using lagrange multiplier tests. Statistica Sinica 8 (3): 647–667.

Web of Science®Google Scholar
Goff, A., Rose, E., Rose, S., and Purves, D. (2007). Does PTSD occur in sentenced prison populations? A systematic literature review. Criminal Behaviour and Mental Health 17 (3): 152–162.
10.1002/cbm.653
CASPubMedGoogle Scholar
Goldstein, H. (1983). Measuring changes in educational attainment over time: problems and possibilities. Journal of Educational Measurement 20 (4): 369–377.
10.1111/j.1745-3984.1983.tb00214.x
Web of Science®Google Scholar
Goodman, L.A. (1968). The analysis of cross-classified data: independence, quasi-independence, and interactions in contingency tables with or without missing entries. Journal of the American Statistical Association 63 (324): 1091–1131.
10.1080/01621459.1968.10480916
Web of Science®Google Scholar
Guilford, J. (1954). The constant methods. In: Psychometric Methods, 2e, 597. New York: McGraw-Hill.

Google Scholar
Guinart, D., de Filippis, R., Rosson, S. et al. (2020). Development and validation of a computerized adaptive assessment tool for discrimination and measurement of psychotic symptoms. Schizophrenia Bulletin 9: sbaa168.

Google Scholar
Gulliksen, H. (1950). Theory of Mental Tests. Wiley.
10.1037/13240-000
Google Scholar
Gumbel, E.J. (1961). Bivariate logistic distributions. Journal of the American Statistical Association 56 (294): 335–349.
10.1080/01621459.1961.10482117
Web of Science®Google Scholar
Gupta, S.S. (1963). Probability integrals of multivariate normal and multivariate t. The Annals of Mathematical Statistics 34 (3): 792–828.
10.1214/aoms/1177704004
Web of Science®Google Scholar
Guttman, L. (1945). A basis for analyzing test-retest reliability. Psychometrika 10 (4): 255–282.
10.1007/BF02288892
CASPubMedGoogle Scholar
Haberman, S.J. (1977). Log-linear models and frequency tables with small expected cell counts. Annals of Statistics 5: 1148–1169.
10.1214/aos/1176344001
Web of Science®Google Scholar
Haberman, S. (1978). Analysis of Qualitative Data: Introductory Topics, vol. 1. New York: Academic Press, Incorporated.

Google Scholar
Haberman, S. (1979). Analysis of Qualitative Data: New Developments, vol. 2. New York: Academic Press, Incorporated.

Google Scholar
Haffajee, R.L. and Frank, R.G. (2018). Making the opioid public health emergency effective. JAMA Psychiatry 75 (8): 767–768.
10.1001/jamapsychiatry.2018.0611
PubMedWeb of Science®Google Scholar
Haggard, E. (1958). Intraclass Correlation and the Analysis of Variance. New York: Dryden Press.

Google Scholar
Haley, D.C. (1952). Estimation of the Dosage Mortality Relationship When the Dose is Subject to Error. Technical report. Technical Report No. 15 (Office of Naval Research Contract No 25140, NR 342-022). Stanford, CA: Stanford University Applied Mathematics and Statistics Labs.

Google Scholar
Hambleton, R.K. and Swaminathan, H. (1985). Item Response Theory: Principles and Applications. Boston, MA: Kluwer-Nijhoff.
10.1007/978-94-017-1988-9
Google Scholar
Hamilton, M. (1960). A rating scale for depression. Journal of Neurology, Neurosurgery, and Psychiatry 23: 56–62.
10.1136/jnnp.23.1.56
CASPubMedWeb of Science®Google Scholar
Han, K.T. (2012). SimulCAT: windows software for simulating computerized adaptive test administration. Applied Psychological Measurement 36 (1): 64–66.
10.1177/0146621611414407
Web of Science®Google Scholar
Harman, H. (1967). Modern Factor Analysis. Chicago: University of Chicago Press.
10.2307/3151880
Web of Science®Google Scholar
Hastings, C. (1955). Approximations for Digital Computers. Princeton, NJ: Princeton University Press.
10.1515/9781400875597
Google Scholar
Hathaway, R.J. (1985). A constrained formulation of maximum-likelihood estimation for normal mixture distributions. The Annals of Statistics 13 (2): 795–800.
10.1214/aos/1176349557
Web of Science®Google Scholar
Hausman, J.A. and Wise, D.A. (1978). A conditional probit model for qualitative choice: discrete decisions recognizing interdependence and heterogeneous preferences. Econometrica, Econometric Society 46 (2): 403–426.
10.2307/1913909
Web of Science®Google Scholar
Hawton, K. and Fagg, J. (1992). Trends in deliberate self poisoning and self injury in Oxford, 1976–90. British Medical Journal 304 (6839): 1409–1411.
10.1136/bmj.304.6839.1409
CASPubMedGoogle Scholar
Hedeker, D. (1989). Random regression models with autocorrelated errors. Unpublished PhD dissertation. University of Chicago.

Google Scholar
Hedeker, D. and Gibbons, R.D. (1996). MIXOR: a computer program for mixed-effects ordinal probit and logistic regression analysis. Computer Methods and Programs in Biomedicine 49: 157–176.
10.1016/0169-2607(96)01720-8
CASPubMedWeb of Science®Google Scholar
Hedeker, D. and Gibbons, R. (2006). Longitudinal Data Analysis. New York: Wiley.

CASPubMedGoogle Scholar
Hedeker, D., Mermelstein, R.J., and Flay, B.R. (2006). Application of item response theory models for intensive longitudinal data. In: Models for Intensive Longitudinal Data (ed. T.A. Walls and J.L. Schafer), 84–108. Oxford: Oxford University Press.
10.1093/acprof:oso/9780195173444.003.0004
Google Scholar
Henderson, H.V. and Searle, S.R. (1979). Vec and vech operators for matrices, with some uses in Jacobians and multivariate statistics. Canadian Journal of Statistics 7 (1): 65–81.
10.2307/3315017
Google Scholar
Hendrickson, A.E. and White, P.O. (1964). Promax: a quick method for rotation to oblique simple structure. British Journal of Statistical Psychology 17 (1): 65–70.
10.1111/j.2044-8317.1964.tb00244.x
Web of Science®Google Scholar
Holland, P. and Wainer, H. (1993). Differential Item Functioning. Hillsdale, NJ: Lawrence Erlbaum Associates.

Google Scholar
Holland, P.W., Dorans, N.J., and Petersen, N.S. (2006). Equating test scores. C. R. Rao and S. Sinharay. In: Handbook of Statistics, vol. 26, 169–203. Amsterdam, Netherlands: Elsevier.

Google Scholar
Holzinger, K.J. and Swineford, F. (1937). The Bi-factor method. Psychometrika 2 (1): 41–54.
10.1007/BF02287965
Google Scholar
Householder, A.S. (1953). Principles of Numerical Analysis. New York: McGraw-Hill.

Web of Science®Google Scholar
Householder, A.S. (1964). Theory of Matrices in Numerical Analysis. New York: Blaisdell.

Google Scholar
Jeon, M., Rijmen, F., and Rabe-Hesketh, S. (2013). Modeling differential item functioning using a generalization of the multiple-group bifactor model. Journal of Educational and Behavioral Statistics 38 (1): 32–60.
10.3102/1076998611432173
Web of Science®Google Scholar
Joe, H. and Maydeu-Olivares, A. (2010). A general family of limited information goodness-of-fit statistics for multinomial data. Psychometrika 75 (3): 393–419.
10.1007/s11336-010-9165-5
Web of Science®Google Scholar
Joiner, T. (2010). Myths about suicide. Choice Reviews Online 48 (03): 48–1761.

Google Scholar
Jöreskog, K.G. (1969). A general approach to confirmatory maximum likelihood factor analysis. Psychometrika 34 (2): 183–202.
10.1007/BF02289343
Web of Science®Google Scholar
Jöreskog, K.G. (1979). Basic ideas of factor and component analysis. In: Advances in Factor Analysis and Structural Equation Models, 5–20. Cambridge: Abt Books.

Google Scholar
Jöreskog, K.G. (1994). On the estimation of polychoric correlations and their asymptotic covariance matrix. Psychometrika 59: 381–390.
10.1007/BF02296131
Web of Science®Google Scholar
Kaiser, H.F. (1958). The varimax criterion for analytic rotation in factor analysis. Psychometrika 23: 187–200.
10.1007/BF02289233
Web of Science®Google Scholar
Kalb, L.G., Stapp, E.K., Ballard, E.D. et al. (2019). Trends in psychiatric emergency department visits among youth and young adults in the US. Pediatrics 143 (4).
10.1542/peds.2018-2192
PubMedWeb of Science®Google Scholar
Kelley, T. (1947). Fundamentals of Statistics. Cambridge: Harvard University Press.

Google Scholar
Kennedy, W.J. and Gentle, E.J. (1980). Statistical Computing. New York: Marcel Dekker.

Google Scholar
Kiefer, J. and Wolfowitz, J. (1956). Consistency of the maximum likelihood estimator in the presence of infinitely many incidental parameters. The Annals of Mathematical Statistics 27 (4): 887–906.
10.1214/aoms/1177728066
Web of Science®Google Scholar
Kim, J.J., Silver, R.K., Elue, R. et al. (2016). The experience of depression, anxiety, and mania among perinatal women. Archives of Women's Mental Health 19 (5): 883–890.
10.1007/s00737-016-0632-6
PubMedWeb of Science®Google Scholar
King, C.A., Brent, D., Grupp-Phelan, J. et al. (2021). The computerized adaptive screen for suicidal youth (CASSY) development and independent validation. JAMA Psychiatry, published online ahead of print.
10.1001/jamapsychiatry.2020.4576
Web of Science®Google Scholar
Kingsbury, G.G. and Weiss, D.J. (1980). An Alternate-Forms Reliability and Concurrent Validity Comparison of Bayesian Adaptive and Conventional Ability Tests. Research Report 80-5, Computerized Adaptive Testing Laboratory. Minneapolis, MN: University of Minnesota.

Google Scholar
Kingsbury, G.G. and Weiss, D.J. (1983). A comparison of IRT-based adaptive mastery testing and a sequential mastery testing procedure. D. J. Weiss. In: New Horizons in Testing, 257–283. New York: Academic Press.
10.1016/B978-0-12-742780-5.50024-X
Google Scholar
Kolakowski, D. and Bock, R.D. (1981). A multivariate generalization of probit analysis. Biometrics 37: 541–551.
10.2307/2530567
Web of Science®Google Scholar
Kolen, M.J. (2006). Scaling and norming. In: Educational Measurement (ed. R.L. Brennan), 155–186. Westport, CT: American Council on Education/Prager.

Google Scholar
Kroenke, K., Spitzer, R.L., and Williams, J.B. (2001). The PHQ-9: validity of a brief depression severity measure. Journal of General Internal Medicine 16 (9): 606–613.
10.1046/j.1525-1497.2001.016009606.x
CASPubMedWeb of Science®Google Scholar
La Porte, L.M., Kim, J.J., Adams, M.G. et al. (2020). Feasibility of perinatal mood screening and text messaging on patients' personal smartphones. Archives of Women's Mental Health 23 (2): 181–188.
10.1007/s00737-019-00981-5
PubMedWeb of Science®Google Scholar
Laird, N. (1978). Nonparametric maximum likelihood estimation of a mixing distribution. Journal of the American Statistical Association 73 (364): 805–811.
10.1080/01621459.1978.10480103
Web of Science®Google Scholar
Lawley, D.N. (1943). XXIII.On Problems connected with item selection and test construction. Proceedings of the Royal Society of Edinburgh. Section A. Mathematical and Physical Sciences 61 (3): 273–287.
10.1017/S0080454100006282
Google Scholar
Lazarsfeld, P. (1958). Evidence and inference in social research. Daedalus 87 (4): 99–130.

Web of Science®Google Scholar
Lazarsfeld, P. (1959). Latent structure analysis. In: Psychology: A Study of Science, S. Koch, 476–543. New York: McGraw-Hill.

Google Scholar
Lehman, A.F. (1988). A quality of life interview for the chronically mentally ill. Evaluation and Program Planning 11 (1): 51–62.
10.1016/0149-7189(88)90033-X
Web of Science®Google Scholar
Lehmann, E. and Casella, G. (1998). Theory of Point Estimation. Springer-Verlag.

CASGoogle Scholar
Leung, C.-K., Chang, H.-H., and Hau, K.-T. (2003). Computerized adaptive testing: a comparison of three content balancing methods. The Journal of Technology, Learning and Assessment 2 (5): 1–15.

Google Scholar
Likert, R. (1932). A technique for the measurement of attitudes. Archives of Psychology 22: 5–55.

Google Scholar
Lima Passos, V., Berger, M.P.F., and Tan, F.E. (2007). Test design optimization in CAT early stage with the nominal response model. Applied Psychological Measurement 31 (3): 213–232.
10.1177/0146621606291571
Web of Science®Google Scholar
Lindquist, E. (1953). Design and Analysis of Experiments in Psychology and Education. Boston, MA: Houghton Miffin.

Google Scholar
Linn, R.L., Rock, D.A., and Cleary, T.A. (1969). The development and evaluation of several programmed testing methods. Educational and Psychological Measurement 29 (1): 129–146.
10.1177/001316446902900109
Web of Science®Google Scholar
Linn, R., Levine, M., Hastings, C., and Wardrop, J. (1980). An Investigation of Item Bias in a Test of Reading Comprehension. Technical report (Technical Report No. 163). Urbana, IL: Center for the Study of Reading, University of Illinois.

Google Scholar
Longford, N. (1987). A fast scoring algorithm for maximum likelihood estimation in unbalanced mixed models with nested random effects. Biometrika 74 (4): 817–827.
10.1093/biomet/74.4.817
Web of Science®Google Scholar
Lord, F. (1952). A theory of test scores. Psychometric Monographs 7: 84.

Google Scholar
Lord, F.M. (1953). The relation of test score to the trait underlying the test. Educational and Psychological Measurement 13 (4): 517–549.
10.1177/001316445301300401
Web of Science®Google Scholar
Lord, F.M. (1962). Estimating norms by item-sampling. Educational and Psychological Measurement 22 (2): 259–267.
10.1177/001316446202200202
Web of Science®Google Scholar
Lord, F.M. (1968). Some Test Theory for Tailored Testing. Research Bulletin, 69/4. Princeton, NJ: Educational Testing Service.
10.1002/j.2333-8504.1968.tb00562.x
Google Scholar
Lord, F.M. (1980). Applications of Item Response Theory to Practical Testing Problems. Mahwah, NJ: Lawrence Erlbaum Associates.

Google Scholar
Lord, F.M. (1983). Unbiased estimators of ability parameters, of their variance, and of their parallel-forms reliability. Psychometrika 48: 233–245.
10.1007/BF02294018
Web of Science®Google Scholar
Lord, F.M. and Novick, M.R. (1968). Statistical Theories of Mental Test Scores. Reading, MA: Addison-Wesley Pub. Co.

Google Scholar
Louis, T.A. (1982). Finding the observed information matrix when using the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological) 44 (2): 226–233.
10.1111/j.2517-6161.1982.tb01203.x
Web of Science®Google Scholar
Luce, R.D. (1959). On the possible psychophysical laws. Psychological Review 66 (2): 81–95.
10.1037/h0043178
CASPubMedWeb of Science®Google Scholar
Magnus, J. and Neudecker, H. (1988). Matrix Differential Calculus with Applications in Statistics and Econometrics. New York: Wiley.
10.2307/2531754
Google Scholar
Martínez-Plumed, F., Prudêncio, R.B., Martínez-Usó, A., and Hernández-Orallo, J. (2016). Making sense of item response theory in machine learning. Frontiers in Artificial Intelligence and Applications 285: 1140–1148.

Google Scholar
Masters, G.N. (1982). A Rasch model for partial credit scoring. Psychometrika 47 (2): 149–174.
10.1007/BF02296272
Web of Science®Google Scholar
Mathers, C.D. and Loncar, D. (2006). Projections of global mortality and burden of disease from 2002 to 2030. PLoS Medicine 3 (11): 2011–2030.
10.1371/journal.pmed.0030442
Web of Science®Google Scholar
Matías-Carrelo, L.E., Chávez, L.M., Negrón, G. et al. (2003). The Spanish translation and cultural adaptation of five mental health outcome measures. Culture, Medicine and Psychiatry 27 (3): 291–313.
10.1023/A:1025399115023
PubMedWeb of Science®Google Scholar
Maydeu-Olivares, A. and Joe, H. (2005). Limited- and full-information estimation and goodness-of-fit testing in 2n contingency tables: a unified framework. Journal of the American Statistical Association 100 (471): 1009–1020.
10.1198/016214504000002069
CASWeb of Science®Google Scholar
McBride, J.R. and Martin, J.T. (1983). Reliability and validity of adaptive ability tests in a military setting. In: New Horizons in Testing (ed. D.J. Weiss), 223–236. New York: Academic Press.
10.1016/B978-0-12-742780-5.50022-6
Web of Science®Google Scholar
McFadden, D. (1973). Conditional logit analysis of qualitative choice behavior. In: Frontiers in Econometrics (ed. P. Zarembka), 105–142. New York: Academic Press.

Google Scholar
Meredith, W. (1993). Measurement invariance, factor analysis and factorial invariance. Psychometrika 58 (4): 525–543.
10.1007/BF02294825
Web of Science®Google Scholar
Mislevy, R.J. (1983). Item response models for grouped data. Journal of Educational Statistics 8 (4): 271–288.
10.3102/10769986008004271
Google Scholar
Mislevy, R.J. (1984). Estimating latent distributions. Psychometrika 49 (3): 359–381.
10.1007/BF02306026
Web of Science®Google Scholar
Mislevy, R.J. and Verhelst, N. (1990). Modeling item responses when different subjects employ different solution strategies. Psychometrika 55 (2): 195–215.
10.1007/BF02295283
Web of Science®Google Scholar
Mitchell, A., Vaze, A., and Rao, S. (2009). Clinical diagnosis of depression in primary care: a meta-analysis. The Lancet 374 (9690): 609–619.
10.1016/S0140-6736(09)60879-5
PubMedWeb of Science®Google Scholar
Mosteller, F. and Tukey, J.W. (1977). Data Analysis and Regression: A Second Course in Statistics. Reading, MA: Addison-Wesley Pub. Co.

Google Scholar
Moustaki, I. (2000). A latent variable model for ordinal variables. Applied Psychological Measurement 24 (3): 211–223.
10.1177/01466210022031679
Web of Science®Google Scholar
Moustaki, I., Jöreskog, K.G., and Mavridis, D. (2004). Factor models for ordinal variables with covariate effects on the manifest and latent variables: a comparison of LISREL and IRT approaches. Structural Equation Modeling 11 (4): 487–513.
10.1207/s15328007sem1104_1
Web of Science®Google Scholar
Mulder, J. and van der Linden, W.J. (2009). Multidimensional adaptive testing with optimal design criteria for item selection. Psychometrika 74 (2): 273–296.
10.1007/s11336-008-9097-5
PubMedWeb of Science®Google Scholar
Muraki, E. (1990). Fitting a polytomous item response model to likert-type data. Applied Psychological Measurement 14 (1): 59–71.
10.1177/014662169001400106
Web of Science®Google Scholar
Muraki, E. (1992). A generalized partial credit model: application of an EM algorithm. ETS Research Report Series 1992 (1): i–30.
10.1002/j.2333-8504.1992.tb01436.x
Google Scholar
Mustanski, B. and Espelage, D.L. (2020). Why are we not closing the gap in suicide disparities for sexual minority youth? Pediatrics 145 (3).
10.1542/peds.2019-4002
Web of Science®Google Scholar
Mustanski, B., Whitton, S., Newcomb, M. et al. Predicting suicidality using a computer adaptive test: Two longitudinal studies of sexual and gender minority youth. Journal of Clinical and Consulting Psychology, in press.

Web of Science®Google Scholar
Muthén, B. (1979). A structural probit model with latent variables. Journal of the American Statistical Association 74 (368): 807–811.

Web of Science®Google Scholar
Muthén, B.O. (1989). Latent variable modeling in heterogeneous populations. Psychometrika 54 (4): 557–585.
10.1007/BF02296397
Web of Science®Google Scholar
Nandakumar, R. and Stout, W.F. (1993). Refinements of Stout's procedure for assessing latent trait unidimensionality. Journal of Educational Statistics 18: 41–68.
10.2307/1165182
Web of Science®Google Scholar
National Institute of Mental Health (2017). Ask Suicide-Screening Questions (ASQ) Toolkit. https://www.nimh.nih.gov/research/research-conducted-at-nimh/asq-toolkit-materials/index.shtml (accessed 28 October 2020).

Google Scholar
National Leadership Forum on Behavioral Health/Criminal Justice Services (2009). Ending an American Tragedy: Addressing the Needs of Justice-Involved People with Mental Illnesses and Co-Occurring Disorders. Technical report. https://www.usf.edu/cbcs/mhlp/tac/documents/behavioral-healthcare/samh/ending-an-american-tragedy.pdf (accessed 28 October 2020).

Google Scholar
Naylor, J.C. and Smith, A.F.M. (1982). Applications of a method for the efficient computation of posterior distributions. Applied Statistics 31 (3): 214.
10.2307/2347995
Web of Science®Google Scholar
Neyman, J. and Scott, E.L. (1948). Consistent estimates based on partially consistent observations. Econometrica 16 (1): 1.
10.2307/1914288
Web of Science®Google Scholar
Nishisato, S. and Nishisato, I. (1994). Dual Scaling in a Nutshell. Toronto: MicroStats.

Web of Science®Google Scholar
Nunnally, J. (1967). Psychometric Theory. New York: McGraw Hill.

Google Scholar
Overall, J.E. and Gorham, D.R. (1988). The brief psychiatric rating scale (BPRS): recent developments in ascertainment and scaling. Psychopharmacology Bulletin 24 (1): 97–99.

Web of Science®Google Scholar
Pilkonis, P.A., Choi, S.W., Reise, S.P. et al. (2011). Item banks for measuring emotional distress from the patient-reported outcomes measurement information system (PROMIS®): depression, anxiety, and anger. Assessment 18 (3): 263–283.
10.1177/1073191111411667
PubMedWeb of Science®Google Scholar
Priester, M.A., Browne, T., Iachini, A. et al. (2016). Treatment access barriers and disparities among individuals with co-occurring mental health and substance use disorders: an integrative literature review. Journal of Substance Abuse Treatment 61: 47–59.
10.1016/j.jsat.2015.09.006
PubMedWeb of Science®Google Scholar
Radloff, L.S. (1977). The CES-D scale: a self-report depression scale for research in the general population. Applied Psychological Measurement 1 (3): 385–401.
10.1177/014662167700100306
Google Scholar
Rao, C.R. (1973). Linear Statistical Inference and Its Applications, 2e. New York: Wiley.
10.1002/9780470316436
Google Scholar
Rasch, G. (1960). Studies in Mathematical Psychology: I. Probabilistic Models for Some Intelligence and Attainment Tests. Copenhagen: Nielsen & Lydiche.

Google Scholar
Rasch, G. (1961). On general laws and the meaning of measurement in psychology. Proceedings of the 4th Berkeley Symposium on Mathematical Statistics and Probability, Volume 4: Contributions to Biology and Problems of Medicine, pp. 321–334.

Google Scholar
Reckase, M.D. (2009). Multidimensional Item Response Theory. New York: Springer.
10.1007/978-0-387-89976-3
Google Scholar
Reckase, M.D. and McKinley, R.L. (1991). The discriminating power of items that measure more than one dimension. Applied Psychological Measurement 15: 361–374.
10.1177/014662169101500407
Web of Science®Google Scholar
Reise, S.P. and Waller, N.G. (1993). Traitedness and the assessment of response pattern scalability. Journal of Personality and Social Psychology 65 (1): 143–151.
10.1037/0022-3514.65.1.143
Web of Science®Google Scholar
Rijmen, F., Tuerlinckx, F., De Boeck, P., and Kuppens, P. (2003). A nonlinear mixed model framework for item response theory. Psychological Methods 8 (2): 185–205.
10.1037/1082-989X.8.2.185
PubMedWeb of Science®Google Scholar
Roy, S.N. (1957). Some Aspects of Multivariate Analysis. Kolkata: Statistical Publishing Society.

Google Scholar
Rubin, D.B. (1976). Inference and missing data. Biometrika 63 (3): 581–592.
10.1093/biomet/63.3.581
Web of Science®Google Scholar
Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores. Psychometrika 35 (1): 139.
10.1007/BF02290599
Google Scholar
Samejima, F. (1972). A general model for free-response data. Psychometrika Monograph Supplement 37 (1): 68.

Web of Science®Google Scholar
Samejima, F. (1979). A New Family of Models for the Multiple Choice Item. Technical report (Research Report No. 79-4). Tennessee University Knoxville Department of Psychology.

Google Scholar
SAMHSA (2016). National Survey on Drug Use and Health. Technical report. Mental Health Services Administration. https://www.samhsa.gov/data/release/2016-national-survey-drug-use-and-health-nsduh-releases (accessed 28 October 2020).

Google Scholar
Sanathanan, L. and Blumenthal, S. (1978). The logistic model and estimation of latent structure. Journal of the American Statistical Association 73 (364): 794–799.
10.1080/01621459.1978.10480101
Web of Science®Google Scholar
Sani, S., Busnello, J., Kochanski, R. et al. (2017). High-frequency measurement of depressive severity in a patient treated for severe treatment-resistant depression with deep-brain stimulation. Translational Psychiatry 7 (8): e1207.
10.1038/tp.2017.145
CASPubMedGoogle Scholar
Savage, L.J. (1954). The Foundations of Statistics. New York: Dover Press.

Google Scholar
Schwarz, Gideon (1978). Estimating the dimension of a model. Annals of Statistics 6 (2): 461–464.
10.1214/aos/1176344136
Web of Science®Google Scholar
Schilling, S. and Bock, R.D. (2005). High-dimensional maximum marginal likelihood item factor analysis by adaptive quadrature. Psychometrika 70 (3): 533–555.

Web of Science®Google Scholar
Schott, J.R. (1997). Matrix Analysis for Statistics. Hoboken, NJ: Wiley.

Google Scholar
Schroeder, E. (1945). On Measurement of Motor Skills; An Approach Through a Statistical Analysis of Archery Scores. Oxford: King's Crown Press.
10.7312/schr92192
Google Scholar
Segall, D.O. (1996). Multidimensional adaptive testing. Psychometrika 61 (2): 331–354.
10.1007/BF02294343
Web of Science®Google Scholar
Segall, D.O. (2000). Principles of multidimensional adaptive testing. In: Computerized Adaptive Testing: Theory and Practice (ed. W.J. van der Linden and C.A.W. Glas), pp. 53–73. Boston, MA: Kluwer Academic.
10.1007/0-306-47531-6_3
Google Scholar
Seo, D.G. and Weiss, D.J. (2015). Best design for multidimensional computerized adaptive testing with the bifactor model. Educational and Psychological Measurement 75 (6): 954–978.
10.1177/0013164415575147
PubMedWeb of Science®Google Scholar
Shenton, L.R. and Bowman, K.O. (1977). Maximum Likelihood Estimation in Small Samples. London: Griffin.

Google Scholar
Silvey, S.D. (1980). Optimal Design. London: Chapman & Hall.
10.1007/978-94-009-5912-5
Google Scholar
Skrondal, A. and Rabe-Hesketh, S. (2004). Generalized Latent Variable Modeling: Multilevel, Longitudinal, and Structural Equation Models. London: Chapman and Hall/CRC.
10.1201/9780203489437
Google Scholar
Smith, M.R., Martinez, T., and Giraud-Carrier, C. (2014). An instance level analysis of data complexity. Machine Learning 95 (2): 225–256.
10.1007/s10994-013-5422-z
Web of Science®Google Scholar
Spearman, C. (1904). Measurement of association, Part II. Correction of 'systematic deviations'. American Journal of Psychology 15: 88–101.
10.2307/1412159
Google Scholar
Spearman, C. (1907). Demonstration of formulae for true measurement of correlation. The American Journal of Psychology 18 (2): 161.
10.2307/1412408
Google Scholar
Spielberger, C.D., Gorsuch, R.L., Lushene, R., et al. (1983). Manual for the State-Trait Anxiety Inventory. Palo Alto, CA: Consulting Psychologists Press.

PubMedGoogle Scholar
Stan, A.D., Tamminga, C.A., Han, K. et al. (2020). Associating psychotic symptoms with altered brain anatomy in psychotic disorders using multidimensional item response theory models. Cerebral Cortex 30 (5): 2939–2947.
10.1093/cercor/bhz285
PubMedWeb of Science®Google Scholar
Steadman, H.J., Osher, F.C., Robbins, P.C. et al. (2009). Prevalence of serious mental illness among jail inmates. Psychiatric Services 60 (6): 761–765.
10.1176/ps.2009.60.6.761
PubMedWeb of Science®Google Scholar
Stevens, S.S. (1961). To honor Fechner and repeal his law. Science 133 (3446): 80–86.
10.1126/science.133.3446.80
CASPubMedGoogle Scholar
Stewart, W.F., Ricci, J.A., Chee, E. et al. (2003). Cost of lost productive work time among US workers with depression. Journal of the American Medical Association 289 (23): 3135–3144.
10.1001/jama.289.23.3135
PubMedWeb of Science®Google Scholar
Stocking, M.L. (1988). Scale drift in on-line calibration. ETS Research Report Series 1988 (1): i–122.
10.1002/j.2330-8516.1988.tb00313.x
Google Scholar
Stocking, M.L. and Lord, F.M. (1983). Developing a common metric in item response theory. Applied Psychological Measurement 7 (2): 201–210.
10.1177/014662168300700208
Web of Science®Google Scholar
Stout, W. (1987). A nonparametric approach for assessing latent trait unidimensionality. Psychometrika 52 (4): 589–617.
10.1007/BF02294821
Web of Science®Google Scholar
Stout, W. (1990). A new item response theory modeling approach with applications to unidimensional assessment and ability estimation. Psychometrika 55: 293–326.
10.1007/BF02295289
Web of Science®Google Scholar
Stroud, A.H. and Secrest, D. (1966). Gaussian Quadrature Formulas. Englewood Cliffs, NJ: Prentice Hall.

Google Scholar
Stuart, A. (1958). Equally correlated variates and the multinormal integral. Journal of the Royal Statistical Society: Series B (Methodological) 20: 373–378.
10.1111/j.2517-6161.1958.tb00301.x
Web of Science®Google Scholar
Swaminathan, H. and Rogers, H. (1990). Detecting differential item functioning using logistic regression procedures. Journal of Educational Measurement 27 (4): 361–370.
10.1111/j.1745-3984.1990.tb00754.x
Web of Science®Google Scholar
Tamminga, C.A., Ivleva, E.I., Keshavan, M.S. et al. (2013). Clinical phenotypes of psychosis in the Bipolar-Schizophrenia network on intermediate phenotypes (B-SNIP). American Journal of Psychiatry 170 (11): 1263–1274.
10.1176/appi.ajp.2013.12101339
PubMedWeb of Science®Google Scholar
Thissen, D. (1982). Marginal maximum likelihood estimation for the one-parameter logistic model. Psychometrika 47 (2): 175–186.
10.1007/BF02296273
Web of Science®Google Scholar
Thissen, D. (1991). MULTILOG User's Guide: Multiple, Categorical Item Analysis and Test Scoring Using Item Response Theory. Lincolnwood, IL: Scientific Software, Inc.

Google Scholar
Thissen, D. and Steinberg, L. (1984). A response model for multiple choice items. Psychometrika 49 (4): 501–519.
10.1007/BF02302588
Web of Science®Google Scholar
Thissen, D. and Steinberg, L. (1986). A taxonomy of item response models. Psychometrika 51 (4): 567–577.
10.1007/BF02295596
Web of Science®Google Scholar
Thissen, D., Steinberg, L., and Wainer, H. (1993). Detection of differential item functioning using the parameters of item response models. In: Differential Item Functioning, 67–113. Hillsdale NJ: Lawrence Erlbaum Associates.

Google Scholar
Thissen, D., Cai, L., and Bock, R.D. (2010). The nominal categories item response model. In: Handbook of Polytomous Item Response Theory Models (ed. M.L. Nering and R. Ostini), 43–75. Routledge/Taylor & Francis Group.

Google Scholar
Thurstone, L.L. (1927). Psychophysical analysis. The American Journal of Psychology 38 (3): 368–389.
10.2307/1415006
Google Scholar
Thurstone, L.L. (1928). Attitudes can be measured. American Journal of Sociology 33 (4): 529–554.
10.1086/214483
Web of Science®Google Scholar
Thurstone, L.L. (1929). Theory of attitude measurement. Psychological Review 36 (3): 222–241.
10.1037/h0070922
Google Scholar
Thurstone, L.L. (1947). Multiple Factor Analysis. Chicago, IL: University of Chicago Press.

Google Scholar
Thurstone, L.L. (1959). The Measurement of Values. Chicago, IL: University of Chicago Press.

Google Scholar
Tsai, J.; Gu, X. (1958). Utilization of addiction treatment among U.S. adults with history of incarceration and substance use disorders. Addiction Science & Clinical Practice 14 (9).

Google Scholar
Tucker, L.R. (1958). An inter-battery method of factor analysis. Psychometrika 23 (2): 111–136.
10.1007/BF02289009
Web of Science®Google Scholar
Vale, C.D. and Weiss, D.J. (1977). A Rapid Item-Search Procedure for Bayesian Adaptive Testing 77-4. Technical report 77-4. Minneapolis, MN: University of Minnesota, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory.

Google Scholar
van der Linden, W.J. (1999). Adaptive Testing with Equated Number-Correct Scoring. OMD research report; No. 99-02. Enschede, Netherlands: University of Twente, Faculty Educational Science and Technology.

Google Scholar
van der Linden, W.J. and Pashley, P.J. (2010). Item selection and ability estimation in adaptive testing. In: Elements of Adaptive Testing (ed. W.J. van der Linden and C.A.W. Glas), pp. 3–30. New York: Springer.
10.1007/978-0-387-85461-8
Google Scholar
van der Linden, W.J. and Reese, L.M. (1998). A model for optimal constrained adaptive testing. Applied Psychological Measurement 22 (3): 259–270.
10.1177/01466216980223006
Web of Science®Google Scholar
Veerkamp, W.J.J. and Berger, M.P.F. (1997). Some new item selection criteria for adaptive testing. Journal of Educational and Behavioral Statistics 22 (2): 203–226.
10.3102/10769986022002203
Web of Science®Google Scholar
Wainer, H. and Mislevy, R. (1990). Item response theory, item calibration, and proficiency estimation. In: Computerized Adaptive Testing: A Primer (ed. H. Wainer), 81–99. Mahwah, NJ: Lawrence Erlbaum Associates.

Google Scholar
Wainer, H., Sireci, S.G., and Thissen, D. (1991). Differential testlet functioning definitions and detection. ETS Research Report Series 1991 (1): i–42.
10.1002/j.2333-8504.1991.tb01423.x
Google Scholar
Walker, H.M. and Lev, J. (1953). Statistical Inference. New York: Henry Holt & Company.
10.1037/11773-000
Google Scholar
Warm, T.A. (1989). Weighted likelihood estimation of ability in item response theory. Psychometrika 54 (3): 427–450.
10.1007/BF02294627
Web of Science®Google Scholar
Weiss, D.J. (1985). Adaptive testing by computer. Journal of Consulting and Clinical Psychology 53 (6): 774–789.
10.1037/0022-006X.53.6.774
CASPubMedWeb of Science®Google Scholar
Weiss, D.J. and Kingsbury, G.G. (1984). Application of computerized adaptive testing to educational problems. Journal of Educational Measurement 21 (4): 361–375.
10.1111/j.1745-3984.1984.tb01040.x
Web of Science®Google Scholar
Weiss, D.J. and McBride, J.R. (1984). Bias and information of Bayesian adaptive testing. Applied Psychological Measurement 8 (3): 273–285.
10.1177/014662168400800303
Web of Science®Google Scholar
Whooley, M.A. (2012). Diagnosis and treatment of depression in adults with comorbid medical conditions: a 52-year-old man with depression. Journal of the American Medical Association 307 (17): 1848–1857.
10.1001/jama.2012.3466
CASPubMedWeb of Science®Google Scholar
Woods, C.M., Cai, L., and Wang, M. (2013). The Langer-improved Wald test for DIF testing with multiple groups: evaluation and comparison to two-group IRT. Educational and Psychological Measurement 73 (3): 532–547.
10.1177/0013164412464875
Web of Science®Google Scholar
Yao, L. (2012). Multidimensional CAT item selection methods for domain scores and composite scores: theory and applications. Psychometrika 77: 495–523.
10.1007/s11336-012-9265-5
PubMedWeb of Science®Google Scholar
Yao, L. (2013). Comparing the performance of five multidimensional CAT selection procedures with different stopping rules. Applied Psychological Measurement 37: 3–23.
10.1177/0146621612455687
Web of Science®Google Scholar
Yi, Q. and Chang, H. (2003). A-stratified CAT design with content blocking. British Journal of Mathematical and Statistical Psychology 56: 359–378.
10.1348/000711003770480084
PubMedWeb of Science®Google Scholar
Zhang, J. and Stout, W. (1999). Conditional covariance structure of generalized compensatory multidimensional items. Psychometrika 64 (2): 129–152.
10.1007/BF02294532
Web of Science®Google Scholar
Zimowski, M.F., Muraki, E., Mislevy, R.J., and Bock, R.D. (1996). BILOG-MG: Multiple-Group IRT Analysis and Test Maintenance for Binary Items. Chicago, IL: Scientific Software International.

Web of Science®Google Scholar

Item Response Theory

Bibliography

References

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Bibliography

References

References

Related

Information