Abstract
This article gives an overview of statistical analysis with latent variables. Using traditional structural equation modeling as a starting point, it shows how the idea of latent variables captures a wide variety of statistical concepts, including random effects, missing data, sources of variation in hierarchical data, finite mixtures. latent classes, and clusters. These latent variable applications go beyond the traditional latent variable useage in psychometrics with its focus on measurement error and hypothetical constructs measured by multiple indicators. The article argues for the value of integrating statistical and psychometric modeling ideas. Different applications are discussed in a unifying framework that brings together in one general model such different analysis types as factor models, growth curve models, multilevel models, latent class models and discrete-time survival models. Several possible combinations and extensions of these models are made clear due to the unifying framework.
Article PDF
Similar content being viewed by others
Change history
13 February 2021
A Correction to this paper has been published: https://doi.org/10.1007/s41237-020-00128-2
References
Agresti, A. (1990). Categorical data analysis. New York: John Wiley & Sons.
Angrist, J. D., Imbens, G. W., & Rubin, D. B. (1996). Identification of causal effects using instrumental variables. Journal of the American Statistical Association, 91, 444–445.
Arminger, G. & Stein, P. (1997). Finite mixtures of covariance structure models with regressors. Sociological Methods & Research, 26, 148–182.
Arminger, G., Stein, P., & Wittenberg, J. (1998). Mixtures of conditional mean- and covariance-structure models. Psychornetrika, 64, 475–494.
Asparouhov, T. & Muthén, B. (2002). Full-information maximum-likelihood estimation of general two-level latent variable models. Draft.
Bandeen-Roche, K., Miglioretti, D. L., Zeger, S. L., & Rathouz, P. J. (1997). Latent variable regression for multiple discrete outcomes. Journal of the American Statistical Association, 92, 1375–1386.
Bartholomew, D. J. (1987). Latent variable models and factor analysts. New York: Oxford University Press.
Blafield, E. (1980). Clustering of observations from finite mixtures with structural information. Unpublished doctoral dissertation, Jyväskyla studies in computer science, economics, and statistics, Jyväskyla, Finland.
Bollen, K. A. (1989). Structural equations with, latent variables. New York: John Wiley.
Browne, M. W. & Arminger, G. (1995). Specification and estimation of mean- and covariance-structure models. In G. Arminger, C. C. Clogg, & M. E. Sobel (Eds.), Handbook of statistical modeling for the social and behavioral sciences (pp. 311–359). New York: Plenum Press.
Bryk, A. S. & Raudenbush, S. W. (2002). Hierarchical linear models: Applications and data analysis methods, 2nd ed. Newbury Park, CA: Sage Publications.
Clogg, C. C. (1995). Latent class models. In G. Arminger, C. C. Clogg, & M. E. Sobel (Eds.), Handbook of statistical modeling for the social and behavioral sciences (pp. 311–359). New York: Plenum Press.
Clogg, C. C. & Goodman, L. A. (1985). Simultaneous latent structural analysis in several groups. In Tuma, N. B. (Ed.), Sociological Methodology (pp. 81–110). San Francisco: Jossey-Bass Publishers.
Collins, L. M. & Saver, A. (Eds.) (2001). New methods for the analysis of change. Washington, D.C.: APA.
Collins, L. M. & Wugalter, S. E. (1992). Latent class models for stage-sequential dynamic latent variables. Multivariate Behavioral Research, 27, 131–157.
Collins, L. M., Graham, J. W., Rousculp, S. S., & Hansen, W. B. (1997). Heavy caffeine use and the beginning of the substance use onset process: An illustration of latent transition analysis. In K. Bryant, M. Windle. & S. West (Eds.), The science of prevention: Methodological advances from alcohol and substance use research (pp. 79–99). Washington DC: American Psychological Association.
Dayton, C. M. & Macready, G. B. (1988). Concomitant variable latent class models. Journal of the American Statistical Association, 83, 173–178.
Dolan, C. V. & van der Maas, H. L. J. (1998). Fitting multivariate normal mixtures subject to structural equation modeling. Psychornetrika, 63, 227–253.
Everitt, B. S. & Hand, D. J. (1981). Finite mixture distributions. London: Chapman and Hall.
Follrnann, D. A. & Lambert, D. (1989). Generalizing logistic regression by nonparametric mixing. Journal of the American Statistical Association, 84, 295–300.
Formann, A. K. (1992). Linear logistic latent class analysis for polytomous data. Journal of the American Statistical Association, 87, 476–486.
Frangakis, C. E. & Baker, S. G. (2001). Compliance subsampling designs for comparative research: Estimation and optimal planning. Biometrics, 57, 899–908.
Gibson, W. A. (1959). Three multivariate models: factor analysis, latent structure analysis, and latent profile analysis. Psychornetrika, 24, 229–252.
Goldstein, H. (1995). Multilevel statistical models. London: Edward Arnold.
Goodman, L. A. (1974). Exploratory latent structure analysis using both identifiable and unidentifiable models. Biometrika, 61, 215–231.
Graham, J. W., Collins, L. M., Wugalter, S. E., Chung, N. K., & Hansen, W. B. (1991). Modeling transitions in latent stagesequential processes: A substance use prevention example. Journal of Consulting and Clinical Psychology, 59, 48–57.
Hecht, R. H. (2001). Multilevel modeling with SEM. In G. A. Marcoulides & R. E. Schumacker (Eds.). New developments and techniques in structural equation modeling (pp. 89–127). Mahaw, NJ: Lawrence Erlbaum Associates.
Hedeker, D. & Rose, J. S. (2000). The natural history of smoking: A pattern-mixture random-effects regression model. In J. Rose, L. Chassin, C. Presson, & J. Sherman (Eds.). Multivariate applications in substance use research (pp. 79–112). Hillsdale, NJ: Erlbaum.
Heijden, P. G. M., Dressens, J., & Bockenholt, U. (1996). Estimating the concomitantvariable latent-class model with the EM algorithm. Journal of Educational and Behavioral Statistics, 21, 215–229.
Hogan, J. W. & Laird, N. M. (1997). Mixture models for the joint distribution of repeated measures and event times. Statistics in Medicine, 16, 239–258.
Hoshino, T. (2001). Bayesian inference for finite mixtures in confirmatory factor analysis. Behaviormetrika, 28, 37–63.
Hosmer, D. W. (1973). A comparison of iterative maximum likelihood estimates of the parameters of a mixture of two normal distributions under three different types of sample. Biometrics, 29, 761–770.
Jedidi, K., Jagpal, H. S., & DeSarbo, W. S. (1997). Finite-mixture structural equation models for response-based segmentation and unobserved heterogeneity. Marketing Science, 16, 39–59.
Jedidi, K., Ramaswamy, V., DeSarbo, W. S., & Wedel, M. (1996). On estimating finite mixtures of multivariate regression and simultaneous equation models. Structural Equation Modeling, 3, 266–289.
Jennrich, R. I. & Schluchter, M. D. (1986). Unbalanced repeated-measures models with structured covariance matrices. Biometrics, 42, 805–820.
Jo, B. (2001a). Estimation of intervention effects with noncompliance: Alternative model specifications. Accepted for publication in the Journal of Educational and Behavioral Statistics.
Jo, B. (2001b). Model misspecification sensitivity analysis in estimating causal effects of interventions with noncompliance. Accepted for publication in Statistics in Medicine.
Jo, B. (2001c). Statistical power in randomized intervention studies with noncompliance. Forthcoming in Psychological Methods.
Jo, B. & Muthén, B. (2001). Modeling of intervention effects with noncompliance: A latent variable modeling approach for randomized trials. In G. A. Marcoulides & R. E. Schumacker (Eds.), New developments and techniques in structural equation modeling (pp. 57–87). Mahaw, NJ: Lawrence Erlbaum Associates.
Jöreskog, K. G. & Goldberger, A. S. (1975). Estimation of a model with multiple indicators and multiple causes of a single latent variable. Journal of the American Statistical Association, 70, 631–639.
Kandel, D. B., Yarnaguchi, K., & Chen, K. (1992). Stages of progression in drug involvement from adolescence to adulthood: Further evidence for the gateway theory. Journal of Studies of Alcohol, 53, 447–457.
Kaplan, D. & Elliott, P. R. (1997). A didactic example of multilevel structural equation modeling applicable to the studv of organizations. Structural Equation Modeling, 4, 1–23.
Khoo, S. T. & Muthén, B. (2000). Longitudinal data on families; Growth modeling alternatives. In J. Rose, L. Chassin, C. Presson, & J. Sherman (Eds.). Multivariate applications in substance use research (pp. 43–78). Hillsdale, NJ: Erlbaum.
Laird, N. M. & Ware, J. H. (1982). Random-effects models for longitudinal data. Biometrics, 38, 963–974.
Land, K. C. (2001). Introduction to the special issue on finite mixture models. Sociological Methods & Research, 29, 275–281.
Lazarsfeld, P. F. & Henry, N. W. (1968). Latent structure analysis. New York: Houghton Mifflin.
Li, F., Duncan, T. E. Duncan, S. C., & Acock, A. (2001). Latent growth modeling of longitudinal data: A finite growth mixture modeling approach. Structural Equation Modeling, 8, 493–530.
Little, R. J. & Wang, Y. (1996). Pattern-mixture models for multivariate incomplete data with covariates. Biometrics, 52, 98–111.
Little, R. J. & Yau, L. H. Y. (1998). Statistical techniques for analyzing data from prevention trials: Treatment of no-shows using Rubin’s causal model. Psychological Methods, 3, 147–159.
Lindstrom, M. J., & Bates, D. M. (1988). Newton-Raphson and EM algorithms for linear mixed-effects models for repeated-measures data. Journal of the American Statistical Association, 83, 1014–1022.
Lubke, G., Muthén, B., & Larsen, K. (2001). Global and local identifiability of factor mixture models. Submitted for publication. University of California, Los Angeles.
MacCallum, R. C. & Austin, J. T. (2000). Applications of structural equation modeling in psychological research. Annual Review of Psychology, 51, 201–226.
McDonald, R. & Goldstein, H. (1989). Balanced versus unbalanced designs for linear structural relations in two-level data. British Journal of Mathematical and Statistical Psychology, 42, 215–232.
McLachlan, G. J. & Krishnan, T. (1997). The EM algorithm and extensions. New York: John Wiley & Sons.
McLachlan, G. J. & Peel, D. (2000). Finite mixture, models. New York: Wiley & Sons.
Muthén, B. (1988). Some uses of structural equation modeling in validity studies: Extending IRT to external variables. In H. Wainer & H. Braun (Eds.). Test validity (pp. 213–238). Hillsdale, NJ: Erlbaum Associates.
Muthén, B.(1989). Latent variable modeling in heterogeneous populations. Psychometrika, 54, 557–585.
Muthén, B. (1990). Mean and covariance structure analysis of hierarchical data. LTCLA Statistics Series #62, August 1990.
Muthén, B. (1991). Multilevel factor analysis of class and student achievement components. Journal of Educational Measurement, 28, 338–354.
Muthén, B. (1994). Multilevel covariance structure analysis. In J. Hox & I. Kreft (Eds.), Multilevel Modeling, a special issue of Sociological Methods & Research, 22, 376–398.
Muthén, B. (1997). Latent variable modeling with longitudinal and multilevel data. In A. Raftery (Ed.), Sociological Methodology (pp. 453–480). Boston: Blackwell Publishers.
Muthén, B. (2001a). Second-generation structural equation modeling with a combination of categorical and continuous latent variables: New opportunities for latent class/latent growth modeling. In Collins, L. M. & Saver, A. (Eds.), New methods for the analysis of change (pp. 291–322). Washington, D.C.: APA.
Muthén, B. (2001b). Latent variable mixture modeling. In G. A. Marcoulides & R. E. Schuinacker (Eds.), New developments and techniques in structural equation modeling (pp. 1–33). Mahaw, NJ: Lawrence Erlbaum Associates.
Muthén, B. (2001c). Two-part growth mixture modeling. Draft. University of California, Los Angeles.
Muthén, B. & Brown, C. H. (2001). Non-ignorable missing data in a general latent variable framework. Draft. University of California, Los Angeles.
Muthén, B., Brown, C. H., Masyn, K., Jo, B., Khoo, S. T., Yang, C. C., Wang, C. P., Kellam, S., Carlin, J., & Liao, J. (in press). General growth mixture modeling for randomized preventive interventions. Forthcoming in Biostatistics.
Muthén, B. & Curran, P. (1997). General longitudinal modeling of individual differences in experimental designs: A latent variable framework for analysis arid power estimation. Psychological Methods, 2, 371–402.
Muthén, B., Kao, C. F., & Burstein, L. (1991). Instructional sensitivity in mathematics achievement test items: Applications of a new IRT-based detection technique. Journal of Educational Measurement, 28, 1–22.
Muthén, B. & Khoo, S. T. (1998). Longitudinal studies of achievement growth using latent variable modeling. In John B. Cooney (Ed.), Latent Growth Curve Analysis, a special issue of Learning and Individual Differences, 10, 73–101.
Muthén, B., Khoo, S. T., Francis, D., & Kim Boscardin, C. (in press). Analysis of reading skills development from Kindergarten through first grade: An application of growth mixture modeling to sequential processes. In S. R. Reise & N. Duan (Eds), Multilevel modeling: Methodological advances, issues, and applications. Mahaw, NJ: Lawrence Erlbaum Associates.
Muthén, B. & Masyn, K. (2001). Discrete-time survival mixture analysis. University of California, Los Angeles.
Muthén, B. & Muthén, L. (2000). Integrating person-centered and variable-centered analysis: Growth mixture modeling with latent trajectory classes. Alcoholism: Clinical and Experimental Research, 24, 882–891.
Muthén, L. & Muthén, B. (1998–2001). Mplus user’s guide. Los Angeles, CA: Muthén & Muthén.
Nagin, D. S. (1999). Analyzing developmental trajectories: A semi-parametric, group-based approach. Psychological Methods, 4, 139–157.
Nagin, D. S. & Land, K. C. (1993). Age, climinal careers, and population heterogeneity: Specification and estimation of a nonparametric, mixed Poisson model. Criminology, 31, 327–362.
Nagin, D. S. & Tremblay, R. E. (2001). Analyzing developmental trajectories of distinct but related behaviors: a group-based method. Psychological Methods, 6, 18–34.
Nestadt, G., Hanfelt, J., Liang, K. Y., Lamacz, M., Wolyniec, P., & Pulver, A. E. (1994). An evaluation of the structure of schizophrenia spectrum personality disorders. Journal of Personality Disorders, 8, 288–298.
Pearson, K. (1895). Contributions to the theory of mathematical evolution, II: skew variation. Philosophical Transactions of the Royal Society of London A, 186, 343–414.
Raudenbush, S. W. (2001). Toward a coherent framework for comparing trajectories of individual change. In Collins, L. M. & Saver, A. (Eds.), New methods for the analysis of change (pp. 35–64). Washington, D.C.: APA.
Reboussin, B. A., Reboussin, D. M., Liang, K. Y., & Anthony, J. C. (1998). Latent transition modeling of progression of health-risk behavior. Multivariate Behavioral Research, 33, 457–478.
Rindskopf, D. & Rindskopf, W. (1986). The value of latent class analysis in medical diagnosis. Statistics in Medicine, 5, 21–27.
Takane, Y. & DeLeeuw, J. (1987). On the relationship between item response theory and factor analysis of discretized variables. Psychometrika, 52, 393–408.
Uebersax, J. S., & Grove, W. M. (1990). Latent class analysis of diagnostic agreement. Statistics in Medicine, 9, 559–572.
Yung, Y. F. (1997). Finite mixtures in confirmatory factor-analysis models. Psychometrika., 62, 297–330.
Author information
Authors and Affiliations
Corresponding author
Additional information
The original online version of this article was revised due to the retrospective open access order.
Rights and permissions
This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.
About this article
Cite this article
Muthén, B.O. Beyond SEM: General Latent Variable Modeling. Behaviormetrika 29, 81–117 (2002). https://doi.org/10.2333/bhmk.29.81
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.2333/bhmk.29.81