This page shows an example of a latent growth curve model (LGCM) with footnotes explaining the output. A LGCM can be similar to a multilevel model (a model many people have seen). To help you understand the LGCM and its output, first a multilevel model is shown using HLM and then using Stata, and then the same data is analyzed using Mplus using a LGCM. The Mplus output is related to the multilevel model results. We suggest that you view this page using two web browsers so you can show the page side by side showing the Stata output in one browser and the corresponding Mplus output in the other browser.
This example is drawn from the Mplus User’s Guide (example 6.10) and we suggest that you see the Mplus User’s Guide for more details about this example. We thank the kind people at Muthén & Muthén for permission to use examples from their manual.
Example Using HLM
Each subject is observed on the variable Y at four different times. A covariate called a is measured at each of the four time points. Also, the variables x1 and x2 are measured for each person. Conceptualized as a multilevel model, the variables time and a are level 1 variables. (Note that time is coded 0, 1, 2, and 3.) The variables x1 and x2 are level 2 variables. The model uses time and a to predict the values of y at level 1, and uses x1 and x2 to predict the intercept and slope of time at level 2. We can write this model using multiple equations as shown below. This uses the ex610.mdm file.
Level-1 Model Y = B0 + B1*(A) + B2*(TIME) + R Level-2 Model B0 = G00 + G01*(X1) + G02*(X2) + U0 B1 = G10 B2 = G20 + G21*(X1) + G B2 = G20 + G21*(X1) + G22*(X2) + U2
Here is the output from HLM, condensed to save space. Footnotes are included for relating the output to Mplus.
Sigma_squared = 0.54200I
Tau
INTRCPT1,B0 1.08757F 0.05079
TIME,B2 0.05079H 0.20495G
Tau (as correlations)
INTRCPT1,B0 1.000 0.108
TIME,B2 0.108 1.000
Final estimation of fixed effects:
----------------------------------------------------------------------------
Standard Approx.
Fixed Effect Coefficient Error T-ratio d.f. P-value
----------------------------------------------------------------------------
For INTRCPT1, B0
INTRCPT2, G00 0.570413A 0.054807 10.408 497 0.000
X1, G01 0.560548B 0.054574 10.271 497 0.000
X2, G02 0.716557B 0.055865 12.827 497 0.000
For A slope, B1
INTRCPT2, G10 0.296872E 0.021381 13.885 1993 0.000
For TIME slope, B2
INTRCPT2, G20 1.010207C 0.025332 39.879 497 0.000
X1, G21 0.263030D 0.025223 10.428 497 0.000
X2, G22 0.473419D 0.025819 18.336 497 0.000
----------------------------------------------------------------------------
Example Using Stata
Combining the two equations into one by substituting the level 2 equation into the level 1 equation, we have the equation below, with the random effects identified by placing them in square brackets.
Composite model Y = G00 + G01*(X1) + G02*(X2) + G10*A + G20*TIME + G21*X1*TIME + G22*X2*TIME + [ U0 + U2*TIME + r ]
Based on the composite model, this is the same example using Stata.
infile y1 y2 y3 y4 x1 x2 a1 a2 a3 a4 using https://stats.idre.ucla.edu/stat/mplus/output/ex6.10.dat
generate id = _n
reshape long y a, i(id) j(time)
replace time = time-1
generate timeBYx1 = time*x1
generate timeBYx2 = time*x2
mixed y x1 x2 a time timeBYx1 timeBYx2 || id: time, cov(un)
Mixed-effects ML regression Number of obs = 2000
Group variable: id Number of groups = 500
Obs per group: min = 4
avg = 4.0
max = 4
Wald chi2(6) = 2871.89
Log likelihood = -3075.8518 Prob > chi2 = 0.0000
------------------------------------------------------------------------------
y | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
x1 | .560547B .0544035 10.30 0.000 .4539181 .6671759
x2 | .716562B .0556897 12.87 0.000 .6074121 .8257118
a | .2967777E .0213597 13.89 0.000 .2549134 .3386419
time | 1.010207C .0252504 40.01 0.000 .9607168 1.059696
timeBYx1 | .2630303D .0251425 10.46 0.000 .2137518 .3123087
timeBYx2 | .4734171D .0257359 18.40 0.000 .4229756 .5238586
_cons | .5704131A .0546356 10.44 0.000 .4633293 .6774969
------------------------------------------------------------------------------
------------------------------------------------------------------------------
Random-effects Parameters | Estimate Std. Err. [95% Conf. Interval]
-----------------------------+------------------------------------------------
id: Unstructured |
var(time) | .2030399G .0203019 .1669053 .2469977
var(_cons) | 1.078679F .0937659 .9097041 1.27904
cov(time,_cons) | .0515093H .0314154 -.0100638 .1130825
-----------------------------+------------------------------------------------
var(Residual) | .5416011I .0242353 .496124 .5912467
------------------------------------------------------------------------------
LR test vs. linear regression: chi2(3) = 1344.80 Prob > chi2 = 0.0000
Note: LR test is conservative and provided only for reference
Mplus Example
Here is the same example analyzed as a multilevel using Mplus based on the ex6.10.dat data file.
Title:
Two level multilevel model in Mplus
Data:
File is ex6.10.dat ;
Variable:
Names are id time y x1 x2 a;
WITHIN = time a;
BETWEEN = x1 x2;
CLUSTER = id;
ANALYSIS:
TYPE = TWOLEVEL RANDOM;
MODEL:
%WITHIN%
s | y ON time;
y on a;
%BETWEEN%
y on x1 x2;
s on x1 x2;
y with s;
SUMMARY OF ANALYSIS
Number of observations 2000
SUMMARY OF DATA
Number of clusters 500
Average cluster size 4.000
Estimated Intraclass Correlations for the Y Variables
Intraclass Intraclass
Variable Correlation Variable Correlation
Y 0.615
TESTS OF MODEL FIT
Loglikelihood
H0 Value -3075.853
Information Criteria
Number of Free Parameters 11
Akaike (AIC) 6173.706
Bayesian (BIC) 6235.316
Sample-Size Adjusted BIC 6200.369
(n* = (n + 2) / 24)
MODEL RESULTS
Estimates S.E. Est./S.E.
Within Level
Y ON
A 0.297E 0.022 13.300
Residual Variances
Y 0.541I 0.024 22.178
Between Level
S ON
X1 0.263D 0.027 9.801
X2 0.473D 0.025 18.909
Y ON
X1 0.561B 0.054 10.296
X2 0.717B 0.054 13.264
Y WITH
S 0.050H 0.033 1.529
Intercepts
Y 0.570A 0.055 10.400
S 1.010C 0.025 39.763
Residual Variances
Y 1.081F 0.093 11.617
S 0.204G 0.020 10.236
- This is G00 in the multilevel model. It is the predicted value of y when time and a are both 0.
- This is G01 and G02 in the multilevel model. It is the predicted increase in the intercept for a one unit increase in x1 and x2, respectively.
- This is G20 in the multilevel model. It is the slope for time when x1 and x2 are held constant at 0.
- This is G21 and G22 in the multilevel model. It is the predicted increase in the time slope for a one unit increase in x1 and x2, respectively.
- This is G10 from the multilevel model, representing the regression of y on a.
- This is the variance of the intercept, the variance component for the intercept in the multilevel model.
- This is the variance of the slope for time, the variance component for the time slope in the multilevel model.
- This is the covariance of the intercept and slope, the covariance of B0 and B1 from the multilevel model.
- This is the residual variance.
