Censored Regression | Mplus Annotated Output

This page shows an example of censored regression with footnotes explaining the output. First an example is shown using Stata, and then an example is shown using Mplus, to help you relate the output you are likely to be familiar with (Stata) to output that may be new to you (Mplus). We suggest that you view this page using two web browsers so you can show the page side by side showing the Stata output in one browser and the corresponding Mplus output in the other browser.

This example is drawn from the Mplus User’s Guide (example 3.2) and we suggest that you see the Mplus User’s Guide for more details about this example. We thank the kind people at Muthén & Muthén for permission to use examples from their manual.

Example Using Stata

Here is a probit regression example using Stata with two continuous predictors x1 and x2 used to predict a binary outcome variable, u1.

infile u1 x1 x3 using https://stats.idre.ucla.edu/wp-content/uploads/2016/02/ex3.2.dat, clear

summarize u1

    Variable |       Obs        Mean    Std. Dev.       Min        Max
-------------+--------------------------------------------------------
          u1 |      1000    .9240341    1.113079          0^A   6.579389

tobit u1 x1 x3, ll(0)

Tobit regression                                  Number of obs   =       1000
                                                  LR chi2(2)      =     697.44
                                                  Prob > chi2     =     0.0000
Log likelihood = -1142.8851                       Pseudo R2       =     0.2338

------------------------------------------------------------------------------
          u1 |      Coef.   Std. Err.      t    P>|t|     [95% Conf. Interval]
-------------+----------------------------------------------------------------
          x1 |   1.074801^D   .0419657    25.61   0.000     .9924498    1.157152
          x3 |   .4947541^D   .0378985    13.05   0.000     .4203842     .569124
       _cons |   .5154865^E   .0405066    12.73   0.000     .4359986    .5949743
-------------+----------------------------------------------------------------
      /sigma |   1.071333^F   .0316242                      1.009276    1.133391
------------------------------------------------------------------------------
  Obs. summary:        376  left-censored observations at u1<=0
                       624     uncensored observations
                         0 right-censored observations

estat ic

------------------------------------------------------------------------------
       Model |    Obs    ll(null)   ll(model)     df          AIC         BIC
-------------+----------------------------------------------------------------
           . |   1000   -1491.605   -1142.885^B      4      2293.77^C    2313.401^C
------------------------------------------------------------------------------

The output is labeled with superscripts to help you relate the later Mplus output to this Stata output. To summarize the output, both predictors in this model, x1 and x2, are significantly related to the outcome variable, u1.

Mplus Example

Here is the same example illustrated in Mplus based on the https://stats.idre.ucla.edu/wp-content/uploads/2016/02/ex3.2.dat data file. Note that by using estimator=wls; (weighted least squares) the results are shown in a probit metric. Had we specified something like estimator=ml; (maximum likelihood) then the results would be shown in a logit scale.

TITLE:	
  this is an example of a censored 
  regression for a censored dependent
  variable with two covariates
DATA:
  FILE IS https://stats.idre.ucla.edu/wp-content/uploads/2016/02/ex3.2.dat;
VARIABLE:
  NAMES ARE y1 x1 x3;
  CENSORED ARE y1 (b);
ANALYSIS:	
  ESTIMATOR = MLR;
MODEL:	
  y1 ON x1 x3;

SUMMARY OF ANALYSIS

<some output omitted to save space>
Number of observations                                        1000

<some output omitted to save space>
SUMMARY OF CENSORED LIMITS
      Y1                 0.000^A

THE MODEL ESTIMATION TERMINATED NORMALLY

TESTS OF MODEL FIT

Loglikelihood

          H0 Value                       -1142.885^B

Information Criteria

          Number of Free Parameters              4
          Akaike (AIC)                    2293.770^C
          Bayesian (BIC)                  2313.401^C
          Sample-Size Adjusted BIC        2300.697
            (n* = (n + 2) / 24)

MODEL RESULTS
                   Estimates     S.E.  Est./S.E.
 Y1         ON
    X1                 1.075^D    0.043     25.101
    X3                 0.495^D    0.037     13.344

 Intercepts
    Y1                 0.515^E    0.040     12.810

 Residual Variances
    Y1                 1.148^F    0.067     17.235

This indicates that the variable y1 is censored at 0. This is derived from the data, where Mplus notes that the lowest value of y1 is 0 (it seeks the lowest value because the input specification indicated the censoring was from below). Note how this corresponds to the results of the Stata summarize command that found the minimum value of y1 to be 0.
This is the log likelihood of the model. Note how this corresponds to the ll(model) from the Stata estat ic command.
These are the AIC and BIC fit indices, and correspond to the values shown from the estat ic command from Stata.
These are the regression coefficients showing the relationship between x1 x2 and y1. Such coefficients are interpreted in the same way as an OLS regression coefficient. The difference is that these coefficients attempt to estimate how estimate how strong the coefficient would have been had the censoring not taken place. Note the correspondence between these coefficients and those from Stata.
This is the intercept, the predicted value when all predictors are held constant at 0. Note the correspondence to the value shown in the Stata output.
This is the residual variance in y1 after accounting for the predictors, and would be analogous to the MSE from an OLS regression. In the Stata output this is reported as /sigma and is reported as a standard deviation (as opposed to a variance). Squaring the value from Stata yields 1.071333^2 = 1.1477544, corresponding to the result from Mplus.