The interpretation of coefficients in an ordinal logistic regression varies by the software you use. In this FAQ page, we will focus on the interpretation of the coefficients in Stata but the results generalize to R, SPSS and Mplus.
Definitions
First let’s establish some notation and review the concepts involved in ordinal logistic regression. Let
for
Ordinal Logistic Regression Model
The ordinal logistic regression model can be defined as
How Stata parameterizes the ordinal regression model
In Stata the ordinal logistic regression model is parameterized as
where
Suppose we want to see whether a binary predictor parental education (pared
) predicts an ordinal outcome of students who are unlikely, somewhat likely and very likely to apply to a college (apply
).
Due to the parallel lines assumption, even though we have three categories, the coefficient of parental education (pared
) stays the same across the two categories. The the two equations for pared = 1
and pared = 0
are
Then
To run an ordinal logistic regression in Stata, first import the data and then use the ologit
command.
use "https://stats.idre.ucla.edu/stat/data/ologit.dta", clear ologit apply i.pared <... omitted output...> Ordered logistic regression Number of obs = 400 LR chi2(1) = 18.41 Prob > chi2 = 0.0000 Log likelihood = -361.39515 Pseudo R2 = 0.0248 ------------------------------------------------------------------------------ apply | Coef. Std. Err. z P>|z| [95% Conf. Interval] -------------+---------------------------------------------------------------- 1.pared | 1.127491 .2634324 4.28 0.000 .611173 1.643809 -------------+---------------------------------------------------------------- /cut1 | .3768424 .1103421 .1605758 .593109 /cut2 | 2.451855 .1825628 2.094039 2.809672 ------------------------------------------------------------------------------
The output shows that for students whose parents attended college, the log odds of being unlikely to apply to college (versus somewhat or very likely) is actually
To see the connection between the parallel lines assumption and the proportional odds assumption, exponentiate both sides of the equations above and use the property that pared
for each level of apply
.
From the odds of each level of pared, we can calculate the odds ratio of pared
for each level of apply
.
The proportional odds assumption ensures that the odds ratios across all
Interpreting the odds ratio
The proportional odds assumption is not simply that the odds are the same but that the odds ratios are the same across categories. These odds ratios can be derived by exponentiating the coefficients (in the log-odds metric), but the interpretation is a bit unexpected. Recall that the coefficient
Since the exponent is the inverse function of the log, we can simply exponentiate both sides of this equation, and by using the property that
For simplicity of notation and by the proportional odds assumption, let
However, as we will see in the output, this is not what we actually obtain from Stata!
To obtain the odds ratio in Stata, add the option or
to the ologit
command.
ologit apply i.pared, or <... omitted output...> Ordered logistic regression Number of obs = 400 LR chi2(1) = 18.41 Prob > chi2 = 0.0000 Log likelihood = -361.39515 Pseudo R2 = 0.0248 ------------------------------------------------------------------------------ apply | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval] -------------+---------------------------------------------------------------- 1.pared | 3.087899 .8134527 4.28 0.000 1.842591 5.174843 -------------+---------------------------------------------------------------- /cut1 | .3768424 .1103421 .1605758 .593109 /cut2 | 2.451855 .1825628 2.094039 2.809672 ------------------------------------------------------------------------------ Note: Estimates are transformed only in the first equation.
In our example,
Since
From the output,
Another way to look at the odds ratio
Double negation can be logically confusing. Suppose we wanted to interpret the odds of being more likely to apply to college. We can perform a slight manipulation of our original odds ratio:
Since
Instead of interpreting the odds of being in the
Verifying both interpretations of the odds ratio using predicted probabilities
To verify that indeed the odds ratio of 3.08 can be interpreted in two ways, let’s derive them from the predicted probabilities in Stata.
Following the ologit
command, run margins with a categorical predictor to obtain predicted probabilities for each level of the predictor for each level of the outcome (
margins pared Adjusted predictions Number of obs = 400 Model VCE : OIM 1._predict : Pr(apply==0), predict(pr outcome(0)) 2._predict : Pr(apply==1), predict(pr outcome(1)) 3._predict : Pr(apply==2), predict(pr outcome(2)) -------------------------------------------------------------------------------- | Delta-method | Margin Std. Err. z P>|z| [95% Conf. Interval] ---------------+---------------------------------------------------------------- _predict#pared | 1 0 | .5931113 .0266289 22.27 0.000 .5409196 .645303 1 1 | .32068 .0532744 6.02 0.000 .2162641 .4250959 2 0 | .3275857 .0239325 13.69 0.000 .2806789 .3744926 2 1 | .4692269 .0333495 14.07 0.000 .4038631 .5345907 3 0 | .079303 .0133296 5.95 0.000 .0531774 .1054286 3 1 | .2100931 .0424965 4.94 0.000 .1268015 .2933847 --------------------------------------------------------------------------------
The number on the first column represents apply
and the second column represents pared
.
Interpretation 1
The first interpretation is for students whose parents did not attend college, the odds of being unlikely versus somewhat or very likely (i.e., less likely) to apply is 3.08 times that of students whose parents did go to college.
To verify this interpretation, we arbitrarily calculate the odds ratio for the first level of apply
which we know by the proportional odds assumption is equivalent to the odds ratio for the second level of apply
. Since we are looking at pared = 0
vs. pared = 1
for
Interpretation 2
The second interpretation is for students whose parents did attend college, the odds of being very or somewhat likely versus unlikely (i.e., more likely) to apply is 3.08 times that of students whose parents did not go to college.
Here we are looking at pared = 1
vs. pared = 0
for apply
The odds ratio for both interpretations matches the output of Stata.
Summary
In general, to obtain the odds ratio it is easier to exponentiate the coefficient itself rather than its negative because this is what is output directly from Stata. The researcher must then decide which of the two interpretations to use:
- For students whose parents did not attend college, the odds of being less likely to apply is 3.08 times that of students whose parents did go to college.
- For students whose parents did attend college, the odds of being more likely to apply is 3.08 times that of students whose parents did not go to college.
The second interpretation is easier because it avoids double negation.
References
Bilder, C. R., & Loughin, T. M. (2014). Analysis of categorical data with R. Chapman and Hall/CRC.