This page was created using Mplus version 5.2, the output and/or syntax may be different for other versions of Mplus.
This page shows an example exploratory factor analysis with footnotes explaining the output. The data used in this example were collected on 1428 college students (complete data on 1365 observations) and are responses to items on a survey. You can obtain the data set by clicking here. The analysis includes 12 variables, item13 to item24.
Data: File is m255.dat ; Variable: Names are item13 item14 item15 item16 item17 item18 item19 item20 item21 item22 item23 item24; Missing are all (-9999) ; Analysis: type = efa 3 3;
Comments on syntax
- Some variables in the data set have missing values for some of the cases. These are indicated in the input data file with a -9999. We use the missing statement to indicate that for all variables in the model, missing values are indicated with a -9999. Note that starting with Mplus v.5 cases with missing values on some of the variables in the model are included in the analysis by default. If we wanted to include only cases with complete data, we could use the listwise = on; option of the data command.
- We indicate the type of analysis that we would like to do, that is, exploratory factor analysis (efa), using the type option of the analysis command. The numbers at the end of the statement indicate the minimum and maximum number of factors to be extracted. By using 3 3, we are saying that we only want a three-factor solution. We have done this to save space. We suggest that you use a reasonable range here, and each solution will be shown in the output. For example, if we had 2 4 at the end of the option, we would see the two-factor, three-factor and four-factor solution in the output.
- We have used the default geomin rotation. A number of other rotation methods, including the more traditional promax and varimax are available using the rotation option of the analysis command, for example: rotation=promax;.
- When all of the variables are continuous, as in this example, Mplus uses maximum likelihood (ML) as its method of deriving the factors by default. You may request other methods, such as unweighted least squares (ULS), using the estimator option. Note that not all methods are available for all types of variables.
- If you would like to get a scree plot, you can use the plot command
and indicate plot2. For example:
plot: type = plot2;
To see the graph, you need to click on "Graph" at the top of Mplus, and select "View Graphs". You then select "Eigenvalues for exploratory factor analysis" and click on "View" to see the screen plot.
Notes and summary information
The information below printed near the top of the output, it is useful because it lets you know what Mplus did. You want to check this part of the output to make sure Mplus ran the analysis that you intended.
*** WARNINGa Data set contains cases with missing on all variables. These cases were not included in the analysis. Number of cases with missing on all variables: 1 1 WARNING(S) FOUND IN THE INPUT INSTRUCTIONS SUMMARY OF ANALYSIS Number of groups 1 Number of observationsb 1427 Number of dependent variablec 12 Number of independent variables 0 Number of continuous latent variables 0 Observed dependent variablesd Continuous ITEM13 ITEM14 ITEM15 ITEM16 ITEM17 ITEM18 ITEM19 ITEM20 ITEM21 ITEM22 ITEM23 ITEM24 Estimatore ML Rotationf GEOMIN Row standardization CORRELATION Type of rotationg OBLIQUE Epsilon value Varies Information matrix OBSERVED Maximum number of iterations 1000 Convergence criterion 0.500D-04 Maximum number of steepest descent iterations 20 Maximum number of iterations for H1 2000 Convergence criterion for H1 0.100D-03 Optimization Specifications for the Exploratory Factor Analysis Rotation Algorithm Number of random starts 30 Maximum number of iterations 10000 Derivative convergence criterion 0.100D-04 Input data file(s) m255.dat Input data format FREE
a. Warning. In this case, the output includes a warning that 1 case had missing values on all of the variables in the analysis, and hence was excluded from the model.
b. Number of observations. The number of observations used in the analysis. As mentioned above, by default, Mplus will include all cases that have at least partial data on the variables in the analysis.
c. Number of dependent variables. Gives the number of dependent (outcome) variables in the model. Note that Mplus classifies the factor indicators as dependent variables.
d. Observed dependent variables. The list of variables included in this analysis. All of the variables in our model are listed under Continuous. If the model included categorical variables, they would be listed here under the heading Categorical. If this section includes variables you did not intend to include in your analysis, you may need to use the usevariables option of the data command.
e. Estimator. The method used to estimate the model, in this case, maximum likelihood (ML).
f. Rotation. The specific rotation method used in the model.
g. Type of rotation. Rotations that allow the factors to be correlated are oblique, while rotations that force the factors to be uncorrelated are known as orthogonal. The default geomin rotation is oblique.
SUMMARY OF DATA Number of missing data patternsh 24 COVARIANCE COVERAGE OF DATA Minimum covariance coverage value 0.100 PROPORTION OF DATA PRESENT Covariance Coveragei ITEM13 ITEM14 ITEM15 ITEM16 ITEM17 ________ ________ ________ ________ ________ ITEM13 0.994 ITEM14 0.994 0.998 ITEM15 0.993 0.996 0.998 ITEM16 0.991 0.994 0.994 0.995 ITEM17 0.992 0.995 0.995 0.994 0.997 ITEM18 0.992 0.996 0.996 0.994 0.996 ITEM19 0.989 0.993 0.994 0.992 0.994 ITEM20 0.974 0.977 0.977 0.975 0.976 ITEM21 0.992 0.994 0.994 0.992 0.994 ITEM22 0.986 0.989 0.989 0.987 0.989 ITEM23 0.992 0.995 0.995 0.992 0.994 ITEM24 0.989 0.992 0.992 0.989 0.991 Covariance Coverage ITEM18 ITEM19 ITEM20 ITEM21 ITEM22 ________ ________ ________ ________ ________ ITEM18 0.998 ITEM19 0.995 0.995 ITEM20 0.977 0.975 0.978 ITEM21 0.994 0.992 0.977 0.996 ITEM22 0.989 0.987 0.973 0.989 0.991 ITEM23 0.995 0.992 0.976 0.995 0.989 ITEM24 0.991 0.988 0.973 0.991 0.986 Covariance Coverage ITEM23 ITEM24 ________ ________ ITEM23 0.997 ITEM24 0.992 0.993
h. Number of missing data patterns. This gives the number of different patterns of missingness present in the variables included in the model. Large numbers of missing data patterns can result in difficulty estimating the model.
i. Covariance Coverage. If any of the variables in the model have missing values, Mplus provides information on the number and distribution of missing values. The covariance coverage matrix gives the proportion of values present for each variable individually (on the diagonal) and pairwise combinations of variables (below the diagonal). For example, 99.4% of cases have non-missing values for item13 and 99.3% of cases have valid values for item13 and item15.
RESULTS FOR EXPLORATORY FACTOR ANALYSIS EIGENVALUES FOR SAMPLE CORRELATION MATRIXj 1 2 3 4 5 ________ ________ ________ ________ ________ 1 6.289 1.228 0.709 0.606 0.561 EIGENVALUES FOR SAMPLE CORRELATION MATRIX 6 7 8 9 10 ________ ________ ________ ________ ________ 1 0.499 0.470 0.384 0.366 0.329 EIGENVALUES FOR SAMPLE CORRELATION MATRIX 11 12 ________ ________ 1 0.309 0.251 EXPLORATORY FACTOR ANALYSIS WITH 3 FACTOR(S): TESTS OF MODEL FIT Chi-Square Test of Model Fitk Value 137.865 Degrees of Freedom 33 P-Value 0.0000 Chi-Square Test of Model Fit for the Baseline Model Value 9132.568 Degrees of Freedom 66 P-Value 0.0000 CFI/TLIl CFI 0.988 TLI 0.977 Loglikelihood H0 Value -17776.793 H1 Value -17707.861 Information Criteriam Number of Free Parameters 57 Akaike (AIC) 35667.586 Bayesian (BIC) 35967.596 Sample-Size Adjusted BIC 35786.527 (n* = (n + 2) / 24) RMSEA (Root Mean Square Error Of Approximation)n Estimate 0.047 90 Percent C.I. 0.039 0.055 Probability RMSEA <= .05 0.701 SRMR (Standardized Root Mean Square Residual) Value 0.014 MINIMUM ROTATION FUNCTION VALUE 0.31440
j. Eigenvalues for sample correlation matrix. An eigenvalue is the variance of the factor. In the initial factor solution, the first factor will account for the most variance, the second will account for the next highest amount of variance, and so on.
k. Chi-square test of model fit. Compares the fit of the model to a model with no restrictions (i.e. all variables correlated freely). Chi-square values can be used to test the difference in fit between nested models.
l. Fit indices. The Comparative Fit Index (CFI) and the Tucker Lewis Index (TLI) are measures of model fit. They have a range from 0 to 1 with higher values indicating better fit.
m. Information Criteria. The Akaike information criterion (AIC) and the Bayesian information criterion (BIC, sometimes also called the Schwarz criterion), can also be used to compare models, including non-nested models.
n. RMSEA. The root mean square error of approximation is another measure of model fit. Smaller values indicate better model fit.
GEOMIN ROTATED LOADINGSo 1 2 3 ________ ________ ________ ITEM13 0.858 -0.087 0.010 ITEM14 0.832 -0.022 -0.021 ITEM15 0.724 0.085 0.007 ITEM16 0.645 0.129 -0.069 ITEM17 0.515 0.276 0.084 ITEM18 0.091 0.755 0.012 ITEM19 -0.015 0.842 -0.095 ITEM20 0.099 0.559 -0.011 ITEM21 0.221 0.408 0.199 ITEM22 0.000 0.508 0.205 ITEM23 0.123 0.011 0.795 ITEM24 -0.009 -0.008 0.804 GEOMIN FACTOR CORRELATIONSp 1 2 3 ________ ________ ________ 1 1.000 2 0.591 1.000 3 0.743 0.704 1.000
o. Geomin rotated loadings. The rotated loadings are the linear combination of variables that make up the factor. In addition to the factor loadings, to completely interpret an oblique rotation one needs to take into account both the factor pattern and the factor structure matrices (shown bellow) and the correlations among the factors. Note that orthogonal rotations produce only a single matrix, which gives the correlations between the variable and the factor.
p. Geomin factor correlations. The factor correlations matrix gives the correlations between the factors. For example, the correlation between factor 1 and factor 2 is 0.591.
ESTIMATED RESIDUAL VARIANCESq ITEM13 ITEM14 ITEM15 ITEM16 ITEM17 ________ ________ ________ ________ ________ 1 0.332 0.354 0.388 0.543 0.388 ESTIMATED RESIDUAL VARIANCES ITEM18 ITEM19 ITEM20 ITEM21 ITEM22 ________ ________ ________ ________ ________ 1 0.325 0.408 0.623 0.459 0.555 ESTIMATED RESIDUAL VARIANCES ITEM23 ITEM24 ________ ________ 1 0.194 0.373
q. Estimated residual variances. These are the variances of the observed variables after accounting for all of the variance in the efa model.
Below are the standard errors for the geomin rotated loadings, factor correlations, and estimated residual variances. These values can be used to perform hypothesis tests and estimate confidence intervals.
S.E. GEOMIN ROTATED LOADINGS 1 2 3 ________ ________ ________ ITEM13 0.043 0.051 0.031 ITEM14 0.030 0.035 0.040 ITEM15 0.040 0.053 0.039 ITEM16 0.073 0.060 0.099 ITEM17 0.044 0.052 0.061 ITEM18 0.043 0.033 0.029 ITEM19 0.027 0.044 0.059 ITEM20 0.045 0.039 0.040 ITEM21 0.045 0.044 0.051 ITEM22 0.022 0.048 0.057 ITEM23 0.152 0.034 0.180 ITEM24 0.024 0.090 0.104 S.E. GEOMIN FACTOR CORRELATIONS 1 2 3 ________ ________ ________ 1 0.000 2 0.042 0.000 3 0.054 0.030 0.000 S.E. ESTIMATED RESIDUAL VARIANCES ITEM13 ITEM14 ITEM15 ITEM16 ITEM17 ________ ________ ________ ________ ________ 1 0.021 0.021 0.020 0.025 0.018 S.E. ESTIMATED RESIDUAL VARIANCES ITEM18 ITEM19 ITEM20 ITEM21 ITEM22 ________ ________ ________ ________ ________ 1 0.021 0.026 0.024 0.020 0.023 S.E. ESTIMATED RESIDUAL VARIANCES ITEM23 ITEM24 ________ ________ 1 0.074 0.076
Below are the z-statistics (i.e. estimate/standard error) for the geomin rotated loadings, factor correlations, and estimated residual variances. These values can be compared to a normal distribution to perform hypothesis tests.
Est./S.E. GEOMIN ROTATED LOADINGS 1 2 3 ________ ________ ________ ITEM13 20.050 -1.681 0.317 ITEM14 27.757 -0.623 -0.516 ITEM15 18.307 1.618 0.165 ITEM16 8.793 2.157 -0.700 ITEM17 11.680 5.280 1.377 ITEM18 2.136 23.045 0.416 ITEM19 -0.564 19.088 -1.602 ITEM20 2.190 14.343 -0.280 ITEM21 4.966 9.225 3.884 ITEM22 -0.017 10.650 3.567 ITEM23 0.812 0.314 4.426 ITEM24 -0.357 -0.088 7.694 Est./S.E. GEOMIN FACTOR CORRELATIONS 1 2 3 ________ ________ ________ 1 0.000 2 14.212 0.000 3 13.724 23.775 0.000 Est./S.E. ESTIMATED RESIDUAL VARIANCES ITEM13 ITEM14 ITEM15 ITEM16 ITEM17 ________ ________ ________ ________ ________ 1 16.029 16.792 19.200 21.564 21.324 Est./S.E. ESTIMATED RESIDUAL VARIANCES ITEM18 ITEM19 ITEM20 ITEM21 ITEM22 ________ ________ ________ ________ ________ 1 15.574 15.812 26.444 23.392 24.618 Est./S.E. ESTIMATED RESIDUAL VARIANCES ITEM23 ITEM24 ________ ________ 1 2.610 4.894
FACTOR STRUCTUREr 1 2 3 ________ ________ ________ ITEM13 0.815 0.428 0.587 ITEM14 0.803 0.456 0.582 ITEM15 0.779 0.518 0.605 ITEM16 0.670 0.461 0.501 ITEM17 0.740 0.639 0.660 ITEM18 0.546 0.818 0.611 ITEM19 0.412 0.766 0.486 ITEM20 0.421 0.610 0.456 ITEM21 0.610 0.678 0.650 ITEM22 0.452 0.651 0.562 ITEM23 0.720 0.643 0.894 ITEM24 0.584 0.553 0.792 FACTOR DETERMINACIESs 1 2 3 ________ ________ ________ 1 0.945 0.926 0.935
r. Factor Structure. With an oblique rotation, the factor structure matrix presents the correlations between the variables and the factors. For example, the correlation between item13 and factor 1 is 0.815. As noted above, the factor structure matrix is used along with the factor loadings and factor correlations to interpret the model.
s. Factor Determinacies are the correlations between the estimated factor score and the factor.