Principal Components (PCA) and Exploratory Factor Analysis (EFA) with SPSS

Overview

This seminar will give a practical overview of both principal components analysis (PCA) and exploratory factor analysis (EFA) using SPSS. We will begin with variance partitioning and explain how it determines the use of a PCA or EFA model. For the PCA portion of the seminar, we will introduce topics such as eigenvalues and eigenvectors, communalities, sum of squared loadings, total variance explained, and choosing the number of components to extract. For the EFA portion, we will discuss factor extraction, estimation methods, factor rotation, and generating factor scores for subsequent analyses. The seminar will focus on how to run a PCA and EFA in SPSS and thoroughly interpret output, using the hypothetical SPSS Anxiety Questionnaire as a motivating example.

Download links

SPSS Dataset: SAQ-8.sav
Powerpoint Slides: Slides for EFA and PCA in SPSS
SPSS Syntax: SPSS Syntax File for EFA and PCA Seminar

Outline

Introduction
1. Motivating example: The SAQ
2. Pearson correlation formula
3. Partitioning the variance in factor analysis
Extracting factors
1. Principal components analysis
  - Running a PCA with 8 components in SPSS
  - Running a PCA with 2 components in SPSS
2. Common factor analysis
  - Principal axis factoring (2-factor PAF)
  - Maximum likelihood (2-factor ML)
Rotation methods
1. Simple Structure
2. Orthogonal rotation (Varimax)
3. Oblique (Direct Oblimin)
Generating factor scores

Introduction

Suppose you are conducting a survey and you want to know whether the items in the survey have similar patterns of responses, do these items “hang together” to create a construct? The basic assumption of factor analysis is that for a collection of observed variables there are a set of underlying or latent variables called factors (smaller than the number of observed variables), that can explain the interrelationships among those variables. Let’s say you conduct a survey and collect responses about people’s anxiety about using SPSS. Do all these items actually measure what we call “SPSS Anxiety”?

Correlations
	Statistics makes me cry	My friends will think I’m stupid for not being able to cope with SPSS	Standard deviations excite me	I dream that Pearson is attacking me with correlation coefficients	I don’t understand statistics	I have little experience with computers	All computers hate me	I have never been good at mathematics
Statistics makes me cry	1
My friends will think I’m stupid for not being able to cope with SPSS	-.099	1
Standard deviations excite me	-.337	.318	1
I dream that Pearson is attacking me with correlation coefficients	.436	-.112	-.380	1
I don’t understand statistics	.402	-.119	-.310	.401	1
I have little experience with computers	.217	-.074	-.227	.278	.257	1
All computers hate me	.305	-.159	-.382	.409	.339	.514	1
I have never been good at mathematics	.331	-.050	-.259	.349	.269	.223	.297	1

Component Matrix^aComponent Matrix, table, 2 levels of column headers and 1 levels of row headers, table with 9 columns and 13 rows
	Component
	1	2	3	4	5	6	7	8
Statistics makes me cry	.659	.136	-.398	.160	-.064	.568	-.177	.068
My friends will think I’m stupid for not being able to cope with SPSS	-.300	.866	-.025	.092	-.290	-.170	-.193	-.001
Standard deviations excite me	-.653	.409	.081	.064	.410	.254	.378	.142
I dream that Pearson is attacking me with correlation coefficients	.720	.119	-.192	.064	-.288	-.089	.563	-.137
I don’t understand statistics	.650	.096	-.215	.460	.443	-.326	-.092	-.010
I have little experience of computers	.572	.185	.675	.031	.107	.176	-.058	-.369
All computers hate me	.718	.044	.453	-.006	-.090	-.051	.025	.516
I have never been good at mathematics	.568	.267	-.221	-.694	.258	-.084	-.043	-.012
Extraction Method: Principal Component Analysis.
a. 8 components extracted.

Total Variance ExplainedTotal Variance Explained, table, 2 levels of column headers and 1 levels of row headers, table with 7 columns and 12 rows
Component	Initial Eigenvalues			Extraction Sums of Squared Loadings
Component	Total	% of Variance	Cumulative %	Total	% of Variance	Cumulative %
1	3.057	38.206	38.206	3.057	38.206	38.206
2	1.067	13.336	51.543	1.067	13.336	51.543
3	.958	11.980	63.523	.958	11.980	63.523
4	.736	9.205	72.728	.736	9.205	72.728
5	.622	7.770	80.498	.622	7.770	80.498
6	.571	7.135	87.632	.571	7.135	87.632
7	.543	6.788	94.420	.543	6.788	94.420
8	.446	5.580	100.000	.446	5.580	100.000
Extraction Method: Principal Component Analysis.

Total Variance ExplainedTotal Variance Explained, table, 2 levels of column headers and 1 levels of row headers, table with 7 columns and 12 rows
Component	Initial Eigenvalues			Extraction Sums of Squared Loadings
Component	Total	% of Variance	Cumulative %	Total	% of Variance	Cumulative %
1	3.057	38.206	38.206	3.057	38.206	38.206
2	1.067	13.336	51.543	1.067	13.336	51.543
3	.958	11.980	63.523
4	.736	9.205	72.728
5	.622	7.770	80.498
6	.571	7.135	87.632
7	.543	6.788	94.420
8	.446	5.580	100.000
Extraction Method: Principal Component Analysis.

Overview

Download links

Outline

Introduction

Motivating Example: The SAQ (SPSS Anxiety Questionnaire)

Pearson Correlation of the SAQ-8

Partitioning the variance in factor analysis

Performing Factor Analysis

Extracting Factors

Principal Components Analysis

Running a PCA with 8 components in SPSS

Eigenvalues and Eigenvectors

Component Matrix of the 8-component PCA

Choosing the number of components to extract

Running a PCA with 2 components in SPSS

Quick check:

Communalities of the 2-component PCA

Quiz

Common Factor Analysis

Running a Common Factor Analysis with 2 factors in SPSS

Communalities of the 2-factor PAF

Total Variance Explained (2-factor PAF)

Quick Quiz

Factor Matrix (2-factor PAF)

Practical Interpretation

The relationship between the three tables

Quiz

Maximum Likelihood Estimation (2-factor ML)

Quiz

Comparing Common Factor Analysis versus Principal Components

Quiz

Rotation Methods

Simple structure

Quiz

Orthogonal Rotation (2 factor PAF)

Running a two-factor solution (PAF) with Varimax rotation in SPSS

Rotated Factor Matrix (2-factor PAF Varimax)

Interpreting the factor loadings (2-factor PAF Varimax)

Factor Transformation Matrix and Factor Loading Plot (2-factor PAF Varimax)

Total Variance Explained (2-factor PAF Varimax)

Other Orthogonal Rotations

Oblique Rotation

Running a two-factor solution (PAF) with Direct Quartimin rotation in SPSS

Quiz

Factor Pattern Matrix (2-factor PAF Direct Quartimin)

Factor Structure Matrix (2-factor PAF Direct Quartimin)

Factor Correlation Matrix (2-factor PAF Direct Quartimin)

Factor plot

Relationship between the Pattern and Structure Matrix

Total Variance Explained (2-factor PAF Direct Quartimin)

Interpreting the factor loadings (2-factor PAF Direct Quartimin)

Quiz

Evaluating Simple Structure

Promax Rotation

Quiz

Generating Factor Scores

Generating factor scores using the Regression Method in SPSS

Regression, Bartlett and Anderson-Rubin compared

Quiz