This page shows an example of a canonical correlation analysis in SAS with footnotes explaining the output. A researcher has collected data on three psychological variables, four academic variables (standardized test scores) and gender for 600 college freshman. She is interested in how the set of psychological variables relates to the academic variables and gender. In particular, the researcher is interested in how many dimensions are necessary to understand the association between the two sets of variables.
We have a data file, https://stats.idre.ucla.edu/wp-content/uploads/2016/02/mmr.sas7bdat, with 600 observations on eight variables. The psychological variables are locus of control, self-concept and motivation. The academic variables are standardized test scores in reading, writing, math and science. Additionally, the variable female is a zero-one indicator variable with the one indicating a female student. The researcher is interested in the relationship between the psychological variables and the academic variables, with gender considered as well. Canonical correlation analysis aims to find pairs of linear combinations of each group of variables that are highly correlated. These linear combinations are called canonical variates. Each canonical variate is orthogonal to the other canonical variates except for the one with which its correlation has been maximized. The possible number of such pairs is limited to the number of variables in the smallest group. In our example, there are three psychological variables and more than three academic variables. Thus, a canonical correlation analysis on these sets of variables will generate three pairs of canonical variates.
To begin, let’s read in and explore the dataset.
proc means data = mmr; run;
The SAS System The MEANS Procedure Variable Label N Mean Std Dev Minimum Maximum ID 600 300.5000000 173.3493582 1.0000000 600.0000000 LOCUS_OF_CONTROL locus of control 600 0.0965333 0.6702799 -2.2300000 1.3600000 SELF_CONCEPT self-concept 600 0.0049167 0.7055125 -2.6199999 1.1900001 MOTIVATION motivation 600 0.6608333 0.3427294 0 1.0000000 READ reading score 600 51.9018334 10.1029830 28.2999992 76.0000000 WRITE writing score 600 52.3848333 9.7264550 25.5000000 67.0999985 MATH math score 600 51.8490000 9.4147363 31.7999992 75.5000000 SCIENCE science score 600 51.7633332 9.7061789 26.0000000 74.1999969 FEMALE 600 0.5450000 0.4983864 0 1.0000000
To run our canonical correlation, we will use the cancorr procedure in SAS. We list the set of variables in our first group in the var statement and the set of variables in our second group in the with statement. We include the optional commands vprefix, wprefix, vname and wname in the proc cancor statement to give understandable prefixes to our sets of variables and make the output easier to interpret.
proc cancorr data=mmr vprefix=Psych vname='Psychological Measurements' wprefix=Academic wname='Academic Measurements'; var locus_of_control self_concept motivation; with read write math science female; run;
...[additional output omitted]...
Correlations Among the Original Variables
Correlations Among the Psychological Measurements
LOCUS_OF_
CONTROL SELF_CONCEPT MOTIVATION
LOCUS_OF_CONTROL 1.0000 0.1712 0.2451
SELF_CONCEPT 0.1712 1.0000 0.2886
MOTIVATION 0.2451 0.2886 1.0000
Correlations Among the Academic Measurements
READ WRITE MATH SCIENCE FEMALE
READ 1.0000 0.6286 0.6793 0.6907 -0.0417
WRITE 0.6286 1.0000 0.6327 0.5691 0.2443
MATH 0.6793 0.6327 1.0000 0.6495 -0.0482
SCIENCE 0.6907 0.5691 0.6495 1.0000 -0.1382
FEMALE -0.0417 0.2443 -0.0482 -0.1382 1.0000
Correlations Between the Psychological Measurements
and the Academic Measurements
READ WRITE MATH
LOCUS_OF_CONTROL 0.3736 0.3589 0.3373
SELF_CONCEPT 0.0607 0.0194 0.0536
MOTIVATION 0.2106 0.2542 0.1950
Correlations Between the Psychological Measurements
and the Academic Measurements
SCIENCE FEMALE
LOCUS_OF_CONTROL 0.3246 0.1134
SELF_CONCEPT 0.0698 -0.1260
MOTIVATION 0.1157 0.0981
--------------------------------------------------------------------------------
Canonical Correlation Analysis
Adjusted Approximate Squared
Canonical Canonical Standard Canonical
Correlation Correlation Error Correlation
1 0.464086 0.455474 0.032059 0.215376
2 0.167509 . 0.039712 0.028059
3 0.103991 . 0.040417 0.010814
Eigenvalues of Inv(E)*H
= CanRsq/(1-CanRsq)
Eigenvalue Difference Proportion Cumulative
1 0.2745 0.2456 0.8734 0.8734
2 0.0289 0.0179 0.0919 0.9652
3 0.0109 0.0348 1.0000
Test of H0: The canonical correlations in the
current row and all that follow are zero
Likelihood Approximate
Ratio F Value Num DF Den DF Pr > F
1 0.75436113 11.72 15 1634.7 <.0001
2 0.96142996 2.94 8 1186 0.0029
3 0.98918584 2.16 3 594 0.0911
Multivariate Statistics and F Approximations
S=3 M=0.5 N=295
Statistic Value F Value Num DF Den DF Pr > F
Wilks' Lambda 0.75436113 11.72 15 1634.7 <.0001
Pillai's Trace 0.25424936 11.00 15 1782 <.0001
Hotelling-Lawley Trace 0.31429738 12.38 15 1113 <.0001
Roy's Greatest Root 0.27449563 32.61 5 594 <.0001
NOTE: F Statistic for Roy's Greatest Root is an upper bound.
--------------------------------------------------------------------------------
Canonical Correlation Analysis
Raw Canonical Coefficients for the Psychological Measurements
Psych1 Psych2 Psych3
LOCUS_OF_CONTROL locus of control 1.2538339076 0.6214775237 -0.661689607
SELF_CONCEPT self-concept -0.35134993 1.1876866562 0.8267209411
MOTIVATION motivation 1.2624203286 -2.027264053 2.0002284379
Raw Canonical Coefficients for the Academic Measurements
Academic1 Academic2 Academic3
READ reading score 0.0446205959 0.0049100176 0.0213805581
WRITE writing score 0.0358771125 -0.042071471 0.0913073288
MATH math score 0.0234171847 -0.004229472 0.0093982096
SCIENCE science score 0.0050251567 0.0851621751 -0.109835018
FEMALE 0.6321192387 -1.084642482 -1.794646917
--------------------------------------------------------------------------------
Canonical Correlation Analysis
Standardized Canonical Coefficients for the Psychological Measurements
Psych1 Psych2 Psych3
LOCUS_OF_CONTROL locus of control 0.8404 0.4166 -0.4435
SELF_CONCEPT self-concept -0.2479 0.8379 0.5833
MOTIVATION motivation 0.4327 -0.6948 0.6855
Standardized Canonical Coefficients for the Academic Measurements
Academic1 Academic2 Academic3
READ reading score 0.4508 0.0496 0.2160
WRITE writing score 0.3490 -0.4092 0.8881
MATH math score 0.2205 -0.0398 0.0885
SCIENCE science score 0.0488 0.8266 -1.0661
FEMALE 0.3150 -0.5406 -0.8944
--------------------------------------------------------------------------------
Canonical Structure
Correlations Between the Psychological Measurements
and Their Canonical Variables
Psych1 Psych2 Psych3
LOCUS_OF_CONTROL locus of control 0.9040 0.3897 -0.1756
SELF_CONCEPT self-concept 0.0208 0.7087 0.7052
MOTIVATION motivation 0.5672 -0.3509 0.7451
Correlations Between the Academic Measurements and Their Canonical Variables
Academic1 Academic2 Academic3
READ reading score 0.8404 0.3588 0.1354
WRITE writing score 0.8765 -0.0648 0.2546
MATH math score 0.7639 0.2979 0.1478
SCIENCE science score 0.6584 0.6768 -0.2304
FEMALE 0.3641 -0.7549 -0.5434
Correlations Between the Psychological Measurements and
the Canonical Variables of the Academic Measurements
Academic1 Academic2 Academic3
LOCUS_OF_CONTROL locus of control 0.4196 0.0653 -0.0183
SELF_CONCEPT self-concept 0.0097 0.1187 0.0733
MOTIVATION motivation 0.2632 -0.0588 0.0775
Correlations Between the Academic Measurements and the
Canonical Variables of the Psychological Measurements
Psych1 Psych2 Psych3
READ reading score 0.3900 0.0601 0.0141
WRITE writing score 0.4068 -0.0109 0.0265
MATH math score 0.3545 0.0499 0.0154
SCIENCE science score 0.3056 0.1134 -0.0240
FEMALE 0.1690 -0.1265 -0.0565
--------------------------------------------------------------------------------
Canonical Redundancy Analysis
Raw Variance of the Psychological Measurements Explained by
Their Own The Opposite
Canonical Variables Canonical Variables
Canonical
Variable Cumulative Canonical Cumulative
Number Proportion Proportion R-Square Proportion Proportion
1 0.3806 0.3806 0.2154 0.0820 0.0820
2 0.3126 0.6932 0.0281 0.0088 0.0908
3 0.3068 1.0000 0.0108 0.0033 0.0941
Raw Variance of the Academic Measurements Explained by
Their Own The Opposite
Canonical Variables Canonical Variables
Canonical
Variable Cumulative Canonical Cumulative
Number Proportion Proportion R-Square Proportion Proportion
1 0.6251 0.6251 0.2154 0.1346 0.1346
2 0.1704 0.7955 0.0281 0.0048 0.1394
3 0.0395 0.8350 0.0108 0.0004 0.1398
--------------------------------------------------------------------------------
Canonical Redundancy Analysis
Standardized Variance of the Psychological Measurements Explained by
Their Own The Opposite
Canonical Variables Canonical Variables
Canonical
Variable Cumulative Canonical Cumulative
Number Proportion Proportion R-Square Proportion Proportion
1 0.3798 0.3798 0.2154 0.0818 0.0818
2 0.2591 0.6389 0.0281 0.0073 0.0891
3 0.3611 1.0000 0.0108 0.0039 0.0930
Standardized Variance of the Academic Measurements Explained by
Their Own The Opposite
Canonical Variables Canonical Variables
Canonical
Variable Cumulative Canonical Cumulative
Number Proportion Proportion R-Square Proportion Proportion
1 0.5249 0.5249 0.2154 0.1130 0.1130
2 0.2499 0.7748 0.0281 0.0070 0.1201
3 0.0907 0.8655 0.0108 0.0010 0.1210
--------------------------------------------------------------------------------
Canonical Redundancy Analysis
Squared Multiple Correlations Between the Psychological Measurements
and the First M Canonical Variables of the Academic Measurements
M 1 2 3
LOCUS_OF_CONTROL locus of control 0.1760 0.1803 0.1806
SELF_CONCEPT self-concept 0.0001 0.0142 0.0196
MOTIVATION motivation 0.0693 0.0727 0.0787
Squared Multiple Correlations Between the Academic Measurements and
the First M Canonical Variables of the Psychological Measurements
M 1 2 3
READ reading score 0.1521 0.1557 0.1559
WRITE writing score 0.1655 0.1656 0.1663
MATH math score 0.1257 0.1282 0.1284
SCIENCE science score 0.0934 0.1062 0.1068
FEMALE 0.0286 0.0445 0.0477
Correlations
Among the Original Variables
The SAS System
The CANCORR Procedure
Correlations Among the Original Variables
Correlations Among the Psychological Measurementsa
LOCUS_OF_
CONTROL SELF_CONCEPT MOTIVATION
LOCUS_OF_CONTROL 1.0000 0.1712 0.2451
SELF_CONCEPT 0.1712 1.0000 0.2886
MOTIVATION 0.2451 0.2886 1.0000
Correlations Among the Academic Measurementsb
READ WRITE MATH SCIENCE FEMALE
READ 1.0000 0.6286 0.6793 0.6907 -0.0417
WRITE 0.6286 1.0000 0.6327 0.5691 0.2443
MATH 0.6793 0.6327 1.0000 0.6495 -0.0482
SCIENCE 0.6907 0.5691 0.6495 1.0000 -0.1382
FEMALE -0.0417 0.2443 -0.0482 -0.1382 1.0000
Correlations Between the Psychological Measurements
and the Academic Measurementsc
READ WRITE MATH
LOCUS_OF_CONTROL 0.3736 0.3589 0.3373
SELF_CONCEPT 0.0607 0.0194 0.0536
MOTIVATION 0.2106 0.2542 0.1950
Correlations Between the Psychological Measurements
and the Academic Measurements
SCIENCE FEMALE
LOCUS_OF_CONTROL 0.3246 0.1134
SELF_CONCEPT 0.0698 -0.1260
MOTIVATION 0.1157 0.0981
a. Correlations Among the Psychological Measurements – This is the Pearson correlation matrix for the three psychological variables. This gives us a sense of the relationships between the variables within this group. Because there are three variables in this group, the correlation matrix is 3×3. The psychological variables are not highly correlated. This suggests that knowing the values in one of the psychological variables does not provide much information about the other psychological variables. These relationships between the variables will effect the way in which the group is summarized as a linear combination of these variables.
b. Correlations Among the Academic Measurements – This is the Pearson correlation matrix for the four academic variables and female. This gives us a sense of the relationships between the variables within this group. Because there are three variables in this group, the correlation matrix is 5×5. We can see that the four standardized test variables (read, write, math, and science) are much more highly correlated than the psychological variables.
c. Correlations Between the Psychological Measurements and the Academic Measurements – This matrix presents the psychological variables in rows and the academic variables in columns. The correlations in the matrix are between all combinations of variables in different groups. Because we have 3 variables in one group and 5 in the other, a total of 15 such correlations exist. In this table, we can see that all of the correlations are less than 0.4.
Canonical Correlations
Adjusted Approximate Squared
Canonical Canonical Standard Canonical
Correlationd Correlatione Errorf Correlationg
1 0.464086 0.455474 0.032059 0.215376
2 0.167509 . 0.039712 0.028059
3 0.103991 . 0.040417 0.010814
Eigenvalues of Inv(E)*H
= CanRsq/(1-CanRsq)
Eigenvalueh Differencei Proportionj Cumulativek
1 0.2745 0.2456 0.8734 0.8734
2 0.0289 0.0179 0.0919 0.9652
3 0.0109 0.0348 1.0000
Test of H0: The canonical correlations in the
current row and all that follow are zero
Likelihood Approximate
Ratiol F Valuem Num DF Den DFn Pr > Fo
1 0.75436113 11.72 15 1634.7 <.0001
2 0.96142996 2.94 8 1186 0.0029
3 0.98918584 2.16 3 594 0.0911
Multivariate Statistics and F Approximations
S=3 M=0.5 N=295
Statistic Value F Valuem Num DF Den DFn Pr > Fo
Wilks' Lambdap 0.75436113 11.72 15 1634.7 <.0001
Pillai's Traceq 0.25424936 11.00 15 1782 <.0001
Hotelling-Lawley Tracer 0.31429738 12.38 15 1113 <.0001
Roy's Greatest Roots 0.27449563 32.61 5 594 <.0001
NOTE: F Statistic for Roy's Greatest Root is an upper bound.
d. Canonical Correlation – These are the Pearson correlations of the pairs of canonical variates. The first pair of variates, a linear combination of the psychological measurements and a linear combination of the academic measurements, has a correlation coefficient of 0.464086. The second pair has a correlation coefficient of 0.167509, and the third pair 0.103991.
e. Adjusted Canonical Correlation – These are adjusted canonical correlations which are less biased than the raw correlations. These adjusted values may be negative. If an adjusted canonical correlation is close to zero or if it is greater than the previous adjusted canonical correlation, then it is reported as missing.
f. Approximate Standard Error – These are the approximate standard errors for the canonical correlations.
g. Squared Canonical Correlation – These are the squares of the canonical correlations. For example, (0.464086*0.464086) = 0.215376. These values can be interpreted similarly to R-squared values in OLS regression: they are the proportion of the variance in the canonical variate of one set of variables explained by the canonical variate of the other set of variables.
h. Eigenvalue – These are the eigenvalues of the product of the model matrix and the inverse of the error matrix. These eigenvalues can also be calculated using the squared canonical correlations. The largest eigenvalue is equal to largest squared correlation /(1- largest squared correlation). So 0.215376/(1-0.215376) = 0.2745. These calculations can be completed for each correlation to find the corresponding eigenvalue. The magnitudes of the eigenvalues are related to the tests of the correlations. The larger the eigenvalues are associated with lower p-values. If we think about the relationship between the canonical correlations and the eigenvalues, it makes sense that the larger correlations are more likely to be significantly different from zero.
i. Difference – This is the difference between the given eigenvalue and the next-largest eigenvalue: 0.2745-0.0289 = 0.2456 and 0.0289-0.0109 = 0.0179 (with rounding).
j. Proportion – This is the proportion of the sum of the eigenvalues represented by a given eigenvalue. The sum of the three eigenvalues is (0.2745+0.0289+0.0109) = 0.3143. Then, the proportions can be calculated: 0.2745/0.3143 = 0.8734, 0.0289/0.3143 = 0.0919, and 0.0109/0.3143 = 0.0348.
k. Cumulative – This is the cumulative sum of the proportions.
l. Likelihood Ratio – This is the likelihood ratio for testing the hypothesis that the given canonical correlation and all smaller ones are equal to zero in the population. It is equivalent to Wilks’ lambda (see superscript p) and can be calculated as the product of the values of (1-canonical correlation2). In this example, our canonical correlations are 0.4641, 0.1675, and 0.1040. Hence the likelihood ratio for testing that all three of the correlations are zero is (1- 0.46412)*(1-0.16752)*(1-0.10402) = 0.754361. To test that the two smaller canonical correlations, 0.1675 and 0.1040, are zero in the population, the likelihood is (1-0.16752)*(1-0.10402) = 0.96143. The likelihood that the smallest canonical correlation is zero is (1-0.10402) = 0.989186.
m. (Approximate) F Value – These are the F values associated with the various tests (likelihood ratio or one of the four multivariate tests) that are included in SAS’s cancorr procedure. For the likelihood ratio tests, the F values are approximate. For Roy’s Greatest Root, the F value is an upper bound. For the likelihood tests, the F values are testing the hypotheses that the given canonical correlation and all smaller ones are equal to zero in the population. For the multivariate tests, the F values are testing the hypothesis that all three canonical correlations are equal to zero in the population.
n. Num DF, Den DF – These are the degrees of freedom used in determining the F values. Note that there are instances in which the degrees of freedom may be a non-integer (here, the Den DF associated with Wilks’ lambda is a non-integer) because these degrees of freedom are calculated using the mean squared errors, which are often non-integers.
o. Pr > F – This is the p-value associated with the F value of a given test statistic. The null hypothesis that our two sets of variables are not linearly related is evaluated with regard to this p-value. The null hypothesis is rejected if the p-value is less than our specified alpha level (often 0.05). If not, then we fail to reject the null hypothesis. In this example, we reject the null hypothesis that all three canonical correlations are equal to zero at alpha level 0.05 because the p-values for all tests of this hypothesis are less than 0.05 (Wilks’ Lambda, Pillai’s Trace, Hotelling-Lawley Trace, Roy’s Greatest Root and the first Likelihood Ratio). The p-value associated with the likelihood ratio test of the second and third canonical correlations suggest that they we can also reject the hypothesis that both the second and third canonical correlations are zero, but the p-value associated with the likelihood ratio test of the third canonical correlation alone is 0.0911. Because this is greater than 0.05, we fail to reject the hypothesis that the third canonical correlation is zero.
p. Wilks’ Lambda – This is one of the four multivariate statistics calculated by SAS to test the null hypothesis that the canonical correlations are zero (which, in turn, means that there is no linear relationship between the two specified groups of variables). Wilks’ lambda is the product of the values of (1-canonical correlation2). In this example, our canonical correlations are 0.4641, 0.1675, and 0.1040 so the Wilks’ Lambda testing all three of the correlations is (1- 0.46412)*(1-0.16752)*(1-0.10402) = 0.75436113. This test statistic is equal to the likelihood ratio (see superscript l).
q. Pillai’s Trace – Pillai’s trace is another of the four multivariate statistics calculated by SAS. Pillai’s trace is the sum of the squared canonical correlations: 0.46412 + 0.16752 + 0.10402 = 0.25424936.
r. Hotelling-Lawley Trace – This is very similar to Pillai’s trace. It is the sum of the values of (canonical correlation2/(1-canonical correlation2)). We can calculate 0.46412 /(1- 0.46412) + 0.16752/(1-0.16752) + 0.10402/(1-0.10402) = 0.31429738.
s. Roy’s Greatest Root – This is the largest eigenvalue. Because it is based on a maximum, it can behave differently from the other three test statistics. In instances where the other three are not significant and Roy’s is significant, the effect should be considered not significant.
Canonical Coefficients
Raw Canonical Coefficients for the Psychological Measurementst
Psych1 Psych2 Psych3
LOCUS_OF_CONTROL locus of control 1.2538339076 0.6214775237 -0.661689607
SELF_CONCEPT self-concept -0.35134993 1.1876866562 0.8267209411
MOTIVATION motivation 1.2624203286 -2.027264053 2.0002284379
Raw Canonical Coefficients for the Academic Measurementst
Academic1 Academic2 Academic3
READ reading score 0.0446205959 0.0049100176 0.0213805581
WRITE writing score 0.0358771125 -0.042071471 0.0913073288
MATH math score 0.0234171847 -0.004229472 0.0093982096
SCIENCE science score 0.0050251567 0.0851621751 -0.109835018
FEMALE 0.6321192387 -1.084642482 -1.794646917
Standardized Canonical Coefficients for the Psychological Measurementsu
Psych1 Psych2 Psych3
LOCUS_OF_CONTROL locus of control 0.8404 0.4166 -0.4435
SELF_CONCEPT self-concept -0.2479 0.8379 0.5833
MOTIVATION motivation 0.4327 -0.6948 0.6855
Standardized Canonical Coefficients for the Academic Measurementsu
Academic1 Academic2 Academic3
READ reading score 0.4508 0.0496 0.2160
WRITE writing score 0.3490 -0.4092 0.8881
MATH math score 0.2205 -0.0398 0.0885
SCIENCE science score 0.0488 0.8266 -1.0661
FEMALE 0.3150 -0.5406 -0.8944
t. Raw Canonical Coefficients for the Psychological/Academic Measurements – These are the raw canonical coefficients. They define the linear relationship between the variables in a given group and the canonical variates. They can be interpreted in the same way you would interpret regression coefficients, assuming the canonical variate as the outcome variable. For example, a one unit increase in locus_of_control leads to a 1.253834 unit increase in the first variate of the psychological measurements ("Psych1"), and a one unit increase in read score leads to a 0.0446206 unit increase in the first variate of the academic measurements ("Academic1").
u. Standardized Canonical Coefficients for the Psychological/Academic Measurements – These are the standardized canonical coefficients. This means that, if all of the variables in the analysis are rescaled to have a mean of zero and a standard deviation of 1, the coefficients generating the canonical variates would indicate how a one standard deviation increase in the variable would change the variate. For example, an increase of one standard deviation in locus_of_control would lead to a 0.8404 unit increase in the first variate of the psychological measurements ("Psych1"), and an increase of one standard deviation in read would lead to a 0.4508 unit increase in the first variate of the academic measurements ("Academic1").
Correlations Among Original Variables and Canonical Variates
Correlations Between the Psychological Measurements and Their Canonical Variablesv
Psych1 Psych2 Psych3
LOCUS_OF_CONTROL locus of control 0.9040 0.3897 -0.1756
SELF_CONCEPT self-concept 0.0208 0.7087 0.7052
MOTIVATION motivation 0.5672 -0.3509 0.7451
Correlations Between the Academic Measurements and Their Canonical Variablesv
Academic1 Academic2 Academic3
READ reading score 0.8404 0.3588 0.1354
WRITE writing score 0.8765 -0.0648 0.2546
MATH math score 0.7639 0.2979 0.1478
SCIENCE science score 0.6584 0.6768 -0.2304
FEMALE 0.3641 -0.7549 -0.5434
Correlations Between the Psychological Measurements and
the Canonical Variables of the Academic Measurementsw
Academic1 Academic2 Academic3
LOCUS_OF_CONTROL locus of control 0.4196 0.0653 -0.0183
SELF_CONCEPT self-concept 0.0097 0.1187 0.0733
MOTIVATION motivation 0.2632 -0.0588 0.0775
Correlations Between the Academic Measurements and the
Canonical Variables of the Psychological Measurementsx
Psych1 Psych2 Psych3
READ reading score 0.3900 0.0601 0.0141
WRITE writing score 0.4068 -0.0109 0.0265
MATH math score 0.3545 0.0499 0.0154
SCIENCE science score 0.3056 0.1134 -0.0240
FEMALE 0.1690 -0.1265 -0.0565
v. Correlations Between the Psychological/Academic Measurements and Their Canonical Variables – Here, SAS presents the correlations between each variable in a group and the group’s canonical variates. These can allow us to see if the variates are combining the variables in such a way that might represent a particular idea. For example, we can see that the first variate for the psychological variables, Psych1, is highly correlated with locus_of_control and motivation, but uncorrelated with self-concept. Thus, this variate arguably captures much of the shared variance of locus_of_control and motivation. If we look at the academic variables, we can see that the first variate is highly correlated with all four of the subject variables. Those four variables were very highly correlated with each other (see superscript b), so it is not surprising that they should all be highly correlated with a variate that captures their shared variance. The second variate is highly correlated with science and negatively correlated with female. Thus, the first variate might represent overall academic performance with an emphasis on reading and writing, while the second variate emphasizes performance in science and is possibly indicative of male students.
w. Correlations Between the Psychological Measurements and the Canonical Variables of the Academic Measurements – In addition to the correlations between the variables in a group and the group’s canonical variates, SAS also presents the correlations between each variable in one group and the canonical variates of the other. We see that the psychological variables locus_of_control, self_concept and motivation are correlated with Academic1, Academic2 and Academic3 (a total of 3×3=9 correlations). Here, we can see that locus_of_control and motivation are correlated with the first academic variate, while self_concept is uncorrelated with the first variate but slightly correlated with the second variate. Based on our observations about these two variates in superscript v, we might interpret these correlations to mean that overall academic performance, especially reading and writing, are related to locus_of_control and motivation, while performance in science and gender may be related to self_concept.
x. Correlations Between the Academic Measurements and the Canonical Variables of the Psychological Measurements – Here, we see how the academic variables read, write, math, science and female are correlated with Psych1, Psych2 and Psych3 (a total of 5×3=15 correlations). We see that the academic variables read, write, math and science are all correlated with Psych1, the first psychological variate strongly correlated with locus_of_control and motivation. This supports what we noted in superscript w about the possible relationship between overall academic performance and these two psychological variables.
Canonical Redundancy Analysis
Canonical Redundancy Analysis
Raw Variance of the Psychological Measurements Explained by
Their Own The Opposite
Canonical Variablesy Canonical Variablesz
Canonical
Variable Cumulative Canonical Cumulative
Number Proportion Proportion R-Square Proportion Proportion
1 0.3806 0.3806 0.2154 0.0820 0.0820
2 0.3126 0.6932 0.0281 0.0088 0.0908
3 0.3068 1.0000 0.0108 0.0033 0.0941
Raw Variance of the Academic Measurements Explained by
Their Own The Opposite
Canonical Variablesy Canonical Variablesz
Canonical
Variable Cumulative Canonical Cumulative
Number Proportion Proportion R-Square Proportion Proportion
1 0.6251 0.6251 0.2154 0.1346 0.1346
2 0.1704 0.7955 0.0281 0.0048 0.1394
3 0.0395 0.8350 0.0108 0.0004 0.1398
Standardized Variance of the Psychological Measurements Explained by
Their Own The Opposite
Canonical Variablesaa Canonical Variablesbb
Canonical
Variable Cumulative Canonical Cumulative
Number Proportion Proportion R-Square Proportion Proportion
1 0.3798 0.3798 0.2154 0.0818 0.0818
2 0.2591 0.6389 0.0281 0.0073 0.0891
3 0.3611 1.0000 0.0108 0.0039 0.0930
Standardized Variance of the Academic Measurements Explained by
Their Own The Opposite
Canonical Variablesaa Canonical Variablesbb
Canonical
Variable Cumulative Canonical Cumulative
Number Proportion Proportion R-Square Proportion Proportion
1 0.5249 0.5249 0.2154 0.1130 0.1130
2 0.2499 0.7748 0.0281 0.0070 0.1201
3 0.0907 0.8655 0.0108 0.0010 0.1210
y. Raw Variance of the Psychological/Academic Measurements Explained by Their Own Canonical Variables – This is the degree to which the canonical variates of a group can explain the variability in the group’s variables. For example, we see here that the first canonical variate for the academic group explains 62.5% of the variability in the academic variables and the first canonical variate for the psychological group explains 38% of the variability in the psychological variables.
z. Raw Variance of the Psychological/Academic Measurements Explained by The Opposite Canonical Variables – This is the degree to which the canonical variates of a group can explain the variability in the other group’s variables. For example, we see here that the first canonical variate for the academic group explains 8.2% of the variability in the psychological variables and the first canonical variate for the psychological group explains 13.5% of the variability in the academic variables.
aa. Standardized Variance of the Psychological/Academic Measurements Explained by Their Own Canonical Variables – This is similar to superscript y, but performed on standardized data variables.
bb. Standardized Variance of the Psychological/Academic Measurements Explained by The Opposite Canonical Variables -This is similar to superscript z, but performed on standardized data variables.
Squared Multiple Correlations
Squared Multiple Correlations Between the Psychological Measurements
and the First M Canonical Variables of the Academic Measurementscc
M 1 2 3
LOCUS_OF_CONTROL locus of control 0.1760 0.1803 0.1806
SELF_CONCEPT self-concept 0.0001 0.0142 0.0196
MOTIVATION motivation 0.0693 0.0727 0.0787
Squared Multiple Correlations Between the Academic Measurements and
the First M Canonical Variables of the Psychological Measurementscc
M 1 2 3
READ reading score 0.1521 0.1557 0.1559
WRITE writing score 0.1655 0.1656 0.1663
MATH math score 0.1257 0.1282 0.1284
SCIENCE science score 0.0934 0.1062 0.1068
FEMALE 0.0286 0.0445 0.047
cc. Squared Multiple Correlations Between the Psychological/Academic Measurements and the First M Canonical Variables of the Psychological Measurements – Here, the correlations that were presented earlier between each variable in a given group and the canonical variates of the other group, are squared. Each value is equivalent to the R-squared value in an OLS regression where we are predicting a single variable with a single variate or vice versa. For example, we saw earlier in the output that locus_of_control and Academic1 have a correlation of 0.4196. We can calculate (0.4196*0.4196) = 0.1760, the squared correlation presented in this portion of the output. This means that 17.6% of the variability in locus_of_control can be explained by Academic1.
For more on the options available in cancorr and details on the underlying calculations, see the corresponding SAS documentation page.
