You can download the data for this chapter by clicking here: https://stats.idre.ucla.edu/wp-content/uploads/2016/02/dee.sas7bdat.
(a) Univariate Statistics from Table 10.1 on page 207.
proc univariate data=dee; var register college; run; The UNIVARIATE Procedure Variable: register (Is respondent currently registered to vote?) Moments N 9227 Sum Weights 9227 Mean 0.67085727 Sum Observations 6190 Std Deviation 0.46992736 Variance 0.22083173 Skewness -0.7273222 Kurtosis -1.4713213 Uncorrected SS 6190 Corrected SS 2037.39352 Coeff Variation 70.048785 Std Error Mean 0.00489216 Basic Statistical Measures Location Variability Mean 0.670857 Std Deviation 0.46993 Median 1.000000 Variance 0.22083 Mode 1.000000 Range 1.00000 Interquartile Range 1.00000 Tests for Location: Mu0=0 Test -Statistic- -----p Value------ Student's t t 137.1291 Pr > |t| <.0001 Sign M 3095 Pr >= |M| <.0001 Signed Rank S 9580573 Pr >= |S| <.0001 Quantiles (Definition 5) Quantile Estimate 100% Max 1 99% 1 95% 1 90% 1 75% Q3 1 50% Median 1 25% Q1 0 10% 0 5% 0 1% 0 0% Min 0 The UNIVARIATE Procedure Variable: register (Is respondent currently registered to vote?) Extreme Observations ----Lowest---- ----Highest--- Value Obs Value Obs 0 9227 1 9221 0 9222 1 9223 0 9220 1 9224 0 9218 1 9225 0 9217 1 9226 The UNIVARIATE Procedure Variable: college (Attended junior, community or 4year college by 1984?) Moments N 9227 Sum Weights 9227 Mean 0.54709006 Sum Observations 5048 Std Deviation 0.49780456 Variance 0.24780938 Skewness -0.189232 Kurtosis -1.9646171 Uncorrected SS 5048 Corrected SS 2286.28937 Coeff Variation 90.9913372 Std Error Mean 0.00518237 Basic Statistical Measures Location Variability Mean 0.547090 Std Deviation 0.49780 Median 1.000000 Variance 0.24781 Mode 1.000000 Range 1.00000 Interquartile Range 1.00000 Tests for Location: Mu0=0 Test -Statistic- -----p Value------ Student's t t 105.5675 Pr > |t| <.0001 Sign M 2524 Pr >= |M| <.0001 Signed Rank S 6371838 Pr >= |S| <.0001 Quantiles (Definition 5) Quantile Estimate 100% Max 1 99% 1 95% 1 90% 1 75% Q3 1 50% Median 1 25% Q1 0 10% 0 5% 0 1% 0 0% Min 0 The UNIVARIATE Procedure Variable: college (Attended junior, community or 4year college by 1984?) Extreme Observations ----Lowest---- ----Highest--- Value Obs Value Obs 0 9226 1 9219 0 9224 1 9221 0 9222 1 9223 0 9220 1 9225 0 9218 1 9227
Cross-tabulation of register and college, with a chi-squared test. (Not shown in the text.)
proc freq data=dee; table register*college / chisq cellchi2 expected; run; The FREQ Procedure Table of register by college register(Is respondent currently registered to vote?) college(Attended junior, community or 4year college by 1984?) Frequency Expected Cell Chi-Square Percent Row Pct Col Pct 0 1 Total 0 1780 1257 3037 1375.5 1661.5 118.96 98.483 19.29 13.62 32.91 58.61 41.39 42.59 24.90 1 2399 3791 6190 2803.5 3386.5 58.366 48.319 26.00 41.09 67.09 38.76 61.24 57.41 75.10 Total 4179 5048 9227 45.29 54.71 100.00 The FREQ Procedure Statistics for Table of register by college Statistic DF Value Prob Chi-Square 1 324.1293 <.0001 Likelihood Ratio Chi-Square 1 324.2767 <.0001 Continuity Adj. Chi-Square 1 323.3285 <.0001 Mantel-Haenszel Chi-Square 1 324.0942 <.0001 Phi Coefficient 0.1874 Contingency Coefficient 0.1842 Cramer's V 0.1874 Fisher's Exact Test Cell (1,1) Frequency (F) 1780 Left-sided Pr <= F 1.0000 Right-sided Pr >= F 1.270E-72 Table Probability (P) 1.573E-72 Two-sided Pr <= P 2.092E-72
(b) Sample Bivariate Correlations and Covariances from Table 10.1 on page 207.
proc corr data=dee cov; var register college; run; The CORR Procedure 2 Variables: register college Covariance Matrix, DF = 9226 register college register Is respondent currently registered to vote? 0.2208317276 0.0438448426 college Attended junior, community or 4year college by 1984? 0.0438448426 0.2478093831 Simple Statistics Variable N Mean Std Dev Sum Minimum Maximum register 9227 0.67086 0.46993 6190 0 1.00000 college 9227 0.54709 0.49780 5048 0 1.00000 Simple Statistics Variable Label register Is respondent currently registered to vote? college Attended junior, community or 4year college by 1984? Pearson Correlation Coefficients, N = 9227 Prob > |r| under H0: Rho=0 register college register 1.00000 0.18743 Is respondent currently registered to vote? <.0001 college 0.18743 1.00000 Attended junior, community or 4year college by 1984? <.0001
(c) OLS Regression Analysis: Outcome=register from Table 10.1 on page 207.
proc glm data=dee; model register = college; run; The GLM Procedure Number of Observations Read 9227 Number of Observations Used 9227 The GLM Procedure Dependent Variable: register Is respondent currently registered to vote? Sum of Source DF Squares Mean Square F Value Pr > F Model 1 71.570283 71.570283 335.86 <.0001 Error 9225 1965.823236 0.213097 Corrected Total 9226 2037.393519 R-Square Coeff Var Root MSE register Mean 0.035128 68.81117 0.461625 0.670857 Source DF Type I SS Mean Square F Value Pr > F college 1 71.57028291 71.57028291 335.86 <.0001 Source DF Type III SS Mean Square F Value Pr > F college 1 71.57028291 71.57028291 335.86 <.0001 Standard Parameter Estimate Error t Value Pr > |t| Intercept 0.5740607801 0.00714090 80.39 <.0001 college 0.1769297112 0.00965436 18.33 <.0001
(a) Univariate Statistics from Table 10.2 on page 219. Note that only the variable distance has been added from panel a of Table 10.1.
proc univariate data=dee; var distance; run; The UNIVARIATE Procedure Variable: distance (Miles from respondents HS to nearest 2yr college) Moments N 9227 Sum Weights 9227 Mean 9.73599232 Sum Observations 89834.0011 Std Deviation 8.70228565 Variance 75.7297756 Skewness 1.18127321 Kurtosis 0.60086158 Uncorrected SS 1573306.05 Corrected SS 698682.909 Coeff Variation 89.3826265 Std Error Mean 0.09059476 Basic Statistical Measures Location Variability Mean 9.735992 Std Deviation 8.70229 Median 7.000000 Variance 75.72978 Mode 5.000000 Range 35.00000 Interquartile Range 12.00000 Tests for Location: Mu0=0 Test -Statistic- -----p Value------ Student's t t 107.4675 Pr > |t| <.0001 Sign M 4443.5 Pr >= |M| <.0001 Signed Rank S 19746914 Pr >= |S| <.0001 Quantiles (Definition 5) Quantile Estimate 100% Max 35 99% 35 95% 30 90% 25 75% Q3 15 50% Median 7 25% Q1 3 10% 1 5% 1 1% 0 0% Min 0 The UNIVARIATE Procedure Variable: distance (Miles from respondents HS to nearest 2yr college) Extreme Observations ----Lowest---- ----Highest--- Value Obs Value Obs 0 9048 35 9081 0 9047 35 9082 0 9046 35 9083 0 9045 35 9084 0 9044 35 9085
(b) Sample Bivariate Correlations and Covariances from Table 10.2 on page 219.
proc corr data=dee cov; var register college distance; run; The CORR Procedure 3 Variables: register college distance Covariance Matrix, DF = 9226 register college register Is respondent currently registered to vote? 0.22083173 0.04384484 college Attended junior, community or 4year college by 1984? 0.04384484 0.24780938 distance Miles from respondents HS to nearest 2yr college -0.13687315 -0.48247222 Covariance Matrix, DF = 9226 distance register Is respondent currently registered to vote? -0.13687315 college Attended junior, community or 4year college by 1984? -0.48247222 distance Miles from respondents HS to nearest 2yr college 75.72977557 Simple Statistics Variable N Mean Std Dev Sum Minimum Maximum register 9227 0.67086 0.46993 6190 0 1.00000 college 9227 0.54709 0.49780 5048 0 1.00000 distance 9227 9.73599 8.70229 89834 0 35.00000 Simple Statistics Variable Label register Is respondent currently registered to vote? college Attended junior, community or 4year college by 1984? distance Miles from respondents HS to nearest 2yr college Pearson Correlation Coefficients, N = 9227 Prob > |r| under H0: Rho=0 register college distance register 1.00000 0.18743 -0.03347 Is respondent currently registered to vote? <.0001 0.0013 college 0.18743 1.00000 -0.11137 Attended junior, community or 4year college by 1984? <.0001 <.0001 distance -0.03347 -0.11137 1.00000 Miles from respondents HS to nearest 2yr college 0.0013 <.0001
(c) Method-of-Moments IVE Estimate from Table 10.2 on page 219 can be done as a hand calculation: -.136873/-.482472 = .28369107
Table 10.3 on page 228.
proc syslin data=dee 2sls; endogenous college; instruments distance; model college = distance; model register = college; run; The SYSLIN Procedure Two-Stage Least Squares Estimation Model college Dependent Variable college Label Attended junior, community or 4year college by 1984? Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 1 28.35903 28.35903 115.86 <.0001 Error 9225 2257.930 0.244762 Corrected Total 9226 2286.289 Root MSE 0.49473 R-Square 0.01240 Dependent Mean 0.54709 Adj R-Sq 0.01230 Coeff Var 90.43015 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.609118 0.007729 78.81 <.0001 Intercept distance 1 -0.00637 0.000592 -10.76 <.0001 Miles from respondents HS to nearest 2yr college The SYSLIN Procedure Two-Stage Least Squares Estimation Model register Dependent Variable register Label Is respondent currently registered to vote? Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 1 2.282356 2.282356 10.57 0.0012 Error 9225 1991.882 0.215922 Corrected Total 9226 2037.394 Root MSE 0.46467 R-Square 0.00114 Dependent Mean 0.67086 Adj R-Sq 0.00104 Coeff Var 69.26575 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.515653 0.047982 10.75 <.0001 Intercept college 1 0.283691 0.087258 3.25 0.0012 Attended junior, community or 4year college by 1984?
Table 10.4 on page 235.
proc syslin data=dee 3sls; endogenous college; instruments distance; model college = distance; model register = college; run; The SYSLIN Procedure Two-Stage Least Squares Estimation Model college Dependent Variable college Label Attended junior, community or 4year college by 1984? Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 1 28.35903 28.35903 115.86 <.0001 Error 9225 2257.930 0.244762 Corrected Total 9226 2286.289 Root MSE 0.49473 R-Square 0.01240 Dependent Mean 0.54709 Adj R-Sq 0.01230 Coeff Var 90.43015 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.609118 0.007729 78.81 <.0001 Intercept distance 1 -0.00637 0.000592 -10.76 <.0001 Miles from respondents HS to nearest 2yr college The SYSLIN Procedure Two-Stage Least Squares Estimation Model register Dependent Variable register Label Is respondent currently registered to vote? Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 1 2.282356 2.282356 10.57 0.0012 Error 9225 1991.882 0.215922 Corrected Total 9226 2037.394 Root MSE 0.46467 R-Square 0.00114 Dependent Mean 0.67086 Adj R-Sq 0.00104 Coeff Var 69.26575 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.515653 0.047982 10.75 <.0001 Intercept college 1 0.283691 0.087258 3.25 0.0012 Attended junior, community or 4year college by 1984? The SYSLIN Procedure Three-Stage Least Squares Estimation Cross Model Covariance college register college 0.244762 -.026459 register -.026459 0.215922 Cross Model Correlation college register college 1.00000 -0.11510 register -0.11510 1.00000 Cross Model Inverse Correlation college register college 1.01342 0.11664 register 0.11664 1.01342 Cross Model Inverse Covariance college register college 4.14045 0.50738 register 0.50738 4.69347 System Weighted MSE 1.0462 Degrees of freedom 18450 System Weighted R-Square 0.007011 Model college Dependent Variable college Label Attended junior, community or 4year college by 1984? The SYSLIN Procedure Three-Stage Least Squares Estimation Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.609118 0.007729 78.81 <.0001 Intercept distance 1 -0.00637 0.000592 -10.76 <.0001 Miles from respondents HS to nearest 2yr college The SYSLIN Procedure Three-Stage Least Squares Estimation Model register Dependent Variable register Label Is respondent currently registered to vote? Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.515653 0.047982 10.75 <.0001 Intercept college 1 0.283691 0.087258 3.25 0.0012 Attended junior, community or 4year college by 1984?
Table 10.5 on page 240.
proc syslin data=dee 2sls; endogenous college; instruments distance black hispanic otherrace; model college = distance black hispanic otherrace ; model register = college black hispanic otherrace ; run; The SYSLIN Procedure Two-Stage Least Squares Estimation Model college Dependent Variable college Label Attended junior, community or 4year college by 1984? Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 4 49.70516 12.42629 51.24 <.0001 Error 9222 2236.584 0.242527 Corrected Total 9226 2286.289 Root MSE 0.49247 R-Square 0.02174 Dependent Mean 0.54709 Adj R-Sq 0.02132 Coeff Var 90.01632 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.643146 0.009053 71.04 <.0001 Intercept distance 1 -0.00692 0.000595 -11.64 <.0001 Miles from respondents HS to nearest 2yr college black 1 -0.05766 0.015959 -3.61 0.0003 Is respondent black? hispanic 1 -0.11621 0.013257 -8.77 <.0001 Is respondent hispanic? otherrace 1 0.033708 0.024010 1.40 0.1604 Is respondent of another race? The SYSLIN Procedure Two-Stage Least Squares Estimation Model register Dependent Variable register Label Is respondent currently registered to vote? Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 4 9.261561 2.315390 10.85 <.0001 Error 9222 1967.204 0.213316 Corrected Total 9226 2037.394 Root MSE 0.46186 R-Square 0.00469 Dependent Mean 0.67086 Adj R-Sq 0.00425 Coeff Var 68.84653 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.526600 0.046306 11.37 <.0001 Intercept college 1 0.248862 0.080601 3.09 0.0020 Attended junior, community or 4year college by 1984? black 1 0.061733 0.015174 4.07 <.0001 Is respondent black? hispanic 1 0.028293 0.014832 1.91 0.0565 Is respondent hispanic? otherrace 1 -0.10667 0.022819 -4.67 <.0001 Is respondent of another race?
The analyses in Tables 10.6 and 10.7 include a variable that is restricted, so these analyses cannot be reproduced using the available dataset.
Table 10.8 from page 249. Note that many of the coefficients in the first stage models are equal to 0.
* Form interaction terms; * Then form interactions between instruments and exog race variables as instruments; data dee; set dee; distxblack = distance*black; distxhispanic = distance*hispanic; distxotherrace = distance*otherrace; collegexblack = college*black; collegexhispanic = college*hispanic; collegexotherrace = college*otherrace; run; proc syslin data=dee 2sls; endogenous college collegexblack collegexhispanic collegexotherrace; instruments distance black hispanic otherrace distxblack distxhispanic distxotherrace; model college = distance black hispanic otherrace distxblack distxhispanic distxotherrace; model collegexblack = distance black hispanic otherrace distxblack distxhispanic distxotherrace; model collegexhispanic = distance black hispanic otherrace distxblack distxhispanic distxotherrace; model collegexotherrace = distance black hispanic otherrace distxblack distxhispanic distxotherrace; model register = college black hispanic otherrace collegexblack collegexhispanic collegexotherrace; run; The SYSLIN Procedure Two-Stage Least Squares Estimation Model college Dependent Variable college Label Attended junior, community or 4year college by 1984? Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 7 50.20795 7.172564 29.57 <.0001 Error 9219 2236.081 0.242551 Corrected Total 9226 2286.289 Root MSE 0.49250 R-Square 0.02196 Dependent Mean 0.54709 Adj R-Sq 0.02122 Coeff Var 90.02084 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.645183 0.010058 64.15 <.0001 Intercept distance 1 -0.00711 0.000723 -9.83 <.0001 Miles from respondents HS to nearest 2yr college black 1 -0.06506 0.022762 -2.86 0.0043 Is respondent black? hispanic 1 -0.12759 0.019368 -6.59 <.0001 Is respondent hispanic? otherrace 1 0.058954 0.035074 1.68 0.0928 Is respondent of another race? distxblack 1 0.000890 0.002012 0.44 0.6583 distxhispanic 1 0.001291 0.001577 0.82 0.4131 distxotherrace 1 -0.00299 0.002941 -1.02 0.3086 The SYSLIN Procedure Two-Stage Least Squares Estimation Model collegexblack Dependent Variable collegexblack Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 7 289.5044 41.35777 1336.08 <.0001 Error 9219 285.3711 0.030955 Corrected Total 9226 574.8755 Root MSE 0.17594 R-Square 0.50359 Dependent Mean 0.06676 Adj R-Sq 0.50322 Coeff Var 263.53779 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 1.94E-17 0.003593 0.00 1.0000 Intercept distance 1 -202E-20 0.000258 -0.00 1.0000 Miles from respondents HS to nearest 2yr college black 1 0.580126 0.008132 71.34 <.0001 Is respondent black? hispanic 1 -216E-19 0.006919 -0.00 1.0000 Is respondent hispanic? otherrace 1 -328E-19 0.012530 -0.00 1.0000 Is respondent of another race? distxblack 1 -0.00622 0.000719 -8.66 <.0001 distxhispanic 1 1.43E-18 0.000563 0.00 1.0000 distxotherrace 1 2.51E-18 0.001051 0.00 1.0000 The SYSLIN Procedure Two-Stage Least Squares Estimation Model collegexhispanic Dependent Variable collegexhispanic Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 7 326.3327 46.61895 949.38 <.0001 Error 9219 452.6976 0.049105 Corrected Total 9226 779.0302 Root MSE 0.22160 R-Square 0.41890 Dependent Mean 0.09310 Adj R-Sq 0.41845 Coeff Var 238.02882 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 1.88E-17 0.004525 0.00 1.0000 Intercept distance 1 -406E-21 0.000326 -0.00 1.0000 Miles from respondents HS to nearest 2yr college black 1 -266E-19 0.010242 -0.00 1.0000 Is respondent black? hispanic 1 0.517593 0.008714 59.39 <.0001 Is respondent hispanic? otherrace 1 -297E-20 0.015781 -0.00 1.0000 Is respondent of another race? distxblack 1 1.31E-18 0.000905 0.00 1.0000 distxhispanic 1 -0.00582 0.000709 -8.21 <.0001 distxotherrace 1 -241E-21 0.001323 -0.00 1.0000 The SYSLIN Procedure Two-Stage Least Squares Estimation Model collegexotherrace Dependent Variable collegexotherrace Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 7 168.0322 24.00460 2119.51 <.0001 Error 9219 104.4102 0.011326 Corrected Total 9226 272.4424 Root MSE 0.10642 R-Square 0.61676 Dependent Mean 0.03045 Adj R-Sq 0.61647 Coeff Var 349.44899 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 1.22E-17 0.002173 0.00 1.0000 Intercept distance 1 -713E-21 0.000156 -0.00 1.0000 Miles from respondents HS to nearest 2yr college black 1 -109E-19 0.004919 -0.00 1.0000 Is respondent black? hispanic 1 -809E-20 0.004185 -0.00 1.0000 Is respondent hispanic? otherrace 1 0.704137 0.007579 92.91 <.0001 Is respondent of another race? distxblack 1 6.34E-19 0.000435 0.00 1.0000 distxhispanic 1 6.34E-19 0.000341 0.00 1.0000 distxotherrace 1 -0.01011 0.000635 -15.90 <.0001 The SYSLIN Procedure Two-Stage Least Squares Estimation Model register Dependent Variable register Label Is respondent currently registered to vote? Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 7 10.29933 1.471333 6.74 <.0001 Error 9219 2011.839 0.218228 Corrected Total 9226 2037.394 Root MSE 0.46715 R-Square 0.00509 Dependent Mean 0.67086 Adj R-Sq 0.00434 Coeff Var 69.63453 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.464046 0.055299 8.39 <.0001 Intercept college 1 0.358693 0.096492 3.72 0.0002 Attended junior, community or 4year college by 1984? black 1 0.278024 0.162701 1.71 0.0875 Is respondent black? hispanic 1 0.176548 0.120819 1.46 0.1440 Is respondent hispanic? otherrace 1 0.174217 0.175600 0.99 0.3212 Is respondent of another race? collegexblack 1 -0.39859 0.302067 -1.32 0.1870 collegexhispanic 1 -0.29291 0.247842 -1.18 0.2373 collegexotherrace 1 -0.46335 0.284396 -1.63 0.1033
Reproduce last model (Table 10.8 page 249), dropping the irrelevant terms. Note that the results are identical. (Not shown in the text.)
proc syslin data=dee 3sls; endogenous college collegexblack collegexhispanic collegexotherrace; instruments distance black hispanic otherrace distxblack distxhispanic distxotherrace; model college = distance black hispanic otherrace distxblack distxhispanic distxotherrace; model collegexblack = black distxblack ; model collegexhispanic = hispanic distxhispanic ; model collegexotherrace = otherrace distxotherrace; model register = college black hispanic otherrace collegexblack collegexhispanic collegexotherrace; run; The SYSLIN Procedure Two-Stage Least Squares Estimation Model college Dependent Variable college Label Attended junior, community or 4year college by 1984? Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 7 50.20795 7.172564 29.57 <.0001 Error 9219 2236.081 0.242551 Corrected Total 9226 2286.289 Root MSE 0.49250 R-Square 0.02196 Dependent Mean 0.54709 Adj R-Sq 0.02122 Coeff Var 90.02084 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.645183 0.010058 64.15 <.0001 Intercept distance 1 -0.00711 0.000723 -9.83 <.0001 Miles from respondents HS to nearest 2yr college black 1 -0.06506 0.022762 -2.86 0.0043 Is respondent black? hispanic 1 -0.12759 0.019368 -6.59 <.0001 Is respondent hispanic? otherrace 1 0.058954 0.035074 1.68 0.0928 Is respondent of another race? distxblack 1 0.000890 0.002012 0.44 0.6583 distxhispanic 1 0.001291 0.001577 0.82 0.4131 distxotherrace 1 -0.00299 0.002941 -1.02 0.3086 The SYSLIN Procedure Two-Stage Least Squares Estimation Model collegexblack Dependent Variable collegexblack Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 2 289.5044 144.7522 4678.80 <.0001 Error 9224 285.3711 0.030938 Corrected Total 9226 574.8755 Root MSE 0.17589 R-Square 0.50359 Dependent Mean 0.06676 Adj R-Sq 0.50349 Coeff Var 263.46635 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 -916E-20 0.001958 -0.00 1.0000 Intercept black 1 0.580126 0.007551 76.83 <.0001 Is respondent black? distxblack 1 -0.00622 0.000671 -9.28 <.0001 The SYSLIN Procedure Two-Stage Least Squares Estimation Model collegexhispanic Dependent Variable collegexhispanic Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 2 326.3327 163.1663 3324.62 <.0001 Error 9224 452.6976 0.049078 Corrected Total 9226 779.0302 Root MSE 0.22154 R-Square 0.41890 Dependent Mean 0.09310 Adj R-Sq 0.41877 Coeff Var 237.96429 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 -643E-20 0.002577 -0.00 1.0000 Intercept hispanic 1 0.517593 0.007879 65.70 <.0001 Is respondent hispanic? distxhispanic 1 -0.00582 0.000630 -9.24 <.0001 The SYSLIN Procedure Two-Stage Least Squares Estimation Model collegexotherrace Dependent Variable collegexotherrace Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 2 168.0322 84.01609 7422.30 <.0001 Error 9224 104.4102 0.011319 Corrected Total 9226 272.4424 Root MSE 0.10639 R-Square 0.61676 Dependent Mean 0.03045 Adj R-Sq 0.61668 Coeff Var 349.35427 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 -123E-20 0.001136 -0.00 1.0000 Intercept otherrace 1 0.704137 0.007347 95.84 <.0001 Is respondent of another race? distxotherrace 1 -0.01011 0.000616 -16.41 <.0001 The SYSLIN Procedure Two-Stage Least Squares Estimation Model register Dependent Variable register Label Is respondent currently registered to vote? Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 7 10.29933 1.471333 6.74 <.0001 Error 9219 2011.839 0.218228 Corrected Total 9226 2037.394 Root MSE 0.46715 R-Square 0.00509 Dependent Mean 0.67086 Adj R-Sq 0.00434 Coeff Var 69.63453 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.464046 0.055299 8.39 <.0001 Intercept college 1 0.358693 0.096492 3.72 0.0002 Attended junior, community or 4year college by 1984? black 1 0.278024 0.162701 1.71 0.0875 Is respondent black? hispanic 1 0.176548 0.120819 1.46 0.1440 Is respondent hispanic? otherrace 1 0.174217 0.175600 0.99 0.3212 Is respondent of another race? collegexblack 1 -0.39859 0.302067 -1.32 0.1870 collegexhispanic 1 -0.29291 0.247842 -1.18 0.2373 collegexotherrace 1 -0.46335 0.284396 -1.63 0.1033 The SYSLIN Procedure Three-Stage Least Squares Estimation Cross Model Covariance college collegexblack collegexhispanic collegexotherrace register college 0.242551 0.030946 0.049092 0.011322 -.011576 collegexblack 0.030946 0.030938 0.000000 -.000000 0.006014 collegexhispanic 0.049092 0.000000 0.049078 0.000000 0.005229 collegexotherrace 0.011322 -.000000 0.000000 0.011319 0.002175 register -.011576 0.006014 0.005229 0.002175 0.218228 Cross Model Correlation college collegexblack collegexhispanic collegexotherrace register college 1.00000 0.35724 0.44995 0.21609 -0.05031 collegexblack 0.35724 1.00000 0.00000 -0.00000 0.07319 collegexhispanic 0.44995 0.00000 1.00000 0.00000 0.05053 collegexotherrace 0.21609 -0.00000 0.00000 1.00000 0.04376 register -0.05031 0.07319 0.05053 0.04376 1.00000 Cross Model Inverse Correlation college collegexblack collegexhispanic collegexotherrace register college 1.63583 -0.59752 -0.74510 -0.36134 0.17950 collegexblack -0.59752 1.22367 0.27590 0.13522 -0.13949 collegexhispanic -0.74510 0.27590 1.34197 0.16682 -0.13279 collegexotherrace -0.36134 0.13522 0.16682 1.08175 -0.08385 register 0.17950 -0.13949 -0.13279 -0.08385 1.02962 Cross Model Inverse Covariance college collegexblack collegexhispanic collegexotherrace register college 6.74425 -6.8977 -6.8292 -6.8960 0.78021 collegexblack -6.89775 39.5524 7.0805 7.2258 -1.69760 collegexhispanic -6.82921 7.0805 27.3434 7.0776 -1.28312 collegexotherrace -6.89600 7.2258 7.0776 95.5659 -1.68704 register 0.78021 -1.6976 -1.2831 -1.6870 4.71810 System Weighted MSE 1.0187 Degrees of freedom 46110 System Weighted R-Square 0.4339 The SYSLIN Procedure Three-Stage Least Squares Estimation Model college Dependent Variable college Label Attended junior, community or 4year college by 1984? Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.645183 0.008650 74.59 <.0001 Intercept distance 1 -0.00711 0.000571 -12.45 <.0001 Miles from respondents HS to nearest 2yr college black 1 -0.06506 0.019493 -3.34 0.0008 Is respondent black? hispanic 1 -0.12759 0.017201 -7.42 <.0001 Is respondent hispanic? otherrace 1 0.058954 0.028648 2.06 0.0396 Is respondent of another race? distxblack 1 0.000890 0.001724 0.52 0.6057 distxhispanic 1 0.001291 0.001395 0.92 0.3550 distxotherrace 1 -0.00299 0.002402 -1.25 0.2126 The SYSLIN Procedure Three-Stage Least Squares Estimation Model collegexblack Dependent Variable collegexblack Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 2.4E-17 0.001958 0.00 1.0000 Intercept black 1 0.580126 0.007551 76.83 <.0001 Is respondent black? distxblack 1 -0.00622 0.000671 -9.28 <.0001 The SYSLIN Procedure Three-Stage Least Squares Estimation Model collegexhispanic Dependent Variable collegexhispanic Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 2.64E-17 0.002577 0.00 1.0000 Intercept hispanic 1 0.517593 0.007879 65.70 <.0001 Is respondent hispanic? distxhispanic 1 -0.00582 0.000630 -9.24 <.0001 The SYSLIN Procedure Three-Stage Least Squares Estimation Model collegexotherrace Dependent Variable collegexotherrace Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 1.01E-17 0.001136 0.00 1.0000 Intercept otherrace 1 0.704137 0.007347 95.84 <.0001 Is respondent of another race? distxotherrace 1 -0.01011 0.000616 -16.41 <.0001 The SYSLIN Procedure Three-Stage Least Squares Estimation Model register Dependent Variable register Label Is respondent currently registered to vote? Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.464046 0.055030 8.43 <.0001 Intercept college 1 0.358693 0.096017 3.74 0.0002 Attended junior, community or 4year college by 1984? black 1 0.278024 0.162287 1.71 0.0867 Is respondent black? hispanic 1 0.176548 0.120347 1.47 0.1424 Is respondent hispanic? otherrace 1 0.174217 0.174887 1.00 0.3192 Is respondent of another race? collegexblack 1 -0.39859 0.301308 -1.32 0.1859 collegexhispanic 1 -0.29291 0.246891 -1.19 0.2355 collegexotherrace 1 -0.46335 0.283238 -1.64 0.1019
A model similar to the last two (both of which replicate Table 10.8 page 249) estimated using simultaneous equations, dropping the irrelevant terms and non-significant variables from the model. Note that the results are very similar, but not identical. (Not shown in the text.)
proc syslin data=dee 3sls; endogenous college collegexblack collegexhispanic collegexotherrace; instruments distance black hispanic otherrace distxblack distxhispanic distxotherrace; model college = distance black hispanic otherrace; model collegexblack = black distxblack ; model collegexhispanic = hispanic distxhispanic ; model collegexotherrace = otherrace distxotherrace; model register = college black hispanic otherrace collegexblack collegexhispanic collegexotherrace; run; The SYSLIN Procedure Two-Stage Least Squares Estimation Model college Dependent Variable college Label Attended junior, community or 4year college by 1984? Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 4 49.70516 12.42629 51.24 <.0001 Error 9222 2236.584 0.242527 Corrected Total 9226 2286.289 Root MSE 0.49247 R-Square 0.02174 Dependent Mean 0.54709 Adj R-Sq 0.02132 Coeff Var 90.01632 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.643146 0.009053 71.04 <.0001 Intercept distance 1 -0.00692 0.000595 -11.64 <.0001 Miles from respondents HS to nearest 2yr college black 1 -0.05766 0.015959 -3.61 0.0003 Is respondent black? hispanic 1 -0.11621 0.013257 -8.77 <.0001 Is respondent hispanic? otherrace 1 0.033708 0.024010 1.40 0.1604 Is respondent of another race? The SYSLIN Procedure Two-Stage Least Squares Estimation Model collegexblack Dependent Variable collegexblack Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 2 289.5044 144.7522 4678.80 <.0001 Error 9224 285.3711 0.030938 Corrected Total 9226 574.8755 Root MSE 0.17589 R-Square 0.50359 Dependent Mean 0.06676 Adj R-Sq 0.50349 Coeff Var 263.46635 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 -916E-20 0.001958 -0.00 1.0000 Intercept black 1 0.580126 0.007551 76.83 <.0001 Is respondent black? distxblack 1 -0.00622 0.000671 -9.28 <.0001 The SYSLIN Procedure Two-Stage Least Squares Estimation Model collegexhispanic Dependent Variable collegexhispanic Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 2 326.3327 163.1663 3324.62 <.0001 Error 9224 452.6976 0.049078 Corrected Total 9226 779.0302 Root MSE 0.22154 R-Square 0.41890 Dependent Mean 0.09310 Adj R-Sq 0.41877 Coeff Var 237.96429 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 -643E-20 0.002577 -0.00 1.0000 Intercept hispanic 1 0.517593 0.007879 65.70 <.0001 Is respondent hispanic? distxhispanic 1 -0.00582 0.000630 -9.24 <.0001 The SYSLIN Procedure Two-Stage Least Squares Estimation Model collegexotherrace Dependent Variable collegexotherrace Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 2 168.0322 84.01609 7422.30 <.0001 Error 9224 104.4102 0.011319 Corrected Total 9226 272.4424 Root MSE 0.10639 R-Square 0.61676 Dependent Mean 0.03045 Adj R-Sq 0.61668 Coeff Var 349.35427 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 -123E-20 0.001136 -0.00 1.0000 Intercept otherrace 1 0.704137 0.007347 95.84 <.0001 Is respondent of another race? distxotherrace 1 -0.01011 0.000616 -16.41 <.0001 The SYSLIN Procedure Two-Stage Least Squares Estimation Model register Dependent Variable register Label Is respondent currently registered to vote? Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 7 10.29933 1.471333 6.74 <.0001 Error 9219 2011.839 0.218228 Corrected Total 9226 2037.394 Root MSE 0.46715 R-Square 0.00509 Dependent Mean 0.67086 Adj R-Sq 0.00434 Coeff Var 69.63453 Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.464046 0.055299 8.39 <.0001 Intercept college 1 0.358693 0.096492 3.72 0.0002 Attended junior, community or 4year college by 1984? black 1 0.278024 0.162701 1.71 0.0875 Is respondent black? hispanic 1 0.176548 0.120819 1.46 0.1440 Is respondent hispanic? otherrace 1 0.174217 0.175600 0.99 0.3212 Is respondent of another race? collegexblack 1 -0.39859 0.302067 -1.32 0.1870 collegexhispanic 1 -0.29291 0.247842 -1.18 0.2373 collegexotherrace 1 -0.46335 0.284396 -1.63 0.1033 The SYSLIN Procedure Three-Stage Least Squares Estimation Cross Model Covariance college collegexblack collegexhispanic collegexotherrace register college 0.242527 0.030941 0.049084 0.011321 -.011574 collegexblack 0.030941 0.030938 0.000000 -.000000 0.006014 collegexhispanic 0.049084 0.000000 0.049078 0.000000 0.005229 collegexotherrace 0.011321 -.000000 0.000000 0.011319 0.002175 register -.011574 0.006014 0.005229 0.002175 0.218228 Cross Model Correlation college collegexblack collegexhispanic collegexotherrace register college 1.00000 0.35720 0.44990 0.21606 -0.05031 collegexblack 0.35720 1.00000 0.00000 -0.00000 0.07319 collegexhispanic 0.44990 0.00000 1.00000 0.00000 0.05053 collegexotherrace 0.21606 -0.00000 0.00000 1.00000 0.04376 register -0.05031 0.07319 0.05053 0.04376 1.00000 Cross Model Inverse Correlation college collegexblack collegexhispanic collegexotherrace register college 1.63559 -0.59737 -0.74491 -0.36124 0.17946 collegexblack -0.59737 1.22359 0.27580 0.13517 -0.13946 collegexhispanic -0.74491 0.27580 1.34184 0.16676 -0.13276 collegexotherrace -0.36124 0.13517 0.16676 1.08172 -0.08383 register 0.17946 -0.13946 -0.13276 -0.08383 1.02961 Cross Model Inverse Covariance college collegexblack collegexhispanic collegexotherrace register college 6.74396 -6.8963 -6.8278 -6.8946 0.78005 collegexblack -6.89633 39.5499 7.0779 7.2232 -1.69731 collegexhispanic -6.82781 7.0779 27.3409 7.0751 -1.28283 collegexotherrace -6.89459 7.2232 7.0751 95.5633 -1.68675 register 0.78005 -1.6973 -1.2828 -1.6868 4.71807 System Weighted MSE 1.0187 Degrees of freedom 46113 System Weighted R-Square 0.4338 The SYSLIN Procedure Three-Stage Least Squares Estimation Model college Dependent Variable college Label Attended junior, community or 4year college by 1984? Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.643723 0.008035 80.11 <.0001 Intercept distance 1 -0.00697 0.000485 -14.37 <.0001 Miles from respondents HS to nearest 2yr college black 1 -0.05782 0.013765 -4.20 <.0001 Is respondent black? hispanic 1 -0.11633 0.011958 -9.73 <.0001 Is respondent hispanic? otherrace 1 0.033595 0.019636 1.71 0.0871 Is respondent of another race? The SYSLIN Procedure Three-Stage Least Squares Estimation Model collegexblack Dependent Variable collegexblack Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 -154E-19 0.001958 -0.00 1.0000 Intercept black 1 0.581107 0.007275 79.88 <.0001 Is respondent black? distxblack 1 -0.00635 0.000616 -10.30 <.0001 The SYSLIN Procedure Three-Stage Least Squares Estimation Model collegexhispanic Dependent Variable collegexhispanic Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 1.34E-17 0.002577 0.00 1.0000 Intercept hispanic 1 0.519995 0.007487 69.45 <.0001 Is respondent hispanic? distxhispanic 1 -0.00610 0.000560 -10.89 <.0001 The SYSLIN Procedure Three-Stage Least Squares Estimation Model collegexotherrace Dependent Variable collegexotherrace Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 6.98E-18 0.001136 0.00 1.0000 Intercept otherrace 1 0.702269 0.007220 97.27 <.0001 Is respondent of another race? distxotherrace 1 -0.00989 0.000595 -16.62 <.0001 The SYSLIN Procedure Three-Stage Least Squares Estimation Model register Dependent Variable register Label Is respondent currently registered to vote? Parameter Estimates Parameter Standard Variable Variable DF Estimate Error t Value Pr > |t| Label Intercept 1 0.462228 0.054885 8.42 <.0001 Intercept college 1 0.361883 0.095762 3.78 0.0002 Attended junior, community or 4year college by 1984? black 1 0.286554 0.161474 1.77 0.0760 Is respondent black? hispanic 1 0.187519 0.119711 1.57 0.1173 Is respondent hispanic? otherrace 1 0.149163 0.173587 0.86 0.3902 Is respondent of another race? collegexblack 1 -0.41439 0.299796 -1.38 0.1669 collegexhispanic 1 -0.31565 0.245606 -1.29 0.1988 collegexotherrace 1 -0.42303 0.281100 -1.50 0.1324
The analyses in shown in Table 10.9 includes a variable that is restricted, so these analyses cannot be reproduced using the available dataset.