You can download the data for this chapter by clicking here: https://stats.idre.ucla.edu/wp-content/uploads/2016/02/colvoucher.sas7bdat.
Descriptive statistics for Table 11.1 on page 270.
proc means data = colvoucher;
var finish8th use_fin_aid won_lottry base_age male;
run;
The MEANS Procedure
Variable Label N Mean Std Dev Minimum
finish8th finish8th 1171 0.6814688 0.4661058 0
use_fin_aid use_fin_aid 1171 0.5815542 0.4935148 0
won_lottry student won voucher 1171 0.5055508 0.5001828 0
base_age base_age 1171 12.0042699 1.3470381 7.0000000
male gender: equals 1 if applicant is male 1171 0.5046968 0.5001916 0
Variable Label Maximum
finish8th finish8th 1.0000000
use_fin_aid use_fin_aid 1.0000000
won_lottry student won voucher 1.0000000
base_age base_age 17.0000000
male gender: equals 1 if applicant is male 1.0000000
proc means data = colvoucher;
class won_lottry;
var finish8th use_fin_aid base_age male;
run;
The MEANS Procedure
student won N
voucher Obs Variable Label N Mean
0 579 finish8th finish8th 579 0.6252159
use_fin_aid use_fin_aid 579 0.2400691
base_age base_age 579 12.0362694
male gender: equals 1 if applicant is male 579 0.5043178
1 592 finish8th finish8th 592 0.7364865
use_fin_aid use_fin_aid 592 0.9155405
base_age base_age 592 11.9729730
male gender: equals 1 if applicant is male 592 0.5050676
student won N
voucher Obs Variable Label Std Dev Minimum
0 579 finish8th finish8th 0.4844857 0
use_fin_aid use_fin_aid 0.4274945 0
base_age base_age 1.3518143 7.0000000
male gender: equals 1 if applicant is male 0.5004137 0
1 592 finish8th finish8th 0.4409110 0
use_fin_aid use_fin_aid 0.2783108 0
base_age base_age 1.3427549 9.0000000
male gender: equals 1 if applicant is male 0.5003971 0
student won N
voucher Obs Variable Label Maximum
0 579 finish8th finish8th 1.0000000
use_fin_aid use_fin_aid 1.0000000
base_age base_age 16.0000000
male gender: equals 1 if applicant is male 1.0000000
1 592 finish8th finish8th 1.0000000
use_fin_aid use_fin_aid 1.0000000
base_age base_age 17.0000000
male gender: equals 1 if applicant is male 1.0000000
Significance tests for Table 11.1 on page 270.
proc glm data = colvoucher;
model finish8th = won_lottry;
run;
The GLM Procedure
Number of Observations Read 1171
Number of Observations Used 1171
The GLM Procedure
Dependent Variable: finish8th finish8th
Sum of
Source DF Squares Mean Square F Value Pr > F
Model 1 3.6241337 3.6241337 16.91 <.0001
Error 1169 250.5637399 0.2143402
Corrected Total 1170 254.1878736
R-Square Coeff Var Root MSE finish8th Mean
0.014258 67.93692 0.462969 0.681469
Source DF Type I SS Mean Square F Value Pr > F
won_lottry 1 3.62413371 3.62413371 16.91 <.0001
Source DF Type III SS Mean Square F Value Pr > F
won_lottry 1 3.62413371 3.62413371 16.91 <.0001
Standard
Parameter Estimate Error t Value Pr > |t|
Intercept 0.6252158895 0.01924033 32.50 <.0001
won_lottry 0.1112705970 0.02706015 4.11 <.0001
proc glm data = colvoucher;
model use_fin_aid = won_lottry;
run;
The GLM Procedure
Number of Observations Read 1171
Number of Observations Used 1171
The GLM Procedure
Dependent Variable: use_fin_aid use_fin_aid
Sum of
Source DF Squares Mean Square F Value Pr > F
Model 1 133.5541470 133.5541470 1031.16 <.0001
Error 1169 151.4074243 0.1295188
Corrected Total 1170 284.9615713
R-Square Coeff Var Root MSE use_fin_aid Mean
0.468674 61.88368 0.359887 0.581554
Source DF Type I SS Mean Square F Value Pr > F
won_lottry 1 133.5541470 133.5541470 1031.16 <.0001
Source DF Type III SS Mean Square F Value Pr > F
won_lottry 1 133.5541470 133.5541470 1031.16 <.0001
Standard
Parameter Estimate Error t Value Pr > |t|
Intercept 0.2400690846 0.01495640 16.05 <.0001
won_lottry 0.6754714559 0.02103510 32.11 <.0001
proc glm data = colvoucher;
model base_age = won_lottry;
run;
The GLM Procedure
Number of Observations Read 1171
Number of Observations Used 1171
The GLM Procedure
Dependent Variable: base_age base_age
Sum of
Source DF Squares Mean Square F Value Pr > F
Model 1 1.172741 1.172741 0.65 0.4217
Error 1169 2121.805910 1.815061
Corrected Total 1170 2122.978651
R-Square Coeff Var Root MSE base_age Mean
0.000552 11.22302 1.347242 12.00427
Source DF Type I SS Mean Square F Value Pr > F
won_lottry 1 1.17274119 1.17274119 0.65 0.4217
Source DF Type III SS Mean Square F Value Pr > F
won_lottry 1 1.17274119 1.17274119 0.65 0.4217
Standard
Parameter Estimate Error t Value Pr > |t|
Intercept 12.03626943 0.05598946 214.97 <.0001
won_lottry -0.06329646 0.07874516 -0.80 0.4217
proc glm data = colvoucher;
model male = won_lottry;
run;
The GLM Procedure
Number of Observations Read 1171
Number of Observations Used 1171
The GLM Procedure
Dependent Variable: male gender: equals 1 if applicant is male
Sum of
Source DF Squares Mean Square F Value Pr > F
Model 1 0.0001646 0.0001646 0.00 0.9796
Error 1169 292.7240028 0.2504055
Corrected Total 1170 292.7241674
R-Square Coeff Var Root MSE male Mean
0.000001 99.14968 0.500405 0.504697
Source DF Type I SS Mean Square F Value Pr > F
won_lottry 1 0.00016455 0.00016455 0.00 0.9796
Source DF Type III SS Mean Square F Value Pr > F
won_lottry 1 0.00016455 0.00016455 0.00 0.9796
Standard
Parameter Estimate Error t Value Pr > |t|
Intercept 0.5043177893 0.02079614 24.25 <.0001
won_lottry 0.0007497783 0.02924827 0.03 0.9796
Naive OLS estimate of the model shown in Table 11.2 on page 273.
proc glm data = colvoucher;
model finish8th = use_fin_aid base_Age male;
run;
The GLM Procedure
Number of Observations Read 1171
Number of Observations Used 1171
13:58 Monday, January 10, 2011 424
The GLM Procedure
Dependent Variable: finish8th finish8th
Sum of
Source DF Squares Mean Square F Value Pr > F
Model 3 16.2307260 5.4102420 26.53 <.0001
Error 1167 237.9571476 0.2039050
Corrected Total 1170 254.1878736
R-Square Coeff Var Root MSE finish8th Mean
0.063853 66.26252 0.451558 0.681469
Source DF Type I SS Mean Square F Value Pr > F
use_fin_aid 1 5.04596346 5.04596346 24.75 <.0001
base_age 1 9.04074811 9.04074811 44.34 <.0001
male 1 2.14401441 2.14401441 10.51 0.0012
Source DF Type III SS Mean Square F Value Pr > F
use_fin_aid 1 4.14983764 4.14983764 20.35 <.0001
base_age 1 8.33854172 8.33854172 40.89 <.0001
male 1 2.14401441 2.14401441 10.51 0.0012
Standard
Parameter Estimate Error t Value Pr > |t|
Intercept 1.410282818 0.12060311 11.69 <.0001
use_fin_aid 0.120908429 0.02680125 4.51 <.0001
base_age -0.062960958 0.00984556 -6.39 <.0001
male -0.085850473 0.02647542 -3.24 0.0012
Instrumental-variables (2SLS) from Table 11.2 on page 273.
proc syslin data = colvoucher 2sls first;
endogenous use_fin_aid;
instruments won_lottry base_age male;
model finish8th = use_fin_aid base_age male;
run;
The SYSLIN Procedure
First Stage Regression Statistics
Model First St
Dependent Variable use_fin_aid
Label use_fin_aid
Analysis of Variance
Sum of Mean
Source DF Squares Square F Value Pr > F
Model 3 134.1991 44.73303 346.26 <.0001
Error 1167 150.7625 0.129188
Corrected Total 1170 284.9616
Root MSE 0.35943 R-Square 0.47094
Dependent Mean 0.58155 Adj R-Sq 0.46958
Coeff Var 61.80463
Parameter Estimates
Parameter Standard Variable
Variable DF Estimate Error t Value Pr > |t| Label
Intercept 1 0.432760 0.095159 4.55 <.0001 Intercept
won_lottry 1 0.674527 0.021014 32.10 <.0001 student won voucher
base_age 1 -0.01516 0.007826 -1.94 0.0530 base_age
male 1 -0.02026 0.021070 -0.96 0.3365 gender: equals 1 if
applicant is male
Model First St
Dependent Variable finish8th
Label finish8th
Analysis of Variance
Sum of Mean
Source DF Squares Square F Value Pr > F
Model 3 15.44596 5.148654 25.17 <.0001
Error 1167 238.7419 0.204577
Corrected Total 1170 254.1879
The SYSLIN Procedure
First Stage Regression Statistics
Root MSE 0.45230 R-Square 0.06077
Dependent Mean 0.68147 Adj R-Sq 0.05835
Coeff Var 66.37170
Parameter Estimates
Parameter Standard Variable
Variable DF Estimate Error t Value Pr > |t| Label
Intercept 1 1.446936 0.119748 12.08 <.0001 Intercept
won_lottry 1 0.107250 0.026444 4.06 <.0001 student won voucher
base_age 1 -0.06457 0.009848 -6.56 <.0001 base_age
male 1 -0.08837 0.026514 -3.33 0.0009 gender: equals 1 if
applicant is male
The SYSLIN Procedure
Two-Stage Least Squares Estimation
Model finish8th
Dependent Variable finish8th
Label finish8th
Analysis of Variance
Sum of Mean
Source DF Squares Square F Value Pr > F
Model 3 15.44596 5.148654 25.21 <.0001
Error 1167 238.3690 0.204258
Corrected Total 1170 254.1879
Root MSE 0.45195 R-Square 0.06086
Dependent Mean 0.68147 Adj R-Sq 0.05844
Coeff Var 66.31984
Parameter Estimates
Parameter Standard Variable
Variable DF Estimate Error t Value Pr > |t| Label
Intercept 1 1.378128 0.123090 11.20 <.0001 Intercept
use_fin_aid 1 0.159000 0.039173 4.06 <.0001 use_fin_aid
base_age 1 -0.06216 0.009872 -6.30 <.0001 base_age
male 1 -0.08514 0.026504 -3.21 0.0014 gender: equals 1 if
applicant is male
