You can download the dataset for this chapter by clicking here: https://stats.idre.ucla.edu/wp-content/uploads/2016/02/dynarski.sas7bdat.
Descriptive statistics and cross-tabulations for key variables. (Not shown in text.)
proc surveymeans data=dynarski;
cluster hhid;
var coll;
weight wt88;
run;
The SURVEYMEANS Procedure
Data Summary
Number of Clusters 3123
Number of Observations 3986
Sum of Weights 1302933368
Statistics
Std Error
Variable Label N Mean of Mean
coll Full-time college by age 23 3986 0.494350 0.010515
Statistics
Variable 95% CL for Mean
coll 0.47373264 0.51496809
proc freq data=dynarski;
table fatherdec*yearsr;
run;
The FREQ Procedure
Table of fatherdec by yearsr
fatherdec(Father deceased by age 18)
yearsr(Year in which a senior)
Frequency
Percent
Row Pct
Col Pct 79 80 81 82 83 Total
Father not decea 892 986 867 828 222 3795
sed 22.38 24.74 21.75 20.77 5.57 95.21
23.50 25.98 22.85 21.82 5.85
95.61 95.73 94.34 95.28 94.47
Father deceased 41 44 52 41 13 191
1.03 1.10 1.30 1.03 0.33 4.79
21.47 23.04 27.23 21.47 6.81
4.39 4.27 5.66 4.72 5.53
Total 933 1030 919 869 235 3986
23.41 25.84 23.06 21.80 5.90 100.00
(a) Direct Estimate shown in Table 8.1 on page 143. The means of coll are shown in the rows labeled _subpop_3 and _subpop_4. This output also replicates part of the table of variable means and differences from the Dynarski article referenced in the chapter (not shown in the text).
proc sort data=dynarski;
by fatherdec offer;
run;
proc surveymeans data=dynarski;
domain fatherdec*offer;
cluster hhid;
var coll;
weight wt88;
run;
The SURVEYMEANS Procedure
Data Summary
Number of Clusters 3123
Number of Observations 3986
Sum of Weights 1302933368
Statistics
Std Error
Variable Label N Mean of Mean
coll Full-time college by age 23 3986 0.494350 0.010515
Statistics
Variable 95% CL for Mean
coll 0.47373264 0.51496809
Domain Analysis: Father deceased by age 18*Senior in year SSSB support available
Senior in
year SSSB
Father deceased support
by age 18 available Variable Label N Mean
Father not deceased 0 coll Full-time college by age 23 1050 0.475694
1 coll Full-time college by age 23 2745 0.501702
Father deceased 0 coll Full-time college by age 23 54 0.352218
1 coll Full-time college by age 23 137 0.560456
Domain Analysis: Father deceased by age 18*Senior in year SSSB support available
Senior in
year SSSB
Father deceased support Std Error
by age 18 available Variable of Mean 95% CL for Mean
Father not deceased 0 coll 0.018865 0.43870461 0.51268248
1 coll 0.012174 0.47783269 0.52557057
Father deceased 0 coll 0.081245 0.19291968 0.51151598
1 coll 0.052744 0.45703939 0.66387184
(b) Linear-Probability Model (OLS) Estimate shown in Table 8.1 on page 143.
proc surveyreg data=dynarski;
model coll = offer;
cluster hhid;
weight wt88;
where fatherdec=1;
run;
The SURVEYREG Procedure
Regression Analysis for Dependent Variable coll
Data Summary
Number of Observations 191
Sum of Weights 51656801
Weighted Mean of coll 0.49986
Weighted Sum of coll 25821369
Design Summary
Number of Clusters 172
Fit Statistics
R-square 0.03578
Root MSE 0.4936
Denominator DF 171
Tests of Model Effects
Effect Num DF F Value Pr > F
Model 1 4.93 0.0277
Intercept 1 18.59 <.0001
offer 1 4.93 0.0277
NOTE: The denominator degrees of freedom for the F tests is 171.
Estimated Regression Coefficients
Standard
Parameter Estimate Error t Value Pr > |t|
Intercept 0.35221783 0.08168396 4.31 <.0001
offer 0.20823778 0.09375011 2.22 0.0277
Figure 8.1 on page 155. Note that some of the output from the following section of code is not shown.
data dynarski;
set dynarski;
if offer=1 then post = 0;
if offer=0 then post = 1;
run;
proc sort data=dynarski;
by fatherdec post;
run;
proc surveymeans data=dynarski;
by fatherdec post;
cluster hhid;
var coll;
weight wt88;
ods output statistics = table81;
run;
proc print data=table81;
run;
proc format;
value post 0="pre-1981"
1="post-1981";
run;
proc sgplot data=table81;
by fatherdec;
series x=post y=mean ;
yaxis min=.3 max=.6;
xaxis tickvalueformat=post. values=(0 1);
run;
Table 8.2 on page 157, labeled "(First Diff)". (Note this replicates the last regression model.)
proc surveyreg data=dynarski;
model coll = offer;
cluster hhid;
weight wt88;
where fatherdec = 1 ;
run;
The SURVEYREG Procedure
Regression Analysis for Dependent Variable coll
Data Summary
Number of Observations 191
Sum of Weights 51656801
Weighted Mean of coll 0.49986
Weighted Sum of coll 25821369
Design Summary
Number of Clusters 172
Fit Statistics
R-square 0.03578
Root MSE 0.4936
Denominator DF 171
Tests of Model Effects
Effect Num DF F Value Pr > F
Model 1 4.93 0.0277
Intercept 1 18.59 <.0001
offer 1 4.93 0.0277
NOTE: The denominator degrees of freedom for the F tests is 171.
Estimated Regression Coefficients
Standard
Parameter Estimate Error t Value Pr > |t|
Intercept 0.35221783 0.08168396 4.31 <.0001
offer 0.20823778 0.09375011 2.22 0.0277
Table 8.2 on page 157, labeled "(Second Diff)".
proc surveyreg data=dynarski;
model coll = offer;
cluster hhid;
weight wt88;
where fatherdec = 0;
run;
The SURVEYREG Procedure
Regression Analysis for Dependent Variable coll
Data Summary
Number of Observations 3795
Sum of Weights 1251276567
Weighted Mean of coll 0.49412
Weighted Sum of coll 618284217
Design Summary
Number of Clusters 2984
Fit Statistics
R-square 0.000559
Root MSE 0.5000
Denominator DF 2983
Tests of Model Effects
Effect Num DF F Value Pr > F
Model 1 1.50 0.2215
Intercept 1 635.66 <.0001
offer 1 1.50 0.2215
NOTE: The denominator degrees of freedom for the F tests is 2983.
Estimated Regression Coefficients
Standard
Parameter Estimate Error t Value Pr > |t|
Intercept 0.47569354 0.01886756 25.21 <.0001
offer 0.02600809 0.02126729 1.22 0.2215
Table 8.4 on page 161.
proc surveyreg data=dynarski;
model coll = offer fatherdec offer*fatherdec;
cluster hhid;
weight wt88;
estimate offer 1 offer*fatherdec 1 ;
run;
The SURVEYREG Procedure
Regression Analysis for Dependent Variable coll
Data Summary
Number of Observations 3986
Sum of Weights 1302933368
Weighted Mean of coll 0.49435
Weighted Sum of coll 644105586
Design Summary
Number of Clusters 3123
Fit Statistics
R-square 0.001961
Root MSE 0.4997
Denominator DF 3122
Tests of Model Effects
Effect Num DF F Value Pr > F
Model 3 2.19 0.0875
Intercept 1 635.36 <.0001
offer 1 1.49 0.2216
fatherdec 1 2.19 0.1391
offer*fatherdec 1 3.61 0.0574
NOTE: The denominator degrees of freedom for the F tests is 3122.
Estimated Regression Coefficients
Standard
Parameter Estimate Error t Value Pr > |t|
Intercept 0.4756935 0.01887204 25.21 <.0001
offer 0.0260081 0.02127234 1.22 0.2216
fatherdec -0.1234757 0.08345651 -1.48 0.1391
offer*fatherdec 0.1822297 0.09587712 1.90 0.0574
The SURVEYREG Procedure
Regression Analysis for Dependent Variable coll
Estimate
Standard
Label Estimate Error DF t Value Pr > |t|
Row 1 0.2082 0.09328 3122 2.23 0.0257
