

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Anscombe's quartet is a collection of four datasets that have identical simple statistical properties but distinct graphs. Discovered by statistician f j anscombe in 1973, these datasets demonstrate the importance of graphing data before analyzing it and the effect of outliers on statistical properties. Sas code to read and analyze the datasets, as well as generate their graphs.
Typology: Study Guides, Projects, Research
1 / 2
This page cannot be seen from the preview
Don't miss anything!


DATA anscombe; INFILE DATALINES DLM=',' DSD TRUNCOVER; LENGTH dset $3; INPUT x y @; dset='I'; OUTPUT; INPUT x y @; dset='II'; OUTPUT; INPUT x y @; dset='III'; OUTPUT; INPUT x y; dset='IV'; OUTPUT; DATALINES; 10.0,8.04,10.0,9.14,10.0,7.46,8.0,6. 8.0,6.95,8.0,8.14,8.0,6.77,8.0,5. 13.0,7.58,13.0,8.74,13.0,12.74,8.0,7. 9.0,8.81,9.0,8.77,9.0,7.11,8.0,8. 11.0,8.33,11.0,9.26,11.0,7.81,8.0,8. 14.0,9.96,14.0,8.10,14.0,8.84,8.0,7. 6.0,7.24,6.0,6.13,6.0,6.08,8.0,5. 4.0,4.26,4.0,3.10,4.0,5.39,19.0,12. 12.0,10.84,12.0,9.13,12.0,8.15,8.0,5. 7.0,4.82,7.0,7.26,7.0,6.42,8.0,7. 5.0,5.68,5.0,4.74,5.0,5.73,8.0,6. RUN; PROC SORT DATA=anscombe; BY dset x; RUN;
dset=I Number of Observations Read 11 Sum of Mean Source DF Squares Square F Value Pr > F Model 1 27.51000 27.51000 17.99 0. Error 9 13.76269 1. Corrected Total 10 41. Root MSE=1.23660 R-Square=0. Dependent Mean=7.50091 Adj R-Sq=0. Coeff Var=16. Parameter Standard t Variable DF Estimate Error Value Pr > |t| Intercept 1 3.00009 1.12475 2.67 0. x 1 0.50009 0.11791 4.24 0.
PROC REG DATA=anscombe RSQUARE; BY dset; MODEL y=x; RUN;
PROC SGPANEL DATA=anscombe; PANELBY dset; REG X=x Y=y / LEGENDLABEL="Regression"; SCATTER X=x Y=y / LEGENDLABEL="Point" MARKERATTRS=(COLOR=RED SIZE=10PT); RUN;