












Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
This lecture is from Statistics. Key important points are: Quantitative Representation, Crosstabulation and Scatter Diagram, Exploratory Data Analysis, Crosstabulations and Scatter Diagrams, One Variable, Crosstabulation, Finger Lakes Homes, Greatest Number of Homes, Row Or Column Percentages, Column Percentages
Typology: Slides
Uploaded on 01/29/2013
1 / 20
This page cannot be seen from the preview
Don't miss anything!













n Crosstabulation and Scatter Diagram
The techniques of exploratory data analysis consist of simple arithmetic and easy-to-draw pictures that can be used to summarize data quickly.
The left and top margin labels define the classes for the two variables.
Crosstabulation can be used when:
A crosstabulation is a tabular summary of data for two variables.
Price Range Colonial Log Split A-Frame Total
< $200,
$200,
45
Total 30 20 35 15 100
Home Style
Example: Finger Lakes Homes
quantitative variable
categorical variable
The number of Finger Lakes homes sold for each style and price for the past two years is shown below.
Price Range Colonial Log Split A-Frame Total
< $200,
$200,
45
Total 30 20 35 15 100
Home Style
Frequency distribution for the price range variable
Frequency distribution for the home style variable
Example: Finger Lakes Homes
Price Range (^) Colonial Log Split A-Frame
< $200,
$200,
Home Style
Total 100 100 100 100
(Colonial and > $200K)/(All Colonial) x 100 = (12/30) x 100
Example: Finger Lakes Homes
In some cases the conclusions based upon an aggregated crosstabulation can be completely reversed if we look at the unaggregated data. The reversal of conclusions based on aggregate and unaggregated data is called Simpson’s paradox.
We must be careful in drawing conclusions about the relationship between the two variables in the aggregated crosstabulation.
Data in two or more crosstabulations are often aggregated to produce a summary crosstabulation.
x = Number of Interceptions
y = Number of Points Scored
The Panthers football team is interested in investigating the relationship, if any, between interceptions made and points scored.
y
x
Number of Interceptions
Number of Points Scored
Scatter Diagram for the Panthers
0
5
10
15
20
25
30
35
0 1 2 3 4 Number of Interceptions
Number of Points Scored.
Categorical Data Quantitative Data
Tabular Methods
Tabular Methods
Graphical Methods
Graphical Methods
Data