

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
This is a description of my statistic notes discussing the Z table
Typology: Study notes
1 / 3
This page cannot be seen from the preview
Don't miss anything!


Individuals and Variables Individuals are objects described by a set of data. Individuals may be people, but they may also be animals or things. A variable is any characteristic of an individual. A variable can take different values for different individuals.
Categorical and Quantitative Variables A categorical variable places an individual into one of several groups or categories. A quantitative variable takes numerical values for which arithmetic operations such as adding and averaging make sense.
Distribution The distribution of a variable tells us what values the variable takes and how often it takes these variables.
Describing the Overall Pattern of a Distribution – Remember your SOCS To describe the overall pattern of a distribution, address all of the following: S pread – give the lowest and highest value in the data set O utliers – are there any values that stand out as unusual? C enter – what is the approximate average value of the data (only an estimation) S hape – does the graph show symmetry, or is it skewed in one direction (see below)
Outliers An outlier in any graph of data is an individual observation that falls outside the overall pattern of the graph.
Describing the SHAPE of a distribution – Symmetric and Skewed Distributions
Symmetric Skewed Left Skewed Right
Mean = Median Mean < Median Mean > Median
Time Plot A time plot of a variable plots each observation against the time at which it was measured. Always mark the time scale on the horizontal axis and the variable of interest on the vertical axis. If there are not too many points, connecting the points by lines helps show the pattern of changes over time.
To find the mean of a set of observations, add their values and divide by the number of observations. If the n observations are x1, x2, …, xn, their mean is:
1
n i i
The Median (M) The median M is the midpoint of distribution, the number such that half the observations are smaller and the other half are larger. To find the median of distribution: Arrange all observation in order of size, from smallest to largest. If the number of observations n is odd, the median M is the center observation in the ordered list. The position of the center observation can be found at ( n + 1) / 2 If the number of observations n is even, the median M is the mean of the two center observations in the ordered list. The position of the two middle values are n /2 and n /2 + 1
The Five-Number Summary The five-number summary of a data set consists of the smallest observation, the first quartile, the median, the third quartile, and the largest observation, written in order from smallest to largest. In symbols, the five-number summary is:
Minimum – Q 1 – M – Q 3 – Maximum
The Quartiles (Q 1 and Q 3 ) To calculate the quartiles, arrange the observations in increasing order and locate the median M in the ordered list of observations. The 1st quartile (Q1) is middle number of the values that are less than the median. The 3rd quartile (Q3) is the middle number of the values that are greater than the median. Example 2 14 28 29 30 32 33 34 40 42 52 Min Q1 Med Q3 Max
The Interquartile Range (IQR) The IQR is the distance between the first and third quartiles, IQR = Q3 - Q
Outliers: The 1.5 x IQR Criterion Call an observation an outlier if it falls more than 1.5 x IQR below the first quartile or above the third quartile. Using the 5-number summary from above as an example (IQR = 40-28=12)
the 2 is an outlier.