
Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
The steps involved in hypothesis testing, including choosing h0 and ha, determining the significance level, and selecting the appropriate test based on the type and distribution of data. It covers tests for one and two means, proportions, and homogeneity or independence of proportions. Tests include z-tests, t-tests, anova f-test, and χ² tests.
Typology: Study notes
1 / 1
This page cannot be seen from the preview
Don't miss anything!

If n <30, we can’t use a z or a t , we would have to use a non-level by determining which is more critical, a Type I error (then use parametric test. Two means: H 0 : 1 = 2 ( 1 2 = 0) HA is again >, < or . (Matched) paired t -test ( 10 on flowchart): IF the samples are dependent (there is some link between the 2 samples by units), we create a sample of differences, which must be normal if n >30, and use a 1 sample t -level by determining which is more critical, a Type I error (then use test using the mean, standard deviation and number of differences. This is the most powerful of the 3 tests here because we eliminate a source of variability. df = n diff 1, the number of differences minus 1. 2 sample t -test ( 9 on the flowchart): we must have normal data or n >30 for both samples and independent samples. We use s 12 and s 22 for 12 and 22. df = min( n 1 1 and n 2 1) pooled t -test ( 8 on flowchart): we must have normal data or n >30 for both samples, independent samples and the variances must be equal. We pool s 12 and s 22 to get a better estimate of ^2 , s p^2. df = ( n 1 1) + ( n 2 1) larger degrees of freedom, so more power than 2 sample t. if n 1 or n 2 are NOT > 30, we must use a non-level by determining which is more critical, a Type I error (then use parametric procedure Multiple means: H 0 : 1 = 2 =... = k (k different populations) HA: not all the means are equal ANOVA F -test : normal data, independent samples and equal ^2 ’s (same as pooled t -level by determining which is more critical, a Type I error (then use test). We compare the
F = s means^2 / s p^2 , the larger it is the further apart the means are. df (^) num = # of groups 1, df denom = total # of groups. Categorical data: proportions, percents and fractions One proportion: H 0 : = # HA can be >, < or depending on what we want to prove. 1 sample z -test ( 6 on flowchart): must have n and n (1) 10. exact binomial test ( 5 on flowchart): if n OR n (1) < 10, we have to do a binomial test. Two proportions: H 0 : 1 = 2 ( 1 2 = 0) HA is again >, < or . 2 sample z -test ( 11 on flowchart): must have n 1 1 , n 1 (1 1 ), n 2 2 , n 2 (1 2 ) 10, although it’s often relaxed to 5. Multiple proportions: H 0 : 1 = 2 = … = k HA: not all proportions are equal } test for homogeneity OR H 0 : row and column variables are independent HA: row and column variables are related } test for independence ^2 test : all cells (row/column combinations) must have a count of at least 5 (For tables larger than 2 2, we can use the approximation whenever the average of the expected counts is 5 or more and the smallest is at least 1, IPS p.626). The expected count within a cell, Eij = P(rowi)P(columnj) n , is based on the rows and columns being independent, so the further the expected is from the observed count, the larger the ^2 test statistic is, the less we believe the null. df = ( r 1)*( c 1), where r is the number of rows and c is the number of columns.