Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Log in Sign up

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Statistics for AI and CS: Hypothesis Testing Assignment, Exercises of Statistics

Rijksuniversiteit Groningen Statistics

A statistics assignment focusing on hypothesis testing for two samples. The assignment covers concepts such as left-skewed distributions, p-values, type i and ii errors, and hypothesis testing procedures for means using the two-sample independent t-test. The document also includes examples of applying these concepts to real data sets, including one on body balance and another on the nhanes cholesterol study.

Typology: Exercises

2018/2019

Uploaded on 04/03/2019

unknown user 🇳🇱

1 / 4

This page cannot be seen from the preview

Don't miss anything!

Statistics For AI and CS

Hypothesis Testing

Assignment 1

Awin Gray (s3073521)

September 29, 2017

1 General Questions

(a) A left-skewed distribution will have its median being greater than the mean because extreme observa-

tions on the lower side will pull the average towards the low values.

(b) P-value is the probability of observing the results as specified in the null hypothesis given that the null

hypothesis is true.

Type I error occurs when a researcher rejects the null hypothesis when the null hypothesis is true, thereby

accepting a false positive.

Type II error occurs when the researcher fails to reject a false null hypothesis, thereby accepting a false

negative.

These two conceptual types help distinguish between a null hypothesis and an alternative hypothesis.

(c)

1. First, the research states the null and the alternative hypothesis

2. Decides on the level of significance

3. Collection of the data

4. Then chooses the test statistic to use and calculate the statistic

5. Construction of the rejection region

6. Based on the rejection region and the test statistic, one decides whether to reject the null hypothesis

2 Keeping body balance

1

Discover Exercises of Statistics Rijksuniversiteit Groningen

Partial preview of the text

Download Statistics for AI and CS: Hypothesis Testing Assignment and more Exercises Statistics in PDF only on Docsity!

Statistics For AI and CS

Hypothesis Testing

Assignment 1

Awin Gray (s3073521)

September 29, 2017

1 General Questions

(a) A left-skewed distribution will have its median being greater than the mean because extreme observa- tions on the lower side will pull the average towards the low values.

(b) P-value is the probability of observing the results as specified in the null hypothesis given that the null hypothesis is true. Type I error occurs when a researcher rejects the null hypothesis when the null hypothesis is true, thereby accepting a false positive. Type II error occurs when the researcher fails to reject a false null hypothesis, thereby accepting a false negative.

These two conceptual types help distinguish between a null hypothesis and an alternative hypothesis.

(c)

First, the research states the null and the alternative hypothesis
Decides on the level of significance
Collection of the data
Then chooses the test statistic to use and calculate the statistic
Construction of the rejection region
Based on the rejection region and the test statistic, one decides whether to reject the null hypothesis

2 Keeping body balance

Figure 1: box-and-whiskers plot

(a) There is an outlier observed for the Elderly category while there is no outlier in the Young category as seen in Figure 1. The lower fence, the first quartile, the median, the third quartile and the upper fence for the elderly are all greater than the corresponding percentiles for the young individuals.

(b) It is assumed that the data are normally distributed when constructing the confidence interval and that there are no extreme outliers. These assumptions seem to be tenable. The two populations from which the data is drawn have the same variance. It is also assumed that the observations are independent of each other.

R-output:

Welch Two Sample t-test

data: sway$FBSway by sway$Age t = 2.3035, df = 10.971, p-value = 0. alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: 0.3627401 16. sample estimates: mean in group Elderly mean in group Young 26.33333 18.

The 95% confidence interval for the difference in means is (0.3627, 16.0539). Observe that 0 is not included in the confidence interval.

(e) The sample data is inconsistent with the null hypothesis, therefore we reject the null hypothesis and instead conclude that the alternative hypothesis is true. There is a difference in proportions of high cholesterol level across the age groups. Therefore, it can be concluded that age is a factor that influences having high cholesterol.

(f) If we altered the sampling method in the design and for example - use simple random sampling or systematic sampling instead, a significant difference might occur with respect to the measures of central tendency and dispersion to those yielded by the sampling technique used by the study’s sampling method. The difference could have an impact of altering the degree of association between age and the level of high cholesterol.

Using independent sample t-test to test for difference in means of cholesterol level between the old and the young might present contradicting results. There would be no recording of the cholesterol variable in that case.

Statistics for AI and CS: Hypothesis Testing Assignment, Exercises of Statistics

Related documents

Partial preview of the text

Download Statistics for AI and CS: Hypothesis Testing Assignment and more Exercises Statistics in PDF only on Docsity!

Statistics For AI and CS

Hypothesis Testing

Assignment 1

1 General Questions

2 Keeping body balance