


























Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Instructions on how to use R packages and techniques to assess the normality and homogeneity of variances for parametric statistics. It covers graphical displays, skewness and kurtosis, and statistical tests such as Shapiro-Wilk and Levene's test. The document also includes examples using the Festival dataset.
Typology: Study notes
1 / 34
This page cannot be seen from the preview
Don't miss anything!



























http://www.pelagicos.net/classes_biometry_fa18.htm
Characteristics of Normal Distributions Unimodal, Symmetrical, Bell-shaped
Biologist worried about potential health effects of music festivals. Measured hygiene of 810 concert-goers over the three days of a music festival. Hygiene measured using standardized index (from 0 to 4): 0 = you smell terribly 4 = you smell beautifully Import Download Festival Data (MusicFestival.xlsx) For ease of use, rename the Data Set “Festival”
Festival <- DownloadFestival
day1 day2 day histogram density
The solid red line is the expected pattern a normal distribution with the same mean and SD and the sampled data. Points outside of the dashed line envelope suggest significant deviations day
day 2 day 3
We can also explore the summary statistics describing the three datasets (day1, day2, day3) using RCmdr: What statistics would you use to assess data normality? NOTE: multiple datasets can be analyzed at once
Exploring the summary statistics describing the three datasets (day1, day2, day3) using RCmdr:
numSummary(Festival[,c("day1", "day2", "day3"), drop=FALSE], statistics=c("mean", "quantiles", "skewness", "kurtosis"), quantiles=c(.5), type="2") mean skewness kurtosis 50% n NA day1 1.7933580 8.865312 170.4502658 1.79 810 0 day2 0.9609091 1.095226 0.8222057 0.79 264 546 day3 0.9765041 1.032868 0.7315003 0.76 123 687
Exploring additional datasets using other functions: stat.desc() function in psych package
stat.desc(Festival$day1, basic = FALSE, norm = TRUE) basic argument: Basic statistics included if TRUE (Note: FALSE is the default) norm argument: Statistics relating to normal distribution included if TRUE (Note: FALSE is the default)
stat.desc(Festival$day1, basic = FALSE, norm = TRUE) median mean 1.790000e+00 1.793358e+ SE.mean C.I.mean.0. 3.318617e- 02 6.514115e- 02 var std.dev 8.920705e- 01 9.444949e- 01 coef.var
5.266627e- 01