


















Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
MATH533: EXPLORATORY DATA ANALYSIS
Typology: Exams
1 / 26
This page cannot be seen from the preview
Don't miss anything!



















2 | P a g e
3 | P a g e
5 | P a g e
6 | P a g e B. Discuss the first individual variable, using graphical, numerical summary and interpretation. For the first individual variable, I decided to analyze the type of training, provided in the data set. There are three different types of training, which includes: ➢ Online training ➢ None or no training ➢ Group training This variable is considered to be a qualitative variable; it cannot be measured numerically. I decided to tally up the sums and create a table and pie-doughnut chart to represent my findings. The results are:
8 | P a g e what group of employees had the most experience according to the data set. According to my findings based on the table and bar graph, I noticed that employees with two-years of experience are predominate compared to the other
9 | P a g e employees with different years of experience. The data set only provides information from 0 – 5 years of experience. One can only assume that this can be a fairly new company competing in the market. If my assumption is correct, I created a simple logic chart and line graph to support my findings. BAR CHART EXPERIENCE 35 30 29 25 20 22 20 COUN T 15 10 5 0 9 Frequency (^10 ) 012345 M 0 ore NUMBER OF YEARS
LINE GRAPH YEARS OF EXPERIENCE & AMOUNT OF EMPLOYEES 35 30 29 25 22 20 20 15 10 10 10 9 5 0 1ST 2ND^ 3RD^ 4TH^ 5TH^ 6TH 11 | P a g e The line graph also shows a peak in the fourth year, which also corresponds with the employees with two-years’ experience. D. Discuss the third individual variable, using graphical, numerical summary and interpretation. For the third variable, I decided to analyze the average time on calls per week. I wanted to analyze the average time the company
12 | P a g e spent on conducting sales calls. Using the sample of 100, the company spent an average of 15.338 minutes per call. With the assistance of minitab, I was able to create a histogram to show my findings. ➢ Descriptive Statistics: Time (X2) VARIAB LE
14 | P a g e relationship is. According to Investopedia, a correlation coefficient with an absolute value of 0.9 or greater would represent a very strong relationship (Investopedia, 2018). In my findings, the result was 0.318, not close to 1. The result is not considered to be a strong relationship between the amount of calls vs sales. Another factor to consider is the coefficient of determination. According to Investopedia, the coefficient of determination, also commonly known as R 2 , is used as a guideline to measure
15 | P a g e the accuracy and is represented as a value between 0 and 1; indicates the extent that the dependent variable is predicted (Investopedia, 2018). For this analysis, amount of sales (y) is the dependent variable, and the amount sales is independent (X1) calls. Therefore, R 2 of 0. means that only 10% of the variance in (Y) Sales is predictable from (X) the calls made. Some important key details regarding particular test that have statistical significance: ➢ Value of P- does not exceed 50% ➢ P-value of .0001, statistical significance having enough evidence to suggest that the amount of calls has some sort of relationship with sales ➢ P- value =. SCATTER GRAM GRAPH
17 | P a g e employees’ years of experience and the amount of sales. The pie- doughnut and bar graph provides the amount of sales in comparison to the amount of employees’ years of experience within the company. Again, the employees with two-years of experience has the highest total of sales generated in a week. ➢ Descriptive Statistics: SALES (Y) SALES (Y)
VARIABLE YEARS MEAN SUM MINIMU MEDIA MAXIMUM (X3) M N
18 | P a g e