Docsity
Docsity

Prepara tus exámenes
Prepara tus exámenes

Prepara tus exámenes y mejora tus resultados gracias a la gran cantidad de recursos disponibles en Docsity


Consigue puntos base para descargar
Consigue puntos base para descargar

Gana puntos ayudando a otros estudiantes o consíguelos activando un Plan Premium


Orientación Universidad
Orientación Universidad


Estadística 06 2014, Exámenes de Estadística

Examen final statistics

Tipo: Exámenes

2013/2014

Subido el 31/05/2014

iranzu99
iranzu99 🇪🇸

5

(1)

3 documentos

1 / 5

Toggle sidebar

Esta página no es visible en la vista previa

¡No te pierdas las partes importantes!

bg1
Statistics I Final Exam, 24 June 2014.
Degrees in ADE, DER-ADE, ADE-INF, FICO, ECO, ECO-DER.
EXAM RULES: 1) Use separate booklets for each problem. 2) Perform the calculations with at
least two significant decimal places. 3) You may not leave the exam during the first 30 minutes.
4) You are not allowed to leave the classroom without handing in the exam.
1. The following data set represents the number of sold cars in a certain month in 23 car concessionaires:
9 8 14 23 23 27 18 18 8 21 20 18 14 20 29 18 33 19 23 24 24 14 19
(a) (0.5 points) Is the monthly sales mean of this data set larger than 20? Is the monthly sales median
of this data set larger than 20? Is it true that the most frequent monthly sale is 20? Justify your
answers.
(b) (0.5 points) Compute the sample quasi-variance and coefficient of variation of the data set.
(c) (1 point) Draw a boxplot for the data set, obtaining the appropriate numerical measures for that. Is
it true that the distribution shape is heavily positive skewed? Justify your answer.
(d) (0.5 points) Based on the boxplot drawn in the previous item, are there outliers in the data set?
Justify your answer.
Solution.
(a) The sample mean is x= 19.30. The sample median is M= 19. The sample mode is 18. Then, the
answer is NO for all the questions.
(b) The sample quasi-variance is bσ2= 40.13043. The sample coefficient of variation is 0.3281.
(c) The quartiles are Q1= 14, Q2= 19 and Q3= 23, respectively. Then, the sample IQR is I QR =
23 14 = 9. The boxplot is:
10 15 20 25 30
The distribution is slightly right skewed. Therefore, it is NOT heavily positive skewed.
(d) On the one hand, we have Q11.5IQR = 141.5×9 = 0.5. On the other hand, we have Q3+1.5IQR =
23 + 1.5×9 = 36.5. As the minimum and maximum of values in the sample are 8 and 33, respectively,
and 0.5<8 and 36.5>33, we conclude that there are no outliers in the sample.
pf3
pf4
pf5

Vista previa parcial del texto

¡Descarga Estadística 06 2014 y más Exámenes en PDF de Estadística solo en Docsity!

Statistics I Final Exam, 24 June 2014. Degrees in ADE, DER-ADE, ADE-INF, FICO, ECO, ECO-DER.

EXAM RULES: 1) Use separate booklets for each problem. 2) Perform the calculations with at least two significant decimal places. 3) You may not leave the exam during the first 30 minutes.

  1. You are not allowed to leave the classroom without handing in the exam.
  1. The following data set represents the number of sold cars in a certain month in 23 car concessionaires:

9 8 14 23 23 27 18 18 8 21 20 18 14 20 29 18 33 19 23 24 24 14 19

(a) (0.5 points) Is the monthly sales mean of this data set larger than 20? Is the monthly sales median of this data set larger than 20? Is it true that the most frequent monthly sale is 20? Justify your answers. (b) (0.5 points) Compute the sample quasi-variance and coefficient of variation of the data set. (c) (1 point) Draw a boxplot for the data set, obtaining the appropriate numerical measures for that. Is it true that the distribution shape is heavily positive skewed? Justify your answer. (d) (0.5 points) Based on the boxplot drawn in the previous item, are there outliers in the data set? Justify your answer.

Solution.

(a) The sample mean is x = 19.30. The sample median is M = 19. The sample mode is 18. Then, the answer is NO for all the questions. (b) The sample quasi-variance is ̂σ^2 = 40.13043. The sample coefficient of variation is 0.3281. (c) The quartiles are Q 1 = 14, Q 2 = 19 and Q 3 = 23, respectively. Then, the sample IQR is IQR = 23 − 14 = 9. The boxplot is:

10 15 20 25 30

The distribution is slightly right skewed. Therefore, it is NOT heavily positive skewed. (d) On the one hand, we have Q 1 − 1. 5 IQR = 14− 1. 5 ×9 = 0.5. On the other hand, we have Q 3 +1. 5 IQR = 23 + 1. 5 × 9 = 36.5. As the minimum and maximum of values in the sample are 8 and 33, respectively, and 0. 5 < 8 and 36. 5 > 33, we conclude that there are no outliers in the sample.

  1. A young entrepreneur is starting a business in a city. Following certain information about similar businesses in the city, it is known that the 25% have annual income below 1 million euros (included), the 50% of them have annual income between 1 (not included) and 2 million euros (included), and the rest have annual income between 2 (not included) and 3 million euros (included). Given the previous information, and assuming that businesses are uniformly distributed among each of the considered anual income intervals, answer the following questions:

(a) (0.75 points) Obtain the cumulative distribution function of the business annual income (in million euros). (b) (0.25 points) Obtain the probability that the business annual income is between 0.5 (not included) and 1.5 million euros (included). (c) (0.75 points) Compute the expected value and the standard deviation of the business annual income. (d) (0.75 points) If the business is successful, the young entrepreneur plans to start another business smaller than the previous one and with annual income equal to a quarter of the first business annual income. Which are the expected annual income (in million euros) of the second business? and the standard deviation?

Solution.

(a) Let X be the business annual income (in million euros). Then, the probability density function is given by:

f (x) =

1 41 x^ ≤^1 21 1 < x^ ≤^2 4 2 < x^ ≤^3 0 en otro caso

Therefore, the cumulative distribution function can be computed as follows:

  • For x ≤ 0, F (x) = 0
  • For 0 < x ≤ 1, F (x) =

∫ (^) x

0

dx =

x 4

  • For 1 < x ≤ 2, F (x) =

0

dx +

∫ (^) x

1

dx =

(x − 1) =

x −

  • For 2 < x ≤ 3,

F (x) =

0

dx +

1

dx +

∫ (^) x

2

dx =

(x − 2) =

(x + 1)

  • For 3 < x, F (x) = 1 In summary,

F (x) =

0 x ≤ 0 x 1 4 0 < x^ ≤^1 2

x − (^12)

1 < x ≤ 2 1 4 (x^ + 1)^2 < x^ ≤^3 1 3 < x

(b) The probability that the business annual income is between 0.5 (not included) and 1.5 million euros (included) is given by:

Pr (0. 5 < x ≤ 1 .5) = F (1.5) − F (0.5) =

(c) The mean, variance and standard deviation of N is given by:

E [N ] = 25 × 0 .12 = 3 V [N ] = 25 × 0. 12 × 0 .88 = 2. 64 DT [N ] =

Therefore, the mean, variance and standard deviation of the total sales commission earned in a day are given by:

E [45N ] = 45 × E [N ] = 45 × 3 = 135 euros V [45N ] = 452 × V [N ] = 5346 euros^2 DT [45N ] = 45 × DT [N ] = 45 × 1 .6248 = 73.116 euros

(d) The number of sales in a workweek is M ∼ Bin (125, 0 .12). Therefore, the total commission earned in a workweek is W = 45M. The mean, variance and standard deviation of M is given by:

E [M ] = 125 × 0 .12 = 15 V [M ] = 125 × 0. 12 × 0 .88 = 13. 2 DT [M ] =

Using the CLT, we obtain:

Pr (W > 1000) = Pr (45M > 1000) = Pr (M > 22 .22) =

= Pr

M − 15

' Pr (Z > 1 .9872) =

= 1 − Pr (Z < 1 .9872) = 1 − 0 .9761 = 0. 0239

where Z ∼ N (0, 1).

  1. The clients of a certain bank may deposit or withdraw money. Let X be a transaction made by a client. Then, X is positive, if the money is deposited, or negative, if the money is withdrawn. The bank analysts assume that the expectation and the standard deviation of X are 0 and 100 euros, respectively.

(a) (1 point) Compute the probability that the average transaction made by 100 independent clients is between −3 and 3 euros. (b) (0.5 points) Without doing any additional calculations, decide if the probability in (a) will increase or decrease if we increase the sample size. Justify your answer. (c) (0.5 points) Let assume that exactly 100 clients will visit the bank in one day. Compute the probability that the sum of transactions at the end of the day is positive. (d) (0.5 points) Even though the bank assumed that the true expectation of X equals 0, the bank analysts suspect that lately the clients are mostly depositing and not withdrawing money. To check this suspicion, the analysts collect a sample of 25 transactions made by independent clients and obtain a sample mean of 50 euros. Obtain a 90% confidence interval for the mean assuming that X follows a normal distribution with standard deviation 100. How confident are you that the true mean is in fact larger than zero?

Solution:

(a) Since n ≥ 30, by CLT we have that X ∼ N

0 , 100 √n

= N (0, 10). Then, the required probability is given by:

Pr

− 3 < X < 3

= Pr

X

' Pr (− 0. 3 < Z < 0 .3) =

= Pr (Z < 0 .3) − Pr (Z < − 0 .3) = 0. 6179 − 0 .3821 = 0. 2358

where Z ∼ N (0, 1).

(b) If the sample size increases, the variance of X decreases. Therefore, the probability for X to be in the interval (− 3 , 3) will increase. (c) We want to compute the following probability:

Pr

( (^) n ∑

i=

Xi > 0

This is similar to: Pr

X > 0

Then, as in a), by CLT we have that X ∼ N

0 , 100 √n

= N (0, 10). Therefore, the required probability is given by:

Pr

( (^) n ∑

i=

Xi > 0

= Pr

X > 0

= Pr

X

' Pr (Z > 0) = 0. 5

where Z ∼ N (0, 1).

(d) The confidence interval for the mean is given by:

( 50 − 1. 65

Consequently, we are more than 90% confident that the true mean is positive.