



Prepara tus exámenes y mejora tus resultados gracias a la gran cantidad de recursos disponibles en Docsity
Gana puntos ayudando a otros estudiantes o consíguelos activando un Plan Premium
Prepara tus exámenes
Prepara tus exámenes y mejora tus resultados gracias a la gran cantidad de recursos disponibles en Docsity
Prepara tus exámenes con los documentos que comparten otros estudiantes como tú en Docsity
Encuentra los documentos específicos para los exámenes de tu universidad
Estudia con lecciones y exámenes resueltos basados en los programas académicos de las mejores universidades
Responde a preguntas de exámenes reales y pon a prueba tu preparación
Consigue puntos base para descargar
Gana puntos ayudando a otros estudiantes o consíguelos activando un Plan Premium
Comunidad
Pide ayuda a la comunidad y resuelve tus dudas de estudio
Ebooks gratuitos
Descarga nuestras guías gratuitas sobre técnicas de estudio, métodos para controlar la ansiedad y consejos para la tesis preparadas por los tutores de Docsity
Examen final statistics
Tipo: Exámenes
1 / 5
Esta página no es visible en la vista previa
¡No te pierdas las partes importantes!




Statistics I Final Exam, 24 June 2014. Degrees in ADE, DER-ADE, ADE-INF, FICO, ECO, ECO-DER.
EXAM RULES: 1) Use separate booklets for each problem. 2) Perform the calculations with at least two significant decimal places. 3) You may not leave the exam during the first 30 minutes.
9 8 14 23 23 27 18 18 8 21 20 18 14 20 29 18 33 19 23 24 24 14 19
(a) (0.5 points) Is the monthly sales mean of this data set larger than 20? Is the monthly sales median of this data set larger than 20? Is it true that the most frequent monthly sale is 20? Justify your answers. (b) (0.5 points) Compute the sample quasi-variance and coefficient of variation of the data set. (c) (1 point) Draw a boxplot for the data set, obtaining the appropriate numerical measures for that. Is it true that the distribution shape is heavily positive skewed? Justify your answer. (d) (0.5 points) Based on the boxplot drawn in the previous item, are there outliers in the data set? Justify your answer.
Solution.
(a) The sample mean is x = 19.30. The sample median is M = 19. The sample mode is 18. Then, the answer is NO for all the questions. (b) The sample quasi-variance is ̂σ^2 = 40.13043. The sample coefficient of variation is 0.3281. (c) The quartiles are Q 1 = 14, Q 2 = 19 and Q 3 = 23, respectively. Then, the sample IQR is IQR = 23 − 14 = 9. The boxplot is:
10 15 20 25 30
The distribution is slightly right skewed. Therefore, it is NOT heavily positive skewed. (d) On the one hand, we have Q 1 − 1. 5 IQR = 14− 1. 5 ×9 = 0.5. On the other hand, we have Q 3 +1. 5 IQR = 23 + 1. 5 × 9 = 36.5. As the minimum and maximum of values in the sample are 8 and 33, respectively, and 0. 5 < 8 and 36. 5 > 33, we conclude that there are no outliers in the sample.
(a) (0.75 points) Obtain the cumulative distribution function of the business annual income (in million euros). (b) (0.25 points) Obtain the probability that the business annual income is between 0.5 (not included) and 1.5 million euros (included). (c) (0.75 points) Compute the expected value and the standard deviation of the business annual income. (d) (0.75 points) If the business is successful, the young entrepreneur plans to start another business smaller than the previous one and with annual income equal to a quarter of the first business annual income. Which are the expected annual income (in million euros) of the second business? and the standard deviation?
Solution.
(a) Let X be the business annual income (in million euros). Then, the probability density function is given by:
f (x) =
1 41 x^ ≤^1 21 1 < x^ ≤^2 4 2 < x^ ≤^3 0 en otro caso
Therefore, the cumulative distribution function can be computed as follows:
∫ (^) x
0
dx =
x 4
0
dx +
∫ (^) x
1
dx =
(x − 1) =
x −
F (x) =
0
dx +
1
dx +
∫ (^) x
2
dx =
(x − 2) =
(x + 1)
F (x) =
0 x ≤ 0 x 1 4 0 < x^ ≤^1 2
x − (^12)
1 < x ≤ 2 1 4 (x^ + 1)^2 < x^ ≤^3 1 3 < x
(b) The probability that the business annual income is between 0.5 (not included) and 1.5 million euros (included) is given by:
Pr (0. 5 < x ≤ 1 .5) = F (1.5) − F (0.5) =
(c) The mean, variance and standard deviation of N is given by:
E [N ] = 25 × 0 .12 = 3 V [N ] = 25 × 0. 12 × 0 .88 = 2. 64 DT [N ] =
Therefore, the mean, variance and standard deviation of the total sales commission earned in a day are given by:
E [45N ] = 45 × E [N ] = 45 × 3 = 135 euros V [45N ] = 452 × V [N ] = 5346 euros^2 DT [45N ] = 45 × DT [N ] = 45 × 1 .6248 = 73.116 euros
(d) The number of sales in a workweek is M ∼ Bin (125, 0 .12). Therefore, the total commission earned in a workweek is W = 45M. The mean, variance and standard deviation of M is given by:
E [M ] = 125 × 0 .12 = 15 V [M ] = 125 × 0. 12 × 0 .88 = 13. 2 DT [M ] =
Using the CLT, we obtain:
Pr (W > 1000) = Pr (45M > 1000) = Pr (M > 22 .22) =
= Pr
' Pr (Z > 1 .9872) =
= 1 − Pr (Z < 1 .9872) = 1 − 0 .9761 = 0. 0239
where Z ∼ N (0, 1).
(a) (1 point) Compute the probability that the average transaction made by 100 independent clients is between −3 and 3 euros. (b) (0.5 points) Without doing any additional calculations, decide if the probability in (a) will increase or decrease if we increase the sample size. Justify your answer. (c) (0.5 points) Let assume that exactly 100 clients will visit the bank in one day. Compute the probability that the sum of transactions at the end of the day is positive. (d) (0.5 points) Even though the bank assumed that the true expectation of X equals 0, the bank analysts suspect that lately the clients are mostly depositing and not withdrawing money. To check this suspicion, the analysts collect a sample of 25 transactions made by independent clients and obtain a sample mean of 50 euros. Obtain a 90% confidence interval for the mean assuming that X follows a normal distribution with standard deviation 100. How confident are you that the true mean is in fact larger than zero?
Solution:
(a) Since n ≥ 30, by CLT we have that X ∼ N
0 , 100 √n
= N (0, 10). Then, the required probability is given by:
Pr
= Pr
' Pr (− 0. 3 < Z < 0 .3) =
= Pr (Z < 0 .3) − Pr (Z < − 0 .3) = 0. 6179 − 0 .3821 = 0. 2358
where Z ∼ N (0, 1).
(b) If the sample size increases, the variance of X decreases. Therefore, the probability for X to be in the interval (− 3 , 3) will increase. (c) We want to compute the following probability:
Pr
( (^) n ∑
i=
Xi > 0
This is similar to: Pr
Then, as in a), by CLT we have that X ∼ N
0 , 100 √n
= N (0, 10). Therefore, the required probability is given by:
Pr
( (^) n ∑
i=
Xi > 0
= Pr
= Pr
' Pr (Z > 0) = 0. 5
where Z ∼ N (0, 1).
(d) The confidence interval for the mean is given by:
( 50 − 1. 65
Consequently, we are more than 90% confident that the true mean is positive.