









Prepara tus exámenes y mejora tus resultados gracias a la gran cantidad de recursos disponibles en Docsity
Gana puntos ayudando a otros estudiantes o consíguelos activando un Plan Premium
Prepara tus exámenes
Prepara tus exámenes y mejora tus resultados gracias a la gran cantidad de recursos disponibles en Docsity
Prepara tus exámenes con los documentos que comparten otros estudiantes como tú en Docsity
Encuentra los documentos específicos para los exámenes de tu universidad
Estudia con lecciones y exámenes resueltos basados en los programas académicos de las mejores universidades
Responde a preguntas de exámenes reales y pon a prueba tu preparación
Consigue puntos base para descargar
Gana puntos ayudando a otros estudiantes o consíguelos activando un Plan Premium
Comunidad
Pide ayuda a la comunidad y resuelve tus dudas de estudio
Ebooks gratuitos
Descarga nuestras guías gratuitas sobre técnicas de estudio, métodos para controlar la ansiedad y consejos para la tesis preparadas por los tutores de Docsity
Introduction to statistical inference
Tipo: Apuntes
1 / 16
Esta página no es visible en la vista previa
¡No te pierdas las partes importantes!










A) Objetive: to study one or more variables in a population B) Method:
iii. Nonparametric tests for paired samples (sign test and Wilcoxon signed rank test) and nonparametric tests for independent random samples (Mann-Whitney U test and Wilcoxon rank sum test). Example: A) Variable to study: the income earned by an individual in Madrid B) Method:
B.1) Simple Random Sampling (s.r.s.) 2 : Each element in the population has the same probability of being selected. a. This allows for the sample resembling the structure of the population, on average b. It is appropiate when there is homogeneity in the population regarding the variable under study c. It is the easiest sampling method d. Two requirements: all the members of the population must be in a list and a random selection tool has to be implemented e. Two kind of sampling: i. WITH REPLACEMENT:
a. Strata are homogeneus inside them but heterogeneus among them b. After the strata are defined, a sample must be taken from each one of them. Although any sampling technique can be used, it is common to use s.r.s. c. The number of observations to select from each strata can be proportional either to its size or to its variability d. We can draw inferences about specific subgroups in the population e. This technique requires lower sample size than the s.r.s. method B.3) Cluster sampling: The population can be clasified in different clusters, which are groups of heterogeneous elements, hence providing a similar variability to that existing in the population analyzed. a. Then a number of clusters are selected randomly b. After that, one may either survey all the units included in the selected clusters or just a s.r.s. c. It is cheaper than s.r.s. in the whole population B.4) Sistematic sampling: it is a kind of s.r.s. The elements in the population must be listed a. Let k be the integer nearer to:
b. Then, an element of the population is chosen at random among the first k. That one will be the first observation in the sample: x 1 c. The second observation in the sample will be that occupying the x 1 + k position d. The third one will fall at the x 1 + 2k position, and so on till completing the sample: x 1 + (n-1)k Final considerations: o If there is lack of information about the variable under study in the population we will apply s.r.s. o In other case, population is divided in groups either homogeneus (stratum) or heterogeneus (clusters) and then apply s.r.s. o Whenever a sampling procedure is implemented, the technique employed must be clearly explained. o There are no good or bad samples, just good or bad sampling procedures. o The bigger the sample size, the better the estimation. However the improvement in precision decreases from certain sample size o The expenses involved must be taking into account.
SAMPLING DISTRIBUTIONS OF ESTIMATORS (s.r.s.) ONE POPULATION Sample mean:
ିଵ Sample variance ଶ ^ ଶ ଶ ଶ With mean: ଶ ଶ And variance (general case): ଶ ସ^ ସ ସ ସ ଶ ସ ସ ଷ where (^) And variance (only when ξ~N(μ;σ)): ଶ ସ ଶ Moreover (key issue) if ξ~N(μ;σ): ଶ ଶ ିଵ ଶ hence: ଶ ଶ ିଵ ଶ
A) Two normal populations (independent samples). Two independent samples coming, respectively, from the two variables to be compaired are taken: ଵ ଶ ௫ ௫ ଵ ଶ ௬ ௬ Difference between two sample means Case 1) The population variances are known: ௫ ௬ ௫ ଶ ௬ ଶ Case 2) The population variances are unknown but equal ௫ ௬ ∗ ାିଶ ∗ ଵ௫ ଶ ଵ௬ ଶ
or: ∗ ௫ ଶ ௬ ଶ Case 3) The population variances are unknown and different: ௫ ௬ ଵ௫ ଶ ଵ௬ ଶ ାିଶିு ଵ௫ ଶ ଵ௬ ଶ ଶ ଵ௫ ସ ଶ ଵ௬ ସ ଶ Quotient between sample variances ଵ௫ ଶ ௫ ଶ ଵ௬ ଶ ௬ ଶ (ିଵ ),(ିଵ ) : ௫ ଶ ௫ ଶ ௬ ଶ ௬ ଶ (ିଵ ),(ିଵ )
or: ௫ ௬ ௪ ିଵ C) Two non normal populations (independent samples with big size). Difference between two sample means General case: two independent samples with n and m high. The difference between two sample means will be aproximated to a normal distribution with mean μx-μy and variance being equal to the sum of the estimated variances of the respective sample means: ௫ ௬ ଵ௫ ଶ ଵ௬ ଶ Particular case: two independent samples with n and m high coming from a B(1;p 1 ) and a B(1;p 2 ). The difference between two sample means will be aproximated to the following normal distribution: ଵ ଶ ଵ ଶ ଵ ଵ ଶ ଶ Then after standardizing: ଵ ଶ ଵ ଶ ଵ ଵ ଶ ଶ