



Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Definitions for key terms related to statistics and parameter estimation, including parameters, statistics, estimators, accuracy, precision, point estimators, interval estimates, sample size, type i and ii errors, practical vs. Statistical significance, hypothesis tests, p-values, power, prediction intervals, inference for a single proportion, two-sample t-test assumptions, models, independent variables, observational studies, designed experiments, simple linear regression, least squares estimation, residuals, outliers, leverage points, influential points, collinearity, collinearity causes, and variance inflation factor (vif).
Typology: Quizzes
1 / 7
This page cannot be seen from the preview
Don't miss anything!




a numeric quantity that describes an important feature of a population always depend on the model selected to describe the population TERM 2
DEFINITION 2 a quantity calculated from a sample that describes an important feature of that sample random variable TERM 3
DEFINITION 3 a statistic used to estimate an unknown parameter of a population random variable want estimators to have accuracy and precision TERM 4
DEFINITION 4 closeness to true value classical statistics tends to measure accuracy through the concept of unbiasedness Unbiased estimator: when the expected value is equal to the parameter of interest TERM 5
DEFINITION 5 reproducibility of measurements In statistics, precision looks at the variance of the estimator an estimator is more precise if its sampling distribution has a smaller standard error
estimate specific values of a parameter xbar and s^ estimate mu and sigma^2 respectively most point estimates are continuous random variables, and therefore, have no chance of being correct...instead we use interval estimates TERM 7
DEFINITION 7 width of the interval should reflect two factors: 1. confidence in the interval 2. the variability of the estimator confidence refers to the reliability of the procedure, not the specific interval intervals are based on the t-distribution TERM 8
DEFINITION 8 to obtain the desired precision in a study n can be chosen so that the confidence interval is as small as desired. n>/= ((z_alpha/2*sigma/B))^2 Where B is the desired precision we round n up since n must be an integer TERM 9
DEFINITION 9 rejecting a true null hypothesis represented by alpha often called the significance level of the test typically worse than type II TERM 10
DEFINITION 10 failing to reject a false null hypothesis represented by Beta often called the power of the test
we need np>/= 5, and n(1-p)>/=5...preferably both greater than/equal to ten TERM 17
DEFINITION 17
DEFINITION 18 express the relationships among variables TERM 19
DEFINITION 19 usually called regressors or predictors used to try to predict the response or dependent variables TERM 20
DEFINITION 20 observation of a process no experimenter influence
manipulation of the regressors or factors with a measured response TERM 22
DEFINITION 22 single regressor (x-axis), single response (y-axis) first step...scatterplot TERM 23
DEFINITION 23 y_i = B_0 + B_1x_i + error B_0 is the intercept B_1 is the slope Assumptions: 1. random errors are independent 2. random errors have mean 0 and variance sigma^ TERM 24
DEFINITION 24 =0...response does not depend on the regressor...response and regressor are uncorrelated 0...values of response grow larger as the values of the regressor are increased....positively correlated TERM 25
DEFINITION 25 the differences between the observed and predicted values for the response an appropriate measure of the quality of the fit negative residual- point below line zero residual- point on line (good fit) positive residual- point above the line e_i = y_i
allows us to look at the joint relationships among a specified regressor and all the other regressors VIF greater than or equal to 10 indicates a strong problem with collinearity 5 TERM 32
DEFINITION 32 more data collection (collect data in areas missed before) subset models (assume that because the regressors are highly related we do not need all of them in the model) biased regression methods (think all the regressors are important and should appear in the model)