

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
the cheatsheat of econometrics
Typology: Cheat Sheet
1 / 3
This page cannot be seen from the preview
Don't miss anything!


By Marcelo Moreno - King Juan Carlos University The Econometrics Cheat Sheet Project
Econometrics - is a social science discipline with the objective of quantify the relationships between economic agents, test economic theories and evaluate and implement government and business policies. Econometric model - is a simplified representation of the reality to explain economic phenomena. Ceteris paribus - if all the other relevant factors remain constant.
Cross section - data taken at a given moment in time, an static photo. Order doesn’t matter. Time series - observation of variables across time. Order does matter. Panel data - consist of a time series for each observation of a cross section. Pooled cross sections - combines cross section from dif- ferent time periods.
Study and predict the mean value of a variable (dependent variable, y) regarding the base of fixed values of other vari- ables (independent variables, x’s). In econometrics it is common to use Ordinary Least Squares (OLS) for regres- sion analysis.
Correlation analysis don’t distinguish between dependent and independent variables. Simple correlation measures the grade of linear associa- tion between two variables. r = Cov( σx·x,yσy )=
Pn √P i=1((xi−x)·(yi−y)) ni=1(xi−x) (^2) ·Pni=1(yi−y) 2
Partial correlation measures the grade of linear associa- tion between two variables controlling a third.
Under this assumptions, the OLS estimator will present good properties. Gauss-Markov assumptions:
Under the econometric model assumptions and the Central Limit Theorem (CLT): Hold 1 to 4a: OLS is unbiased. E( βˆj ) = βj Hold 1 to 4: OLS is consistent. plim( βˆj ) = βj (to 4b left out 4a, weak exogeneity, biased but consistent) Hold 1 to 5: asymptotic normality of OLS (then, 7 is necessarily satisfied): u ∼ a N (0, σ^2 u) Hold 1 to 6: unbiased estimate of σ u^2. E(ˆσ^2 u) = σ^2 u Hold 1 to 6: OLS is BLUE (Best Linear Unbiased Esti- mator) or efficient. Hold 1 to 7: hypothesis testing and confidence intervals can be done reliably.
Objective - minimize the Sum of Squared Residuals (SSR): min
Pn i=1 uˆ
2 i , where ˆui^ =^ yi^ −^ ˆyi
y
x
β 0
β 1
Equation: yi = β 0 + β 1 xi + ui Estimation: yˆi = βˆ 0 + βˆ 1 xi where: βˆ 0 = y − βˆ 1 x β^ ˆ 1 = Cov(y,x) Var(x)
x 2
y
x 1
β 0
Equation: yi = β 0 + β 1 x 1 i + · · · + βkxki + ui Estimation: yˆi = βˆ 0 + βˆ 1 x 1 i + · · · + βˆkxki where: βˆ 0 = y − βˆ 1 x 1 − · · · − βˆkxk β^ ˆj = Cov(y,resid^ xj^ ) Var(resid xj ) Matrix: βˆ = (XTX)−^1 (XTy)
Model Dependent Independent β 1 interpretation Level-level y x ∆y = β 1 ∆x Level-log y log(x) ∆y ≈ (β 1 /100)(%∆x) Log-level log(y) x %∆y ≈ (100β 1 )∆x Log-log log(y) log(x) %∆y ≈ β 1 (%∆x) Quadratic y x + x^2 ∆y = (β 1 + 2β 2 x)∆x
Sum of Sq. Residuals: SSR =
Pn i=1 uˆ
2 i =^
Pn i=1(yi^ −^ yˆi)
2 Explained Sum of Squares: SSE =
Pn i=1(ˆyi^ −^ y)
2 Total Sum of Sq.: SST = SSE + SSR =
Pn i=1(yi^ −^ y)
2
Standard Error of the Regression: σˆu =
q SSR n−k− 1 Standard Error of the βˆ’s: se( βˆ) =
p σˆ^2 u · (XTX)−^1 Mean Squared Error: MSE =
Pn i=1(yi−yˆi)^2 n Absolute Mean Error: AME =
Pn i=1|yi−ˆyi| n Mean Percentage Error: MPE =
Pn i=1|ˆui/yi| n ·^100
Is a measure of the goodness of the fit, how the regression fits to the data: R^2 = SSESST = 1 − SSRSST Measures the percentage of variation of y that is lin- early explained by the variations of x’s. Takes values between 0 (no linear explanation of the variations of y) and 1 (total explanation of the varia- tions of y). When the number of regressors increment, the value of the R-squared increments as well, whatever the new variables are relevant or not. To solve this problem, there is an ad- justed R-squared by degrees of freedom (or corrected R- squared):
R 2 = 1 − (^) nn−−k−^11 · SSRSST = 1 − (^) n−n−k−^11 · (1 − R^2 )
For big sample sizes: R 2 ≈ R^2
An hypothesis test is a rule designed to explain from a sam- ple, if exist evidence or not to reject an hypothesis that is made about one or more population parameters. Elements of an hypothesis test: Null hypothesis (H 0 ) - is the hypothesis to be tested. Alternative hypothesis (H 1 ) - is the hypothesis that cannot be rejected when the null hypothesis is rejected. Test statistic - is a random variable whose probability distribution is known under the null hypothesis. Critic value - is the value against which the test statistic is compared to determine if the null hypothesis is rejected or not. Is the value that makes the frontier between the regions of acceptance and rejection of the null hypothesis. Significance level (α) - is the probability of rejecting the null hypothesis being true (Type I Error). Is chosen by who conduct the test. Commonly is 0.10, 0.05 or 0.01. p-value - is the highest level of significance by which the null hypothesis cannot be rejected (H 0 ). The rule is: if the p-value is less than α, there is evidence to reject the null hypothesis at that given α (there is evidence to accept the alternative hypothesis).
Tests if a parameter is significantly different from a given value, ϑ. H 0 : βj = ϑ H 1 : βj ̸= ϑ Under H 0 : t = βˆj −ϑ se( βˆj ) ∼^ tn−k−^1 ,α/^2 If |t| > |tn−k− 1 ,α/ 2 |, there is evidence to reject H 0. Individual significance test - tests if a parameter is sig- nificantly different from zero. H 0 : βj = 0 H 1 : βj ̸= 0 Under H 0 : t = βˆj se( βˆj ) ∼^ tn−k−^1 ,α/^2 If |t| > |tn−k− 1 ,α/ 2 |, there is evidence to reject H 0.
Simultaneously tests multiple (linear) hypothesis about the parameters. It makes use of a non restricted model and a restricted model: Non restricted model - is the model on which we want to test the hypothesis. Restricted model - is the model on which the hypoth- esis that we want to test have been imposed. Then, looking at the errors, there are: SSRUR - is the SSR of the non restricted model. SSRR - is the SSR of the restricted model. Under H 0 : F = SSR SSRR−SSRUR UR· n−k q −^1 ∼ Fq,n−k− 1 where k is the number of parameters of the non restricted model and q is the number of linear hypothesis tested. If Fq,n−k− 1 < F , there is evidence to reject H 0. Global significance test - tests if all the parameters as- sociated to x’s are simultaneously equal to zero. H 0 : β 1 = β 2 = · · · = βk = 0 H 1 : β 1 ̸= 0 and/or β 2 ̸= 0... and/or βk ̸= 0 In this case, we can simplify the formula for the F statistic. Under H 0 : F = R
2 1 −R^2 ·^
n−k− 1 k ∼^ Fk,n−k−^1 If Fk,n−k− 1 < F , there is evidence to reject H 0.
The confidence intervals at (1 − α) confidence level can be calculated: βˆj ∓ tn−k− 1 ,α/ 2 · se( βˆj )
Dummy (or binary) variables are used for qualitative infor- mation like sex, civil state, country, etc. Takes the value 1 in a given category and 0 in the rest. Are used to analyze and modeling structural changes in the model parameters. If a qualitative variable have m categories, we only have to include (m − 1) dummy variables.
Structural change refers to changes in the values of the pa- rameters of the econometric model produced by the effect of different sub-populations. Structural change can be in- cluded in the model through dummy variables. The location of the dummy variables (D) matters: On the intercept (additive effect) - represents the mean difference between the values produced by the structural change. y = β 0 + δ 1 D + β 1 x 1 + u On the slope (multiplicative effect) - represents the ef- fect (slope) difference between the values produced by the structural change. y = β 0 + β 1 x 1 + δ 1 D · x 1 + u Chow’s structural test - is used when we want to analyze the existence of structural changes in all the model param- eters, it’s a particular expression of the F test, where the null hypothesis is: H 0 : No structural change (all δ = 0).
Changes in the measurement units of the variables: In the endogenous variable, y∗^ = y ·λ - affects all model parameters, β j∗ = βj · λ, ∀j = 1,... , k In an exogenous variable, x∗ j = xj · λ - only affect the parameter linked to said exogenous variable, β j∗ = βj · λ Same scale change on endogenous and exogenous - only affects the intercept, β∗ 0 = β 0 · λ
Changes in the measurement origin of the variables (en- dogenous or exogenous), y∗^ = y + λ - only affects the model’s intercept, β 0 ∗ = β 0 + λ