Resolution for Problem Set 2 - Current Problems in Archaeology | ANTH 206, Assignments of Introduction to Cultural Anthropology

Material Type: Assignment; Class: CUR PROBS IN ARCH; Subject: Anthropology; University: University of California - Santa Barbara; Term: Unknown 1989;

Typology: Assignments

Pre 2010

Uploaded on 09/17/2009

koofers-user-fca-1
koofers-user-fca-1 🇺🇸

4

(1)

9 documents

1 / 6

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
ESM 206 Problem set 2
Solutions
Part A:
1) A regression of Highway MPG on weight in pounds has an estimated slope of -
0.0073. Thus a 100-pound reduction in weight should, on average, increase
mileage by 0.73 MPG.
2) The equation is
0 1i i i
H W
.
Variables Parameters Residual
The estimate of b0 is 51.58, the estimate of b1 is -0.0073, and the estimate of the
residual variance is 9.96. The 95% confidence interval for b0 is 48.10 to 55.05,
and for b1 is -0.0084 to -0.0062.
3) The interaction term shows how engine size affects the relationship between
weight and mileage. Since the parameter estimate is positive, increasing engine
size seems to decrease the negative effect of weight on mileage. Another way of
looking at it is that increasing engine size improves fuel economy after accounting
for car weight, and that this effect gets stronger the heavier the car is.
4) The fit improves slightly. The R2 goes from 0.65 to 0.68, and the F ratio for the
entire model goes from 171 to 194. Some slight curvature in the relationship is
eliminated, and the unusually large residuals at low weight are brought under
control. This makes sense, for I would expect that an increase in weight should
produce a proportional increase in fuel consumption, which is the inverse of
mileage. (Note that in metric countries, fuel efficiency is generally measured in
liters per 100 kilometers)
5) I first ran a model with weight, type, and the interaction between them. The P
values for the latter two were very large, so I removed the interaction (which had
the larger P). Then both weight and type were strongly significant (P < 0.0001;
figure 1). Thus, the different types have different inherent fuel efficiencies
(intercepts) but the rate at which fuel consumption increases with weight is the
same for all of them. This may have a lot to do with aerodynamics, for the vans
had the highest fuel consumption, given their weight.
Alternatively, the model with weight and the interaction describes the data
almost as well (F = 48.9, vs. 49.3 for the previous model). Vans again stand out,
having the steepest slope (figure 2). The reason both models do nearly equally
well is that the dominant effect is van’s increased consumption for their weight,
and that all the vans are heavy: there are no data on light vans that would tell us
whether the lines should be parallel.
The type-specific coefficients for both models are in table 1.
pf3
pf4
pf5

Partial preview of the text

Download Resolution for Problem Set 2 - Current Problems in Archaeology | ANTH 206 and more Assignments Introduction to Cultural Anthropology in PDF only on Docsity!

ESM 206 Problem set 2 Solutions Part A:

  1. A regression of Highway MPG on weight in pounds has an estimated slope of - 0.0073. Thus a 100-pound reduction in weight should, on average, increase mileage by 0.73 MPG.
  2. The equation is H^ i ^ ^0 ^  1 Wi ^  i. Variables Parameters Residual The estimate of b0 is 51.58, the estimate of b1 is -0.0073, and the estimate of the residual variance is 9.96. The 95% confidence interval for b0 is 48.10 to 55.05, and for b1 is -0.0084 to -0.0062.
  3. The interaction term shows how engine size affects the relationship between weight and mileage. Since the parameter estimate is positive, increasing engine size seems to decrease the negative effect of weight on mileage. Another way of looking at it is that increasing engine size improves fuel economy after accounting for car weight, and that this effect gets stronger the heavier the car is.
  4. The fit improves slightly. The R2 goes from 0.65 to 0.68, and the F ratio for the entire model goes from 171 to 194. Some slight curvature in the relationship is eliminated, and the unusually large residuals at low weight are brought under control. This makes sense, for I would expect that an increase in weight should produce a proportional increase in fuel consumption, which is the inverse of mileage. (Note that in metric countries, fuel efficiency is generally measured in liters per 100 kilometers)
  5. I first ran a model with weight, type, and the interaction between them. The P values for the latter two were very large, so I removed the interaction (which had the larger P). Then both weight and type were strongly significant (P < 0.0001; figure 1). Thus, the different types have different inherent fuel efficiencies (intercepts) but the rate at which fuel consumption increases with weight is the same for all of them. This may have a lot to do with aerodynamics, for the vans had the highest fuel consumption, given their weight. Alternatively, the model with weight and the interaction describes the data almost as well (F = 48.9, vs. 49.3 for the previous model). Vans again stand out, having the steepest slope (figure 2). The reason both models do nearly equally well is that the dominant effect is van’s increased consumption for their weight, and that all the vans are heavy: there are no data on light vans that would tell us whether the lines should be parallel. The type-specific coefficients for both models are in table 1.

Table 1: Type-specific intercepts (for the constant slope model) and slopes (for the constant intercept model) for the two models discussed in problem 5. Type Intercept Slope Compact 0.1340 7.266e- Large 0.1321 6.742e- Midsize 0.1344 7.389e- Small 0.1334 7.053e- Sporty 0.1356 7.802e- Van 0.1395 8.689e-

HiwayGPM 1500 2000 2500 3000 3500 4000 Weight Compact Large Midsize Small Sporty Van Figure 1: Highway fuel consumption as a function of weight, for the model with identical slopes but different intercepts.

There are many other variables that might be relevant. For example, I would expect gasoline consumption to decrease in response to increases in public spending on public transit infrastructure. I would expect consumption to increase as population size increases, because new housing tends to be built farther from the urban center (sprawl). Any variables that are rationalized in a similar way would be acceptable. 3) The OLS regression shows a highly significant relationship between consumption and price. However, the relationship is in the opposite direction to the one I predicted: consumption increases as prices increase (figure 3). Either our economic understanding is very wrong, or we are seeing biased results as a consequence of failing to include appropriate control variables. 4) The only control variable from section b that is in the data is income. Adding this to the model produces parameter estimates that are all strongly significant (P < 0.01 in all cases) and changes the sign of the price parameter to be negative: Consumption = 145 – 85.3 Price + 0.019 Income, R^2 = 0.775, P < 0.0001. Thus, the control variable Income does affect consumption (positively, as I predicted), and changes the sign of the relationship between consumption and price to go in the predicted direction. This 250 300 350 400 450 Consumption .2 .3 .4 .5 .6 .7 .8 .9 1 1. Price Figure 3: Gasoline consumption as a function of price. happens because including this variable removes the bias caused by the general increase in prosperity over the period of the data. 5) For full credit on this question, you should test all of the OLS assumptions. I will go through them for the model that I have created: Linearity in parameters: by inspection, this is satisfied. Normality of residuals: The residuals exhibit very strong kurtosis, and are clearly not normally distributed (figure 4). There is no power transformation that would fix this

problem. Although there may be some sort of exotic transformation that could work, let’s look at the rest of the assumptions first. -1.

-0. 0

1

.01 .05 .10 .25 .50 .75 .90 .95. -3 -2 -1 0 1 2 3 Normal Quantile Plot Figure 4: Studentized residuals from the regression of consumption on price and income. Constant Variance: The plot of residuals vs. predicted values shows no obvious trend in the variance. Thus this assumption is satisfied. Residuals have expected value zero: The plot of residuals vs. predicted values shows that this is egregiously violated: the average residual value has a roughly quadratic dependence on the fitted value (figure 5). Looking at the partial residual plots also shows quadratic dependence of the partial residuals on the independent variables, suggesting that we should square-root transform income and add a price squared term. However, the curious patterns in the Price partial residual plot suggest that there may be other things going on as well.

0 10 20 30 40 Consumption Residual 250 300 350 400 450 Consumption Predicted

0 25 Price Partial residuals .2 .3 .4 .5 .6 .7 .8 .9 1 1. Price 150 200 250 300 350 Income parital residuals 8000 10000 12000 14000 16000 18000 20000 Income Figure 5: Residual vs. fitted plot and the two partial residual plots for the model that includes price and income. Residuals are independent: There is strong positive temporal autocorrelation in the residuals, so this assumption is violated. If there is no additional variable that could