Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Log in Sign up

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Data Analysis Density Estimation, Exercises - Engineering, Exercises of Advanced Data Analysis

Carnegie Mellon University (CMU)Advanced Data Analysis

Data Analysis Density Estimation, Exercises - Engineering, Data Analysis Catch-up And Consolidation day 1, Exercises - Engineering, Advanced Data Analysis

Typology: Exercises

2010/2011

Uploaded on 11/03/2011

bridge 🇺🇸

4.9

(13)

287 documents

1 / 2

This page cannot be seen from the preview

Don't miss anything!

Homework 6: Nice Demo City, But Will It Scale?

36-402, Advanced Data Analysis

Due at the start of class, 21 February 2011

For data-collection purposes, urban areas of the United States are divided

into several hundred “Metropolitan Statistical Areas” based on patterns of res-

idence and commuting; these cut across the boundaries of legal cities and even

states. In the last decade, the U.S. Bureau of Economic Analysis has begun

to estimate “gross metropolitan products” for these areas — the equivalent of

gross national product, but for each metropolitan area. (See Homework 2 for the

definition of “gross national product”.) Even more recently, it has been claimed

that these gross metropolitan products show a simple quantitative regularity,

called “supra-linear power-law scaling”. If Yis the gross metropolitan product

in dollars, and Nis the number of people in the city, then, the claim goes,

Y≈cNb(1)

where the exponent b > 1 and the scale factor c > 0. This homework will use

the tools built so far to test this hypothesis.

1. (15 points) A metropolitan area’s gross per capita pro duct is y=Y/N.

Show that if Eq. 1 holds, then

log y≈β0+β1log N

How are β0and β1related to cand b?

2. (15 points) The data files gmp 2006.csv and pcgmp 2006.csv on the class

website contain the total gross metropolitan product (Y) in millions of

dollars, and the per capita gross metropolitan product (y) in dollars, for

all metropolitan areas in the US in 2006. Read them in and use them to

calculate the metropolitan populations (N). If it’s done correctly, then

running summary on the population figures should give

Min. 1st Qu. Median Mean 3rd Qu. Max.

54980 135600 231500 680900 530900 18850000

(Your exact results may differ very slightly because of rounding and display

settings.) What is the variance of logy?

1

Discover Exercises of Advanced Data Analysis Carnegie Mellon University (CMU)

Partial preview of the text

Download Data Analysis Density Estimation, Exercises - Engineering and more Exercises Advanced Data Analysis in PDF only on Docsity!

Homework 6: Nice Demo City, But Will It Scale?

36-402, Advanced Data Analysis

Due at the start of class, 21 February 2011

For data-collection purposes, urban areas of the United States are divided into several hundred “Metropolitan Statistical Areas” based on patterns of res- idence and commuting; these cut across the boundaries of legal cities and even states. In the last decade, the U.S. Bureau of Economic Analysis has begun to estimate “gross metropolitan products” for these areas — the equivalent of gross national product, but for each metropolitan area. (See Homework 2 for the definition of “gross national product”.) Even more recently, it has been claimed that these gross metropolitan products show a simple quantitative regularity, called “supra-linear power-law scaling”. If Y is the gross metropolitan product in dollars, and N is the number of people in the city, then, the claim goes,

Y ≈ cN b^ (1)

where the exponent b > 1 and the scale factor c > 0. This homework will use the tools built so far to test this hypothesis.

(15 points) A metropolitan area’s gross per capita product is y = Y /N. Show that if Eq. 1 holds, then

log y ≈ β 0 + β 1 log N

How are β 0 and β 1 related to c and b?

(15 points) The data files gmp 2006.csv and pcgmp 2006.csv on the class website contain the total gross metropolitan product (Y ) in millions of dollars, and the per capita gross metropolitan product (y) in dollars, for all metropolitan areas in the US in 2006. Read them in and use them to calculate the metropolitan populations (N ). If it’s done correctly, then running summary on the population figures should give

Min. 1st Qu. Median Mean 3rd Qu. Max. 54980 135600 231500 680900 530900 18850000

(Your exact results may differ very slightly because of rounding and display settings.) What is the variance of log y?

(20 points) Estimating the power-law scaling model. Use lm to linearly regress log per capita product, log y, on log population, log N. How does estimating this statistical model relate to Equation 1? What are the es- timated coefficients? Are they compatible with the idea of supra-linear scaling? What is the mean squared error for log y?
(15 points) Plot per capita product y against N , along with the fitted power-law relationship from problem 3. (Be careful about logs!)
(15 points) Fit a non-parametric smoother to log y and log N. (You can use kernel regression, a spline, or any other non-parametric smoother.) What is the mean squared error for log y? Describe, in words, how this curve compares to the power-law model from problem 3.
(20 points) Using the method from lecture 10, section 1, test whether the power-law relationship is correctly specified. What is the p-value? What do you conclude about the validity of the power-law model, based not just on this problem but the previous ones as well?

Data Analysis Density Estimation, Exercises - Engineering, Exercises of Advanced Data Analysis

Related documents

Partial preview of the text

Download Data Analysis Density Estimation, Exercises - Engineering and more Exercises Advanced Data Analysis in PDF only on Docsity!

Homework 6: Nice Demo City, But Will It Scale?

36-402, Advanced Data Analysis

Due at the start of class, 21 February 2011