

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Material Type: Assignment; Class: Data Analysis II; Subject: Statistics; University: University of Missouri - Columbia; Term: Spring 2007;
Typology: Assignments
1 / 2
This page cannot be seen from the preview
Don't miss anything!


Homework 3 STAT 8320 Due March 21, 2007
Problem 1: The following four measurements were made upon three species of iris (Fisher 1936): y 1 = sepal length, in mm. y 2 = sepal width, in mm. y 3 = petal length, in mm. y 4 = petal width, in mm. The data are attached in the file labelled problem 1. Additionally, they are an example data set in SAS. These data will be used to perform a discriminant analysis. a. First, perform a linear discriminant analysis on the complete data, assuming that the variance-covariance matrices are equal. Talk about the assumptions that you use. Addition- ally, discuss the resubstitution error estimates; specifically what is the estimate of the error rate, and what are the possible problems with using this procedure. b. Next, select 10 observations from each of the three species at random. Use these 30 observations to form a test set. Use the remaining observations to form a linear discriminant rule. Consider the measures of error from both resubstitution and based upon the test data set. Discuss their relative merits, and any differences in your conclusions. c. Finally, perform the analysis without assuming that the covariance matrices are equal. Look at the error rates from resubstitution and cross-validation. Discuss the differences between these methods. Additionally, what can you say about the assumption of equal variance-covariance matrix?
Problem 2: Suppose that we have data which we believe has arisen from the nonlinear model
Yi = θ 1 xθ 12 i exp(θ 3 x 2 i) + ≤i 1
a. Find the normal equations for this model, based upon minimizing the SSR. b. Consider the data attached. Find starting values for a convergence algorithm using ordi- nary linear regression. c. Find the forms necessary to perform the Gauss-Newton algorithm for a single update. Evaluate these forms based upon the data and your initial estimate from part b. Perform a single update of the parameters. d. Find the forms necessary to perform the Newton-Raphson algorithm for a single update. Evaluate these forms based upon the data and your initial estimates from part b. Perform a single update of the parameters. How does this update compare to that from part c? e. Find the final estimates θˆ for this data. Assume that the asymptotic results hold and form confidence intervals for each of the parameters. Additionally, form a prediction interval for y when x = (2, 2).