Problem Set 6: Mapping Data Noise to Solution Error | GEOS 567, Assignments of Geology

Material Type: Assignment; Class: Inverse Problems in Geophysics; Subject: GEOSCIENCES; University: University of Arizona; Term: Fall 2007;

Typology: Assignments

Pre 2010

Uploaded on 08/31/2009

koofers-user-tk6-1
koofers-user-tk6-1 🇺🇸

10 documents

1 / 2

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Geos 567: Inverse Problems Fall 2007
Problem Set # 6: Mapping Data Noise to Solution Error
Due: Thursday, 11 October 2007
This homework builds on HW#5. Assume that the b
1
,b
2
data from HW#5 are actually
noisy y
1
,y
2
values to which you are finding the least squares straight line solution y = m
1
+ m
2
z. Assume that the y
1
and y
2
values were taken at z
1
=2, and z
2
=7, respectively.
This homework should help you see how noise propagates into the solution, and the
important role that data covariance and correlation coefficient matrices play.
1. Find (and print) the (unweighted) least squares solutions (m
1
,m
2
)
T
and predicted data
(y
1
,y
2
)
Tpredicted
for the mean values of y
1
,y
2
for Experiments 1-50, 51-100, and 1-100,
respectively, recalling that unweighted implies unit data covariance matrices for each
case. Calculate (and print) the a posteriori model covariance matrix (see Equation
2.63 (page 25) of the Notes) for each case. Calculate (and print) the model
correlation coefficient matrix (see Equation 2.60, page 24) for each case.
2. Find (and print) the weighted least squares solutions (m
1
,m
2
)
T
and predicted data
(y
1
,y
2
)
Tpredicted
for the mean values of the y
1
,y
2
values for Experiments 1-50, 51-100,
and 1-100, respectively, using the data covariance matrices from HW#5 for the
weighting. Calculate (and print) the a posteriori model covariance matrix and model
correlation coefficient matrices for each case (Equations 2.63 and 2.60). Are they
different from the ones you calculated in Problem 1? Are the ones from Problem 1 or
2 ‘better’, and why? Finally, are your solutions (m
1
,m
2
)
T
and predicted data
(y
1
,y
2
)
Tpredicted
different from those calculated in Problem 1 above? Does this surprise
you? Explain.
3. Now generate (m
1
,m
2
)
T
data sets for Experiments 1-50, 51-100, and 1-100,
respectively, using m = G
LS-1
d. That is, you will get a set of 50 (m
1
,m
2
)
T
pairs for
Experiments 1-50, etc. Use G
LS-1
from problem 1, and note that while d and m will
vary from experiment to experiment, G
LS-1
does not. Calculate (and print) the mean
value for (m
1
,m
2
)
T
for Experiments 1-50, 51-100, and 1-100, respectively, and
compare with your results in Problem 1.
4. Using Equation 2.59 of the Notes, calculate (and print) the a posteriori model
covariance matrix for each case (i.e., Experiments 1-50, 51-100, and 1-100).
Compare the results with those obtained in Problem 2 above. Do the results surprise
you? Explain.
pf2

Partial preview of the text

Download Problem Set 6: Mapping Data Noise to Solution Error | GEOS 567 and more Assignments Geology in PDF only on Docsity!

Geos 567: Inverse Problems Fall 2007

Problem Set # 6: Mapping Data Noise to Solution Error

Due: Thursday, 11 October 2007

This homework builds on HW#5. Assume that the b 1 ,b 2 data from HW#5 are actually noisy y 1 ,y 2 values to which you are finding the least squares straight line solution y = m (^1)

  • m 2 z. Assume that the y 1 and y 2 values were taken at z 1 =2, and z 2 =7, respectively. This homework should help you see how noise propagates into the solution, and the important role that data covariance and correlation coefficient matrices play.
  1. Find (and print) the (unweighted) least squares solutions (m 1 ,m 2 )T^ and predicted data (y 1 ,y 2 ) Tpredicted for the mean values of y 1 ,y 2 for Experiments 1-50, 51-100, and 1-100, respectively, recalling that unweighted implies unit data covariance matrices for each case. Calculate (and print) the a posteriori model covariance matrix (see Equation 2.63 (page 25) of the Notes) for each case. Calculate (and print) the model correlation coefficient matrix (see Equation 2.60, page 24) for each case.
  2. Find (and print) the weighted least squares solutions (m 1 ,m 2 )T^ and predicted data (y 1 ,y 2 ) Tpredicted for the mean values of the y 1 ,y 2 values for Experiments 1-50, 51-100, and 1-100, respectively, using the data covariance matrices from HW#5 for the weighting. Calculate (and print) the a posteriori model covariance matrix and model correlation coefficient matrices for each case (Equations 2.63 and 2.60). Are they different from the ones you calculated in Problem 1? Are the ones from Problem 1 or 2 ëbetterí, and why? Finally, are your solutions (m 1 ,m 2 )T^ and predicted data (y 1 ,y 2 ) Tpredicted different from those calculated in Problem 1 above? Does this surprise you? Explain.
  3. Now generate (m 1 ,m 2 )T^ data sets for Experiments 1-50, 51-100, and 1-100, respectively, using m = GLS -1^ d. That is, you will get a set of 50 (m 1 ,m 2 )T^ pairs for Experiments 1-50, etc. Use GLS -1^ from problem 1, and note that while d and m will vary from experiment to experiment, GLS -1^ does not. Calculate (and print) the mean value for (m 1 ,m 2 )T^ for Experiments 1-50, 51-100, and 1-100, respectively, and compare with your results in Problem 1.
  4. Using Equation 2.59 of the Notes, calculate (and print) the a posteriori model covariance matrix for each case (i.e., Experiments 1-50, 51-100, and 1-100). Compare the results with those obtained in Problem 2 above. Do the results surprise you? Explain.

Geos 567: Inverse Problems Fall 2007

  1. Plot the solutions (m 1 ,m 2 )T^ for Experiments 1-50, Experiments 51-100, and Experiments 1-100 as separate full-page figures on separate pages, as you did for the b 1 ,b 2 distributions in HW#5. Use different symbols for the 1-50 and 51-100 parts of the solution. Use MATLABís axis [-5 15 -2 4] command to have m 1 vary from -5 to 15 and m 2 vary from -2 to 4 for all three plots. Put a symbol at the location of the mean value for (m 1 ,m 2 )T^ on each plot. Put a title on each plot that clearly identifies what is being plotted.
  2. The data covariance matrix for Experiments 1-100 has zero off-diagonal elements, indicating that the errors in the y 1 ,y 2 values are uncorrelated. The a posteriori model covariance matrix and the correlation coefficient matrix for Experiments 1-100 have non-zero off-diagonal entries, indicating correlated errors for the solution. That is, if you have a solution where the intercept (m 1 ) is, say, above the mean value for all experiments, then it is likely that the slope (m 2 ) will be below the mean value for the experiments. This is a general property of least square straight line solutions with uncorrelated data errors, and not the result of the particular dataset you are working with in this problem set. How can model errors be correlated in this case when data errors are not? Explain why the model errors are correlated in the way (sign) that they are. This problem requires some thought, looking at the figures youíve created, the G matrix, the least squares inverse operator, and the mapping between data space and model space by the inverse operator.
  3. The off-diagonal terms of the model parameter correlation coefficient matrices (and plots in Problem 5) vary/change systematically between the Experiments 1-50, 51- 100, and 1-100 cases. What is it about the data covariance and (especially) data correlation coefficient matrices that explain the variations? (Hint: the answer to Problem 6 is helpful here.)