Data Filtering - Seismology - Lab Notes, Study notes of Geology

Fundamentals of these Lab Notes are as follows : Data Filtering, Resolution, Noise, Spreadsheet, Leave Column, Leap Years, Represented, Mathematically, General Form, Temperature

Typology: Study notes

2012/2013

Uploaded on 07/19/2013

pinkii
pinkii 🇮🇳

4.4

(7)

48 documents

1 / 3

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Lab 2 :: Data Filtering and Noise Reduction GLY3160 / PHY3160
Page 1 of 3
LAB 2 :: DATA FILTERING AND NOISE REDUCTION
1. Set up an Excel spreadsheet and leave column A blank for now. Title column B, “Julian Day” and
populate the rest of the column with a list starting at 0 and ending after three years’ worth of days
have been entered (i.e. 1094 should be the last entry). Do not worry about leap years.
2. Weather.com states that the average daily high temperatures in Boone, NC range from 39oF in
January to 76oF in August. We will assume that these temperatures vary sinusoidally (i.e. they can be
mathematically represented by a sine function) and that day zero (Jan 1st) has the lowest daily high
temperature. Label column C “Temperature (F)” and calculate the predicted daily temperature
values using a sine function. Hint: a simple y = sin(x) will not work. The general form of a sine wave
of wavelength, λ, is: 𝑦 =sin ( 2𝜋 𝑥
𝜆 )
You will have to use your knowledge of mathematics to determine the equation for a sine wave that
has a maximum of 76 and a minimum of 39 with 39 occurring on day 0, 365, etc. Recall that like
virtually all programs, the sin function in Excel is in radians. Do not use a cosine function. While you
are tweaking your sin function, it will help you a great deal to make a plot to visually check your
results. I recommend that you start out plotting 𝑦 = sin (2𝜋 𝑥
𝜆) and then tweak this equation one
parameter at a time until you get the desired result. Make certain that your result matches the
appropriate wavelength, amplitude, and has the lowest daily temp on day 0, 365, etc… In your
write-up, you should state the type of model you are using, your model’s equation, and describe
what each portion of the equation represents. Do not write equations in Excel format in your
report! Your supervisor doesn’t care how Excel works. He/She wants to see the equations written
in normal mathematical notation.
3. Make a scatter plot with straight lines of your predicted daily high temperatures. Place this plot in
your spreadsheet so you will be able to see the graph change as you tweak parameters. Follow
these specific instructions: Title the plot “Figure 1” and label both axes. Make the horizontal and
vertical gridlines dotted @ 0.5 pt and have horizontal gridlines every 5o F and vertical gridlines every
half year. Make the graph span 0 - 1095 days in the horizontal and 30 - 85o F in the vertical direction.
Show minor tick marks on the vertical axis for each degree of temperature. Only label every 365
The following lab utilizes the computer program, Excel. In this exercise you will generate synthetic data sets based on a
simplified model of daily high temperatures in Boone and apply several filtering techniques to your data. A key to this lab is
that you use Excel in an efficient manner; otherwise, this exercise may take a long time to complete. A formal typed write-
up with referenced figures of your results is due by the beginning of the next lab period. The overarching purpose of this
lab is two-fold: 1) Perform some quantitative data processing and get results, and 2) Make a professional interpretation
and recommendation based on your results. This is more or less what consultants do.
Part I :: Noise & Resolution
Docsity.com
pf3

Partial preview of the text

Download Data Filtering - Seismology - Lab Notes and more Study notes Geology in PDF only on Docsity!

Page 1 of 3

LAB 2 :: DATA FILTERING AND NOISE REDUCTION

  1. Set up an Excel spreadsheet and leave column A blank for now. Title column B, “Julian Day” and populate the rest of the column with a list starting at 0 and ending after three years’ worth of days have been entered (i.e. 1094 should be the last entry). Do not worry about leap years.
  2. Weather.com states that the average daily high temperatures in Boone, NC range from 39oF in January to 76oF in August. We will assume that these temperatures vary sinusoidally (i.e. they can be mathematically represented by a sine function) and that day zero (Jan 1st) has the lowest daily high temperature. Label column C “Temperature (F)” and calculate the predicted daily temperature values using a sine function. Hint: a simple y = sin(x) will not work. The general form of a sine wave of wavelength, λ, is: 𝑦 = sin ( 2𝜋 𝑥𝜆 )

You will have to use your knowledge of mathematics to determine the equation for a sine wave that has a maximum of 76 and a minimum of 39 with 39 occurring on day 0, 365, etc. Recall that like virtually all programs, the sin function in Excel is in radians. Do not use a cosine function. While you are tweaking your sin function, it will help you a great deal to make a plot to visually check your results. I recommend that you start out plotting 𝑦 = sin (2𝜋 𝑥𝜆 ) and then tweak this equation one parameter at a time until you get the desired result. Make certain that your result matches the appropriate wavelength, amplitude, and has the lowest daily temp on day 0, 365, etc… In your write-up, you should state the type of model you are using, your model’s equation, and describe what each portion of the equation represents. Do not write equations in Excel format in your report! Your supervisor doesn’t care how Excel works. He/She wants to see the equations written in normal mathematical notation.

  1. Make a scatter plot with straight lines of your predicted daily high temperatures. Place this plot in your spreadsheet so you will be able to see the graph change as you tweak parameters. Follow these specific instructions: Title the plot “Figure 1” and label both axes. Make the horizontal and vertical gridlines dotted @ 0.5 pt and have horizontal gridlines every 5o^ F and vertical gridlines every half year. Make the graph span 0 - 1095 days in the horizontal and 30 - 85o^ F in the vertical direction. Show minor tick marks on the vertical axis for each degree of temperature. Only label every 365

The following lab utilizes the computer program, Excel. In this exercise you will generate synthetic data sets based on a simplified model of daily high temperatures in Boone and apply several filtering techniques to your data. A key to this lab is that you use Excel in an efficient manner; otherwise, this exercise may take a long time to complete. A formal typed write- up with referenced figures of your results is due by the beginning of the next lab period. The overarching purpose of this lab is two-fold: 1) Perform some quantitative data processing and get results, and 2) Make a professional interpretation and recommendation based on your results. This is more or less what consultants do.

Part I :: Noise & Resolution

Page 2 of 3

days on the horizontal axis. Leave the legend in the graph and call this series “Temp Model”. You will need it later. Plot your data with a dark red line at 2.25 pt. thickness.

  1. Although you now have a model to predict daily high temperatures in Boone, real data would never look like this due to noise. Let’s assume that your advisor is a cheapskate and wants to buy temperature gauges that have a +/- 10o^ F error. You task is to determine what level of error is acceptable if you are trying to predict the annual pattern of daily high temperatures. I.e. you do not care about accurately predicting each daily high temperature; you just want to be able to see the annual trends. Use the random number generator in Excel to add noise to your predictions simulating measurements made by a device with a +/- 10o^ F error. To do this, make a cell that will hold your error level variable. Label cell A1, “Temp Error +/-”, and below this enter the number 10. Before you attempt to use the random number generator in Excel, you should first play with the RAND() function to figure out how it works. What is the output range of RAND()? How can the range of RAND() be scaled? How can you use RAND() to give a +/- error range? This should be described in your write up. Once you have a good understanding on RAND(), use RAND() to add a +/- 10oF random noise to your data. Add this noisy data to your Figure 1 plot. Call this series “Noisy Data” and plot it with 3pt circles with no fill and a blue marker line color. Do not plot a connected line; only plot the symbols. Feel free to play with the error level in cell A2, but make sure that the plot you include in your write up has a +/- 10o^ F error level. Be sure to discuss the following questions in your write-up. a) What kind of modeling are you doing? What are the advantages and disadvantages of this type of model? You should discuss this in your report. b) Given the +/- 10o^ F error of the proposed temperature gauge, will the annual variation in daily high temperatures be detectable? c) At what level of error (in terms of +/-) does the annual pattern become completely obfuscated? d) What error level (given as +/-) would you recommend to your advisor and why? e) What is the minimum sampling rate (in days) needed to capture the annual variations in daily high temperatures? What sampling rate would you recommend to your advisor? f) Give one example of how this data set could become aliased and how you will avoid this in your study.

Because you were so helpful to your advisor in determining the necessary precision of temperature gauges, your advisor has given you the task of determining what types of filtering will be most effective at improving your data quality of your noisy temperature data (i.e. maximizing the signal to noise ratio).

Part II :: Stacking vs. Digital Filters