Smoothing Periodic Functional Data: Lecture 35 by Moo K. Chung, Study notes of Statistics

The smoothing of periodic functional data using diffusion methods. The lecture covers the motivation for considering periodic data, the laplacian on a unit circle, and the solution to the diffusion equation. The document also includes an explanation of the heat kernel expansion and the solution to the diffusion smoothing.

Typology: Study notes

Pre 2010

Uploaded on 09/02/2009

koofers-user-ex2
koofers-user-ex2 🇺🇸

5

(1)

10 documents

1 / 2

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Stat 992: Lecture 35
Smoothing periodic functional data
Moo K. Chung [email protected]
April 23, 2004
1. Solution to Problem 34. Diffusion smoothing on
a unit circle. Motivation would be when we have
periodic functional data g(θ) = g(θ+L). This
periodic condition can be transformed to
g¡L
2π
2π
Lθ¢=g³L
2π¡2π
Lθ+ 2π¢´.
So any period Lfunction can be transformed to a
periodic function with period 2π. So without loss
of generality, we consider periodic data of the form
g(θ) = g(θ+ 2π)which is viewed as data defined
on a unit circle.
Laplacian on S1is trivially = 2
θ. This can
be seen from the line element dl = so the
Laplacian on a circle must be the Laplacian in 1D
Euclidean space. We need to solve tf=2
θf
with initial condition f(θ, 0) = g(θ). The so-
lution to an isotropic diffusion equation on mani-
fold is given as a heat kernel convolution by find-
ing the eigenfunctions of Laplacian first. We solve
2
θH+λH = 0. Trivially two independent so-
lutions are H= cos λθ, sin λθ with λ > 0.
To grantee the periodicity of the eigenfunctions,
λ=jZ+. So we have sequence of eigenfunc-
tions Hj= sinj θ, ˜
Hj= cosj θ.j= 0,1,2,···.
Note that R2π
0sin2 =R2π
0cos2 =π
from symmetry. So we normalize the eigenfunc-
tions Hj=1
πsin jθ, ˜
Hj=1
πcos . This gives an
orthonormal system of basis functions on S1.
From Lecture 21, heat kernel expansion,
Kt(θ1, θ2) = 1
π2
X
j=0
ej2t[sin(1) sin(jθ2)
+ cos(1) cos(jθ2)]
=1
π2
X
j=0
ej2tcos[j(θ1θ2)].
This makes intuitive sense since the kernel must
be isotropic. Then the solution to the diffusion
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
−2
−1.5
−1
−0.5
0
0.5
1
1.5
2
2.5
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
−6
−4
−2
0
2
4
6
Figure 1: Left: White noise N(0,1) and KσN(0,1).
Right: 1000 zero mean unit variance Gaussian field. We
are seeing the boundary effect of kernel smoothing.
smoothing is given by
f(θ, t) = Z2π
0
Kt(θ, φ)g(φ)
=1
π2
X
j=0
ej2tZ2π
0
g(φ) cos[j(θφ)] dφ.
Obviously it is much easier to discretize the diffu-
sion equation than implementing the above analyti-
cal solution exactly.
2. Solution to Problem 36 (Tulaya Limpiti). Simulate
zero mean isotropic Gaussian field in [0,1] and find
the corrected P-value numerically. First we gener-
ate 5000 Gaussian white noise N(0,1) apply iter-
ated Gaussian kernel smoothing.
weight= K(1,[-1 0 1])
>>weight =
0.2119 0.5761 0.2119
GRF=zeros(5000,99);
for k=1:5000
w=normrnd(0,1,101,1);
tem=w;
for j=1:20
for i=2:100
tem(i) = dot(w((i-1):(i+1)),weight);
end;
w=tem;
end;
pf2

Partial preview of the text

Download Smoothing Periodic Functional Data: Lecture 35 by Moo K. Chung and more Study notes Statistics in PDF only on Docsity!

Stat 992: Lecture 35

Smoothing periodic functional data

Moo K. Chung [email protected]

April 23, 2004

  1. Solution to Problem 34. Diffusion smoothing on a unit circle. Motivation would be when we have periodic functional data g(θ) = g(θ + L). This periodic condition can be transformed to

g

( L

2 π

2 π L

θ

= g

( L

2 π

( (^2) π L

θ + 2π

So any period L function can be transformed to a periodic function with period 2 π. So without loss of generality, we consider periodic data of the form g(θ) = g(θ + 2π) which is viewed as data defined on a unit circle. Laplacian on S^1 is trivially ∆ = ∂ θ^2. This can be seen from the line element dl = dθ so the Laplacian on a circle must be the Laplacian in 1D Euclidean space. We need to solve ∂tf = ∂^2 θ f with initial condition f (θ, 0) = g(θ). The so- lution to an isotropic diffusion equation on mani- fold is given as a heat kernel convolution by find- ing the eigenfunctions of Laplacian first. We solve ∂ θ^2 H + λH = 0. Trivially two independent so- lutions are H = cos

λθ, sin

λθ with λ > 0. To grantee the periodicity of the eigenfunctions,√ λ = j ∈ Z+. So we have sequence of eigenfunc- tions Hj = sin jθ, H˜j = cos jθ. j = 0, 1 , 2 , · · ·. Note that

∫ (^2) π 0 sin (^2) jθ dθ = ∫^2 π 0 cos (^2) jθ dθ = π from symmetry. So we normalize the eigenfunc- tions Hj = (^1) π sin jθ, H˜j = (^1) π cos jθ. This gives an orthonormal system of basis functions on S^1. From Lecture 21, heat kernel expansion,

Kt(θ 1 , θ 2 ) =

π^2

∑^ ∞

j=

e−j

(^2) t [sin(jθ 1 ) sin(jθ 2 )

  • cos(jθ 1 ) cos(jθ 2 )]

=

π^2

∑^ ∞

j=

e−j (^2) t cos[j(θ 1 − θ 2 )].

This makes intuitive sense since the kernel must be isotropic. Then the solution to the diffusion

−2 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

−1.

−0.

0

1

2

−6 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0

2

4

6

Figure 1: Left: White noise N (0, 1) and Kσ ∗ N (0, 1). Right: 1000 zero mean unit variance Gaussian field. We are seeing the boundary effect of kernel smoothing.

smoothing is given by

f (θ, t) =

∫ (^2) π

0

Kt(θ, φ)g(φ) dφ

π^2

∑^ ∞

j=

e−j (^2) t

∫ (^2) π

0

g(φ) cos[j(θ − φ)] dφ.

Obviously it is much easier to discretize the diffu- sion equation than implementing the above analyti- cal solution exactly.

  1. Solution to Problem 36 (Tulaya Limpiti). Simulate zero mean isotropic Gaussian field in [0, 1] and find the corrected P-value numerically. First we gener- ate 5000 Gaussian white noise N (0, 1) apply iter- ated Gaussian kernel smoothing.

weight= K(1,[-1 0 1])

weight = 0.2119 0.5761 0.

GRF=zeros(5000,99); for k=1: w=normrnd(0,1,101,1); tem=w; for j=1: for i=2: tem(i) = dot(w((i-1):(i+1)),weight); end; w=tem; end;

−0.2−4 −3 −2 −1 0 1 2 3 4

0

1

Standard Normal Quantiles

Quantiles of Input Sample

QQ Plot of Sample Data versus Standard Normal

(^00) 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.

1

Figure 2: Left: QQ-plot of the suprima Gaussian ran- dom field. Right: Monte-Carlo version of P -value of the suprima field.

w=w(2:100); GRF(k,:)=w’; end; x=[1:99]/99; plot(x,w);

Generating random field in this way cause the boundary effect effect (Figure 1.). So we need to chop off fields near boundary. So we will only use w(10:90) and rerun the above code.

sup=max(GRF,[],2) pvalue=inline(’length(find(sup>=h))/5000’) x=[0:0.01:1.63]; for i=1:length(x) P(i)=pvalue(x(i),sup); end; plot(x,P,’linewidth’,2)

  1. Quantile-quantile plot (QQ-plot) QQ-plot was first introduced by Wilk and Gnanadesikan (1968). Quantile point q for random variable X is a point that satisfies P (X ≤ q) = FX (q) = p, where FX is the cumulative distribution function (cdf) of X. Assuming we can find the inverse cdf, q = F (^) X− 1 (p). There is a way to define the inverse of cdf (see Casella and Berger’s Statistical inference). Then we can define the quantile-quantile plot of two distributions X and Y to be a parametric curve (F (^) X− 1 (p), F (^) Y− 1 (p)) ∈ R^2 for p ∈ [0, 1]. Problem 48. Prove that the QQ plot of N (μ 1 , σ 12 ) and N (μ 2 , σ^22 ) is a straight line. What is the slope and the intersection? Bonus problem: Find the QQ plot of two lognormal distributions exp(N (μ 1 , σ 12 )), exp(N (μ 2 , σ 22 )) analytically.