Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Log in Sign up

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Texture Synthesis: Implementing Efros and Leung's Method - Prof. David Jacobs, Assignments of Computer Science

University of Maryland Computer Science

Prof. David Jacobs

The steps to implement the texture synthesis method by efros and leung as described in their 1999 paper. The goal is to generate new pixels in an image using a neighborhood of already generated pixels. The sum of squared differences (ssd) method and its relationship to correlation, as well as implementing the findmatches function. Students are encouraged to use their own sample textures and modify the program for color images.

Typology: Assignments

Pre 2010

Uploaded on 02/13/2009

koofers-user-64r 🇺🇸

9 documents

1 / 9

This page cannot be seen from the preview

Don't miss anything!

Problem Set 3 Solutions

CMSC 426

Thursday, October 13

Background Subtraction 15 points

For this problem, you are given a set of 100 images of background, I1, I2, … I100. and one

test image, J. These are on the class web page in BackgroundImages.zip and

foreground_image.jpg. Your task is to classify each pixel in the test image as either

foreground or background. Suppose pixel J(x,y) has intensity k. To classify it, you

should compute:

Once you’ve computed this for each pixel, you’ll need to choose a threshold, T, so that

you classify all pixels as foreground when P(J(x,y)=k)<T. Choose a value of T by hand

that seems to produce pleasing results. Our results are in

BackgroundSubtractionResults.jpg. Turn in your code, a picture of your result, and

indicate which threshold you used. Our results are included on the class web site for

comparison.

Hint: As usual, you can achieve better performance by minimizing the amount of looping

you do. However, when dealing with 100 images, you must be a little careful to avoid

using up too much memory; in my implementation I do loop through the images rather

than trying to perform some operations on all of them at once.

We can do this with the assistance of the following two

functions. First, we really need a function to read in 100

images; doing this by hand would be too tedious.

function WT = read_waving_trees(st, en)

% There were really more than 100 images I was playing with. So I made

% this function general enough to read in any subset of them. st and

% en should be integers that indicate the first and last images that we

% want to use. These could just be set to 0 and 99.

WT = [];

% We will store the images in a 3D matrix. WT(:,:,i) will contain

% image i.

for i = st:en

is = int2str(i);

zeros = [];

for j = 1:(5-length(is))

zeros = [zeros, '0'];

end

% The file name will have either 2, 3 or 4 zeros depending on the

% number of characters needed to represent the current image

% number.

fn = ['b', zeros, is, '.bmp'];

( )( )



=











−−

== 100

12

2

)),((

exp

2100

1

,

i

iyxIk

kyxJP

σ

πσ

Discover Assignments of Computer Science University of Maryland

Partial preview of the text

Download Texture Synthesis: Implementing Efros and Leung's Method - Prof. David Jacobs and more Assignments Computer Science in PDF only on Docsity!

Problem Set 3 Solutions

CMSC 426

Thursday, October 13

Background Subtraction 15 points

For this problem, you are given a set of 100 images of background, I 1 , I 2 , … I 100. and one

test image, J. These are on the class web page in BackgroundImages.zip and

foreground_image.jpg. Your task is to classify each pixel in the test image as either

foreground or background. Suppose pixel J(x,y) has intensity k. To classify it, you

should compute:

Once you’ve computed this for each pixel, you’ll need to choose a threshold, T , so that

you classify all pixels as foreground when P(J(x,y)=k)<T. Choose a value of T by hand

that seems to produce pleasing results. Our results are in

BackgroundSubtractionResults.jpg. Turn in your code, a picture of your result, and

indicate which threshold you used. Our results are included on the class web site for

comparison.

Hint: As usual, you can achieve better performance by minimizing the amount of looping

you do. However, when dealing with 100 images, you must be a little careful to avoid

using up too much memory; in my implementation I do loop through the images rather

than trying to perform some operations on all of them at once.

We can do this with the assistance of the following two

functions. First, we really need a function to read in 100

images; doing this by hand would be too tedious.

function WT = read_waving_trees(st, en) % There were really more than 100 images I was playing with. So I made % this function general enough to read in any subset of them. st and % en should be integers that indicate the first and last images that we % want to use. These could just be set to 0 and 99. WT = []; % We will store the images in a 3D matrix. WT(:,:,i) will contain % image i. for i = st:en is = int2str(i); zeros = []; for j = 1:(5-length(is)) zeros = [zeros, '0']; end % The file name will have either 2, 3 or 4 zeros depending on the % number of characters needed to represent the current image % number. fn = ['b', zeros, is, '.bmp'];

=

100

1

2

exp

i

PJ x y k k Ii^ x y

σ π^ σ

% Notice we can concatenate strings together just as we do vectors, % since a string is a vector of characters. WT = cat(3, WT, rgb2gray(imread(fn))); % We convert everything to gray scale, since we don’t use color. % Notice that cat can be used to concatenate matrices in the 3 rd % dimension. end

Now we do the real work. B will contain the background images, in the form produced by read_waving_trees. I contains the foreground image (we’ll read this in and convert it to gray with interactive commands).

function P = background_probability(B, I, sigma)

% B will be a 3D matrix in which B(:,:,i) is an image.

P = zeros(size(I));

% We will sum the probabilities into this matrix.

ID = double(I);

for i = 1:size(B,3)

% I'm going to iterate to avoid using large amounts of memory.

D = ID - double(B(:,:,i));

% D now contains the difference between the foreground image and background image

% i.

P = P + (1/(100sigmasqrt(2pi))) * exp( (-(D.^2))./(2sigma^2));

% This implements the equation given in the problem.

end

We can turn this result into a binary array in which foreground is 1, with a command like:

P < SomeThreshold.

A sigma that is a small integer makes sense to use in computing this, but to find a good

value for SomeThreshold one really just has to use trial and error.

Challenge Problem: Up to 20 points

See if you can improve on this performance by using a statistical model that does not

assume that every pixel is independent. For example, you could try to model the

distribution of pairs of pixels, or make use of the fact that if I(x,y) = k, it is somewhat

likely that in the next image, I(x+1,y)=k. Describe what you did and whether it worked.

Note that this is a vague problem, and I’m not sure whether really good results can be

obtained. But any good ideas that you try will get some credit, whether they work or not.

Texture Synthesis

The goal of this problem is to implement the texture synthesis method of Efros and

Leung. This is described in the paper: ``Texture Synthesis by Non-parametric

Sampling’’, by Efros and Leung, in the International Conference on Computer Vision,

n i n

n

D x y j n Sx i y j Ti j

2

To see how this is related to correlation, let’s open up the expression:

^ (^ )^ (^ )^

=− =− =− =− = − =−

=− =−

n i n

n j n

n i n

n j n

n i n

n j n

n i n

n j n

n i n

n j n

S x i y j Ti jSx i y j Ti j

Sx i y j Ti j Sx i y j Ti j

Dx y Sx i y j Ti j

(^22)

2

The bottom line shows that to compute SSD we can combine 3 separate

summations. You should notice that the middle term is just -2 times the result of

correlating the template, T, with the sample image, S. We know how to compute

this, for example, by using imfilter. This also gives us a sense of the relationship

between SSD and correlation; in general, the higher the correlation between the

template and the sample at some location, the lower the SSD will be. This is why

correlation alone is sometimes used as a way of finding locations where the

template and sample are similar.

The third term does not depend on S, x, or y at all. It only depends on the

template, T. So it doesn’t have to be computed at every point in S , it just has to be

computed once (you should be able to compute it without looping).

So the only problem we have left is to compute the first term:

=− =− (^ + + )

n i n

n

j nS^ x i y j

2

This only depends on the squared value of the sample image. Given this squared

sample image, we need to add up all the values in a square region of S. You

should try to figure out how to do this using imfilter (and no loops).

Things get a little more complicated when we have a mask. We can write this as:

n i n

n

D ( x , y ) j n Sx i , y j T ( i , j ) M ( i , j )

2

In this case, for every summation, we only want to use the pixels in the template

that are part of the mask. For example, if we make some of the pixels in the

template 0, these will not have any effect when we correlate the template with S,

or when we compute the sum of squared values in the template. When we

compute the first of the three terms in SSD, we must filter the squared sample

values so that we will only count the valid values in the template.

You can test your function using the my_checkerboard function. Execute the call:

SSD(my_checkerboard(3,3),[0 1 1; 1 0 -1; 1 0 0], [1 1 1; 1 1 0; 1 1 1])

and my program produces:

SSD(my_checkerboard(3,3),[0 1 1; 1 0 -1; 1 0 0], [1 1 1; 1 1 0; 1 1 1])

ans =

Turn in a printout of the results, along with your code.

You may be worried about how to compute SSD when the template goes outside

the boundary. Don’t worry about this. You can do anything that produces

reasonable results. For example, you can just use the default values of imfilter,

which treats pixels outside S as if they were 0. This is what I did in the example

above. This will work fine, since these areas then will generally not match your

template very well.

2. 15 points: Using SSD and the pseudocode below, implement the function

FindMatches. This should find all candidate pixels in the sample that have a

neighborhood that is sufficiently similar to the template. In finding the best

matches it is also important that you not loop through the entire image. Here

is one useful Matlab trick for avoiding such loops. You can index into a

matrix using a binary array that is the same size as the matrix. You will then

get back a vector with all the matrix elements that are at locations where the

index was 1. For example:

a = rand(3,3)

a =

>> a(a>.5)

ans =

Pixel.value = BestMatch.value end return Image end

Function GetUnfilledNeighbors() returns a list of all unfilled pixels that have filled

pixels as their neighbors (the image is subtracted from its morphological dilation). The

list is randomly permuted and then sorted by decreasing number of filled neighbor pixels.

GetNeigborhoodWindow() returns a window of size WindowSize around a given pixel.

RandomPick() picks an element randomly from the list. FindMatches() is as follows:

function FindMatches(Template,SampleImage) ValidMask = 1s where Template is filled, 0s otherwise TotWeight = sum i,j ValidMask(i,j) for i,j do for ii,jj do dist = (Template(ii,jj)-SampleImage(i-ii,j-jj))^ SSD(i,j) = SSD(i,j) + distValidMask(ii,jj)GaussMask(ii,jj) end SSD(i,j) = SSD(i,j) / TotWeight end PixelList = all pixels (i,j) where SSD(i,j) <= min(SSD)*(1+ErrThreshold) return PixelList end

In our implementation the constant were set as follows: ErrThreshold = 0.1. Pixel

values are in the range of 0 to 1.

Solution

I’ll just put the whole solution here at the bottom, with comments.

function I = GrowImage(S, n, w, h) if n/2 == round(n/2) error('n must be odd'); end nh = (n-1)/2; Ipad = init(S,n,w,h,nh); S = double(S); PixelList = GetUnfilledNeighbors(Ipad); count = 0; while ~isempty(PixelList) count=count+ % I just print out count so I can see how far along we are. for i = 1:size(PixelList,2) Pixel = PixelList(:,i); Template = GetWindow(Pixel, Ipad, nh); % Template may contain -1 or -2; we don't use those pixels. BMs = FindMatches(Template, S); BM = RandomPick(BMs); Ipad(Pixel(1),Pixel(2)) = BM; end PixelList = GetUnfilledNeighbors(Ipad); end I = Ipad((nh+1):(h+nh), (nh+1):(w+nh));

function BM = RandomPick(BMs) % Pick a column at random ind = 1+floor(rand*length(BMs)); % This gives us a random number between 1 and length(BMs). BM = BMs(ind);

function BMs = FindMatches(Tem, S) % This function will compute the SSD between the template and every % equal-sized window in S. If the Template contains a -1, we want to % ignore that pixel in computing the SSD. Then we want to return the % pixel values in S at the center of every region that has a distance % to the template that is no more than 1.1 times the distance of the % nearest region. THRESHOLD = 1.1; ValidMask = Tem>=0; % We only use pixels in the template that are non-negative. D = SSD(S, Tem, ValidMask); Best = max(0,min(D(:))); % With round-off error, we can get a best distance that is negative, % and this causes trouble. BMs = S(D <= Best*THRESHOLD);

function D = SSD(S, Tem, M) ker = Tem.M; % This is the part of the template, Tem, that has valid values, as % indicated by the mask, M. Anything that is 0 in M is 0 in ker too. Ssq = S.^2; S2 = imfilter(Ssq, double(M)); % S2 will contain, for each pixel, the sum of squares of values in S % that are in the neighborhood of that pixel, and that play a part in % comparing the region around that pixel to the template in Tem. S_Tem = imfilter(S, ker); % This is the result of correlating S and Tem, again only using the % valid parts of Tem. Tem2 = sum(sum(ker.^2)); % Sum of squares of valid entries in Tem. Note that this is a scalar, % while S2 and S_Tem are matrices. This is because the value computed % in Tem2 doesn’t depend on pixel position. But we can still combine % these with no problem. D = S2 + Tem2 - 2S_Tem;

function Tem = GetWindow(P, I, nh); % Extract an 2nh+1x2nh+1 window from I that is centered on pixel P. Tem = I((P(1)-nh):(P(1)+nh), (P(2)-nh):(P(2)+nh));

function PL = GetUnfilledNeighbors(I) % We want to find every -1 that is next to something that's not -1. % For speed, it is important to do this without looping, using % imfilter and find. These are returned as a 2xn matrix, where every % column gives the position of a pixel that is -1, but next to % something not -1. The first row of PL gives the row position of the % pixels, the second row gives the column position. Id = imfilter(I>=0, [0 1 0; 1 0 1; 0 1 0]) > 0; % Id will indicate whether the number of neighboring pixels that are % non-negative is bigger than 0. Note that I only used four neighbors. % One could probably use a neighborhood of eight pixels equally well.

Texture Synthesis: Implementing Efros and Leung's Method - Prof. David Jacobs, Assignments of Computer Science

Related documents

Partial preview of the text

Download Texture Synthesis: Implementing Efros and Leung's Method - Prof. David Jacobs and more Assignments Computer Science in PDF only on Docsity!

Problem Set 3 Solutions

CMSC 426

Thursday, October 13

Background Subtraction 15 points

For this problem, you are given a set of 100 images of background, I 1 , I 2 , … I 100. and one

test image, J. These are on the class web page in BackgroundImages.zip and

foreground_image.jpg. Your task is to classify each pixel in the test image as either

foreground or background. Suppose pixel J(x,y) has intensity k. To classify it, you

should compute:

Once you’ve computed this for each pixel, you’ll need to choose a threshold, T , so that

you classify all pixels as foreground when P(J(x,y)=k)<T. Choose a value of T by hand

that seems to produce pleasing results. Our results are in

BackgroundSubtractionResults.jpg. Turn in your code, a picture of your result, and

indicate which threshold you used. Our results are included on the class web site for

comparison.

Hint: As usual, you can achieve better performance by minimizing the amount of looping

you do. However, when dealing with 100 images, you must be a little careful to avoid

using up too much memory; in my implementation I do loop through the images rather

than trying to perform some operations on all of them at once.

We can do this with the assistance of the following two

functions. First, we really need a function to read in 100

images; doing this by hand would be too tedious.

exp

PJ x y k k Ii^ x y

function P = background_probability(B, I, sigma)

% B will be a 3D matrix in which B(:,:,i) is an image.

P = zeros(size(I));

% We will sum the probabilities into this matrix.

ID = double(I);

for i = 1:size(B,3)

% I'm going to iterate to avoid using large amounts of memory.

D = ID - double(B(:,:,i));

% D now contains the difference between the foreground image and background image

% i.

P = P + (1/(100sigmasqrt(2pi))) * exp( (-(D.^2))./(2sigma^2));

% This implements the equation given in the problem.

end

We can turn this result into a binary array in which foreground is 1, with a command like:

P < SomeThreshold.

A sigma that is a small integer makes sense to use in computing this, but to find a good

value for SomeThreshold one really just has to use trial and error.

Challenge Problem: Up to 20 points

See if you can improve on this performance by using a statistical model that does not

assume that every pixel is independent. For example, you could try to model the

distribution of pairs of pixels, or make use of the fact that if I(x,y) = k, it is somewhat

likely that in the next image, I(x+1,y)=k. Describe what you did and whether it worked.

Note that this is a vague problem, and I’m not sure whether really good results can be

obtained. But any good ideas that you try will get some credit, whether they work or not.

Texture Synthesis

The goal of this problem is to implement the texture synthesis method of Efros and

Leung. This is described in the paper: ``Texture Synthesis by Non-parametric

Sampling’’, by Efros and Leung, in the International Conference on Computer Vision,

D x y j n Sx i y j Ti j

To see how this is related to correlation, let’s open up the expression:

^ (^ )^ (^ )^

S x i y j Ti jSx i y j Ti j

Sx i y j Ti j Sx i y j Ti j

Dx y Sx i y j Ti j

The bottom line shows that to compute SSD we can combine 3 separate

summations. You should notice that the middle term is just -2 times the result of

correlating the template, T, with the sample image, S. We know how to compute

this, for example, by using imfilter. This also gives us a sense of the relationship

between SSD and correlation; in general, the higher the correlation between the

template and the sample at some location, the lower the SSD will be. This is why

correlation alone is sometimes used as a way of finding locations where the

template and sample are similar.

The third term does not depend on S, x, or y at all. It only depends on the

template, T. So it doesn’t have to be computed at every point in S , it just has to be

computed once (you should be able to compute it without looping).

So the only problem we have left is to compute the first term:

=− =− (^ + + )

j nS^ x i y j

This only depends on the squared value of the sample image. Given this squared

sample image, we need to add up all the values in a square region of S. You

should try to figure out how to do this using imfilter (and no loops).

Things get a little more complicated when we have a mask. We can write this as: