Problem Set 6 with Solutions - Image Processing | CMSC 426 | Assignments Computer Science

Problem Set 6

CMSC 426

Assigned Thursday, November 10, Due Tuesday, November 22

1. Stereo Correspondence. For this problem set you will solve the stereo

correspondence problem using a shortest path algorithm, as described in class.

The goal of this algorithm is to find the lowest cost matching between the left and

right images, so that the matching obeys the epipolar, ordering, non-negative

disparity and uniqueness constraints. First, let’s define these:

a. The epipolar constraint tells us that we can match the images one row at a

time. So we have to solve a matching problem with 1D images, matching

pixels in a row in the left image to pixels in the same row of the right

image. Then we combine the results for every row.

b. The ordering constraint means that if pixel i in the left image matches

pixel j in the right image, then no pixel to the right of pixel i is allowed to

match a pixel to the left of pixel j.

c. The uniqueness constraint means that every pixel can match at most one

pixel. However, a pixel might be occluded, and match nothing.

d. Non-negative disparity means that no point should have negative disparity,

because all points are in front of the camera, and have positive depth.

e. Subject to these constraints we use a cost function to measure how good a

match is. If we match pixel i image I to pixel j in image J, the cost of this

match will be (I(i)-J(j))2. If any pixel is not matched, the cost of this is

OC, which is some constant occlusion cost. For the experiments below, I

suggest an occlusion cost of OC = 625, which is about the same as the cost

of matching two pixels with an intensity that differs by 25. However, feel

free to experiment with other values to try to improve the results.

We will define a graph with start and end nodes, so that a path from the start to the end

will encode a complete matching of the two images. At every step of the way, we can do

two possible things. One is to match two pixels, the second is to allow a pixel to go

unmatched. We will create a graph in which nodes represent which pixels have been

accounted for, edges represent possible matches or occlusions, and edge weights encode

the cost of matching. We will name a node in a way that indicates which pixels have

been matched so far. For example, if we reach node N(5,3) this will mean that the first

five pixels in image 1, and the first three pixels in image 2, have all been taken care of.

From N(5,3) we can go to N(6,4). This must mean that we take care of both nodes 6 and

4 in one step, by matching them together. Or we can go to node N(6,3). This means that

node 6 in the first image is taken care of by not matching it to anything. That is, node 6

in the first image is occluded. Likewise, we could go to N(5,4). We also need a special

start node, S. This is connected to N(0,1), N(1,0) and N(1,1). We need a special end

node, E. For example, if there are 9 pixels in each image, E will be N(9,9), indicating

that all pixels are accounted for. It will be connected to N(8,8), N(8,9), N(9,8).

Finally, we use edge weights to encode the cost of these choices.

E(N(i-1,j-1), N(i,j)) = (I1(i)-I2(j))2.

Problem Set 6 with Solutions - Image Processing | CMSC 426, Assignments of Computer Science

Related documents

Partial preview of the text

Download Problem Set 6 with Solutions - Image Processing | CMSC 426 and more Assignments Computer Science in PDF only on Docsity!

Problem Set 6

CMSC 426

Assigned Thursday, November 10, Due Tuesday, November 22

z=