Correlation and Convolution in Image Processing: CMSC 426 Class Notes - Prof. David Jacobs | Study notes Computer Science

Correlation and Convolution

Class Notes for CMSC 426, Fall 2005

David Jacobs

Introduction

Correlation and Convolution are basic operations that we will perform to extract

information from images. They are in some sense the simplest operations that we can

perform on an image, but they are extremely useful. Moreover, because they are simple,

they can be analyzed and understood very well, and they are also easy to implement and

can be computed very efficiently. Our main goal is to understand exactly what

correlation and convolution do, and why they are useful. We will also touch on some of

their interesting theoretical properties; though developing a full understanding of them

would take more time than we have.

These operations have two key features: they are shift-invariant, and they are linear.

Shift-invariant means that we perform the same operation at every point in the image.

Linear means that this operation is linear, that is, we replace every pixel with a linear

combination of its neighbors. These two properties make these operations very simple;

it’s simpler if we do the same thing everywhere, and linear operations are always the

simplest ones.

We will first consider the easiest versions of these operations, and then generalize. We’ll

make things easier in a couple of ways. First, convolution and correlation are almost

identical operations, but students seem to find convolution more confusing. So we will

begin by only speaking of correlation, and then later describe convolution. Second, we

will start out by discussing 1D images. We can think of a 1D image as just a single row

of pixels. Sometimes things become much more complicated in 2D than 1D, but luckily,

correlation and convolution do not change much with the dimension of the image, so

understanding things in 1D will help a lot. Also, later we will find that in some cases it is

enlightening to think of an image as a continuous function, but we will begin by

considering an image as discrete, meaning as composed of a collection of pixels.

Notation

We will use uppercase letters such as I and J to denote an image. An image may be

either 2D (as it is in real life) or 1D. We will use lowercase letters, like i and j to denote

indices, or positions, in the image. When we index into an image, we will use the same

conventions as Matlab. First, that means that the first element of an image is indicated by

1 (not 0, as in Java, say). So if I is a 1D image, I(1) is its first element. Second, for 2D

images we give first the row, then the column. So I(3,6) is the pixel in the third row of

the image, and the sixth column.

An Example

Correlation and Convolution in Image Processing: CMSC 426 Class Notes - Prof. David Jacobs, Study notes of Computer Science

Related documents

Partial preview of the text

Download Correlation and Convolution in Image Processing: CMSC 426 Class Notes - Prof. David Jacobs and more Study notes Computer Science in PDF only on Docsity!