























































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
The first lecture slides from a computer vision course taught by marshall tappen at marshall university in fall 2009. The slides cover an introduction to the course, grading policies, required software environments, and an overview of image processing through convolution. The lecture also includes examples of image blurring using different kernel sizes and the mathematical representation of convolution.
Typology: Study Guides, Projects, Research
1 / 63
This page cannot be seen from the preview
Don't miss anything!
























































CAP 5415 – Computer Vision Marshall Tappen Fall 2009 Lecture 1
About Me − (Relatively) New Faculty Member (Three Years) − (^) Interested in Machine Vision and Machine Learning − (^) Happy to chat with you at almost any time (^) May want to e-mail me first − Office Hours: (^) Tuesday-Thursday after class
Finishing the problem sets will require access to an interpreted environment − (^) MATLAB − Octave − Numerical Python NO COMPILED LANGUAGES!!!!! − (^) No C/C++ − (^) No Java − No x86 Assembler My Compiled Languages Rant
MATLAB − Pro:Well-established package. You can find many tutorials on the net. − Con: Not free. If your lab does not already have it, talk to me about getting access. Octave − Free MATLAB look-alike − (^) Pro: Should be able to handle anything you will do in this class − Con: “Should be”. I'm not sure about support in Windows
We will use it We will be talking about mathematical models of images and image formation This class is not about proving theorems My goal is to have you build intuitions about the models Try and visualize the computation that each equation is expressing Basic Calculus and Basic Linear Algebra should be sufficient
We will use Szeliski Book – Free this year! Not required – more of a reference
For now, we won't worry about the physical aspects of getting images View image as an array of continuous values
What if we wanted to blur this image? We could take a local average − Replace each pixel with the mean of an NxN pixel neighborhood surrounding that pixel.
Original Averaged
Original Averaged
Input Image Kernel
Multiply corresponding numbers and add
Input Image
Mathematically expressed as Resulting Image Input Image Convolution Kernel
Let’s say i = 10 and j= Which location in K is multiplied by I (5,5)? I (5,4) Resulting Image Input Image Convolution Kernel