











































































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
This report is for final year project to complete degree in Computer Science. It emphasis on Applications of Computer Sciences. It was supervised by Dr. Abhisri Yashwant at Bengal Engineering and Science University. Its main points are: Block, Diagram, Hamburg, Taxi, Sequence, Visual, Surveillance, Gradient, Video, Flow, Chart
Typology: Study Guides, Projects, Research
1 / 83
This page cannot be seen from the preview
Don't miss anything!












































































Figure 2 - 2 : Shows the color confidence maps for a certain frame in the sequence used
x
Abstract
The techniques studied, implemented and presented are all premeditated in detail and then put into p ractice in this thesis. The techniques deliberated and implemented so far are Template Matching, Fast Mean Shift and Kalman Filter where as for object detection background subtraction through frame differencing is implemented. Still there is room for improvement and future enhancements which can improve the efficiency and increase performance level. Object tracking through Kalman filter is known for accuracy and precision so as Fast Mean Shift is also a very receptive method to be applied in practical implementations where as it is found after implementation and study that Template Matching is a lengthy procedure due to its correlation operations but withstands occlusion to some extent. So this thesis is an inclusive introduction to object tracking hierarchy and object tracking techniques with their implementation complications and experimental results.
xi
events, and recognition of objects. Video tracking is a vital and active research area in computer vision.
In its simplest form, tracking can be defined as the problem of estimating the trajectory of an object in the image plane as it moves around a scene. In other words, a tracker assigns consistent labels to the objects in question in different frames. Object tracking no doubt is an exigent problem. Impenetrability in tracking can arise due to abrupt motion of object, changing directions and appearances of the object and the scene, non rigid or articulated object structures as birds and human beings, object- to -object and object- to -scene occlusions, and camera motion [ 5]. Tracking is generally performed in the milieu of higher-level applications that require the location and/or shape of the object in every frame. Normally, assumptions are made to constrict and lessen the tracking problem s.
1.2 Scope and Objective
Moving object detection is important in many real-time image processing applications such as autonomous robotics, traffic control, and driver assistance and surveillance systems. Usually high resolution gray-scale images must be processed; since each image pixel may belong to a moving object, pixel- wise processing is required. Now days, video surveillance is an important and challenging field in computer vision for both indoor and outdo or environments. Organizations which need a surveillance system can easily get low priced surveillance cameras but they still need many security agents to keep a constant watch on all monitors. This approach is not efficient, and in fact most of the time video tapes or files are replayed a number of times to check on a particular event after it has happened thus the automation of this system is highly desired.
The project Moving object tracking from video sequences is an attempt to study some algorithms, which are robust for the tracking of mobile but non rigid objects from the image sequences precisely called video. The Figure 1-1 shows the basic working grid of the processes involved in the system as Figure 1-1 also includes pre and post processing sequences as well which includes noise removal, image enhancement issues, organization and classification etc. but these issues are not of primary
importance as far as this project is concerned so are not addressed in detail in this project.
Figure 1-1 Block Diagram of system
1.4 Thesis Outline
Chapter 1 includes the introduction of the problem with its objective and scope along with applications. The concept of object detection and its methods along with the method studied and implemented which is background subtraction is all engrossed in Chapter 2. Object tracking in detail and its techniques are described in chapter 3. Similarly, chapter 4 depicts the modus operandi used for object tracking during this project along with their experimental results including Template matching and Fast Mean Shift. The Kalman filter and it necessary details along with implementation complexities, algorithm and results are described in chapter 5 .It also includes the comprehensive comparison of techniques used. In the end whole of the work is concluded and few future work aspects are mentioned in chapter 6.
One of the most difficult problems in image processing is detecting particular objects in an image. For a human observer it is very easy to identify any object, however it is far more difficult for a machine. Numerous methods exist for detecting objects of known type in a particular environment or image. However, in many cases, the visual characteristics of the objects are unknown, or it is necessary to detect objects that are very different from each other. This kind of method has applications in the domain of robotics, particularly for robots that are designed to operate in a hostile or unknown environment.
Videos are the sequences of frames that run fast enough to give an effect of continuity as human eye perceives the frame sequences moving with a particular speed as a video. As far as object detection is concerned techniques of image processing are applied to the frames in order to identify any change so as to state the motion of object detected through the change observed in the two consecutive frames after attaining the results of the image processing techniques used for identification purpose. A surveillance system can be implemented in three steps. The first step consists of detecting the objects in motion .Then tracking them and finally High- level interpretation of the ongoing events. First step of detecting the object in question is described in detail with the experimental results in order to elaborate the mechanism in Chapter 2. Object detection is the first step in the motion tracking phenomena. The object detection is performed through background subtraction algorithm in this project though many other detection methods have been already developed and are widely in use.
2.2.2 Optical Flow
The optical flow is the disarticulation field allied to each of the pixels in a sequence. Such displacement field results from the apparent motion of the image brightness in time. For the computation of optical flow it is assumed that image brightness is continuous and differentiable as many times as needed in both the spatial and temporal domain. Estimating the optical flow is fundamental problem in low- level vision, and can be undoubtedly serve for many applications in image sequence processing. Most of the algorithms for the estimation of the optical flow concentrate on the goal of estimating the motion field between succeeding images in a sequence, disregarding the estimates obtained for the previous image pair.
If the apparent brightness of moving objects remain constant then the image brightness E over time is given by
dE dt (^) 0 (2.2)
Figure 2-1: Application in Visual Surveillance: Optical Flow computed on Carthe Hamburg taxi sequence
Since image brightness E is regarded as a function of both spatial coordinates of the image plane, x and y , and of time, that is,
E E x , y , t (2.3)
Via a chain rule of differentiation, the total temporal derivative reads as
dE xtdt , yt , t Exdxdt Eydydt E t 0 (2. 4)
2.2.3 Color based Method
This method is based on background modeling and subtraction using both color and edge information. Both the color and edge models and subtraction are computed separately [10]. For storage of results confidence maps are used rep resent ing how confident the method is in recognizing that a pixel is a foreground obje ct. Color and intensities of the previous image are compared with every new coming image and this difference signifies the motion. Background subtraction is done by performing the color -based subtraction and the edge-based subtraction separately and then combining the re sults [10]. Color -based subtraction is performed by subtracting the current image from the mean image in each color channel. For each pixel, the confidence is computed as
c
c cc c c
c M D M
MD mm m D M
D m C 100
A significant change in any color channel indicates motion and thus a foreground region is detected.
50 100 150 200 250 300
50 100 150 200
Figure 2-2 : Shows the color confidence maps for a certain frame in the sequence used
2.2.6 Experimental Results
The results achieved after applying the background subtraction algorithm using Frame Differencing technique are illustrated as under.
Figure 2-4 : The video results showing background subtraction in Frame 10 and 15
Figure 2-5 : The video results showing background subtraction in Frame 42 and 45
Figure 2-6: The video results showing background subtraction in Frame 86 and 92
In few cases where the background illumination or brightness becomes higher up to a certain level that it becomes detectable after subtraction like the object it also appears as shown in figure 2 - 4.
Figure 2-7 : Video Results showing high illumination in the background in frame 56 and 58
So, the experimental results shown in Figure 2- 1,2 - 2,2 -3 show the object in question which is a plane clearly detected from the background .Similarly, Figure 2-4 shows another dimension of the background subtraction algorithm that it has to be robust enough in order to overcome the raising illuminat ion and brightness factors.
2.3 Conclusion
Object detection is the lying at the start in the hierarchy of motion based tracking mechanism. It basically deducts the background and focuses the object of interest from the scene. Many methods can be used for background subtraction depending on color, edge and such kind of features .Some of the methods are described in brief in the preceding chapter .It is effectively used in real time applications as well .It is also feasible in correcting images deserts caused by inappropriate illumination and brightness effects .Thus background subtraction is one of the widely used and important technique as far as object detection in concerned.
out in automatic mode. Once the target is confirmed the control of the system is transferred to tracking state.
3.2.2 Tracking state
This stage should use computationally inexpensive techniques. Current location extracted by locking state is used for processing. Next position of the target is identified, and that positional information is stored in history database. If the target does not exist in the predicted window area, then the system control is transferred to recovery state.
3.2.3 Recovery state
Quite often the moving object of interest may be lost temporarily or permanently. In this state if the target is lost, the system will try to recover the target from low- resolution image. If the target is recovered in a few frames, then the system will transfer control to tracking state; otherwise it remains in recovery state till its predefined time expires. After the time is elapsed, control transfers to locking state.
3.3 Different Approaches for Object Tracking
Different methods have been used for moving object tracking given as under.
3.3.1 Correlation based method
The correlation based method simply operates on the principal of intriguing correspondence between the previous frame and every up coming frame. This mechanism finally results in the correlation vector of the previous and new frame which in fact gives the measure of relationship between both frames. Thus the difference indicates the motion and determines the new direction for the object of interest. Relatively similar approach is followed in template matching which is described in detail in chapter 3.In template matching the correlation is taken between the template and every up coming frame in the sequence which gives the similarity ratio in the form of a single point which is then resolved by plummeting the components around it to make it template for the next frame. Correlation value differs between -1 to 1 so whenever correlation methods is used in tracking some threshold is set by the .This threshold servers as a reference as how much the images of previous ad current frame match each other or differ from each other which finally determines the motion of the object if interest.
3.3.2 Feature - based Method s
In feature-based object detection, standardization of image features and registration (alignment) of reference points are important. The images may need to be transformed to another space for handling changes in illumination, orientation and size. One or more features are extracted and the objects of interest are modeled in terms of these features. Object detection and recognition then can be transformed in to a graph - matching problem. All the methods we have already presented were using features, but they can be qualified of trivial features. This method considers more sophisticated ones. We consider: The moments computed on the image The contour li nes extracted from the image Color transition