+ All Categories
Home > Documents > Visual Object Tracking

Visual Object Tracking

Date post: 24-Feb-2016
Category:
Upload: konala
View: 90 times
Download: 0 times
Share this document with a friend
Description:
A review on:. Visual Object Tracking. Alireza Asvadi. Video-Vigilance and Biometrics. University of Coimbra. May 2014. Outline:. Definition and application. “Object tracking” from different points of view. Yilmaz et al. 06. Yang et al. 11. David Forsyth & Jean Ponce 12. - PowerPoint PPT Presentation
Popular Tags:
27
isual Object Trackin Alireza Asvadi 1 University of Coimbra May 2014 Video-Vigilance and Biometrics A review on:
Transcript
Page 1: Visual Object Tracking

1

Visual Object Tracking

Alireza Asvadi

University of Coimbra

May 2014

Video-Vigilance and Biometrics

A review on:

Page 2: Visual Object Tracking

2

Definition and application

“Object tracking” from different points of view

Yilmaz et al. 06

Yang et al. 11

David Forsyth & Jean Ponce 12

Outline:

Page 3: Visual Object Tracking

3

Definition and application

Page 4: Visual Object Tracking

4

What is object tracking?

Estimating the trajectory of an object over time by locating its position in every frame.

Page 5: Visual Object Tracking

5

motion-based recognition

automated surveillancevideo indexing

human-computer interaction

traffic monitoringvehicle navigation…

Applications:

Page 6: Visual Object Tracking

6

Object tracking from different points of view

Yilmaz et al. 06

Yang et al. 11

David Forsyth & Jean Ponce 12

Page 7: Visual Object Tracking

7

How to categorize? There are different points of view

Object tracking

Yilmaz et al. 06

Yang et al. 11

David Forsyth & Jean Ponce 12

Page 8: Visual Object Tracking

8

Object Tracking: A Survey

Object detection

Object tracking

Point detectorsBackground subtractionSegmentationSupervised learning

Point trackingKernel tracking

Silhouette tracking

Object representation

Feature selection for tracking

Yilmaz et al. 06

Page 9: Visual Object Tracking

9

Object representation:

Points, primitive geometric shapes, object silhouette and contour, articulated shape models and skeletal models.

Object representations are chosen according to the application domain.

Page 10: Visual Object Tracking

10

Feature selection for tracking:

Desirable property of a visual feature is its uniqueness so that the objects can be easily distinguished in the feature space.

many tracking algorithms use a combination of these features.

Color: RGB, L*u*v, L*a*b, HSV

Edges: less sensitive to illumination changes

Optical flow: displacement vectorsBrightness constraint [Horn & Schunk 81]

Texture: measure of the intensity variation of a surface

Page 11: Visual Object Tracking

11

Object detection:

Point detectors: are used to find interest points in images which have an expressive texture in their respective localities (ex. Harris & SIFT).A desirable interest point: invariance to illumination and camera viewpoint

Background subtraction: Object detection can be achieved by building a representation of the scene called the background model and then finding deviations from the model for each incoming frame.

Segmentation: The aim of image segmentation algorithms is to partition the image into perceptually similar regions and can be used for object detection(ex. Mean shift clustering & graph cut).

Supervised learning: Object detection can be performed by learning different object views automatically from a set of examples by means of a supervised learning mechanism (ex. SVM).

Every tracking method requires an object detection mechanism either in every frame or when the object first appears in the video.

Page 12: Visual Object Tracking

12

Taxonomy of tracking methods:

Page 13: Visual Object Tracking

13

Point tracking: Objects detected in consecutive frames are represented by points.

Deterministic Methods: Proximity, maximum velocity (r denotes radius), small velocity-change, common motion, rigidity constraints

Statistical Methods: Kalman filter, particle filter

Page 14: Visual Object Tracking

14

Kernel tracking: Kernel refers to the object shape and appearance. For example, the kernel can be a rectangular template or an elliptical shape with an associated histogram.

Template and density based appearance models: Ex. Template matching & mean shift tracking. Advantage of the mean shift is the elimination of a brute force search.

Multiview appearance models: The objects may appear different from different views. Different views of the object can be learned offline and used for tracking.

Page 15: Visual Object Tracking

15

Silhouette tracking: Silhouette-based object tracker try to find the object region in each frame by means of an object model generated using the previous frames.

Shape matching: Object model is in the form of an edge map. Shape matching performed similar to tracking based on template matching.

Contour tracking: Iteratively evolve an initial contour in the previous frame to its new position in the current frame. This contour evolution requires that some part of the object in the current frame overlap with the object region in the previous frame.

Page 16: Visual Object Tracking

16

Summary:

Point tracking: Good for finding Geometrical and 3D structure of object.Point correspondence is a complicated problem-specially in the presence of occlusions, misdetections, entries, and exits of objects.

Kernel tracking: Real time applicability.One of the limitations of primitive geometric shapes for object representation is thatparts of the objects may be left outside of the defined shape while parts of the background may reside inside it.

Silhouette tracking: Good for modeling object with complex shape.Sensitive to noise. Not capable to deal with object split and merge.

Page 17: Visual Object Tracking

17

Recent advances and trends in visual tracking: A review

Feature descriptors for visual tracking

Online learning based tracking methods

Yang et al. 11

Generative methods

Discriminative methods

Page 18: Visual Object Tracking

18

Feature descriptors for visual tracking:

Gradient features: statistical summarization of the gradients

Color features:

Texture features:

Spatio-temporal features:

Multiple features fusion: HOG-LBP, …

HOGmulti-resolution HOGSIFTSURF…

CSIFT (concatenation of the hue histogram with the SIFT descriptor. Invariant to light intensity)

LBP (Local Binary Patterns) a grayscale invariant textureMB-LBP (multi-scale block LBP)…

HOG/HOF (Histograms of oriented gradients & optic flow)HOG3D (histograms of 3D gradient orientations)ESURF (extends the image SURF descriptor to video)DLBP (dynamical Local Binary Patterns)…

Page 19: Visual Object Tracking

19

Online learning based tracking methods:

The generative method builds a model to describe the appearance of an object and then finds the object by searching for the region most similar to the reference model in each frame.The object model is often updated online to adapt to appearance changes.generative methods would easily fail within cluttered background.

Generative online learning methods:

Discriminative methods pose object tracking as a binary classification problem in which the task is to determine a decision boundary that distinguishes the object from the background without the need to a complex model characterizing the object.To handle appearance changes, the classifier is updated incrementally over time.A major shortcoming of discriminative methods is their noise sensitivity.

Discriminative online learning methods:

There are methods which combine these two methods.

Page 20: Visual Object Tracking

20

Object tracking

Computer Vision A Modern Approach 2nd Edition,Ch 11, TrackingForsyth et al. 12

Tracking by detection

Tracking with dynamics

Applying Data association

Page 21: Visual Object Tracking

21

Time t Time t+1 Time t+n

Tracking by detection:we have a strong model of the object, we detect the object independently in each frame and can record its position over time.

Occlusions

TimeSimilar Objects

Problems:

Page 22: Visual Object Tracking

22

Tracking with dynamics:

Key idea: Given a model of expected motion, predict where objects will occur in next frame.Filtering Problem: Estimate of c based on prediction a and measurement b

Observation (Detected object) + Dynamics

The information from the predictions and measurements are combined to provide the best possible estimate of the location of the train.

the product of two Gaussian functions is another Gaussian function.

R. Faragher , “Understanding the Basis of the Kalman Filter Via a Simple and Intuitive Derivation,” IEEE Signal Processing Magazine, September 2012.

The Kalman filter:

a bc

Page 23: Visual Object Tracking

23

Problems: Yet it is not good enough

So far, we’ve assumed the entire measurement to be relevant to determining the state. In reality, there may be uninformative measurements may belong to different tracked objects.

Data association: Task of determining which measurements go with which tracks.

Tracking Matching: Match should be close to predicted position

Gating: Omit MeasurementsOutside the gate

Page 24: Visual Object Tracking

24

Tracking: Detection(observation)+ dynamics +Data association

Applying Data association (Gating) Predicted Values By Kalman Filter (Green)

NCCNCC+KalmanNCC+Kalman+DA

Page 25: Visual Object Tracking

25

Tracking by detectiondetect the object independently in each frame tracking=Detection Detection Methods:

Tracking with dynamicsincorporate object dynamics to tracking Methods:tracking=Detection(observation)+dynamics

Applying Data associationEliminate highly unlikely measurementstracking=Detection(observation)+ dynamics +Data association

Summary:

Point detectorsTemplate matchingdensity Based appearance modelsBackground Subtraction…

Filtering MethodsKalman filter…

Methods:Tracking MatchingGating…

Page 26: Visual Object Tracking

26

Reference:

A. Yilmaz, O. Javed, and M. Shah, “Object tracking: A survey,” ACM Computing Surveys, Vol. 38, No. 4, pp. 1–45, December 2006.

H. Yang, L. Shao, F. Zheng, L. Wang, and Z. Song, “Recent advances and trends in visual tracking: A review,” Neurocomputing, Vol. 74, No. 18, pp. 3823-3831.

D. A. Forsyth, J. Ponce, “Computer Vision: A Modern Approach,” Prentice Hall,2nd Edition, 2012.

R. Faragher , “Understanding the Basis of the Kalman Filter Via a Simple and Intuitive Derivation,” IEEE Signal Processing Magazine, September 2012.

Page 27: Visual Object Tracking

27

Thank you for your attention


Recommended