+ All Categories
Home > Documents > CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete...

CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete...

Date post: 12-Mar-2020
Category:
Upload: others
View: 7 times
Download: 0 times
Share this document with a friend
59
CS 1674: Intro to Computer Vision Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 9, 2016
Transcript
Page 1: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

CS 1674: Intro to Computer Vision

Deformable Part Models

Prof. Adriana KovashkaUniversity of Pittsburgh

November 9, 2016

Page 2: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Today: Object category detection

• Window-based approaches:– Last time: Viola-Jones detector

– Dalal-Triggs pedestrian detector

• Part-based approaches:– Implicit shape model

– Deformable parts model (DPM)

• Improvements:– Speeding up DPM

– Analyzing the failures of DPM

Page 3: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Dalal-Triggs pedestrian detector

1. Extract fixed-sized (64x128 pixel) window at each position and scale

2. Compute HOG (histogram of gradient) features within each window

3. Score the window with a linear SVM classifier

4. Perform non-maxima suppression to remove overlapping detections with lower scores

Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

Page 4: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

• Histogram of gradient orientations

– Votes weighted by magnitude

Orientation: 9 bins

(for unsigned angles)Histograms in

8x8 pixel cells

Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

Page 5: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Histograms of oriented gradients (HOG)

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection,

CVPR 2005

10x10 cells

20x20 cells

Image credit: N. Snavely

Page 6: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Histograms of oriented gradients (HOG)

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection,

CVPR 2005Image credit: N. Snavely

Page 7: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Histograms of oriented gradients (HOG)

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection,

CVPR 2005

Page 8: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Histograms of oriented gradients (HOG)

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection,

CVPR 2005

Page 9: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Slide by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

pos w neg w

Page 10: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

pedestrian

Slide by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

Page 11: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Resolving detection scores

Non-max suppression

Score = 0.1

Score = 0.8 Score = 0.8

Adapted from Derek Hoiem

Page 12: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Detection examples

Page 13: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Are we done?

• Single rigid template usually not enough to represent a category

– Many objects (e.g. humans) are articulated, or have parts that can vary in configuration

– Many object categories look very different from different viewpoints, or from instance to instance

Slide by N. Snavely

Page 14: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Deformable objects

Images from Caltech-256

Slide Credit: Duan Tran

Page 15: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Deformable objects

Images from D. Ramanan’s datasetSlide Credit: Duan Tran

Page 16: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Parts-based Models

Define object by collection of parts modeled by

1. Appearance

2. Spatial configuration

Slide credit: Rob Fergus

Page 17: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

How to model spatial relations?

• One extreme: fixed template

Derek Hoiem

Page 18: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Part-based template

• Object model = sum of scores of features at fixed positions

+3 +2 -2 -1 -2.5 = -0.5

+4 +1 +0.5 +3 +0.5 = 10.5

> 7.5?

> 7.5?

Non-object

Object

Derek Hoiem

Page 19: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

How to model spatial relations?

• Another extreme: bag of words

=

Derek Hoiem

Page 20: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

How to model spatial relations?

• Star-shaped model

=X X

X

Derek Hoiem

Page 21: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

How to model spatial relations?

• Star-shaped model

Root

Part

Part

Part

Part

Part

Derek Hoiem

Page 22: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Parts-based Models

• Articulated parts model

– Object is configuration of parts

– Each part is detectable and can move around

Adapted from Derek Hoiem, images from Felzenszwalb

Page 23: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Implicit shape models

• Visual vocabulary is used to index votes for

object position [a visual word = “part”]

B. Leibe, A. Leonardis, and B. Schiele, Combined Object Categorization and

Segmentation with an Implicit Shape Model, ECCV Workshop on Statistical

Learning in Computer Vision 2004

visual codeword with

displacement vectors

training image annotated with object localization info

Lana Lazebnik

Page 24: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Implicit shape models: Training

1. Build vocabulary of patches around

extracted interest points using clustering

Lana Lazebnik

Page 25: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Implicit shape models: Training

1. Build vocabulary of patches around

extracted interest points using clustering

2. Map the patch around each interest point to

closest word

Lana Lazebnik

Page 26: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Implicit shape models: Training

1. Build vocabulary of patches around

extracted interest points using clustering

2. Map the patch around each interest point to

closest word

3. For each word, store all positions it was

found, relative to object center

Lana Lazebnik

Page 27: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Recall: Generalized Hough transform

• Template representation:

for each type of landmark

point, store all possible

displacement vectors

towards the center

Model

Template

Svetlana Lazebnik

Page 28: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Implicit shape models: Testing

1. Given new test image, extract patches, match to

vocabulary words

2. Cast votes for possible positions of object center

3. Search for maxima in voting space

Lana Lazebnik

Page 29: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Perc

eptu

al

and S

enso

ry A

ugm

ente

d C

om

puti

ng

Vis

ual

Ob

ject

Reco

gn

itio

n T

uto

rial

K. Grauman, B. Leibe

Detection Results

• Qualitative Performance

Recognizes different kinds of objects

Robust to clutter, occlusion, noise, low contrast

Page 30: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Discriminative part-based models

P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan, Object Detection

with Discriminatively Trained Part Based Models, PAMI 32(9), 2010

Root

filterPart

filtersDeformation

weights

Lana Lazebnik

Page 31: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Discriminative part-based models

P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan, Object Detection

with Discriminatively Trained Part Based Models, PAMI 32(9), 2010

Multiple components

Lana Lazebnik

Page 32: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Discriminative part-based models

P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan, Object Detection

with Discriminatively Trained Part Based Models, PAMI 32(9), 2010Lana Lazebnik

Page 33: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Scoring an object hypothesis

• The score of a hypothesis is

the sum of appearance scores

minus the sum of deformation costs

),,,()(),...,( 22

0 1

0 ii

n

i

n

i

iiiiin dydxdydxDpxWppscore

Appearance weights

Part

features

Deformation weights

Displacements

Adapted from Lana Lazebnik

i.e. how much the part pi moved from its

expected location in the x, y directions

i.e. how much we’ll penalize the part pi

for moving from its expected location

Page 34: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Detection

Lana Lazebnik

Page 35: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Training

• Training data consists of images with labeled

bounding boxes

• Need to learn the weights and deformation parameters

Lana Lazebnik

Page 36: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Training

• Our classifier has the form

• w are model parameters, z are latent hypotheses

• Latent SVM training:

• Initialize w and iterate:

• Fix w and find the best z for each training example

• Fix z and solve for w (standard SVM training)

),(max)( zxHwx z f

Lana Lazebnik

Page 37: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Car model

Component 1

Component 2

Lana Lazebnik

Page 38: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Car detections

Lana Lazebnik

Page 39: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Person model

Lana Lazebnik

Page 40: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Person detections

Lana Lazebnik

Page 41: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Cat model

Lana Lazebnik

Page 42: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Cat detections

Lana Lazebnik

Page 43: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Today: Object category detection

• Window-based approaches:– Last time: Viola-Jones detector

– Dalal-Triggs pedestrian detector

• Part-based approaches:– Implicit shape model

– Deformable parts model (DPM)

• Improvements:– Speeding up DPM

– Analyzing the failures of DPM

Page 44: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Speeding up detection: Restrict set of windows we pass through SVM to those w/ high “objectness”

Alexe et al., CVPR 2010

Page 45: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Objectness cue #1: Where people look

Alexe et al., CVPR 2010

Page 46: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Objectness cue #2: color contrast at boundary

Alexe et al., CVPR 2010

Page 47: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Objectness cue #3: no segments “straddling” the object box

Alexe et al., CVPR 2010

Page 48: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Boxes found to have high “objectness”

Alexe et al., CVPR 2010

Cyan = ground truth bounding boxes, yellow = correct and red = incorrect predictions for “objectness”

Only run the sheep / horse / chair etc. classifier on the yellow/red boxes.

Page 49: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Today: Object category detection

• Window-based approaches:– Last time: Viola-Jones detector

– Dalal-Triggs pedestrian detector

• Part-based approaches:– Implicit shape model

– Deformable parts model (DPM)

• Improvements:– Speeding up DPM

– Analyzing the failures of DPM

Page 50: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

How do detectors fail?

• Most errors that detectors make are reasonable

– Localization error and confusion with similar objects

– Misdetection of occluded or small objects

• Detectors have different sensitivity to different factors

– E.g. less sensitive to truncation than to size differences

• Failure analysis code and annotations available online– http://web.engr.illinois.edu/~dhoiem/projects/detectionAnalysis/

Adapted from Hoiem et al., ECCV 2012

Page 51: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Top false positives: Airplane (DPM)

3

27 37

1

4

5

30

33

26

7

Other Objects

11%

Background

27%

Similar Objects

33%

Bird, Boat, Car

Localization

29%

AP = 0.36

Hoiem et al., ECCV 2012

Page 52: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Top false positives: Dog (DPM)

Similar Objects

50%

Person, Cat, Horse

1 6 1642 5

8 22

Background

23%

93

10

Localization

17%

Other Objects

10%

AP = 0.03

Hoiem et al., ECCV 2012

Page 53: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Analysis of object characteristics

Additional annotations for seven categories: occlusion level, parts visible, sides visible

Hoiem et al., ECCV 2012

Page 54: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Object characteristics: AeroplaneOcclusion: poor robustness to occlusion, but little impact on overall performance

Easier (None) Harder (Heavy)Hoiem et al., ECCV 2012

Page 55: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Size: strong preference for average to above average sized airplanes

Object characteristics: Aeroplane

Easier Harder

X-SmallSmallX-LargeMediumLarge

Hoiem et al., ECCV 2012

Page 56: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Aspect Ratio: 2-3x better at detecting wide (side) views than tall views

Object characteristics: Aeroplane

TallX-TallMediumWideX-Wide

Easier (Wide) Harder (Tall)Hoiem et al., ECCV 2012

Page 57: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Sides/Parts: best performance = direct side view with all parts visible

Object characteristics: Aeroplane

Easier (Side) Harder (Non-Side)Hoiem et al., ECCV 2012

Page 58: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Detection in 2014

Object detection system overview. Our system (1) takes an input image, (2) extracts

around 2000 bottom-up region proposals, (3) computes features for each proposal

using a large convolutional neural network (CNN), and then (4) classifies each region

using class-specific linear SVMs. R-CNN achieves a mean average precision (mAP)

of 53.7% on PASCAL VOC 2010. For comparison, Uijlings et al. (2013) report 35.1%

mAP using the same region proposals, but with a spatial pyramid and bag-of-visual-

words approach. The popular deformable part models perform at 33.4%.

R. Girshick, J. Donahue, T. Darrell, and J. Malik, Rich Feature Hierarchies for Accurate Object

Detection and Semantic Segmentation, CVPR 2014. Lana Lazebnik

Page 59: CS 1674: Intro to Computer Visionkovashka/cs1674_fa16/vision_19_dpms.pdf · Adapted from Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

Summary

• Window-based approaches – Assume object appears in roughly the same

configuration in different images

– Look for alignment with a global template

• Part-based methods– Allow parts to move somewhat from their usual

locations

– Look for good fits in appearance, for both the global template and the individual part templates

– Speed up by only scoring boxes that look like any object

– Models prefer that objects appear in certain views


Recommended