+ All Categories
Home > Documents > Pedestrian Detection in Crowded Scenes

Pedestrian Detection in Crowded Scenes

Date post: 22-Jan-2016
Category:
Upload: mandek
View: 36 times
Download: 0 times
Share this document with a friend
Description:
Pedestrian Detection in Crowded Scenes. Dhruv Batra ECE CMU. Pedestrian Detection in Crowded Scenes. - PowerPoint PPT Presentation
Popular Tags:
34
Pedestrian Detection in Crowded Scenes Dhruv Batra ECE CMU
Transcript
Page 1: Pedestrian Detection in Crowded Scenes

Pedestrian Detection in Crowded Scenes

Dhruv BatraECE CMU

Page 2: Pedestrian Detection in Crowded Scenes

Pedestrian Detection in Crowded Scenes

1. Pedestrian Detection in Crowded Scenes. Bastian Leibe, Edgar Seemann, and Bernt Schiele. In IEEE International Conference on Computer Vision and Pattern Recognition (CVPR'05), San Diego, CA, June 2005.

2. An Evaluation of Local Shape-Based Features for Pedestrian Detection. Edgar Seemann, Bastian Leibe, Krystian Mikolajczyk, and Bernt Schiele. In British Machine Vision Conference (BMVC'05) Oxford, UK, September 2005.

3. Combined Object Categorization and Segmentation with an Implicit Shape Model. Bastian Leibe, Ales Leonardis, and Bernt Schiele. In ECCV'04 Workshop on Statistical Learning in Computer Vision, Prague, May 2004.

Page 3: Pedestrian Detection in Crowded Scenes

Theme of the Paper

Probabilistic top-down/bottom-up formulation of segmentation/recognition

Basic Premise: “[Such a] problem is too difficult for any type of feature or model alone”

Page 4: Pedestrian Detection in Crowded Scenes

Theme of the Paper

Open Question: How would you do pedestrian detection/segmentation?

Solution: integrate as many cues as possible from many sources

Original imageSupport of Segmentation from local featuresSegmentation from local featuresSupport of segmentation from global features (Chamfer Matching)Segmentation from global features (Chamfer Matching)

Page 5: Pedestrian Detection in Crowded Scenes

Theme of the Paper

Goal: Localize AND count pedestrians in a given image

Datasets

Training Set: 35 people walking parallel to the image planeTesting Set (Much harder!): 209 images of 595 annotated pedestrians

Page 6: Pedestrian Detection in Crowded Scenes

Theme of the Paper

Page 7: Pedestrian Detection in Crowded Scenes

Evaluation Criteria

Criteria 1: Relative Distance

Fixed aspect ratio- 11:15

Threshold d < 0.5

Page 8: Pedestrian Detection in Crowded Scenes

Evaluation Criteria

Criteria 2 & 3: Cover and Overlap

Threshold cover >50% overlap >50%

Page 9: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Training: Code book Approach (with spatial information)

Page 10: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Training:

Lowe’s DoG Detector 3x 3 patches

Resize to 25 x 25

Page 11: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Training: Agglomerative Clustering

Page 12: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Training: Agglomerative Clustering

Page 13: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Training: Agglomerative Clustering

Page 14: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Training: Agglomerative Clustering

Codebook entries store figure-ground masks for these entries

Page 15: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Training: But wait! We just lost spatial information … Run again

Lowe’s DoG Detector

Resize to 25 x 25

3x 3 patches

Find codebook patches

Learn Spatial Distribution

Page 16: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Testing: Initial Hypothesis: Overall

Page 17: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Testing: Initial Hypothesis (Probabilistic Hough Voting Procedure)

measuring similarity between patch and codebook entrylearnt from spatial distributions of codebook entries

Search for maximum in probability spaceUsing a fixed size search window

Page 18: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Testing: Initial Hypothesis: found as maxima in 3D voting space

maxima computed using Mean Shift Mode Estimationover this balloon density estimator

Uniform Cubicle Kernel

Page 19: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Testing: Initial Hypothesis: Overall

Page 20: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Testing: Initial Hypothesis: Overall

Page 21: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Testing: Probabilistic top down segmentation

Intermediate Goal: Find this

start here

Assumption: Uniform Priors

Estimate from training data

From similarity measure

Page 22: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Testing: Probabilistic top down segmentation

Marginalized over all patches in image

Substitute this here

to get this

Page 23: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Testing: Probabilistic top down segmentation

Page 24: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

First Step: Generate hypotheses from local features (Intrinsic Shape Models)

Testing: Probabilistic top down segmentation

Page 25: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

Second Step: Handling overlapping detections

Page 26: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

Second Step: Segmentation based Verification (Minimum Description Length)

Saving that can be achieved by explaining part of image by a particular hypothesis

Number of pixels N explained by hModel complexityCost of describing the error made by hypothesis h

Sum over all pixels hypothesized as figureProbability of being a background

Page 27: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

Second Step: Segmentation based Verification (Minimum Description Length)

With this framework we can resolve conflicts between overlapping hypothesis

Relative importance assigned to support of hypothesisBias term

Page 28: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

Second Step: Segmentation based Verification (Minimum Description Length)

Voila! It works

Page 29: Pedestrian Detection in Crowded Scenes

Initial Recognition Approach

Second Step: Segmentation based Verification (Minimum Description Length)

Caveat: it leads to another set of problems

ISM doesn’t know a person doesn’t have three legs!

Global Cues are needed

Or four legs and three arms

Page 30: Pedestrian Detection in Crowded Scenes

Assimilation of Global Cues

Distance Transform, Chamfer Matching

get Feature Image by an edge detectorget DT image by computing distance to nearest feature point Chamfer Distance between template and DT image

Page 31: Pedestrian Detection in Crowded Scenes

Assimilation of Global Cues (Attempt 1)

Distance Transform, Chamfer Matching

Chamfer distancebased matching

Use scale estimateto cut out surrounding region

Apply Cannydetector andcompute DT

Yellow is highestChamfer score

Initial hypothesisgenerated by local features

Page 32: Pedestrian Detection in Crowded Scenes

Assimilation of Global Cues (Attempt 2)

Maximize Chamfer Score AND overlap with overlap with hypothesized segmentation instead of pure Chamfer Score

Overlap expressed as Bhattacharya coeff.

Joint score is linear combinationof the two

Page 33: Pedestrian Detection in Crowded Scenes

Assimilation of Global Cues (Attempt 3)

Apply hypothesis saving MDL method again Boolean quadratic formulation

Page 34: Pedestrian Detection in Crowded Scenes

Results


Recommended