DataScience Lab 2017_Обзор методов детекции лиц на изображение

Survey of Face Detection Approaches

Yurii Pashchenko

DataScience Lab, Odessa, 2017

Classification vs. Detection

http://tutorial.caffe.berkeleyvision.org/caffe-cvpr15-detection.pdf 2

http://tutorial.caffe.berkeleyvision.org/caffe-cvpr15-detection.pdf

http://tutorial.caffe.berkeleyvision.org/caffe-cvpr15-detection.pdf

Evaluation

3

Evaluation metric. Receiver Operating Characteristic (ROC)

4

Benchmarks● FDDB● AFW● PascalFace● IJB-A● MALF● WIDER Face

5

FDDB: A Benchmark for Face Detection in Unconstrained Settings

● 2 845 images with a total of 5 171 faces; ● a wide range of difficulties:

○ occlusions ○ different poses○ low resolution○ out-of-focus faces

● the specification of face regions as elliptical regions

● both grayscale and color images.

http://vis-www.cs.umass.edu/fddb/ 6

http://vis-www.cs.umass.edu/fddb/

http://vis-www.cs.umass.edu/fddb/

FDDB. Annotation

http://vis-www.cs.umass.edu/fddb/fddb.pdf 7

http://vis-www.cs.umass.edu/fddb/fddb.pdf

http://vis-www.cs.umass.edu/fddb/fddb.pdf

FDDB.Evaluation

8

IARPA Janus Benchmark A (IJB-A)

• 5 712 images and 2085 videos, with an average of 11.4 images and 4.2 videos per subject

• full pose variation• joint use for face recognition and

face detection benchmarking • a mix of images and videos• wider geographic variation of

subjects• landmark locations

Brendan F Klare, Emma Taborsky, Austin Blanton, Jordan Cheney, Kristen Allen, Patrick Grother, Alan Mah, Mark Burge, and Anil K Jain. 2015. Pushing the frontiers of unconstrained face detection and recognition: IARPA Janus Benchmark A. In

Proc. IEEE Conf. Comput. Vis. Pattern Recognit. 1931–1939 9

http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Klare_Pushing_the_Frontiers_2015_CVPR_paper.pdf




IJB-A. Evaluation

* False Accept and Detection Rate are computed per image

10

WIDER FACE: A Face Detection Benchmark

• It consists of 32 203 images with 393 703 labeled faces, which is 10 times larger than the current largest face detection dataset

• The faces vary largely in appearance, pose, and scale

• Annotated multiple attributes: occlusion, pose, and event categories, which allows in depth analysis of existing algorithms.

http://mmlab.ie.cuhk.edu.hk/projects/WIDERFace/ 11

http://mmlab.ie.cuhk.edu.hk/projects/WIDERFace/



WIDER FACE. Annotations

https://arxiv.org/pdf/1511.06523.pdf 12

https://arxiv.org/pdf/1511.06523.pdf


WIDER FACE. Evaluation results

13

Comparison of Face Detection Datasets




Viola-Jones Object Detector

• Very popular for Human Face Detection• May be trained for Cat and Dog Face detection• Available free in OpenCV library (http://opencv.org)

O. Parkhi, A. Vedaldi, C. V. Jawahar, and A. Zisserman. The Truth about Cats and Dogs // Proceedings of the International Conference on Computer Vision (ICCV), 2011. J.

Liu, A. Kanazawa, D. Jacobs, P. Belhumeur. Dog Breed Classification Using Part Localization // Lecture Notes in Computer Science Volume 7572, 2012, pp 172-185.

http://opencv.org/

Main Principles

● Scanning window● Features● Integral image● Boosted feature selection● Cascaded classifier

P.A. Viola, M.J. Jones, Rapid object detection using a boosted cascade of simple features, in: CVPR, issue 1, 2001, pp. 511–518.

16

Scaning window

17

Integral Image

18

Features

⚫Available features:⚫ HAAR⚫ LBP⚫ HOG

⚫Too many features!⚫ location, scale, type⚫ 180,000+ possible features

associated with each 24 x 24 window

⚫Not all of them are useful!

19

Feature selection

⚫ Idea: Combining several weak classifiers to generate a strong classifier

α1 α

2

α3 α

T

……

α1h

1+ α

2h

2 + α

3h

3 + … + α

Th

T ><

Tthreshol

d

weak classifier (feature, threshold)h

1 = 1 or 0

20

Cascaded Classifier

● A 1 feature classifier achieves 100% detection rate and about 50% false positive rate.

● A 5 feature classifier achieves 100% detection rate and 40% false positive rate (20% cumulative) – using data from previous stage.

● A 20 feature classifier achieve 100% detection rate with 10% false positive rate (2% cumulative)

21

Viola Jones Pipeline

https://habrahabr.ru/post/133826/22

https://habrahabr.ru/post/133826/

https://habrahabr.ru/post/133826/

Viola Jones. Evaluation Results on FDDB

23

A Convolutional Neural Network Cascade for Face Detection

● 12-net● 12-calibration-net● 24-net● 24-calibration-net● 48-net● 48-calibration-net

http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Li_A_Convolutional_Neural_2015_CVPR_paper.pdf 24

http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Li_A_Convolutional_Neural_2015_CVPR_paper.pdf


Cascade CNN. Calibration Net

The calibration pattern adjusts the window to be

N = 45 patterns, formed by all combinations of




Cascade CNN. Evaluation Results on FDDB

~14 fps on CPU ~100 fps on GPU




Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks (MTCCN)

• Improved previous approach• Joint face detection and alignment• Online Hard sample mining• Multi-source training

https://arxiv.org/pdf/1604.02878.pdf27


MTCNN. Evaluation on FDDB and WIDER



Faster R-CNN

29

Region proposal network

30

Bootstrapping Face Detection with Hard Negative Examples

• ResNet-50• Foreground ROI thr >=0.5• Background ROI in the interval [0.1, 0.5) • Balancing bg-fg RoIs: 3:1• Hard Negative mining




Face Detection using Deep Learning: An Improved Faster RCNN Approach (DeepIR)

• VGG16 architecture• Hard negative mining• Feature concatenation• Multi-scale training




DeepIR. Evaluation on FDDB

DeepIR




Finding Tiny Faces (HR-ER)




HR-ER. Approach

What about context?




HR-ER. Evaluation on WIDER and FDDB




THANK YOU FOR YOUR ATTENTION!

e-mail: [email protected]: george.pashchenko

37

mailto:[email protected]

Date post:	21-Jan-2018
Category:	Technology
Upload:	geekslab-odessa
View:	173 times
Download:	4 times

DataScience Lab 2017_Обзор методов детекции лиц на изображение

Technology