Object detection & classification for...

Object detection & classification for ADAS

✓Robust for Bad situations

✓Small object sizes

✓Robust for occlusion

✓Small model size

SVNet @ NVIDIA TX2

25/19/2017

Please click

Icon for Video

https://drive.google.com/open?id=0Bw8jwiAZjRXFT2VQZmc4UkNySXc

Robust detection for various situations

35/19/2017

Please click

Icon for Video

Night w/ Lamp

Please click

Icon for Video

Rain

Snow

Please click

Icon for Video

Fog

Please click

Icon for Video

https://drive.google.com/open?id=0Bw8jwiAZjRXFVjFGUVhxY3VTOU0

http://pan.baidu.com/s/1boFcblP

https://drive.google.com/open?id=0Bw8jwiAZjRXFNnl5aGtnR3E5d2M

http://pan.baidu.com/s/1mhUYG8w

https://drive.google.com/open?id=0Bw8jwiAZjRXFLW5KVVJIY2l4VVk

http://pan.baidu.com/s/1o7ZEq6Y

https://drive.google.com/open?id=0Bw8jwiAZjRXFNFJaX1A2aEtRWkk

http://pan.baidu.com/s/1cl8hmI

SVNet Algorithm Flow

Conv

Layer

Proposal

Layer

FC

Layer

FC layer :Fully Connected networks

Conv layer

: deep convolutional neural networksProposal layer

: multi-scale region proposal

ROI

pooling

Image

Feature map

Feature vectors

Candidate Regions

Detection

Results

(Bonding Box

, label)

✓Robust for Bad situations

✓Small object sizes

✓Robust for occlusion

✓Small model size

✓ optimal parameters of network

(size of kernels, # of layers, depth

of channels) for the target

platform

✓ optimal parameters of network

(# of layers, weight connections)

for the target platform

Labeling System

Input image Automatic Labeling

Detection

Success

Detection Failure

False Detection

Manual Correction Ground Truth

Pedestrian: 94%, Vehicle: 95%

Pedestrian: 6%, Vehicle: 5%

~1 in 5 min video

Manual Correction on 5% of the objects in input images

How we use GPU (Titan X and GTX1080) for training

Modelsdesigned by human experts

Target H/Wwhere we measure speed to

select candidates before training

GPUstrain candidate models &

evaluate their accuracy

~3 hours ~2 days ~2 weeks

GPU utilization last month

~2 months

Road Test

Pass <10%

Pass ~30%

Start from

>50 prototypes

CuDNN framework

Lower memory bandwidth

Faster kernel execution

NVIDIA TX2 (*)(*) Image from https://devblogs.nvidia.com/parallelforall/jetson-tx2-delivers-twice-intelligence-edge/

SVNet

https://devblogs.nvidia.com/parallelforall/jetson-tx2-delivers-twice-intelligence-edge/

Customized Development Examples

85/19/2017

Input Scene

PD/VD on Input Scene other than the Curved Mirror

PD/VD on Curved Mirror Image

Collision Warning at Blind Corner

Example: Collision Warning at Blind Corner using PD/VD on Curved Mirror

Publications

Local Decorrelation for Improved Pedestrian Detection

• Woonhyun Nam, Piotr Dollár, and Joon Hee Han.

Advances in Neural Information Processing Systems (NIPS), 27: 424-432, 2014.

Macrofeature Layout Selection for Pedestrian Localization and Its Acceleration Using GPU

• Woonhyun Nam, Bohyung Han, and Joon Hee Han

Computer Vision and Image Understanding (CVIU), 120: 46-58, 2014

• Canny Text Detector: Fast and Robust Scene Text Localization Algorithm

• Hojin Cho, Myungchul Sung, Bongjin Jun,

• 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2016), Las Vegas, USA, 2016 (to appear)

• Learning to Select Pre-trained Deep Representations with Bayesian Evidence Framework

• Yong-Deok Kim, Taewoong Jang, Bohyung Han, Seungjin Choi

• 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2016), Las Vegas, USA, 2016 (to appear)

• Scene Text Detection with Robust Character Candidate Extraction Method

• Myung-Chul Sung, Bongjin Jun, Hojin Cho, Daijin Kim,

• 13th International Conference on Document Analysis and Recognition (ICDAR 2015), 2015.

Plus 20+ papers @ major conference/journal from StradVision’s algorithm engineers @ POSTECH

Automotive Product Roadmap

2017 2018

Platform Features Camera 1M 2M 3M 4M 5M 6M 3Q 4Q 1Q 2Q 3Q 4Q

High Seg

NVIDIA PX2 PD/VD, LD, FSD Frontal

NVIDIA TX2 PD/VD, LD, FSD Frontal

NVIDIA TX1 PD/VD Frontal

Mid Seg

PD/VD Frontal

PD/VD Frontal

VD Side

PD Rear

PD/VD Rear

PD/VD AVM

Low Seg

ARM PCW, FCW, PD, VD, LD Frontal

ARM PD Frontal

ARM PD/VD Frontal

ARM POD Internal

ServerPC PD/VD, Attributes Frontal

PC PD/VD, Attributes Frontal

Left edge = First Prototype; Right edge = Second Prototype

Thanks for listening!

Any Questions / Comments, please contact [email protected]

mailto:[email protected]

Date post:	19-Mar-2020
Category:	Documents
Upload:	others
View:	9 times
Download:	0 times

Object detection & classification for...

Documents