+ All Categories
Home > Documents > ACCELERATE DEEP LEARNING - GPU Technology...

ACCELERATE DEEP LEARNING - GPU Technology...

Date post: 28-Apr-2018
Category:
Upload: lytu
View: 215 times
Download: 0 times
Share this document with a friend
26
WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC’16 ACCELERATE DEEP LEARNING
Transcript
Page 1: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC’16

ACCELERATE DEEP LEARNING

Page 2: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

2

DEEP LEARNING EVERYWHERE

INTERNET & CLOUD

Image Classification Speech Recognition

Language Translation Language Processing Sentiment Analysis Recommendation

MEDIA & ENTERTAINMENT

Video Captioning

Video Search Real Time Translation

AUTONOMOUS MACHINES

Pedestrian Detection Lane Tracking

Recognize Traffic Sign

SECURITY & DEFENSE

Face Detection Video Surveillance Satellite Imagery

MEDICINE & BIOLOGY

Cancer Cell Detection Diabetic Grading Drug Discovery

Page 3: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

3

DEEP LEARNING REVOLUTIONIZING COMPUTING Solves Problems Previously Unsolvable

Page 4: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

4

A NEW COMPUTING MODEL

Traditional Computer Vision

Domain experts design feature detectors

Quality = patchwork of algorithms

Need CV experts and time

Deep Learning

DNN learn features from large data

Quality = data & training method

Needs lots of data and compute

Page 5: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

5

DEEP LEARNING The Next Innovation Network?

1910 1920 1930 1940 1950 1960 1970 1980 1990 2000 2010 2020

VACUUM TUBES

TEST EQUIPMENT

MICROWAVES

INTEGRATED CIRCUITS

PERSONAL COMPUTERS

INTERNET

SMARTPHONES

DEEP LEARNING MACHINE INTELLIGENCE

(Source: Deep Learning Gold Rush of 2015, Tomasz Malisiewicz, November 07, 2015 (adapted from http://steveblank.com/secret-history/)

Page 6: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

6

0%

10%

20%

30%

40%

50%

60%

70%

80%

90%

100%

2009 2010 2011 2012 2013 2014 2015 2016

THE AI RACE IS ON

IBM Watson Achieves Breakthrough in Natural Language Processing

Facebook Launches Big Sur

Baidu Deep Speech 2 Beats Humans

Google Launches TensorFlow

Microsoft & U. Science & Tech, China Beat Humans on IQ

Toyota Invests $1B in AI Labs

IMAGENET Accuracy Rate

Traditional CV Deep Learning

Page 7: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

7 NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.

$500B DEEP LEARNING OPPORTUNITY

Ad Service Technology

Investment

Media

Oil and Gas

Manufacturing

Retail

Other

SOURCE: “Deep Learning for Enterprise Applications, 4Q 2015, Tractica”

“The current cutting edge of deep learning processing platforms seems to be massively parallel GPU systems.”

— Tractica

Deep Learning Software Revenue by Industry World Markets: 2015-2024

Deep Learning Total Revenue by Segment World Markets: 2015-2024

2015 2016 2017 2018 2019 2020 2021 2022 2023 2024

($ M

illions)

$1,000

$2,000

$4,000

$5,000

$-

$120,000

$100,000

$80,000

$60,000

$40,000

$20,000

$-

2015 2016 2017 2018 2019 2020 2021 2022 2023 2024

($ M

illions)

DL-driven GPU Chip Revenue

Page 8: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

8

EDUCATION

START-UPS

CNTK TENSORFLOW

DL4J

THE ENGINE OF MODERN AI

NVIDIA DEEP LEARNING PLATFORM

*U. Washington, CMU, Stanford, TuSimple, NYU, Microsoft, U. Alberta, MIT, NYU Shanghai

VITRUVIAN SCHULTS

LABORATORIES

TORCH

THEANO

CAFFE

MATCONVNET

PURINE MOCHA.JL

MINERVA MXNET*

CHAINER

BIG SUR WATSON

OPENDEEP KERAS

Page 9: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

9 NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.

NVIDIA DEEP LEARNING PLATFORM

TITAN X - DEVELOPERS

DEEP LEARNING SDK

DL FRAMEWORK (CAFFE, CNTK,TENSORFLOW, THEANO, TORCH…)

TESLA - DEPLOYMENT AUTOMOTIVE - DRIVEPX

EMBEDDED - JETSON

Page 10: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

10

DEEP LEARNING FRAMEWORKS

VISION SPEECH BEHAVIOR

Object Detection Voice Recognition Language Translation Recommendation

Engines Sentiment Analysis

DEEP LEARNING

cuDNN

MATH LIBRARIES

cuBLAS cuSPARSE

MULTI-GPU

NCCL

cuFFT

Mocha.jl

Image Classification

NVIDIA Deep Learning SDK High Performance GPU-Acceleration for Deep Learning

Page 11: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

11

NVIDIA Deep Learning SDK

Powerful developer tools and libraries for designing and deploying GPU-accelerated deep learning applications

High Performance Deep Learning for NVIDIA GPUs

Industry Vetted Deep Learning Algorithms

Easily integrated into deep learning applications

developer.nvidia.com/deep-learning

Page 12: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

12

NVIDIA cuDNN

Building blocks for accelerating deep neural networks on GPUs

High performance deep neural network training

Accelerates Deep Learning: Caffe, CNTK, Tensorflow, Theano, Torch

Performance continues to improve over time

“NVIDIA has improved the speed of cuDNN

with each release while extending the

interface to more operations and devices

at the same time.”

— Evan Shelhamer, Lead Caffe Developer, UC Berkeley

developer.nvidia.com/cudnn

AlexNet training throughput based on 20 iterations, CPU: 1x E5-2680v3 12 Core 2.5GHz.

0.0x

2.0x

4.0x

6.0x

8.0x

10.0x

12.0x

2014 2015 2016

K40(cuDNN v1)

M40(cuDNN v3)

Pascal(cuDNN v5)

Page 13: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

13

cuBLAS

GPU-accelerated Basic Linear Algebra Subroutines that delivers 6x to 17x faster performance than the latest MKL BLAS

Accelerated Level 3 BLAS: SGEMM, SYMM, TRSM, SYRK

Up to 7 TFlops Single Precision on a single M40

Multi-GPU BLAS support available in cuBLAS-XT

Accelerated Linear Algebra for Deep Learning

developer.nvidia.com/cublas

Page 14: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

14

NCCL

A topology-aware library of accelerated collectives to improve the scalability of multi-GPU applications

Patterned after MPI’s collectives: includes all-reduce, all-gather, reduce-scatter, reduce, broadcast

Optimized intra-node communication

Supports multi-threaded and multi-process applications

Accelerating Multi-GPU Communications

github.com/NVIDIA/nccl

Page 15: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

15

NVIDIA DIGITS

An interactive development environment for training deep neural networks

Prepare data quickly and easily for training

Visualize network behavior

Maximize training speed

Making Deep Learning Accessible

developer.nvidia.com/digits

Page 16: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

16

What’s new in DIGITS 3?

Train neural network models with Torch support (preview)

Save time by quickly iterating to identify the best model

Manage multiple jobs easily to optimize use of system resources

Active open source project with valuable community contributions

Improves Deep Learning Training Productivity

Screenshot of new DIGITS

New Results Browser!

developer.nvidia.com/digits

Page 17: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

17

Preview DIGITS Future

Object Detection Workflows for Automotive and Defense

Targeted at Autonomous Vehicles, Remote Sensing

Object Detection Workflow

Come see a live demo in the GTC Exhibit Hall!

developer.nvidia.com/digits

Page 18: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

18

Deep Learning at GTC

Page 19: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

19

Deep Learning at GTC

11:00am: Accelerate Deep Learning with NVIDIA's Deep Learning Platform

12:00pm: Hangout -- The DIGITS Roadmap

1:00pm: From Workstation to Embedded: Accelerated Deep Learning on NVIDIA Jetson TX1

3:00pm: A Tutorial on More Ways to Use DIGITS

4:00pm: Hangout – cuDNN -- Features, Roadmap and Q&A

Deep Learning at NVIDIA, Monday 4/4

Page 20: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

20

Deep Learning at GTC

9:00am: Caffe: an Open Framework for Deep Learning

10:00am: TensorFlow: Scaling Up Machine Learning

2:00pm: Torch: A Flexible Platform for Deep Learning Research

3:00pm: Chainer: A Powerful, Flexible, and Intuitive Deep Learning Framework

4:00pm: Theano at a Glance: A Framework for Machine Learning

4:30pm: Deep Learning in Microsoft with CNTK

Monday, 4/4 at 3:00pm: MXNet: Flexible Deep Learning Framework from Distributed GPU Clusters to Embedded Systems

Frameworks Track, Wednesday 4/6

Page 21: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

21

Deep Learning at GTC

Wednesday 4/6

1:00pm: Introduction to CNTK

2:00pm: Machine Learning Using TensorFlow

2:00pm: BIDMach Machine Learning Toolkit

3:30pm: Applied Deep Learning for Vision and Natural Language with Torch7

3:30pm: Caffe Hands-on Lab

Frameworks Hands-on Labs

Thursday 4/7

9:30am: Chainer Hands-on: Introduction To Train Deep Learning Model in Python

9:30am: Deep Learning With the Theano Python Library

1:00pm: IBM Watson Developers Lab

Monday 4/4

1:00pm: Train and Deploy Deep Learning for Vision, Natural Language and Speech Using MXNet

Page 22: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

22

Deep Learning at GTC

Tuesday, 4/5

1:00pm: Distributed Deep Learning at Scale, Soumith Chintala, Research Engineer, Facebook AI Research

2:00pm: Generative Adversarial Networks, Ian Goodfellow, Senior Research Scientist, Google

3:00pm: Video Classification of Live Streams on Twitter's Periscope, Nicolas Koumchatzky, Engineer, Twitter

4:00pm: Training and Deploying Deep Neural Networks for Speech Recognition, Bryan Catanzaro, Senior Researcher, Baidu Research

Wednesday, 4/6, 9:30am: Deep Reinforcement Learning, Pieter Abbeel, Professor, UC Berkeley

Over 50 sessions on Deep Learning, highlights include --

Page 23: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

23

Deep Learning at GTC

Monday 4/4

12:00pm: The DIGITS Roadmap

4:00pm: cuDNN--Features, Roadmap and Q&A

Tuesday 4/5

12:00pm: Dreaming Big: Scaling Up Deep Dream to Operate on Multi-Hundred Megapixel Images

Hangouts

Wednesday 4/6

10:00am: Deep Learning in Image and Video

1:00pm: Deep Learning Exploits Petabytes of DigitalGlobe GIS

1:00pm: Reinforcement Learning

Thursday 4/7

1:00pm: Large Vocabulary Speech Recognition with GPUs

1:00pm: NVIDIA Deep Learning Software

Page 24: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

24

Deep Learning at GTC

S6515 - Listen, Attend and Spell, William Chan, PhD Candidate, Carnegie Mellon University

S6745 - VQA: Visual Question Answering, Aishwarya Agrawal, PhD Student, Virginia Tech

S6781 - Deep Neural Networks for Conversational Language Understanding, Kaheer Suleman, CTO, Maluuba Inc.

S6321 - How Deep Learning Works for Automated Customer Service, Chenghua (Kevin) Li, Chief Scientist of DNN Lab, JD.COM

H6127 - Hangout: Large Vocabulary Speech Recognition with GPUs, Yifang Xu, Senior Deep Learning Software Engineer, NVIDIA

NLP/NLU

Page 25: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

25

Deep Learning at GTC

S6371 - Deep Convolutional Neural Networks for Spoken Dialect Classification of Spectrogram Images Using DIGITS, Nigel Cannings, Chief Technical Officer, Intelligent Voice Limited

S6383 - High Performance CTC Training for End-to-End Speech Recognition on GPU, Minmin Sun, GPU Architecture Engineer, NVIDIA

S6458 - A GPU-Based Cloud Speech Recognition Server for Dialog Applications, Alexei V. Ivanov, CTO, Verbumware Inc.

S6672 - Training and Deploying Deep Neural Networks for Speech Recognition, Bryan Catanzaro, Senior Researcher, Baidu Research (Highly-Rated Speaker)

S6515 - Listen, Attend and Spell, William Chan, PhD Candidate, Carnegie Mellon University

S6781 - Deep Neural Networks for Conversational Language Understanding, Kaheer Suleman, CTO, Maluuba Inc.

Speech

Page 26: ACCELERATE DEEP LEARNING - GPU Technology …on-demand.gputechconf.com/gtc/2016/presentation/s6847...2 DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition

QUESTIONS?


Recommended