+ All Categories
Home > Documents > The Missing Link to Productivity? · 2019-03-29 · Gartner Emerging Tech Hype Cycle - 2015 Machine...

The Missing Link to Productivity? · 2019-03-29 · Gartner Emerging Tech Hype Cycle - 2015 Machine...

Date post: 28-Mar-2020
Category:
Upload: others
View: 9 times
Download: 0 times
Share this document with a friend
23
Data Science Workstations The Missing Link to Productivity? David Patschke, Dell AI/ML Strategy, Dell Precision Workstations S9996
Transcript

Data Science WorkstationsThe Missing Link to Productivity?

David Patschke, Dell

AI/ML Strategy, Dell Precision Workstations

S9996

© Copyright 2019 Dell Inc.2 of Y

IntroductionThe AI/ML Journey courtesy Gartner Hype Cycle

© Copyright 2019 Dell Inc.3 of Y

Gartner Emerging Tech Hype Cycle - 2015

Machine Learning

TIME

Innovation

Trigger

Peak ofInflated

Expectations

Trough of

DisillusionmentSlope of Enlightenment

Plateau of

Productivity

EXPECTATIONS

Citizen

Data

Scientist

Natural Language Question Answering

Source: Gartner (August 2015)

© Copyright 2019 Dell Inc.4 of Y

Gartner Emerging Tech Hype Cycle - 2016

Machine Learning

TIME

Innovation

Trigger

Peak ofInflated

Expectations

Trough of Disillusionment Slope of Enlightenment

Plateau of Productivity

EXPECTATIONSCognitive

Expert

Advisors

Natural Language Question Answering

Source: Gartner (August 2016)

© Copyright 2019 Dell Inc.5 of Y

Gartner Emerging Tech Hype Cycle - 2017

Machine Learning

TIME

Innovation

Trigger

Peak ofInflated

Expectations

Trough of Disillusionment Slope of Enlightenment

Plateau of

Productivity

EXPECTATIONSDeep Learning

Cognitive Expert Advisors

Cognitive Computing

Deep

Reinforcement

Learning

Source: Gartner (August 2017)

© Copyright 2019 Dell Inc.6 of Y

Gartner Emerging Tech Hype Cycle - 2018

TIME

Innovation

Trigger

Peak ofInflated

Expectations

Trough of Disillusionment Slope of Enlightenment

Plateau of

Productivity

EXPECTATIONS Deep Neural Nets

(Deep Learning)

Edge AI

AI PaaS

Conversational AI

Artificial

General

Intelligence

Honest question:

Why has ML/DL/AI been at the top of

the hype cycle for 4 years in a row?

Source: Gartner (August 2018)

© Copyright 2019 Dell Inc.7 of Y

“Hidden Technical Debt in Machine Learning Systems”— Sculley et al, Google

Serving

Infrastructure

Monitoring

Machine Resource

ManagementData Verification

Data Collection

Configuration Analysis Tools

Process Management Tools

ML Code

Feature Extraction

SEXY NOT SEXYSOLVED NOT SOLVED

© Copyright 2019 Dell Inc.8 of Y

A

Proper

Use Case

The

First Mile

DATA

The

Last Mile

DEPLOY

Transparency

&

Explainability

Accommodating

Diverse

Workflows

5 Pieces of the Unsolved Puzzle

© Copyright 2019 Dell Inc.9 of Y

A

Proper

Use Case

Starting with

Wrong Problems

Repeatable Decision

in point in time

Human Capital

Constrained

Same Question(s)

Asked

Measurable

Outcome

IT-Supportable

(Batch, Real-time)

© Copyright 2019 Dell Inc.10 of Y

The

First Mile

DATA

Make Data Come A.L.I.V.E. !

A ggregation

L ineage I ngestion V alidation

E nhancement

© Copyright 2019 Dell Inc.11 of Y

Accommodating

Diverse

Workflows

What’s Your (Data) Problem?

Big Data“Normal”

Data

Workflow depends on the data, right?

© Copyright 2019 Dell Inc.12 of Y

Forrester

Study

Respondents site the

following reasons for using

workstations in AI

workloads:

• Price/Performance

• R&D w/ Flexible

Timelines

• Offload server demand

© Copyright 2019 Dell Inc.13 of Y

Data Science Workflow Considerations

• Resources

• Experimentation

• Agility

• Scaling

• Performance

© Copyright 2019 Dell Inc.14 of Y

Resource Considerations

How long will

this take?

Will

resources be

available? Reality:If Data Scientists are having to

think about these questions, they

are not thinking about solving

the business problems!

© Copyright 2019 Dell Inc.15 of Y

Experimentation Considerations

The

Lower-cost,

“All-Inclusive”

Alternative

Science,

Experimentation,

and

Risk-taking

© Copyright 2019 Dell Inc.16 of Y

Agility Considerations

EDA

&

Baseline Modeling

Before Deep Learning, there were

techniques like: called:

• Chunking

• Subsampling

• Stratification

Surprisingly, they are still rather

successful today for:

• Exploratory Data Analysis (EDA)

• Feature Engineering

• Baseline Model Building

© Copyright 2019 Dell Inc.17 of Y

Containers lend themselves to:

• Reproducibility

• Portability

• Streamlined Model Deployment

Containers possess:

• Data Science libraries and toolkits with

complex dependencies

• Ability to simplify DevOps demands

IT (Container Ship)

Scaling Considerations (Containers)

DellEMC Storage Dell Precision Workstations DellEMC PowerEdge

© Copyright 2019 Dell Inc.18 of Y

Performance Considerations

~10 hours for Resnet50 (2 x GV100)

© Copyright 2019 Dell Inc.19 of Y

Success Considerations

OCT Data Collection OCT Retinal Layer Thickness

Revolutionizing Ophthalmic Image Analysis• Optical Coherence Tomography (OCT) allows for 3-D imaging of the Eye

• Neuro-degenerative diseases can be detected via OCT images (Alzheimer’s, Parkinson’s, ALS,

MS, etc.)

• Using Dell Precision Workstations w/ NVIDIA GV100 graphics cards, Voxeleron has trained Deep

Convolutional Neural Networks to detect known neuro-degenerative pathologies and incorporate

these models into their InSight and InSight3D software for primary-care ophthalmologists.

© Copyright 2019 Dell Inc.20 of Y

Success Considerations

Transforming Business with AI in weeks not years … with existing teams

• The World’s Only Unstructured Database

• An AI system in a box.

• No data scientist necessary to build deep learning

models.

• A data engineer retrieves the data.

• A domain expert ensures that value is being derived

• Models capable of being trained with mixed structure

data (relational, image, audio, video, etc.)

• Car damage – image, structured vehicle information

• Home values – images, geospatial, structured data

• Using Dell Precision Workstations w/ NVIDIA GV100s in

Proof of Concept engagements with customers.

• 20k rows x 6 million features -> trained in a day

Predicted house price

Structured only: r=0.6

ZIFF Holistic: r=0.92

© Copyright 2019 Dell Inc.21 of Y

Hardware:

GPU: 1x Quadro RTX 6000/8000 w/ 24GB/48GB

CPU: Intel Xeon W-2195, 18-core (2.3GHz, 4.3GHz turbo)

Memory: 128GB DDR4 2666MHz

HDD 1: M.2 1TB PCIe NVME Class 40 SSD

HDD 2: 3.5" 4TB 7200rpm Nearline SAS HDD

OS: Ubuntu 16.04

Hardware:

GPU: 2x Quadro RTX 6000/8000 w/ NVLink (48GB/96GB)

CPU: 2x Intel Xeon Gold 6140, 18-core (2.3GHz, 3.7GHz turbo)

Memory: 384GB DDR4 2666MHz

HDD 1: M.2 1TB PCIe NVME Class 40 SSD

HDD 2: 3.5" 4TB 7200rpm Nearline SAS HDD

OS: Ubuntu 16.04

Dell Workstation Offerings

Precision T5820 Precision T7920

Software Ecosystem:

Docker/Singularity

NVIDIA Docker Runtime

Anaconda Python/R

RAPIDS software

Deep Learning frameworks

NVIDIA GPU Cloud (NGC)

….. and more

© Copyright 2019 Dell Inc.22 of Y

Dell Mobile Workstation Offerings

Hardware:

CPU: Intel Xeon E-2176M, 6-core (2.7GHz, 4.4GHz turbo)

Or

CPU: Intel Core i9-8950HK, 6-core (2.9GHz, 4.8GHz turbo)

Memory: 64GB DDR4 2666MHz

HDD : M.2 1TB PCIe NVME Class 40 SSD

OS: Ubuntu 18.04

Hardware:

GPU: NVIDIA Quadro P5200, 16GB GPU memory

CPU: Intel Xeon E-2186M, 6-core (2.9GHz, 4.8GHz turbo)

Or

CPU: Intel Core i9-8950HK, 6-core (2.9GHz, 4.8GHz turbo)

Memory: 128GB DDR4 2666MHz

HDD : M.2 1TB PCIe NVME Class 40 SSD

OS: Ubuntu 18.04

Precision 5530 Precision 7730

Come Visit

Booth #1311


Recommended