+ All Categories
Home > Documents > MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar...

MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar...

Date post: 27-Jun-2020
Category:
Upload: others
View: 15 times
Download: 0 times
Share this document with a friend
35
MLOps: Machine Learning Operationalization
Transcript
Page 1: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

MLOps: Machine Learning Operationalization

Page 2: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

● Nisha Talagala, Co-Founder, CTO & VP Engineering,

ParallelM

● Boris Tvaroska, Global Artificial Intelligence Solutions

Lead, Lenovo

MLOps: Machine Learning Operationalization

Page 3: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

● Webinar recording and slides will be available shortly

● Share questions with panelists using the Question panel

● Q&A session following presentations

MLOps: Machine Learning Operationalization

Page 4: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

Track-record:

Polyglot:

Runtime Focus:

MLOps: Machine Learning Operationalization

Page 5: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

Machine Learning Operationalization

Page 6: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

Nisha TalagalaCo-Founder, CTO & VP EngineeringParallelM

[email protected]

Platform PresentationMLOps: Machine Learning

Operationalization

Page 7: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

CONFIDENTIALCONFIDENTIAL

MLOps: The Last MileFrom Data Science to Business ROI

NISHA TALAGALACTO, ParallelM

Page 8: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

CONFIDENTIAL

Source: “Artificial Intelligence: The Next Digital Frontier?”, McKinsey Global Institute, June 2017

Out of 160 reviewed AI use cases:

88% did not progress beyond the experimental stage

But successful early AI adopters report:

Profit margins

3–15% higher than industry average

20%

AI in Production

80%

Developing, Experimenting,ContemplatingSurvey of 3073 AI-aware C-level Executives

Growing AI Investments; Few Deployed at Scale

Page 9: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

CONFIDENTIALCONFIDENTIAL

The ML Development and Deployment Cycle

Bulk of effort today is in the left side of this process (development) • Many tools, libraries, etc. • Democratization of Data Science• Auto-ML

Page 10: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

CONFIDENTIALCONFIDENTIAL

What makes ML uniquely challenging in production?Part I : Dataset dependency

• ML ‘black box’ into which many inputs (algorithmic, human, dataset etc.) go to provide output.

• Difficult to have reproducible, deterministically ‘correct’ result as input data changes

• ML in production may behave differently than in developer sandbox because live data ≠ training data

Page 11: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

CONFIDENTIALCONFIDENTIAL

What makes ML uniquely challenging in production?Part II : Simple to Complex Practical Topologies

• Multiple loosely coupled pipelines running possibly in parallel, with dependencies and human interactions

• Feature engineering pipelines must match for Training and Inference (CodeGen Pipelines can help here)

• Control pipelines, Canaries, A/B Tests etc.

• Further complexity if ensembles, federated learning etc are used

Page 12: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

CONFIDENTIALCONFIDENTIAL

What makes ML uniquely challenging in production?Part III : Heterogeneity and Scale

• Possibly differing engines (Spark, TensorFlow, Caffe, PyTorch, Sci-kit Learn, etc. )

• Different languages (Python, Java, Scala, R ..)

• Inference vs Training engines• Training can be frequently batch

• Inference (Prediction, Model Serving) can be REST endpoint/custom code, streaming engine, micro-batch, etc.

• Feature manipulation done at training needs to be replicated (or factored in) at inference

• Each engine presents its own scale opportunities/issues

Page 13: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

CONFIDENTIALCONFIDENTIAL

What makes ML uniquely challenging in production?Part IV : Compliance, Regulations…

• Established: Example: Model Risk Management in Financial Services

• https://www.federalreserve.gov/supervisionreg/srletters/sr1107a1.pdf

• Emerging: Example GDPR on Reproducing and Explaining ML Decisions

• https://iapp.org/news/a/is-there-a-right-to-explanation-for-machine-learning-in-the-gdpr/

• Emerging: New York City Algorithm Fairness Monitoring

• https://techcrunch.com/2017/12/12/new-york-city-moves-to-establish-algorithm-monitoring-task-force/

Page 14: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

CONFIDENTIALCONFIDENTIAL

What makes ML uniquely challenging in production?Part V : Collaboration, Process

COLLABORATION

• Expertise mismatch between Data Science & Ops complicates handoff and continuous management and optimization

PROCESS

• Many objects to be tracked and managed (algorithms, models, pipelines, versions etc.)

• ML pipelines are code. Some approach them as code, some not

• Some ML objects (like Models and Human approvals) are not best handled in source control repositories

Page 15: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

CONFIDENTIALCONFIDENTIAL

ML Orchestration

ML Health

Business Impact

Model Governance

Continuous Integration/Deployment

DatabaseMachine Learning

Models

Business Value

MLOps – Automating the Production ML Lifecycle

Page 16: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

CONFIDENTIALCONFIDENTIAL

MLOps, DevOps and SDLC• Integrate with SDLC (Source control repositories, etc.) for code

• Integrate with DevOps for Automation, Scale and Collaboration

Automate

Scale

Measure Business Success

Manage Risk

Compliance & Governance

Automate

Scale

Manage ML Application

Collaborate

MLOps

DevOps

Page 17: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

CONFIDENTIALCONFIDENTIAL

Models, RetrainingControl, StatisticsEvents, AlertsData

Data Science Platforms

Data Streams Data Lakes

MCenter

MCenter Server

AnalyticEngines

MCenter Developer Connectors

MCenterAgent

MCenterAgent

MCenterAgent

MCenterAgent

MCenter Agent

MCenterAgent

)CDSW(

How it Works – MCenter Architecture

Page 18: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

CONFIDENTIALCONFIDENTIAL

Summary

• We are at the beginnings of ML Operationalization

• Much like databases (backbone of production applications) need DBAs and software needs DevOps, ML needs MLOps (specialized operationalization practices, tools and training)

• For more information

• https://www.mlops.org for MLOps resources

• https://www.parallelm.com

Page 19: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

CONFIDENTIALCONFIDENTIAL

Thank [email protected]

Page 20: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

Machine Learning Operationalization

Page 21: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

Boris TvaroskaGlobal Artificial Intelligence Solutions,Lenovo

[email protected]

Platform PresentationMLOps: Machine Learning

Operationalization

Page 22: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

222018 Lenovo Internal. All rights reserved.

Integrating data science into SDLC

Boris Tvaroska

September 2018

Page 23: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

23

Evolution of AIMoving from research papers to applications

Reports using ML/DL

2018 Lenovo Internal. All rights reserved.

AI in products & services

Research about AI

Page 24: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

242018 Lenovo Internal. All rights reserved.

What can happen?

I did not change a single line of code.

Junior Software Engineer after breaking the build

Page 25: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

252018 Lenovo Internal. All rights reserved.

Different lifecycles

•Starts with change in code

•Established practice•Iterations in days / weeks

•Starts which change in code, data or metrics

•Emerging practice•Iterations as fast as possible, several times per day

Define KPI

Collect Data

Transform Data

Model

Evaluate

Page 26: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

262018 Lenovo Internal. All rights reserved.

Main challenges

Test

•The wrong result is acceptable•Need to test for False Positives•Need to test for False Negatives

•Longer test times•More test cases needed

Build & Deploy

•More artifacts to work with•Frequent changes•Versioning of artifacts and source data

Page 27: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

272018 Lenovo Internal. All rights reserved.

Training in test/build cycle

Code Build Test

Dataset

Experiment

Cross-Valid

Train CV Test

Code

Hyperparameters

Train Build Test

Independent cycles

Possible for simple models with small amount of data

Existing toolset

Risks:- Slow CI/CD cycle- More failing builds

Page 28: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

282018 Lenovo Internal. All rights reserved.

Model as a service

Model is independent

Fit languages/frameworks

Risks:- Interface is vector- Pre-mature service

boundaries- Multi-step application

Code Build Test

Dataset

Experiment

Cross-Valid

Train CV Test

Code

Hyperparameters

Train

Build Test

Independent cycles

Test

Integrate

Page 29: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

292018 Lenovo Internal. All rights reserved.

SW emerged in Data Science

Clearly defined service

Data Science toolsetData Science framework

Risks:- Culture clash

Code Build Test

Dataset

Experiment

Cross-Valid

Train CV Test

Code

Train Test

Build

Page 30: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

302019 Lenovo Internal. All rights reserved.

Practical exampleLibraries

Transform Train Validate Build Test Deploy

Page 31: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

312017 Lenovo Internal. All rights reserved.

Boris TvaroskaGlobal Solution Lead for Lenovo

AI Innovation Centers

20 years of experience running engineering teams across Europe, North and South America, Middle East, India

Email: [email protected]@tvaroska.sk

Linkedin: www.linkedin.com/in/boristvaroska

Twitter: @btvaroska

Page 32: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

Q & A

Page 33: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

Making Machine Learning AccessibleMLOps: Machine Learning

Operationalization

● Nisha Talagala, Co-Founder, CTO & VP Engineering,

ParallelM

● Boris Tvaroska, Global Artificial Intelligence Solutions

Lead, Lenovo

Page 34: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

● Learn more about our Platform: https://www.activestate.com/platform

● Watch a demo: https://www.youtube.com/watch?v=c5AIxN9ehrI

● Contact [email protected] for more information.

Page 35: MLOps: Machine Learning OperationalizationMLOps: Machine Learning Operationalization Webinar recording and slides will be available shortly Share questions with panelists using the

Tel: 1.866.631.4581

Website: www.activestate.com

Twitter: @activestate

Facebook: /activestatesoftware

Platform Presentation

Where to find us


Recommended