+ All Categories
Home > Software > Hadoop on Cloud: Why and How?

Hadoop on Cloud: Why and How?

Date post: 13-Jan-2017
Category:
Upload: cloudera-inc
View: 1,334 times
Download: 1 times
Share this document with a friend
44
1 © Cloudera, Inc. All rights reserved. Hadoop on Cloud. Why and How? Andrei Savu | Tech Lead, Cloudera Director Silicon Valley Cloud Computing Group | Nov 18, 2015
Transcript
Page 1: Hadoop on Cloud: Why and How?

1© Cloudera, Inc. All rights reserved.

Hadoop on Cloud. Why and How?Andrei Savu | Tech Lead, Cloudera DirectorSilicon Valley Cloud Computing Group | Nov 18, 2015

Page 2: Hadoop on Cloud: Why and How?

2© Cloudera, Inc. All rights reserved.

About me

Tech Lead on Cloudera Director

Previously founder of axemblr.com

Contributed to Apache Whirr (PMC) & jclouds.

Twitter: @andreisavu

Page 3: Hadoop on Cloud: Why and How?

3© Cloudera, Inc. All rights reserved.

Cloudera Directorcloudera.com/director

Deploy and manage enterprise-grade Hadoop in the cloud

AWS & Google CloudExtensible via plugins

Page 4: Hadoop on Cloud: Why and How?

Journey to the Cloud

Page 5: Hadoop on Cloud: Why and How?

5© Cloudera, Inc. All rights reserved.

Do you use a public or private cloud?

How do you run and manage Hadoop?

Page 6: Hadoop on Cloud: Why and How?

6© Cloudera, Inc. All rights reserved.

What is this talk about?

State of the WorldArchitectural PatternsImagine the Future

Page 7: Hadoop on Cloud: Why and How?

7© Cloudera, Inc. All rights reserved.

Gartner's 2015 Hype Cycle for Emerging Technologies (source)

Advanced AnalyticsHybrid CloudInternet of Things

Page 8: Hadoop on Cloud: Why and How?

8© Cloudera, Inc. All rights reserved.

Hybrid Clouds

Cloud ExchangeApplication PortabilityPrivate-PublicPublic-Public

Page 9: Hadoop on Cloud: Why and How?

9© Cloudera, Inc. All rights reserved.

Cloud Wars

AWSMicrosoft AzureGoogle CloudVMWareOpenstacketc.

Page 10: Hadoop on Cloud: Why and How?

10© Cloudera, Inc. All rights reserved.

Data has Mass and Gravity

Page 11: Hadoop on Cloud: Why and How?

11© Cloudera, Inc. All rights reserved.

Hadoop EnvironmentsOn-Premise versus Cloud

On-Premise CloudStorage Direct Attached Direct Attached or Object Store

Data Not shared across clusters Shared across multiple clusters

Sizing Fixed-size Dynamic based on load

Usage Model All users share cluster Clusters created as needed for apps/users

Resource Management (YARN)

HDFS

Process Discover Model Serve

Industry Standard Servers (CPU, Memory, & Direct Attached Storage)

Resource Management (YARN)

HDFS

Process Discover Model Serve

Industry Standard Servers (CPU & Memory)

Object Storage

Page 12: Hadoop on Cloud: Why and How?

12© Cloudera, Inc. All rights reserved.

Cloud providers shipping distributions of Hadoop

IntegrationUnlock Query EnginesMigration workloads

Is that a sustainable advantage? Or just a temporary stop gap?

Page 13: Hadoop on Cloud: Why and How?

13© Cloudera, Inc. All rights reserved.

Maturity level

On-prem vs. CloudMonitoringDev / Test / ProdAvailabilityDurability

Page 14: Hadoop on Cloud: Why and How?

14© Cloudera, Inc. All rights reserved.

Common Architectural Patterns in the Cloud

Object Storage

Source Data Seed Data Backup/DR

ETL/MODELING(Spark, MapReduce)

• Short-running clusters• Elastic workload• No local storage

necessary

|WASB |SWIFT |BLOB

• Long-running clusters• Sized to demand• Some local storage

BI/ANALYTICS(Impala, Solr)

• Fixed clusters • Periodic sync• Default to local

storage

APP DELIVERY(HBase, Kudu)

Page 15: Hadoop on Cloud: Why and How?

15© Cloudera, Inc. All rights reserved.

Cluster lifecycle management

Create / TerminateDiscoveryMetadataMonitoring

Page 16: Hadoop on Cloud: Why and How?

16© Cloudera, Inc. All rights reserved.

Work Queue

WorkflowsDispatchTrackingDecoupledFault Tolerant

Page 17: Hadoop on Cloud: Why and How?

17© Cloudera, Inc. All rights reserved.

Common Architectural Patterns in the Cloud

Object Storage

Source Data Seed Data Backup/DR

ETL/MODELING(Spark, MapReduce)

• Short-running clusters• Elastic workload• No local storage

necessary

|WASB |SWIFT |BLOB

• Long-running clusters• Sized to demand• Some local storage

BI/ANALYTICS(Impala, Solr)

• Fixed clusters • Periodic sync• Default to local

storage

APP DELIVERY(HBase, Kudu)

Page 18: Hadoop on Cloud: Why and How?

18© Cloudera, Inc. All rights reserved.

Multi-user

SecureIsolatedFriendly

Page 19: Hadoop on Cloud: Why and How?

19© Cloudera, Inc. All rights reserved.

Elastic

Grow or shrinkBusiness hoursNumber of usersStorage vs. ComputeCost efficient

Page 20: Hadoop on Cloud: Why and How?

20© Cloudera, Inc. All rights reserved.

Common Architectural Patterns in the Cloud

Object Storage

Source Data Seed Data Backup/DR

ETL/MODELING(Spark, MapReduce)

• Short-running clusters• Elastic workload• No local storage

necessary

|WASB |SWIFT |BLOB

• Long-running clusters• Sized to demand• Some local storage

BI/ANALYTICS(Impala, Solr)

• Fixed clusters • Periodic sync• Default to local

storage

APP DELIVERY(HBase, Kudu)

Page 21: Hadoop on Cloud: Why and How?

21© Cloudera, Inc. All rights reserved.

Advanced Monitoring

LatencyResource utilizationConsistent performance

Page 22: Hadoop on Cloud: Why and How?

22© Cloudera, Inc. All rights reserved.

High availability and failure domains

Data durabilityRepair within SLAHost-to-instance

Page 23: Hadoop on Cloud: Why and How?

23© Cloudera, Inc. All rights reserved.

Backup and disaster recovery

Object store centricActive-Standby

Page 24: Hadoop on Cloud: Why and How?

24© Cloudera, Inc. All rights reserved.

Imagine the Future

Portable ExperienceSelf-serviceSelf-healingGranular SecurityAdvanced GovernanceComplete Management

What’s your vision?

Page 25: Hadoop on Cloud: Why and How?

25© Cloudera, Inc. All rights reserved.

Thank [email protected]

Page 26: Hadoop on Cloud: Why and How?

26© Cloudera, Inc. All rights reserved.

Resources

Cloudera Director: http://www.cloudera.com/director

Interested in API level integration and scripting?

https://github.com/cloudera/director-sdk

https://github.com/cloudera/director-scripts

Interested in integration with another cloud platform?

https://github.com/cloudera/director-spi

https://github.com/cloudera/director-google-plugin

Page 28: Hadoop on Cloud: Why and How?

Cloudera Director Screenshots

Page 29: Hadoop on Cloud: Why and How?

© 2014 Cloudera, Inc. All rights reserved.

Page 30: Hadoop on Cloud: Why and How?

© 2014 Cloudera, Inc. All rights reserved.

Page 31: Hadoop on Cloud: Why and How?

© 2014 Cloudera, Inc. All rights reserved.

Page 32: Hadoop on Cloud: Why and How?

© 2014 Cloudera, Inc. All rights reserved.

Page 33: Hadoop on Cloud: Why and How?

© 2014 Cloudera, Inc. All rights reserved.

Page 34: Hadoop on Cloud: Why and How?

© 2014 Cloudera, Inc. All rights reserved.

Page 35: Hadoop on Cloud: Why and How?

© 2014 Cloudera, Inc. All rights reserved.

Page 36: Hadoop on Cloud: Why and How?

© 2014 Cloudera, Inc. All rights reserved.

Page 37: Hadoop on Cloud: Why and How?

© 2014 Cloudera, Inc. All rights reserved.

Page 38: Hadoop on Cloud: Why and How?

© 2014 Cloudera, Inc. All rights reserved.

Page 39: Hadoop on Cloud: Why and How?

© 2014 Cloudera, Inc. All rights reserved.

Page 40: Hadoop on Cloud: Why and How?

© 2014 Cloudera, Inc. All rights reserved.

Page 41: Hadoop on Cloud: Why and How?

© 2014 Cloudera, Inc. All rights reserved.

Page 42: Hadoop on Cloud: Why and How?

© 2014 Cloudera, Inc. All rights reserved.

Page 43: Hadoop on Cloud: Why and How?

© 2014 Cloudera, Inc. All rights reserved.

Page 44: Hadoop on Cloud: Why and How?

44© Cloudera, Inc. All rights reserved.

Thank [email protected]


Recommended