+ All Categories
Home > Documents > Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Date post: 24-Jun-2020
Category:
Upload: others
View: 2 times
Download: 0 times
Share this document with a friend
93
Cloud compting 101 Fabien Hermenier image credit http://eyepluscamera.files.wordpress.com/ 1
Transcript
Page 1: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Cloud compting101

Fabien Hermenierimage credit http://eyepluscamera.files.wordpress.com/

1

Page 2: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

was cloud computing needed ?

2

Page 3: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Mainframes

3

Page 4: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Then came with affordable PCs

Then we spread out the load for security, performance, manageability

Then we bought tons of servers to support load spikes

4

Page 5: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

5

Page 6: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Amazon X-mas 2013 426 items sold each second6

Page 7: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Where is energy spent ?

7

Page 8: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

episode 0 rise of the cloud8

Page 9: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Cloud computing is a model for enabling ubiquitous, convenient, on-demand network access to a shared pool of configurable computing resources that can be rapidly provisioned and released with minimal management effort or service provider interaction.

2011

“”9

Page 10: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

1 self-provisioning, no human intervention

On-demand self-services

10

Page 11: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

11

Page 12: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

2availability over the network

standard mechanisms

broad network access

12

Page 13: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

3multi-tenant

virtual or physical resources on-demand allocation

location independance

resource pooling

13

Page 14: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

reserved instances (yearly based) on-demand instances (hourly based)

hotspot instances (market based) 14

Page 15: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Amazon EC2 HotSpot instances

bid over the market price to get the instance15

Page 16: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

4fast (de-)allocation of resources scale to infinity

rapid elasticity

16

Page 17: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

verti

cal e

lastic

ity

Tiers 1

Tiers 2

Tiers 3

17

Page 18: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

verti

cal e

lastic

ity

Tiers 1

Tiers 2

Tiers 3

17

Page 19: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

horizontal elasticity

Tiers 1

Tiers 2

Tiers 3

18

Page 20: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

horizontal elasticity

Tiers 1

Tiers 2

Tiers 3

18

Page 21: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

5 metering capabilities transparent reporting

measured service

19

Page 22: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

20

Page 23: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

and I will call it cloud computing

21

Page 24: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Cloud Computingorigins

22

Page 25: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

If computers of the kind I have advocated become the computers of the future, then computing may someday be organized as a public utility just as the telephone system is a public utility... The computer utility could become the basis of a new and important industry.

John McCarthy, 1961

“”

23

Page 26: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

cluster computingloosely coupled co-located servers

single tenant non-interactive workload

rigid jobs80s

24

Page 27: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Clo

ud o

r no

t ?

on demand self-services

broad network access

resource pooling

rapid elasticity

measured service

25

Page 28: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Clo

ud o

r no

t ?

on demand self-services

broad network access

resource pooling

rapid elasticity

measured service

26

Page 29: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

grid computingIan Foster et al. 2001

27

Page 30: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Pow

er G

rid

Ana

logy

28

Page 31: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

virtual organisation heterogeneous hw. multiple applications abstract resources

doing * at

Power grid Computing grid

multiple providers heterogeneous sources

multiple clients abstract source

large scaleindependencelocation

live consumption batch jobs

29

Page 32: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Throughput(MB/s)

TransferThroughput2014-10-2212:40to2014-10-2312:40UTC

alice atlas cms lhcb

13:00

14:00

15:00

16:00

17:00

18:00

19:00

20:00

21:00

22:00

23:00

00:00

01:00

02:00

03:00

04:00

05:00

06:00

07:00

08:00

09:00

10:00

11:00

12:00

0k

5k

10k

15k

20k

25k

Worldwide LHC Computing grid

170 centres to analyse 30 PB / year30

Page 33: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Clo

ud o

r no

t ?

on demand self-services

broad network access

resource pooling

rapid elasticity

measured service

31

Page 34: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Clo

ud o

r no

t ?

on demand self-services

broad network access

resource pooling

rapid elasticity

measured service

32

Page 35: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Application Service Provider

service oriented

pay as you go

95+

1Client

2Client

3Client

remote access to dedicated applications

33

Page 36: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Clo

ud o

r no

t ?

on demand self-services

broad network access

resource pooling

rapid elasticity

measured service

34

Page 37: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Clo

ud o

r no

t ?

on demand self-services

broad network access

resource pooling (not real hw resources)

rapid elasticity

measured service

35

Page 38: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

computers on demand.2002Deploy full custom stacks (OS to applications)

36

Page 39: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

to (re)deploy reproducible network experiments

multi-tenant, (limited on purpose) resource pooling,

37

Page 40: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

to (re)deploy reproducible network experiments

100Mb/s10ms

10ms50ms,

5% loss

38

Page 41: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

to (re)deploy reproducible network experiments

10ms

10ms

50ms, 5% loss

39

Page 42: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Clo

ud o

r no

t ?

on demand self-services

broad network access

resource pooling

rapid elasticity

measured service

40

Page 43: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Clo

ud o

r no

t ?

on demand self-services

broad network access

resource pooling

rapid elasticity

measured service

41

Page 44: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

S O Aervice

orientedrchitecture

2001+

composable unassociated, loosely coupled units

42

Page 45: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

exponential grows since 2001 private and public services to support its growth

43

Page 46: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Two pizza rule

If a team can’t be fed by two pizzas then it is to big

- Jeff Bezos (founder/ CEO of amazon.com)

44

Page 47: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

800 x

tons of API, mini-services devoted to automation, flexibility, on-demand services for public and private use

45

Page 48: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

2006

scalable web services for other websites or client-side applications

46

Page 49: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

SOAP & REST over HTTP pay as you go elastic *-oriented services

*data, network or computation47

Page 50: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Clo

ud !

on demand self-services

broad network access

resource pooling

rapid elasticity

measured service

48

Page 51: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

RECAP49

Page 52: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

I have a dream, it was about Utility Computing “ ”John McCarthy - 1961

50

Page 53: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

web + grid computing + resources on demand + service oriented architectures

cloud computing (2006)

51

Page 54: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

can we talk about cloud computing now ?

52

Page 55: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

?aaS53

Page 56: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

web access to commercial sw. “one to many” model

customers don’t handle upgrades API for integration

Software as a Service

Saa

S

54

Page 57: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

55

Page 58: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

56

Page 59: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

jailed runtime available to host applications generic or provider-specific APIs no control over the environment

Platform as a Service

Paa

S

57

Page 60: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

58

$ heroku login…

$ git clone https://github.com/heroku/java-getting-started.git$ cd java-getting-started

$ heroku createCreating warm-eyrie-9006... done, stack is cedar-14http://warm-eyrie-9006.herokuapp.com/ | [email protected]:warm-eyrie-9006.gitGit remote heroku added

$ git push heroku master… http://warm-eyrie-9006.herokuapp.com/ deployed to Heroku

$ heroku ps:scale web=1

Page 61: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

59

Page 62: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Infrastructure as a Service

low-level resources to deploy arbitrary software stacks complete control over its network, storage and OS

IaaS

60

Page 63: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

61

Page 64: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

62

Page 65: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

63

Things will crash. Deal with it!

Assume you could start with super reliable servers (MTBF of 30 years) Build computing system with 10 thousand of those Watch one fail per day

Dean Keynote, LADIS 2009

Page 66: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

64

~0.5 overheating (power down most machines in <5 mins, ~1-2 days to recover) ~1 PDU failure (~500-1000 machines suddenly disappear, ~6 hours to come back) ~1 rack-move (plenty of warning, ~500-1000 machines powered down, ~6 hours) ~1 network rewiring (rolling ~5% of machines down over 2-day span) ~20 rack failures (40-80 machines instantly disappear, 1-6 hours to get back) ~5 racks go wonky (40-80 machines see 50% packetloss) ~8 network maintenances (4 might cause ~30-minute random connectivity losses) ~12 router reloads (takes out DNS and external vips for a couple minutes) ~3 router failures (have to immediately pull traffic for an hour) ~dozens of minor 30-second blips for dns ~1000 individual machine failures ~thousands of hard drive failures slow disks, bad memory, misconfigured machines, flaky machines, etc. Long distance links: wild dogs, sharks, dead horses, drunken hunters, etc.

Typical first year for a new google cluster

Page 67: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Leslie Lamport

A distributed system is one in which the failure of a computer you didn't even know existed can render your own computer unusable

65

“”

Page 68: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

66

Building fault tolerant services

deal with failures deal with inconsistency

be pessimistic

at every level

Page 69: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

67

October, 21th 2016: dynDNS targeted by a DDoS

Affected services:

1.2 Tb/s of DNS lookups

Page 70: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

68

Page 71: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

applications

runtimes

integration/security

database

servers

virtualisation

server HW

storage

network

you

man

age ol’

school IT

69

Page 72: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

SaaS

applications

runtimes

integration/security

database

servers

virtualisation

server HW

storage

network

man

aged

by ve

ndor

70

Page 73: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

PaaS

applications

runtimes

integration/security

database

servers

virtualisation

server HW

storage

network

you manage

man

aged

by ve

ndor

71

Page 74: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Open-source PaaS stacks

Page 75: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

IaaS

applications

runtimes

integration/security

database

servers

virtualisation

server HW

storage

network

you

man

age

man

aged

by ve

ndor

73

Page 76: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Open-source IaaS stacks

(2008+)

cloudstack

(2008+)

(2010+) (2012+)

74

Page 77: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

75

vendor lock-in

IaaS PaaS SaaS

Page 78: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

Deployment models

76

Page 79: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

public cloudgeneral availability to everyone

the “real” cloud reduced costs trust issues ?

77

Page 80: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

78

cloud computing vs.

fog of war

Page 81: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

79

Trust in megive me your code & data

Page 82: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

80

I’m aware read my mails

what is my is hacked ?

Page 83: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

private cloudself hosted cloudworldcompany SA

might reduce TCO stronger trust

better manageability

81

Page 84: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

worldcompany SA hybrid cloud

82

Page 85: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

multi-cloudsyou spread your application

avoid Single Point of Failures* take the benefits of each cloud

LB

83

Page 86: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

inter-cloudsthey outsource your components

agreements between the providers“cloud of clouds”

84

Page 87: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

fog computing

85

Page 88: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

community cloudprivate cloud by and for

multiple organizations

86

Page 89: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

RECAP87

Page 90: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

CLOUD IS ABOUT REDUCING COSTS

88

Page 91: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

CLOUD IS ABOUT SCALABILITY

89

Page 92: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

90

CLOUD IS ABOUT RESILIENCY

Page 93: Fabien Hermenier · 64 ~0.5 overheating (power down most machines in

CLOUD IS ABOUT TRUST


Recommended