+ All Categories
Home > Documents > Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data....

Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data....

Date post: 21-Aug-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
35
Keeping up with the architects. Andrew Warfield, UBC and Coho Data
Transcript
Page 1: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Keepingupwiththearchitects.

AndrewWarfield,UBCandCohoData

Page 2: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Aboutthiskeynote.(AndthethingsI'mnotgoingtotalkabout.)

Page 3: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Notgoingtotalkaboutanyofthisstuffrightnow(buthappytointhehallwaytrack)• FinishedPhDatCambridgein2006• Workedinindustrialresearch(AT&TandIntel)• Twostartups(XenSource andCohoData)• AssociateprofatUBC• Threekids• Iwentheli skiinglastFriday.

Page 4: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Here'swhatIamgoingtodo

• Makesomeprettyobviousobservationsabouttechnologydirections.• Drawsomedodgyandhighlyspeculativeconclusionsfromthoseobservations.• Trytoinfluenceyourresearch.

• Disclaimer:thisisnotaconferencetalk,norisit5stapledtogetherconferencetalks.• Anotherdisclaimer:I'mgoingtogiveyoumoreproblemsthansolutions.

Page 5: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Solet'sgo…

Page 6: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Section5:Evaluation.

• (Attheendoftheday,allsystemspapersareaboutperformance.)• Probablybecauseit'soneoftheonlythingsweknowhowtomeasure.• Therearetwotypesofperformanceresults:

1. Smallimprovementsinaverylargesystem.2. Speedupsthataresosignificantthattheychangefunctionality.

• GoogleandFacebookandAmazonandMicrosoftareprobablyalotbetteratsolvingmeaningfulproblemswiththeirsystemsthanyouare.

Page 7: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Herearethehigh-leveltrends/ideasbehindthistalk1. Diminishingscarcity.2. Practical/sensibletoownyourownhardwareagain.3. Thesoftwarewehaveisturningouttobeabigger,slower,more

onerousburdenthanthehardwareitrunson.• Itisapoormatchforchangingperformanceandfailurecharacteristicsofhardware.

• Itisapoormatchfortheoperationalneedsofusers.

Page 8: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Consequencesoftheseideas

• Thegoalpostsaremovingintermsofwhatwedesignsystemsfor.

• Humancostsassociatedwithrunningoursystemsareabiggerexpenseandinconvenience,atalllevels,thanthepiecewiseperformanceofcomponents.• Theyareactuallyabarrier.

• Theendofscarcitymarksthebeginningofapushforefficientpredictability.• Thisiswhystoragecustomersbyflash.It’salsoahardsystemsproblem.

Page 9: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Sowhatdoweneedtounderstand,assystemsresearchers,tohelp?

Page 10: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Onesignificanthardwarechage:Rackscale

Page 11: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

11

Thisisagoogledatacentercirca2001.GFS(2003):largestdeploymentshadover1,000storagenodes,hundredsofclients,

300TBofstoragespace

Page 12: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

http://itq.nl/intels-take-open-compute-project-rack-scale-architecture/

https://www.supermicro.com/solutions/SRSD.cfm

Page 13: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Whatis"rackscale"?

• Everythinginarackwillshareahighperformancebus.• Withinarack,opticalinterconnectsareexpectedtoreachterabitbandwidthintheneartermwithsub-microsecondlatencies.

• Theserverasweknowitwillbecompletelydisaggregated.• CPUs,GPUs,storage,networkinterfaces,andvolatilememorywilleachmovetoindependentphysicalenclosures.Arbitrarycompositionandindependentscale.

• Rackresourceswillbeverydense.• Like,really dense.• Asaballpark,withinarackwearelikelytoseethousandsofcores,tensofpetabytesofpersistentmemory,andterabytesofRAM.

• Inshort,asingledatacenterrackwithacapitalvalueinthelowmillionsofdollars,willbeascapableasentirefirst-generation(e.g.2003-era)"warehouse"datacentersfrompubliccloudproviders

Page 14: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Consequencesoftherackscaletrendonsoftwaredesign.

Page 15: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

What’schanging?

1. Storageisbecomingdense.• Problematicallydense!

2. Thememoryhierarchyishavinganidentitycrisis.3. Applicationlatencyisacrueltaskmaster.

Page 16: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Trend1:Densenonvolatilestoragecapacity.

Page 17: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

DenseNonvolatileCapacity

• Flashvendorshavefinallystartedtorelaxaboutthedurabilityproblem.• Thejawdroppingbit:wewillsee4PBin1uinasmallnumberofyears.• Atapricethatapproachesspinningdisk.

• Thebadnews:intheimmediateterm,interconnectionwillbeaproblem.• Andinthelongertermitmaynotgetawholelotbetter.

Page 18: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

TrendsSSD Cap/1u Xputperdata

2TB 64TB 312MB/s/TB

8TB 256TB 78MB/s/TB

32TB 1PB 20MB/s/TB

128TB 4PB 5MB/s/TB

18

Page 19: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

TrendsSSD Cap/1u Xputperdata

2TB 64TB 312MB/s/TB

8TB 256TB 78MB/s/TB

32TB 1PB 20MB/s/TB

128TB 4PB 5MB/s/TB

NVMedevice:x4PCIeBroadwellCPU:40PCIelanes

19

Page 20: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

TrendsSSD Cap/1u Xputperdata

2TB 64TB 312MB/s/TB

8TB 256TB 78MB/s/TB

32TB 1PB 20MB/s/TB

128TB 4PB 5MB/s/TB

NVMedevice:x4PCIeBroadwellCPU:40PCIelanes

TORcross-racklinkstypicallyoversubscribedat3or4:1

20

Page 21: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Thisisverydifferentfromallthestoragesystemsthatwe'vebuiltinthepast.• Noseekpenalty.

• MeansthatbackgroundI/Oisactuallyreasonabletodo.• Migrationforperformance.• Alternaterepresentations(e.g.materializedviews,intentionalDUPlication)oftenforperformance.

• Metadataalldaylong.• Sprinklerheadsareaproblem.

• 4PBisanawfullyscaryfailuredomain.• Sensibleapplicationoferasurecodingneedsfiveormorenodes.• East/westtrafficisconstrained.

• Capacity-motivateddeletionissillyinmostcases.• Butrealdeletionprobablyneedstobeencryptionbased.

Page 22: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Mirador (FAST’17)

Centralizedthree-stagepipelinecontinuouslyoptimizesplacement22

Page 23: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Trend2:Themagicofpersistentmemory.

Page 24: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

PersistentMemory

• Everyoneisexcitedabout3DXpoint.• (Whattheheckis3dxpoint?)

• Badnews:persistentRAMisatotalPITA.• Becauseit'snotreallypersistentRAM:ramasyouthinkaboutitisatotalilusion.• It'sreallyasuperduperfastdisk.• Infact,it'sasuperduperfast*single**unreliable*disk.Butmoreonthisinasec.

• Butwait,thisdoesn'tmeanthatXPoint isn'taspectacularlygoodidea.• Withit,RAMisabouttobreakthroughthememorywall(coretocapacityratio).• TechnologieslikeXPoint willgiveusamultiplieronworkingset.• Persistencewillmassivelyspeeduprestarttimes,especiallyforread-onlydata.

Page 25: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Onemorespanner:Disaggregation.

• Somesignificantamountofmemoryisabouttomoveoffhost.• Nobodyseemstoagreeonhowthisisgoingtohappen.

• "remote"memoryvssharedphysicalbusvsRack-scaleCCNUMA• Allofthesethingsareinterestingintwoways.

• First,failuredomainsareverydifferent...inwaysthatAppsandOSesareNOTusedtoreasoningabout.

• Second,theyaffordanentirelynew(andexciting!)formofdynamism.• MapReduceandSparkhaveagoodbutverycoarse-grainednotionofpartitioning.• Thesesystemshavethepotentialtobesomuchmoredynamic.• Sameforscaleoutdatastores.• SameforstatereplicationandHA

Page 26: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Sowhat'sgoingtohappenhere

• Totalchaos.• Persistentmemorylookslikeareallyfastdisk.Disaggregatedmemorylookslikeanextensionofthecachehierarchy.• Ourviewofmemory,locality,andpersistenceisintrouble.• Interfacesandabstractionsreallyneedtochangeinsupportofthis.• Oneprediction:filesystemandvirtualmemorywillmerge.

• Loadsofreasonstodothis-- serializationoverheads,reboots,sharing.• butstillmanyopenquestions.

Page 27: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Trend2:Applicationlatency.

Page 28: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Latency

• Tellmeifyou’veheardthisonebefore:CPUsaren'tgettingfaster• I/Oisgettingfasterandwider.• Latencyisbecomingadominantmetric.

• Directimpactone.g.purchaseprobability.• Butit'samuchhardermetrictoworkwiththanthroughput.

• ShrinkingI/Olatenciesresultsinincreasedcomputationaldensity.• BecauseI/Owaitgoesaway(e.g.DBMS)

• Butalatencyfocusimposesalotofconstraintsonsoftwaredesign.• Especiallytail-latencySLOs.• Needtoreasonabouttheslowpathasacommoncase.

Page 29: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

0

200

400

600

800

1000

1200

1400

1600

1800

2000

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

ContentionFree SingleLock

NumberofCores

Throughp

ut(K

IOPS)

THE COST OF CONTENTION

Page 30: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Core

DPDK

TCP

SPDK

BlockI/O

DecibelLogic

Userspace

Kernel

CoreCore

HardwareQueues

Decibel(NSDI‘17)

• Howshouldwestructureastoragesystemtoprovidevirtuallocaldisks?

• Partitionlikecrazy,crusadeagainstlatency,pushallunnecessaryfunctionalityupthestack.

• Thisgeneralizestoapplications.

Page 31: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

0

100

200

300

400

500

600

700

800

900

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

Local Decibel(DPDK) Decibel(Legacy)

0

100

200

300

400

500

600

700

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

Local Decibel(DPDK) Decibel(Legacy)

DecibelPerformance(70/30MixedWorkload)

422 vs450 vs490μs

Throughput(KIOPS) Latency(μs)

NumberofCores NumberofCores

Page 32: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Everythinghurtslatency

• Redundancyisagoodexampleofwhythisgetshard.• Forinmemory,networkRTTwillapproachmediastoretime.• Soaremotewritedoublesthecost.• Worse:Replicationatlowerlayersofthesystemisinvariablyamplified.• Thisiswhyemergingdatastoresdon'tdoit.

• Areallatencyfocusdrivessoftwarearchitectureinaveryspecificdirection.• Contentionisasourceofhard-to-reason-aboutperformancevariance.• Soavoidcontentionatallcosts.Designitoutupfront.• (Ifyoudothisright,youbenefitfromnothavingtohiredevelopersthatunderstandlocking.)

• Doingthisrightmeansdesigningdataandcode-levelpartitioningverycarefully.• LessacademicallyrewardingthanOCCandlockfreedom,butseeparentheticpointabove.

Page 33: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Andwiththat,I’mmostlydone.

Page 34: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Herearethehigh-leveltrends/ideas

1. Diminishingscarcity.2. Practical/sensibletoownyourownhardwareagain.3. Softwareneedstochange.

Page 35: Keeping up with the architects. - IEEE Computer Society€¦ · Andrew Warfield, UBC and Coho Data. About this keynote. (And the things I'm not going to talk about.) Not going to

Closingthought.

• Nobodyisgoingtoadoptyourstuffunlessyoumakeitaseasyasheckforthemtodoit.• Exposeyourresearchresultsasaservice,orassomethingasclosetoaserviceasispossible.• Putthemincontainers,hostthemonAWS.

• Solveapplicationproblems.• Earlyexperiencesworkingwithphysicalscientists.


Recommended