+ All Categories
Home > Documents > The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st,...

The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st,...

Date post: 21-May-2020
Category:
Upload: others
View: 5 times
Download: 0 times
Share this document with a friend
43
The Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science and Engineering (NAISE) Senior Fellow, University of Chicago Computa6on Ins6tute
Transcript
Page 1: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

TheConvergenceofHPCandBigData

PeteBeckmanSeniorScien6st,ArgonneNa6onalLaboratoryCo-Director,Northwestern/ArgonneIns6tuteforScienceandEngineering(NAISE)SeniorFellow,UniversityofChicagoComputa6onIns6tute

Page 2: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

2

ArgonneNa0onalLaboratory

§  $675M/yrbudget§  3,200employees§  1,450scien6sts/eng§  750Ph.D.s

Page 3: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

3

Argonne’sNextBigSupercomputer:Aurora

Page 4: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

4

Following the International Exascale Software Initiative (IESP 2008-2012 èBig Data and Extreme Computing workshops (BDEC)

http://www.exascale.org/bdec/ Overarching goal: 1.  Create an international collaborative process focused on the co-design of software infrastructure for

extreme scale science, addressing the challenges of both extreme scale computing and big data, and supporting a broad spectrum of major research domains,

2.  Describe funding structures and strategies of public bodies with Exascale R&D goals worldwide 3.  Establishing and maintaining a global network of expertise and funding bodies in the area of

Exascale computing 1 – BDEC Workshop, Charleston, SC, USA, April 29-May1, 2013 2 – BDEC Workshop, Fukuoka, Japan, February 26-28, 2014 3 – BDEC Workshop, Barcelona, Spain, January 28-30, 2015 4 – BDEC Workshop, Frankfurt, Germany, June 15-17, 2016

Europe-USA-AsiaInterna6onalseriesofWorkshopsonExtremeScaleScien6ficCompu6ng

Page 5: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

5

Page 6: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

6

Courtesy:MarkAsch

Page 7: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

Courtesy:MarkAsch

Page 8: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science
Page 9: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

Hardware&OS

Applica0ons

Shared

Page 10: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science
Page 11: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

11

Page 12: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

12

WhyConverge?Independentpaths:MoreCost,LessScience,

•  $mul6plehardwareso]wareinfrastructures•  $developingso]warefortwocommuni6es•  $learningtwocompu6ngmodels•  $smallerdiscoverycommunity,fewerideas•  Lessscience

Page 13: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

13Hardware&OS

Applica0ons

Shared

Page 14: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

14

ANL:PeteBeckman(PI),MarcSnir(ChiefScien,st),PavanBalaji,RinkuGupta,KamilIskra,FranckCappello,RajeevThakur,KazutomoYoshii

LLNL:MayaGokhale,EdgarLeon,BarryRountree,Mar6nSchulz,BrianVanEssenPNNL:SriramKrishnamoorthy,RobertoGioiosaUC:HenryHoffmannUIUC:LaxmikantKale,EricBohm,RamprasadVenkataramanUO:AllenMalony,SameerShende,KevinHuckUTK:JackDongarra,GeorgeBosilca,ThomasHerault

See http://www.argo-osr.org/ for more information

AnExascaleOpera0ngSystemandRun0meSoJwareResearch&DevelopmentProject

Developingvendorneutral,open-sourceOS/Rso]ware

Page 15: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

15

WhatOS/RGapsMustWeAddress?•  Extremein-nodeparallelism

–  Poormechanismsforpreciseresourcemanagement(cores,power,memory,network)–  Legacythreads/tasksimplementa6onsperformpoorlyatscale

•  DynamicvariabilityofplaPorm;Powerisconstrained–  Poorrun6memechanismsformanagingdynamicoverclocking,provisioningpower,

adjus6ngworkloads–  Nomechanismsformanagingpowerdynamically,globally,andincoopera6onwith

user-levelrun6melayers

•  Hierarchicalmemory–  Poorinterfaces/strategiesformanagingdeepeningmemory

•  NewmodesforHPC–  Noportableinterfacesforeasilybuildingworkflows,in-situanalysis,coupledphysics,

advancedI/O,applica6onresilience

10/28/16 ArgoOSRPeteBeckman 15

Page 16: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

16HierarchyofEnclaves

connectedviaaBackplane

Elas6cintranodecontainerswithresourceknobs

.

.

. Lightweightthread/tasksdesignedforcontainers,messaging,andmemoryhierarchy

Adap6ve,learning,integratedcontrolsystem

ArgoExplora6onstoAddressExascaleGaps

Page 17: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

17Hardware&OS

Applica0ons

Shared

Page 18: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

18

Understanding Cities

80% GDP

70% Energy

70% GHG

Page 19: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

19

Whymeasureci6es?

Page 20: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

20

PM 2.5, 10, 100

Acollabora6veproject:ArgonneNa6onalLaboratory,theUniversityofChicago,andtheCityofChicago

Supportedbycollabora6ngins6tu6onsandtheU.S.Na6onalScienceFounda6on.IndustryIn-Kindpartners:AT&T,Cisco,Intel,Microso],MotorolaSolu6ons,SchneiderElectric,Zebra

Page 21: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

21

Page 22: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

22

Waggle:AnOpenPlamormforIntelligentSensorsExploi6ngDisrup6veTechnology,EdgeCompu,ng,ResilientDesign

MachineLearningComputerVision

NovelSensorsNano/MEMS

LowPowerCPUsGPU/Smartphones

Page 23: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

23

RelaysCurrentSensors

Cont

rol

Proc

esso

r

Real6meclock&Internalsensors

Mul6plebootmedia(μSD/eMMC)

4-coreARM

NodeControl&Communica6ons

4+4-coreARM8-coreGPU

In-Situ/EdgeProcessing

Powerful,Resilient&Hackable

Hear

tbea

tMon

itors

Re

setp

ins

“DeepSpaceProbe”DesignLinuxDevelopmentEnvironment

Page 24: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

24

Page 25: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

25

Page 26: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

26

Waggle/AoTRobustTes0ng

Page 27: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

27

EdgeCompu0ng:AnalysisandFeatureRecogni0onPreservingPrivacy……

•  ParallelCompu6ng•  OpenPlamorm•  DeepLearning

Page 28: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

28

Page 29: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science
Page 30: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

30

WaggleMachineLearning&EdgeCompu6ng•  WeareexploringCaffe&OpenCV

–  Convolu6onalNeuralNetworks

•  TrainingwillbedoneonsystemsatArgonne

•  Classifica6ononWaggle

Page 31: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science
Page 32: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

32

TheData

Page 33: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

33

PM2.5Alert

PowerOutage

Damen&Ashland

Page 34: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

34Pete Beckman, Charlie Catlett, Rajesh Sankaran (ANL)

Page 35: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

35

Page 36: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

36

Page 37: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

37

Waggle:APlaPormforResearch

•  OpenSource/OpenPlaPorm–  Reusable,extensibleso]warecommuni6es

•  MachineLearning:ComputerVision–  Datamustbereducedin-situ

•  NovelSensors:Nano/MEMS/μfluidics–  Explosionofnano/MEMS&imagingtech

•  Low-PowerCPUs:GPU/Smartphones–  Powerful,low-power,smartphoneCPUs

Opportunity:BigData+Predic6veModelsSmartSensors+Supercomputers/CloudCompu6ng=predic6onsandanalysis

Page 38: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

38

WhyHPCGeeksShouldCare•  Newsensorsareprogrammableparallelcomputers

–  Mul6core+GPUs&OpenCLorOpenMP–  Newalgorithmsforin-situdataanalysis,featuredetec6on,compression,deeplearning–  Neednewprogmodfor“stackable”in-situanalysis(forsensorsandHPC)–  NeedadvancedOS/Rresilience,cybersecurity,networking,over-the-airprogramming

•  1000sofnodesmakeadistributedcompu0ng“instrument”–  Newstreamingprogrammingmodelneeded–  Newtechniquesformachinelearningforscien6ficdatarequired

•  Bothforwithina“node”andcollec6velyacross6meseries

•  HowwillHPCstreaminganaly0csandsimula0onbeconnectedtolivedata?–  CanwetriggerHPCsimula6onsa]erfirstapproxima6ons?(weather,energy,transporta6on)–  UnstructureddatabasewithprovenanceandmetadataforQA/collabora6on

•  UsenovelHPChardwaretosolvepowerissue?–  CanweuseneuromorphicorFPGAstoreducepowerforin-situanalysis&compression?

•  Wearetradingprecision&costforgreaterspa6alresolu6on:Whatispossible?

Page 39: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

39

CloudDatabase

NearRealTimeHPCSimula0ons

DataAggrega0onMul0pleSources

DataAnalysisandHPCsimula0ons

ParallelComputa0onattheEdge

NewEdgeAlgorithms

Page 40: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

40

RISCversionofConvergenceStory:Startbyenabling…removeroadblocks(theneveryone’swishlistfollows)

•  Applica6ons(ScienceDrivers):So]wareneeds&workflowpauerns•  Opera0ons

–  Supportreal-6meandstreamingfromfastnetworks–  Supportnodesharing,long-livedservices,storagerequestsforyears…

•  Architecture–  Mothballcurrentparallelfilesystems,replacewithpersistentstorageservices

(databases,KV,etc.)–  Acceleratemoveofstorageintocomputeinfrastructure

•  SoJware–  Linuxso]waredevelopmentenvironment.–  Na6vesupportforlow-levelinfrastructure:Docker,VMs,Mesos,etc.–  NewfocusonQoS;So]wareDefinedStorage,on-demandservices,etc.

Page 41: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

41

Courtesy:MarkAsch

BOFSC2016

Page 42: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

42

ChicagoPittsburgh

New York Portland

Atlanta

Boston

Delhi Chattanooga

2016-17Phase2Pilots

Developingapilotprojectstrategyaimedatempoweringpartneruniversi,esandna,onallaboratoriestoworkwith

theirlocalci,es.

Chicago

2016Phase1Pilots

Seattle Bristol

Newcastle

Developingapilotprojectstrategyaimedatempoweringpartneruniversi,esandna,onallaboratoriestoworkwith

theirlocalci,es.

Denver

Ini6aldiscussions

Page 43: The Convergence of HPC and Big DataThe Convergence of HPC and Big Data Pete Beckman Senior Scien6st, Argonne Na6onal Laboratory Co-Director, Northwestern / Argonne Ins6tute for Science

43

Ques6ons?


Recommended