+ All Categories
Home > Technology > Opportunities for X-Ray science in future computing architectures

Opportunities for X-Ray science in future computing architectures

Date post: 10-May-2015
Category:
Upload: ian-foster
View: 572 times
Download: 5 times
Share this document with a friend
Description:
The world of computing continues to evolve rapidly. In just the past 10 years, we have seen the emergence of petascale supercomputing, cloud computing that provides on-demand computing and storage with considerable economies of scale, software-as-a-service methods that permit outsourcing of complex processes, and grid computing that enables federation of resources across institutional boundaries. These trends shown no signs of slowing down: the next 10 years will surely see exascale, new cloud offerings, and terabit networks. In this talk I review various of these developments and discuss their potential implications for a X-ray science and X-ray facilities.
Popular Tags:
52
Opportunities for X-ray science in future computing architecture Ian Foster Computation Institute University of Chicago & Argonne National Laboratory
Transcript
Page 1: Opportunities for X-Ray science in future computing architectures

Opportunities for X-ray science in

future computing architecture

Ian FosterComputation Institute

University of Chicago & Argonne National Laboratory

Page 2: Opportunities for X-Ray science in future computing architectures

1940 1950 1960 1970 1980 1990 2000 2010

Year Introduced

1E+2

1E+5

1E+8

1E+11

1E+14

1E+17

Doubling time = 1.5 yr.

ENIAC (vacuum tubes)UNIVAC

IBM 701IBM 704

IBM 7090 (transistors)

IBM Stretch

CDC 6600 (ICs)

CDC 7600

CDC STAR-100 (vectors) CRAY-1

Cyber 205 X-MP2 (parallel vectors)

CRAY-2X-MP4

Y-MP8

i860 (MPPs)

ASCI White, ASCI Q

Petaflop

Blue Gene/L

Blue Pacific

DeltaCM-5 Paragon

NWT

ASCI Red OptionASCI Red

CP-PACS

Earth

VP2600/10SX-3/44

Red Storm

ILLIAC IV

SX-2

SX-4

SX-5

S-810/20

T3D

T3E

multi-Petaflop

Thunder

Fastest supercomputer(floating point ops/sec) Argonne

My laptop

Page 3: Opportunities for X-Ray science in future computing architectures

Acquire funding

Build apparatus

Collect data

30 years? yearsBrahe

Page 4: Opportunities for X-Ray science in future computing architectures

Acquire funding

Build apparatus

Collect data

30 years? years

Publish Analyzedata

Acquire data

10 years6 years2 years

Brahe

Kepler

Page 5: Opportunities for X-Ray science in future computing architectures

Acquire funding

Build apparatus

Collect data

30 years? years

Publish Analyzedata

Acquire data

10 years6 years2 years

Brahe

Kepler

Steal data

Poisonadvisor

Page 6: Opportunities for X-Ray science in future computing architectures

7

Computers at Harvard, 1890

Page 7: Opportunities for X-Ray science in future computing architectures

8

Sloan Digital Sky Survey

Page 8: Opportunities for X-Ray science in future computing architectures
Page 9: Opportunities for X-Ray science in future computing architectures

Aggregate SkyServer monthly traffic from 2001 to 2006. (Singh et al., 2006)

Sloan Digital Sky Survey publication statistics, Chen et al., 2009.

Page 10: Opportunities for X-Ray science in future computing architectures

Three discontinuities:

1) Massive parallelism

2) Large data

3) Economics of aggregation

Page 11: Opportunities for X-Ray science in future computing architectures

Intel x86 processor trends

Page 12: Opportunities for X-Ray science in future computing architectures

Gordon Bell prize winners

Year Flop/s Processors Application1988 109 101 Static finite element analysis1998 1012 103 Metal magnetic atoms2008 1015 105 Superconductive materials2018 1018? 107? ??

Page 13: Opportunities for X-Ray science in future computing architectures

Time

Com

plex

ity

Dim

ensio

ns

1

2

3

Times

cale

Shor

t

Long

Mul

tisca

le

Reso

lutio

n

Coar

se

Fine

A

dapti

ve

Coup

led

(& n

on-li

near

) equ

ation

s

Few

M

any

1

M

any

Para

met

ers o

r ens

embl

e m

embe

rs

No

Yes

Erro

r ana

lysis

No

Y

es

Optim

izatio

n

Algo

rithm

s

Simpl

e

C

ompl

ex

Dan Katz

Page 14: Opportunities for X-Ray science in future computing architectures

1515

DesignMaterials with desired

properties based on computation and data

CreateSynthesis and processing

methods informed by computation; generate data

UnderstandRelationship between

materials properties and structure

Rational design of catalytic materials(Curtis, Greely, Zapol, Kumaran)

Page 15: Opportunities for X-Ray science in future computing architectures

Identifying optimal candidates

Page 16: Opportunities for X-Ray science in future computing architectures

17

High-throughput screening on BG/P

[SC08] “Towards Loosely-Coupled Programming on Petascale Systems”

Page 17: Opportunities for X-Ray science in future computing architectures

Three discontinuities:

1) Massive parallelism

2) Large data

3) Economics of aggregation

Page 18: Opportunities for X-Ray science in future computing architectures

PC disk drive capacity

Page 19: Opportunities for X-Ray science in future computing architectures

Data generation and analysis costs outpace Moore’s Law

$900,000

Wilkening et al, IEEE Cluster09

Page 20: Opportunities for X-Ray science in future computing architectures

Data complexity also increasing

ID MURA_BACSU STANDARD; PRT; 429 AA.DE PROBABLE UDP-N-ACETYLGLUCOSAMINE 1-CARBOXYVINYLTRANSFERASEDE (EC 2.5.1.7) (ENOYLPYRUVATE TRANSFERASE) (UDP-N-ACETYLGLUCOSAMINEDE ENOLPYRUVYL TRANSFERASE) (EPT).GN MURA OR MURZ.OS BACILLUS SUBTILIS.OC BACTERIA; FIRMICUTES; BACILLUS/CLOSTRIDIUM GROUP; BACILLACEAE;OC BACILLUS.KW PEPTIDOGLYCAN SYNTHESIS; CELL WALL; TRANSFERASE.FT ACT_SITE 116 116 BINDS PEP (BY SIMILARITY).FT CONFLICT 374 374 S -> A (IN REF. 3).SQ SEQUENCE 429 AA; 46016 MW; 02018C5C CRC32; MEKLNIAGGD SLNGTVHISG AKNSAVALIP ATILANSEVT IEGLPEISDI ETLRDLLKEI GGNVHFENGE MVVDPTSMIS MPLPNGKVKK LRASYYLMGA MLGRFKQAVI GLPGGCHLGP RPIDQHIKGF EALGAEVTNE QGAIYLRAER LRGARIYLDV VSVGATINIM LAAVLAEGKT IIENAAKEPE IIDVATLLTS MGAKIKGAGT NVIRIDGVKE LHGCKHTIIP DRIEAGTFMI

[source: GlaxoSmithKline]

Page 21: Opportunities for X-Ray science in future computing architectures

Volume

Complexity

Analysisdemands

Page 22: Opportunities for X-Ray science in future computing architectures

23

Page 23: Opportunities for X-Ray science in future computing architectures
Page 24: Opportunities for X-Ray science in future computing architectures

Bob Grossman

Page 25: Opportunities for X-Ray science in future computing architectures

“light sources alone are not enough … Enormous data sets of diffracted signals in reciprocal space and across wide energy ranges must

be collected and analyzed in real time so that they can guide the ongoing

experiments.”

Page 26: Opportunities for X-Ray science in future computing architectures

27

Diamond Light Source

National Crystallography Service (NCS)

Local Earth Sciences Lab University of Cambridge

Function International service -multiple communities

UK service - multiple institutions. Also uses Diamond

Lone researcher at institution - uses NCS and ISIS large-scale facility

Administration Peer-reviewed proposal required

Paper-based records –experiments, safety ERA, instrument time

Multiple proposals, multiple forms

Metadata Core Scientific MetaData Model

eBank/eCrystals schema

?

Identifiers Beam-line number DOI InChI ?

Workflow Formulaic and bespoke

Formulaic, unrecorded Complex, unrecorded

Software In-house scripts In-house scripts + open-source suite

In-house scripts + open-source suite

Raw data In-house GDA store ATLAS data-store Laptop / local server

Derived data Taken offsite on laptop / USB stick

eCrystals repository Laptop / local server / USB stick

Source: Liz Lyon

Page 27: Opportunities for X-Ray science in future computing architectures

Pattern recognition in x-ray spectromicroscopy• Kevin Boyce, U. Chicago: study of the evolution of tree types,

including now-extinct species that dominated in the “coal age” (carboniferous). Acetate peel of fossilized wood.

• Shows how well we can separately map cellulose-derived material from lignin-derived material in plant cell walls, with implications for cellulosic ethanol production from biomass.

Lignin-derived and cellulose-derived regions in 400 million year old chert: Boyce et al., Proc. Nat. Acad. Sci. 101, 17555 (2004), with subsequent pattern recognition analysis by Lerotic, Jacobsen, Schäfer, and Vogt, Ultramicroscopy 100, 35 (2004).

Page 28: Opportunities for X-Ray science in future computing architectures

LDRD: “Next Generation Data Exploration - Intelligence in Data Analysis, Visualization, & Mining”

• “Here’s a cell in this tissue. How much zinc does it have? In the rest of the tissue, how many cells are there like this, and what is their distribution of zinc content?”– Fluorescence and absorption spectral imaging– Databases to combine results of multiple experiments and

instruments– Multivariate statistical analysis and pattern recognition

• People:– APS: Stefan Vogt (PI), Lydia Finney, Chris Jacobsen, Chris Roerhig,

Claude Saunders, Jesse Ward; Mathematics and Computer Science, ANL: Sven Leyffer, Stefan Wild, Mark Hereld; Northwestern: Rachel Mak

Page 29: Opportunities for X-Ray science in future computing architectures

fc *

Wavelength Division Multiplexing

“Lambdas”

Page 30: Opportunities for X-Ray science in future computing architectures

Rapid evolution of 10GbE port pricesmakes campus-Scale 10 Gbps affordable

2005 2007 2009 2010

$80K/port Chiaro(60 Max)

$ 5KForce 10(40 max)

$ 500Arista48 ports

~$1000(300+ Max)

$ 400Arista48 ports

Source: Philip Papadopoulos, SDSC, UCSD

Page 31: Opportunities for X-Ray science in future computing architectures

32

Page 32: Opportunities for X-Ray science in future computing architectures

33

Page 33: Opportunities for X-Ray science in future computing architectures

Three discontinuities:

1) Massive parallelism

2) Large data

3) Economics of aggregation

Page 34: Opportunities for X-Ray science in future computing architectures

Software-as-a-Service (SaaS)

Platform-as-a-Service (PaaS)

Infrastructure-as-a-Service (IaaS)

Page 35: Opportunities for X-Ray science in future computing architectures

Economies of scale in operations

ResourceCost for

medium scaleCost for

large scale Ratio

Network $95 / Mbps / month $13 / Mbps / month ~7x

Storage $2.20 / GB / month $0.40 / GB / month ~6x

Administration

≈140 servers/admin >1000 servers/admin ~7x

Page 36: Opportunities for X-Ray science in future computing architectures

Time-consuming tasks in business

Web presence Email (hosted Exchange) Calendar Telephony (hosted VOIP) Human resources and payroll Accounting Customer relationship mgmt Data analytics Content distribution …

SaaS

Page 37: Opportunities for X-Ray science in future computing architectures

Time-consuming tasks in business

Web presence Email (hosted Exchange) Calendar Telephony (hosted VOIP) Human resources and payroll Accounting Customer relationship mgmt Data analytics Content distribution …

SaaS

IaaS

Page 38: Opportunities for X-Ray science in future computing architectures

Time-consuming tasks in science

Run experimentsCollect dataManage dataMove dataAcquire computersAnalyze dataRun simulationsCompare experiment with simulationSearch the literature

• Communicate with colleagues

• Publish papers• Find, configure, install

relevant software• Find, access, analyze

relevant data• Order supplies• Write proposals• Write reports• …

Page 39: Opportunities for X-Ray science in future computing architectures

40From http://geekandpoke.typepad.com

Page 40: Opportunities for X-Ray science in future computing architectures

Globus ToolkitBuild the Grid

Components for building custom grid solutions

globustoolkit.org

Globus OnlineUse the Grid

Cloud-hostedfile transfer service

globusonline.org

Page 41: Opportunities for X-Ray science in future computing architectures

Time-consuming tasks in science

Run experimentsCollect dataManage dataMove dataAcquire computersAnalyze dataRun simulationsCompare experiment with simulationSearch the literature

• Communicate with colleagues

• Publish papers• Find, configure, install

relevant software• Find, access, analyze

relevant data• Order supplies• Write proposals• Write reports• …

Page 42: Opportunities for X-Ray science in future computing architectures

Time-consuming tasks in science

Run experimentsCollect dataManage dataMove dataAcquire computersAnalyze dataRun simulationsCompare experiment with simulationSearch the literature

• Communicate with colleagues

• Publish papers• Find, configure, install

relevant software• Find, access, analyze

relevant data• Order supplies• Write proposals• Write reports• …

Page 43: Opportunities for X-Ray science in future computing architectures

Datastore

A peek inside Globus Online

GridFTP

GridFTP

Profiles+ state

ConsumerConsumer

ConsumerConsumerRequest

collector

Notificationtarget

WorkerWorker

WorkerWorker

Worker

Page 44: Opportunities for X-Ray science in future computing architectures
Page 45: Opportunities for X-Ray science in future computing architectures
Page 46: Opportunities for X-Ray science in future computing architectures

Task ID : bc6d776c-2af4-11e0-9a1d-12313916526cTask Type : TRANSFERParent Task ID : n/aStatus : SUCCEEDEDRequest Time : 2011-01-28 15:39:04ZDeadline : 2011-01-29 15:39:04ZCompletion Time : 2011-01-28 16:17:12ZTotal Tasks : 500Tasks Successful : 500Tasks Expired : 0Tasks Canceled : 0Tasks Failed : 0Tasks Pending : 0Tasks Retrying : 0Command : transfer (+500 input lines)Files : 500Directories : 0Bytes Transferred: 1073741824000MBits/sec : 3754.342

ALCF-NERSCtask

summary

Page 47: Opportunities for X-Ray science in future computing architectures

48

11 x 125 files200 MB each

11 users12 sites

Page 48: Opportunities for X-Ray science in future computing architectures

Keith Cheng’s phenome project

GordonKindlmann

3000 zebra fish mutants

Page 49: Opportunities for X-Ray science in future computing architectures

Argonne / U Chicago Grid Supercomputing Facility

APS Beamline

Data Acquisition

Argonne National LabAdvanced Photon Source

GridFTP Server

HPC Cluster

GridFTP Server

GridFTP Server

Globus Online - hosted service for high-speed, reliable, secure data movement

Penn State UniversityPhenome Project Coordination

1 Gbps Network link

10 Gbps Network link

Regular Internet link

Beamline data flow

SAN

Graphics Workstations

Users

NASPattern Recognition

Segmentation & Visualization

Software Develop.

DAS

TOMOGRAPHIC RECONSTRUCTION, DERINGING,

SEGMENTATION, MORPHOMETRIC

S & VISUALIZATION

Page 50: Opportunities for X-Ray science in future computing architectures

Argonne / U Chicago Grid Supercomputing Facility

APS Beamline

Data Acquisition

Argonne National LabAdvanced Photon Source

GridFTP Server

HPC Cluster

GridFTP Server

GridFTP Server

Globus Online - hosted service for high-speed, reliable, secure data movement

Penn State UniversityPhenome Project Coordination

1 Gbps Network link

10 Gbps Network link

Regular Internet link

Beamline data flow

SAN

Graphics Workstations

Users

NASPattern Recognition

Segmentation & Visualization

Software Develop.

DAS

TOMOGRAPHIC RECONSTRUCTION, DERINGING,

SEGMENTATION, MORPHOMETRIC

S & VISUALIZATION

Page 51: Opportunities for X-Ray science in future computing architectures

Four theses

• Ultrascale computing enables new problem-solving methods

• Research data management is an essential service like electricity and networking

• Economies of scale motivate highly aggregated computing and storage

• Automation of science processes accelerates discovery and yields competitive advantage

Page 52: Opportunities for X-Ray science in future computing architectures

53

Thank you!

[email protected]


Recommended