The UK e-Science Programme
A View from the National e-Science Centre
Malcolm AtkinsonDirector of NeSC
Universities of Edinburgh and Glasgow
CANARIE 7Toronto
29th November 2001
Contents
What is e-Science?What do we expect from the Grid?
The UK e-Science Programme
NeSC’s Role & Structure
The Road ahead
What is e-Science?
An acceleration of a trend?
A sea change in scientific method?
A new opportunity for science?
Accelerating Trend
More and More dataInstrument resolution doubling /12 monthsInstrument and telemetry speeds increasingStorage capacity doubling / 12 months
More and More ComputationComputations available doubling / 18 monthsAnalyses and simulations increasing
Faster networksRaw bandwidth doubling / 9 months
These IntegrateMore interplay between computation and dataMore collaboration among scientistsMore international collaboration
Sea Change in Scientific Method
In Silico discoveryExploration of data and models predicts resultsVerified by directed experiments Combinatorial chemistry Gene function Protein Structure, …
Shared ResourcesResearcher’s Workbench Laboratory team Multi-national network of labs + modellers Public instruments, repositories and simulations
1. Prior test against data and models2. Experimental Procedures3. Sanity check on results against data and
models
But …
Skilled scientists and computer scientistsRoughly static in numberDiminishing in available attention for any taskDistributed systems remain hard
E.g. component failures and latency are always with us
Important data still in documents
More subjects experiencing the Data delugeAnalysis avalancheSimulation bonanzaCollaboration growth
Must therefore find general solutionsAnd make technology easier to use
The New Behaviour
Shared InfrastructureIntrinsically distributedIntrinsically multi-organisationalMultiple uses interwoven
Shared SoftwareA new attempt at making distributed computing economic, dependable and accessibleScientists from all disciplines share in its design and use
Immediate benefitFaster transfer of ideas and techniques between disciplinesAmortisation of development, operation and education
Not Just Scientists
EngineersThey already travel the same path
Finance, economy, politics, …We can expect best use of data and models to guide the decisions that affect our lives
MedicineSee above
IndustrySee above
The UK Office of Science & TechnologyHas this extension firmly in mind
Several Assumptions
The Technology is ReadyNot true — its emerging
Joining in the task of building middleware Of Advancing Standards Of Developing Dependability
The Scientists / Engineers want thisNot universally true
Addressed by Pilot projects and Demonstrators Addressed by The e-Science Institute
One Size Fits AllNot true
Addressed by a minimum set of composable virtual services But starting with Globus
It’s only for “big” scienceNo — “small” science collaborates too!
We know how we will use gridsNo — Disruptive technology
Contents
What is e-Science?What do we expect from the Grid?
The UK e-Science Programme
NeSC’s Role & Structure
The Road ahead
e- Science and the Grid‘e- Science is about global collaboration in key areas of science, and the next generation of infrastructure that will enable it.’
‘e- Science will change the dynamic of the way science is undertaken.’
J ohn TaylorDirector General of Research Councils
Offi ce of Science and Technology
From presentation by Tony Hey
UK e-Science Initiative (1)
£120M 3 Year Programme to create the next generation IT infrastructure to support e-Science and Business
SR2000 – Funded UK e-Science Grid and Grid Support Centre, e-Science Application research projects and industrial collaboration
SR2002 – Bidding for additional funding to extend scope of e-Science programme
Essential that UK plays a leading role in Global Grid development with the USA, EU and Asia
From presentation by Tony Hey
UK e-Science Initiative (2)
£120M Programme over 3 years
£75M is for Grid Applications in all areas of science and engineering
£10M for Supercomputer upgrade
£35M for development of ‘industrial strength’ Grid middleware
Require £20M ‘matching’ funds from industry
Prof. Tony HeyDirector of the Core Programme
From presentation by Tony Hey
Cambridge
Newcastle
Edinburgh
Oxford
Glasgow
Manchester
Cardiff
Southampton
London
BelfastDL
RALHinxton
UK Grid Network
From Tony Hey 27 July 01
Scotland via Glasgow
NNW
Northern Ireland
MidMAN
TVN
South WalesMAN
SWAN&BWEMAN
WorldComGlasgow
WorldComEdinburgh
WorldComManchester
WorldComReading
WorldComLeeds
WorldComBristol
WorldComLondon
WorldComPortsmouth
Scotland via Edinburgh
YHMAN
NorMAN
EMMAN
EastNet
External Links
LMN
KentishMANLeNSE
10Gbps
622Mbps155Mbps
SuperJanet4, June 2002 20Gbps
2.5Gbps
From presentation by Tony Hey
Access Grid Nodes
Technology Developed by Rick Stevens’ group at Argonne National Laboratory
Access Grid will enable informal and formal group to group collaboration
Distributed lectures and seminarsVirtual meetingsComplex distributed grid demos
Uses MBONE and MultiCast Internet Technologies
Access Grid
From presentation by Tony Hey
Grid Middleware R&D£16M funding available for industrial collaborative projects£11M allocated to Centres projects plus £5M for ‘Open Call’ projects- approved £0.5M ‘Centre’ project with Imperial College Sun Centre of ExcellenceSet up two Task Forces- Database Task Force (Chaired by Norman Paton from Manchester Centre)- Architecture Task Force (Chaired by Malcolm Atkinson, Director of NeSC)
From presentation by Tony Hey
Equator: Technological innovation in physical and digital life
AKT: Advanced Knowledge Technologies
DIRC: Dependability of Computer-Based Systems
MIAS: From Medical Images and Signals to Clinical Information
IRC ‘Grand Challenge’ Project IRC ‘Grand Challenge’ Project
From presentation by Tony Hey
e-Healthcare Grand Challengee-Healthcare Grand Challenge
Funding £0.5M projects to give Grid dimension to these IRCsFunding £2M Joint IRC projects with MIAS on e-Healthcare application
Example: Breast cancer surgery – normalization of mammography and
ultrasound scans- FE modelling of breast tissue Deliver useful clinical information to
surgeon ensuring privacy and security
From presentation by Tony Hey
UK e-Science Projects
£75M for e-Science application ‘pilots’- spans all sciences and engineeringParticle Physics and Astronomy (PPARC)- £20M GridPP and £6M AstroGridEngineering and Physical Sciences (EPSRC)- funding 6 projects at around £3M eachBiology, Medical and Environmental Science- projects with total value of £20M will be announced soon
From presentation by Tony Hey
Particle Physics and Astronomy e-Science Projects
GridPPlinks to EU DataGrid, CERN LHC Computing Project, US GriPhyN and PPDataGrid Projects, and iVDGL Global Grid Project
AstroGridlinks to EU AVO and US NVO projects
From presentation by Tony Hey
Comb-e-Chem:Structure-Property Mapping
Southampton, Bristol, Roche, Pfizer, IBM
DAME: Distributed Aircraft Maintenance Environment
York, Oxford, Sheffield, Leeds, Rolls Royce
Reality Grid: A Tool for Investigating Condensed Matter and Materials
QMW, Manchester, Edinburgh, IC, Loughborough, Oxford, Schlumberger, …
EPSRC e-Science Projects (1)
From presentation by Tony Hey
EPSRC e-Science Projects (2)
My Grid: Personalised Extensible Environments for Data Intensive in silico Experiments in Biology
Manchester, EBI, Southampton, Nottingham, Newcastle, Sheffield, GSK, Astra-Zeneca, IBM, Sun
GEODISE: Grid Enabled Optimisation and Design Search for Engineering
Southampton, Oxford, Manchester, BAE, Rolls Royce
Discovery Net: High Throughput Sensing Applications
Imperial College, Infosense, …
From presentation by Tony Hey
Comb-e-Chem:Structure-Property Mapping
Goal is to integrate structure and property data sources within knowledge environment to find new chemical compounds with desirable properties- Accumulate, integrate and model extensive range of primary data from combinatorial methods- Support for provenance and automation including multimedia and metadataSouthampton, Bristol, Cambridge Crystallographic Data Centre, Roche Discovery, Pfizer, IBM
From presentation by Tony Hey
MyGrid e-Science Workbench
Goal is to develop ‘workbench’ to support:Experimental process of data accumulationUse of community informationScientific collaboration
Provide facilities for resource selection, data management and process enactmentBioinformatics applications
Functional genomics, pattern database annotation
Manchester, EBI, Newcastle,Nottingham, Sheffield, SouthamptonGSK, AstraZeneca, Merck, IBM, Sun, ...
From presentation by Tony Hey
e-Science DemonstratorsDynamic Brain AtlasBiodiversityChemical StructuresMouse GenesRobotic AstronomyCollaborative Visualisation Climateprediction.comMedical Imaging/VR
From presentation by Tony Hey
Contents
What is e-Science?What do we expect from the Grid?
The UK e-Science Programme
NeSC’s Role & Structure
The Road ahead
NeSC’s context
NeSC
eSIGSC
Application Pilots IRCs …e-Science Centrese-Scientists, Grid users, Grid services & Grid Developers
UK Core Directorate Global Grid Forum …
CS Research
TAGDBTF ATF
GNT
Coordination
NeSC’s Roles
Stimulation of Grid & e-Science ActivityUsers, developers, researchersEducation, Training, SupportThink Tank & Research
Coordination of Grid & e-Science ActivityRegional Centres, Task Forces, Pilots & IRCsTechnical and Managerial ForaSupport for training, travel, participation
Developing a High-Profile e-Science Institute
MeetingsVisiting ResearchersInternational Collaboration
Regional SupportPortfolio of Industrial Research Projects
e-Science Institute
The Story so FarAugust & September
3 workshops week 1: DF1, GUM1 & DBAG1 HEC2 and the Grid preGGF3 & DF2
October Steve Tuecke Globus tutorial (oversubscribed) 4-day workshop Getting Going with Globus (G3)
– Reports on DataGrid & GridPP experience Biologist Grid Users’ Meeting 1 (BiGUM1)
November GridPP Configuration management
December Architecture & Strategy with Ian Foster et al. AstroGrid
eSI Highlights cont.2002 & 2003
January Steve Tuecke et al. 4 day Globus Developers’ Workshop Pilot project workshop
February — closed for renovationMarch
Digital Libraries, Librarians, Museums and the Grid Protein folding Workshop 14th to 17th IBM sponsor
May Mind and Brain Workshop
21st to 26th July GGF5 & HPDC 11 EICCAugust Research Festival14th to 16th April 2003 Dependability
Contents
What is e-Science?What do we expect from the Grid?
The UK e-Science Programme
NeSC’s Role & Structure
The Road ahead
Where to Concentrate
International & Industrial CollaborationIdeas, experiments, software, standards
Integrating Data across the GridData growth demands new methodsData ownership expects respect & securityData is hard to scan — indexing & queryData is hard to move — query & move codeHuman attention is scarce but essential
Machine-assisted annotation, provenance, archiving Machine-assisted data mining Machine-assisted ontology construction & integration
Human-factors must drive designs
Dynamic, Dependable and Virtual FabricImproved Programming Models
For more Information
Ask me
www.nesc.ac.uk
Thank you for your attentionor for arriving early for the next talk