+ All Categories
Home > Documents > Www.nordugrid.org NorduGrid: a collaboration, the middleware and related infrastructure projects...

Www.nordugrid.org NorduGrid: a collaboration, the middleware and related infrastructure projects...

Date post: 31-Dec-2015
Category:
Upload: lora-cooper
View: 222 times
Download: 2 times
Share this document with a friend
Popular Tags:
27
www.nordugrid.org NorduGrid: NorduGrid: a collaboration, the a collaboration, the middleware and related middleware and related infrastructure projects infrastructure projects Balázs Kónya Technical Coordinator Nordugrid Collaboration/Lund University OSG Consortium Meeting, August 22, 2006
Transcript

www.nordugrid.org

NorduGrid: NorduGrid: a collaboration, the a collaboration, the

middleware and related middleware and related infrastructure projectsinfrastructure projects

Balázs KónyaTechnical Coordinator

Nordugrid Collaboration/Lund UniversityOSG Consortium Meeting, August 22, 2006

www.nordugrid.org 2

outlineoutline

NorduGrid Collaboration The Advanced Resource Connector (ARC)

middleware– Design & main components– Status & future development– Interoperability & Standards

Infrastructure projects using ARC

www.nordugrid.org 3

NorduGrid collaboration: some NorduGrid collaboration: some historyhistory

2001-2002: a research project of the NORDUNet2 program aimed to enable Grid in the Nordic countries

Since end-2002 is a research collaboration between Nordic academic institutes– Open to anybody, non-binding

Since end-2003 focuses on middleware– Develops own Grid

middleware: the Advanced Resource Connector (ARC)

– Provides middleware to research groups and national Grid projects

ARC is now installed on ~50 sites (~5000 CPUs) in 14 countries all over the World

www.nordugrid.org 4

The NorduGrid Collaboration The NorduGrid Collaboration

From ... ... To

– EDG >ARC– Tesbed >50 sites– HEP >multidisciplinary

+Bio,Chem.,..– 4 Nordic >13 countries– 20 cpu’s >5000 cpu’s– 2001 >2003

... from a research project to a research collaboration

...from a Grid testbed to a major middleware providerNOT an infrastructure, does not operate or control resources

www.nordugrid.org 5

How did ARC appearHow did ARC appear

Back in 2001...High Energy Physics Institutes from Scandinavia wanted to share their computing resources and jointly contribute to CERN/LHC computing– They needed a Grid!– The Grid hype just begun– Globus was regarded as the

“de facto standard” middleware NO production ready middleware

was available or seen on the horizonas of November 2001:– Very alpha Globus GT-2.0 (GRAM-1.5, MDS-2.0); nevertheless

Globus & IBM already started to work on OGSA/I, i.e. GT v.3 (which was announced in February 2002)

– EDG middleware was in an extremely embryonic phase Since May 2002 ARC has been used in production Data

Challenges

www.nordugrid.org 6

Design philosophy (1/2)Design philosophy (1/2)

1. The system must be:a) Light-weightb) Portable & modularc) Non-intrusive on the

resource side:• Resource owners retain

full control• No requirements w.r.t.

OS, resource configuration, etc.

• Clusters need not be dedicated

• Runs independently of other existing Grid installation

d) Special attention to functionality & performance

“Traditionally, Scandinavian design has been associated with

simple,

uncomplicated designs, functionality and a democratic approach”

www.scandesign.org

www.nordugrid.org 7

Design philosophy (2/2)Design philosophy (2/2)

a) Flexible & powerful on the client part • must be easily installable

by a novice user• trivial tasks must be

trivial to perform• no dependency on

central services• No central client(s),

create a real distributed system

1. Strategy: start with something simple that works for users and add functionality gradually Source of design illustrations:

“Scandinavian Design beyond the Myth”

www.scandesign.org

www.nordugrid.org 8

ARC componentsARC components

Goal: no single point of failure

www.nordugrid.org 9

Architecture key pointsArchitecture key points

Each resource has a front-end– Authenticates users, interprets tasks, interacts with

LRMS, publishes information, moves data– Resources are Grid-enabled by the ARC layer deployed

on the front-end, no middleware components behind the front-end!

Each user can have an independent lightweight brokering client (or many)– Resource discovery, matchmaking, job submission and

manipulation, monitoring Grid topology is achieved by an hierarchical,

multi-rooted set of indexing services Monitoring relies entirely on the information

system Ad-hoc data management, for the beginning

www.nordugrid.org 10

Computing resources: Grid-enabled via ARC layer on head node (front-end):– Custom GridFTP server for all the communications– Grid Manager handles job management upon client request, interfaces to LRMS– Performs most data movement (stage in and out), cache management, manages

user work area– Publishes resource and job information via LDAP

Computing service: the key Computing service: the key componentcomponent

no GRAM!!!

www.nordugrid.org 11

Components: ClientsComponents: Clients

Client: a lightweight User Interface with the built-in Resource Broker– A set of command line utilities– Minimal and simple– Under the hood: resource

discovery, matchmaking, optimization, job submission

– Complete support for single job management

– Basic functionality for multiple job management

– Support for single file manipulations

– Built upon ARCLIB Portals and GUI clients are being

developed

www.nordugrid.org 12

Components: InfosystemComponents: Infosystem

Information System: based on Globus-patched OpenLDAP: it uses GRIS and GIIS back-ends

– Keeps strict registration hierarchy– Multi-rooted– Effectively provides a pseudo-mesh architecture,

similar to file sharing networks– Information is only kept on the resource; never

older than 30 seconds– Own schema and providers

www.nordugrid.org 13

Components: StoragesComponents: Storages Storage: any kind of storage system with a disk front-

end– Conventional Storage:

• Own GridFTP server implementation with pluggable back-ends• Ordinary file system access• Grid Access Control Lists (GACL) based access

– “Smart" Storage Element: WS based data service with direct support for Indexing Services (Globus’ RC, RLS)

– no tape storage systems in use so far

www.nordugrid.org 14

Functionality overviewFunctionality overview

Provides reliable implementation of fundamental Grid services:– The usual grid security: single sign on, Grid ACLs (GACL),

VOs (VOMS) – Job submission: direct or via matchmaking and brokering – Information services: resource aggregation,

representation, discovery and monitoring– Implements core data management functionality

• Automated seamless input/output data movement• Data Indexing (RLS, Fireman), client-side data movement

– Job monitoring & management– Logging service

Builds upon standard open source solutions and protocols– Globus Toolkit® pre-WS libraries (no services, no GRAM!)– OpenLDAP, OpenSSL, SASL, SOAP, GridFTP, GSI

www.nordugrid.org 15

What is ARC today; statusWhat is ARC today; status

General purpose Open Source European Grid middleware– Being developed & maintained by the NorduGrid Collaboration– Deployment support– Extensive documentation– Support for most of the popular Linux distributions

Lightweight architecture for a dynamic heterogeneous system– non-intrusive on the server side: coexistence with other

middlewares on the same resource– standalone client (13 MB, pre-configured tar-ball)

User- & performance-driven development– Production quality software since May 2002– First middleware ever to contribute to HEP data challenge

Middleware of choice by many national academic projects due to its technical merits– SWISS Grid(s), Finnish M-Grid, NDGF, etc…– Majority of ARC users now are NOT from the HEP community

www.nordugrid.org 16

ARC development futureARC development future

Production sites run stable releases 0.4.x– Released in April 2004, took 2 years to develop, Globus 2, pre-WS

technology, most basic functionality Release 0.6 (second production ARC release) should be out real soon

– Development branch 0.5.x is already used as a release candidate e.g. in ATLAS’ Dulcinea executor and other clients.

– Currently working on documentation and packaging

Ambitious development plan to turn ARC into a next generation Grid middleware– SOA, WS interfaces based on current powerful core components

– Extend data management functionalities

ARC's developer force: Open source community governed by Nordugrid Collaboration with dedicated personnel:– KnowARC project (17 FTEs)

– NDGF software developer team (6 FTE)

– NorduNet3 (5 Phd students)

– Academic research groups in Umeå (SE), Aalborg (DK)

www.nordugrid.org 17

The KnowARC projectThe KnowARC project

EU FP6 Specific Targeted Research Project– "Grid-enabled Know-how Sharing Technology Based on ARC Services

and Open Standards" – Started: June 1st, 2006 runs for three years– Partners: NorduGrid members and research teams in medicine,

bioinformatics, physics, engineering, automotive industry apps, IT (10 partners from 7 countries)

Objectives:– to create a novel, powerful Next Generation Grid middleware based on

ARC, widely respected for its simplicity, non-invasiveness and cost-efficiency;

– to promote Grid standardization and interoperability;– to contribute to Grid technologies take-up, bridging the gaps between

business and academia in Grid development Will be a major contributor to the development of

the next generation ARC middleware that will be the next step after ARC 0.6, addressing current limitations and shortcomings

www.nordugrid.org 18

KnowARC development areasKnowARC development areas

Core Services– Next generation Grid middleware

architecture survey and design– Web Service interfaces over ARC

services– Back-ends– Sandboxing & virtualization– Security framework (delegation)

Higher level services– Self-healing flexible storage and

user-friendly storage interface– Self healing grid jobs: job

migration & job manager– P2P-like information backbone,

novel brokering approaches– Scalable accounting service– Dynamic application framework

management– ARC-enabled Taverna and

flowGuide (workflow engines)

Standards & Interoperability– OGSA– gLite gateway

Applications– Automotive industry– Medical image processing– Statistical genomics– Engineering portal (construction

industry) Quality assurance, software

distribution– Build system, testing, support– Profiling, performance analysis,

usability studies– Pilot Grid system– ARC in major Linux distributions– New platforms: Windows, Solaris,

Mac OS-X

www.nordugrid.org 19

On interoperability of “pre-On interoperability of “pre-standard” middlewaresstandard” middlewares

For simplicity, let’s “forget” most of the major middlewares (UNICORE, Avaki, SRB, GT4, Condor, etc) and take a look at only on the Glite(LCG) vs ARC interoperability issue:– both are production level middlewares– both are pre-standard implementations (non WS-based

systems)

Security

Job submission

Job description

Information

Data management

Data transfer

Basic software stack

Service/component

ARC protocol via GridFTPmixture of Condor & GRAM

GSI, VOMS, GACLGSI, VOMS, MyProxy, CAS (?)

RSL, soon JSDL supportJDL (based on classAds)

LDAP, ARC schema, ARC-GIIS, interested in “Glue2”

LDAP, GLUE1.1, BDII-GIIS, R-GMA, interested (2) “Glue2”

RC, RLS, FiremanEDG RLS, Fireman & Co, LFC

GridFTP, SRM v1.1 clientGridFTP, SRM v? (DPM)

pre-WS GT libraries, own patches

GT2 solutions from VDT

ARCGlite (LCG)

www.nordugrid.org 20

InteroperabilityInteroperability

Main difficulty– middleware stacks are continuously changing– components have closed, undocumented interfaces

GGF GIN effort aims to collect the low-hanging fruits– so far targets only GRAM-based systems, ARC is the exception– Ninf-G invoke remote process module for ARC has been developed

Glue schema initiative– Glue v 1.2 is rather limited– Glue 1.2 vs. ARC mapping: http://www.nordugrid.org/documents/arc_infosys.pdf

– NorduGrid has been interested in a major Glue revision since Glue v1.1• Glue schema 1.2 meeting notes: “Earliest possible 2.0 discussions

October (2005),might be possible to start discussion with NorduGrid on some common design”

• Glue 2.0 is being postponed• NorduGrid Glue 2.0 thoughts https://twiki.cern.ch/twiki//bin/view/EGEE/Balazs-

thoughts

Job submission gateways through Condor

www.nordugrid.org 21

gLite to ARC gatewaygLite to ARC gateway

www.nordugrid.org 22

Standardization: the NorduGrid Standardization: the NorduGrid perspectiveperspective

Interoperability efforts: good starting point in understanding different existing systems and documenting their interfaces

Fragmentation of the standardization landscape and process– by far too many (non-implemented) proposals of numerous

Standard Development Organizations– integration concerns: can all these proposals be implemented and

used together? It is desirable that major middleware providers become more

dedicated in influencing and implementing emerging standards– major players: Globus, Condor (OSG), Unicore, Glite (EGEE), OMII,

Chinese middleware(s), ObjectWeb (Proactive), ARC (NorduGrid)– there is a threat that Grid community will keep producing non-

implemented, conflicting standards NorduGrid would like to avoid emerging “de facto standards”

defined by implementations Interfaces, Interfaces, Interfaces

www.nordugrid.org 23

Standardization: the NorduGrid Standardization: the NorduGrid perspectiveperspective

Areas where standardization & implementations are urgently needed:

– Job description language (JSDL1.x?)

• JSDL 1.0 is deliberately narrowly focused

– Representation of Grid-related objects (Glue2?, CIM?)• Computing & storage resources

• Grid jobs ( simple things such as “job states” are not defined!!!)

– Standard interface to computing resources (execution service)

• Job submission, Job management, Job monitoring (OGSA-BES?)

– Standard interface to Storages (SRM?)

– (pre-deployed) Grid Application environments/frameworks

– Description of Virtual Organizations

– Standards related to Grid economy (e.g. Usage info)

• GGF-UR group declared that “… will not be a Grid usage record”

www.nordugrid.org 24

ARC-enabled Grid ARC-enabled Grid infrastructure(s)infrastructure(s)

Loosely coupled collection of Grid(s) based on ARC middleware– Multidisciplinary Grids/VOs: Swiss

Bio, Finnish Material Science Grids

– Very low operational costs!!! Grid of National Grids

– SWEGRID, DCGC, NorGrid, M-Grid, SwissGrid, NDGF, etc..

– National operational procedures Easy to Join: assistance in Grid

deployment outside the Nordic area

No central operational centre,no centrally forced VOs, large autonomy

NorduGrid Collaboration is just a provider of the middleware, it does not control resources or operation

www.nordugrid.org 25

Nordic DataGrid Facility Nordic DataGrid Facility

NOS-NDK

SF NS

Nordic Data Grid Facility

NDGF == “Nordic Data Grid Facility”– Idea conceived in 2002

simultaneously with LCG– Goal: create a Nordic Grid

infrastructure, primarily for LHC Grid computing (Tier1)

– 2003-2006: pilot project funded by the 4 Nordic countries (Denmark, Finland, Norway, Sweden)

– NorduGrid/ARC middleware chosen as the basis June 1st 2006: NDGF is launched

– Nordic production Grid, leveraging national grid resources– Common framework for Nordic production Grid– Co-ordinates & hosts major Grid projects (e.g. the Nordic LHC Tier-1)– Develops Grid middleware (ARC contributor)– Single Point of Entry for collaboration, middleware development/deployment, e-

Science projects– Represents the Nordic Grid community internationally

NDGF 2006-2010– Funded (2 MEUR/year) by National Research Councils of the Nordic countries

(NOS-N) NDGF coordinates activities - does not own resources or middleware

www.nordugrid.org 26

SummarySummary

NorduGrid Collaboration develops, maintains & supports an open source Grid middleware– ARC middleware is approaching the milestone of the second

stable release (v0.6)

– Ambitious development plan after the 0.6 release to be carried out by newly launched projects (KnowARC, NDGF, NorduNet3)

ARC is a reliable, robust, easy-to-use Grid middleware, enabling distributed production facilities already for almost 3 years, non-stop.– ARC is a choice of middleware for numerous national and

cross-national Grid projects. The NorduGrid Community is involved in and devoted

towards Grid interoperability and standardization efforts– Interoperability and standardization needs are among the

major driving forces behind further ARC development

www.nordugrid.org 27

referencesreferences

Advanced Resource Connector middleware for lightweight computational Grids, Future Generation Computer Systems (in press)– http://dx.doi.org/10.1016/j.future.2006.05.008

Science Grid This Week article on KnowARC– http://www.interactions.org/sgtw/2006/0809/

knowarc_more.html web sites:

– www.nordugrid.org– www.knowarc.eu– www.ndgf.org


Recommended