Introduction to Globus - XSEDE14 Tutorial

Post on 11-May-2015

183 views 0 download

Tags:

description

Introduces the Globus software-as-a-service for file transfer and data sharing. Includes step-by-step instructions for creating a Globus account, transferring a file, and setting up a Globus endpoint on your laptop.

transcript

Globus Introduction

Raj Kettimuthu Steve Tuecke Vas Vasiliadis

“I need a good place to store / backup / archive my (big) research data, at a reasonable price.”

Public Cloud Archive Mass Store Campus Store

“I need to easily, quickly, & reliably move or mirror portions of my data to other places.”

Research  Compu.ng  HPC  Cluster  

Lab  Server  

Campus  Home  Filesystem  

Desktop  Worksta.on  

Personal  Laptop  

XSEDE  Resource  Public  Cloud  

“I need to easily and securely share my data with my colleagues at other institutions.”

“I need to get data from a scientific instrument to my analysis server.”

Next Gen Sequencer

Light Sheet Microscope

MRI Advanced Light Source

Challenge: Manage research data as easily as…

…our  pictures  

…home  entertainment  …our  e-­‐mail  

What is Globus?

Big data transfer, and sharing… … delivered via SaaS …

… that is simple, secure, and fast… … directly from your own storage

systems

Reliable, secure, high-performance file transfer & synchronization

•  “Fire-and-forget” transfers

•  Automatic fault recovery

•  Seamless security integration

•  Powerful GUI and APIs

Data Source

Data Destination

User initiates transfer request

1

Globus moves and syncs files

2

Globus notifies user

3

Simple, secure sharing off existing storage systems

Data Source

User A selects file(s) to share, selects user or group, and sets permissions

1

Globus tracks shared files; no need to move files to cloud storage!

2

User B logs in to Globus and

accesses shared file

3

•  Easily share large data with any user or group

•  No cloud storage required

Globus is SaaS

•  Web, command line, and REST interfaces •  Reduced IT operational costs •  New features automatically available •  Consolidated support & troubleshooting •  Easy to add your laptop, server, cluster,

supercomputer, etc. with Globus Connect

8,000 active endpoints

(in the past year)

Globus increasingly used to build campus-wide data services

Source: University of Nebraska Holland Computing Center

Enable  campus  compu4ng  facili4es  to  be7er  u4lize  

high  performance  network  infrastructure  

10GE

10GE

10GE

10GE

10G

Border Router

WAN

Science DMZSwitch/Router

Enterprise Border Router/Firewall

Site / CampusLAN

Per-service security policy control points

Clean, High-bandwidth

WAN path

Site / Campus access to Science

DMZ resources

perfSONAR

perfSONAR

perfSONAR

High performanceData Transfer Node

with high-speed storage

Typical deployment

Science DMZ

+ Globus

Details at: fasterdata.es.net

Demonstration

1.  Go to: globus.org/signup 2.  Create your Globus account 3.  Validate e-mail address 4.  Optional: Login with your

campus/InCommon identity

Exercise 1: Account Signup

1.  Install Globus Connect Personal 2.  Move file(s) from esnet#anl-diskpt1 to your

laptop 3.  Sign up for a free Globus Plus trial 4.  Create a shared endpoint on your laptop 5.  Grant your neighbor permissions on your shared

endpoint 6.  Access your neighbor’s shared endpoint 7.  Optional: Create group, and grant share access

Exercise 2: Transfer, Sharing, Group Management

Our challenge:

Sustainability

We are a non-profit, delivering a production-grade service to the non-profit research community

Globus Provider Subscriptions •  Managed Endpoints

–  Priority support –  Management console –  Usage reports –  Mass Storage System optimization –  Host shared endpoints –  Integration support

•  Branded Web Site

•  Alternate Identity Provider (InCommon is standard)

globus.org/provider-plans

NET+ Globus

•  Internet2 members get discounted Globus Provider subscriptions

•  Completing “Service Validation” phase – Sponsors: Cornell, U.Michigan, Yale,

U.Missouri, and U.Chicago

•  Available to “Early Adopters” soon

Globus Platform-as-a-Service

Identity, Group, Profile Management Services

Sharing Service

Transfer Service

Globus Toolkit

Glo

bu

s A

PIs

Glo

bu

s C

on

nec

t

globus genomics

Flexible, scalable, affordable

genomics analysis for all biologists

+ Data management

PaaS

Next-gen sequence analysis SaaS

+ Scalable IaaS

Globus is moving beyond transfer and sharing to data publication and

discovery

Globus Data Publication (coming soon)

•  SaaS for publishing large research data •  Bring your own storage •  Extensible metadata •  Publication and curation workflows •  Public and restricted collections •  Rich discovery model

Identified Described Curated

Verifiable Accessible Preserved

Enables data to be easily…

Search Browse Access

…across collections, endpoints

…and facilitates rich discovery

Metadata Access Control

License Storage

Curation Workflow

Policies Collection

Globus’ view of data publishing

Metadata

Data Metadata

Data

Metadata

Data

Dataset Dataset

Dataset

Community

Argonne Storage System

Univ. of Chicago Argonne IIT UIUC

Exemplar Use Case

3. Assemble Dataset (Transfer Data)

Argonne Curator

2. Describe Submission

Scientist

Shared Endpoint

4. Curate Dataset

1. Publish Data 6. Download

5. Search

Demonstration

Globus CLI

1.  Optional: Generate SSH key 2.  Go to: globus.org/account/

ManageIdentities 3.  Add your SSH key to your Globus identity 4.  SSH to cli.globusonline.org 5.  Check on the status of your earlier

transfer(s) 6.  Transfer a file using the scp command

Exercise 3: Globus CLI

Thank you to our sponsors!

U . S . D E PA RT M E N T O F

ENERGY