Globus Toolkit 4.0 Grid Resource Allocation Manager (GRAM) Job submission

Post on 04-Jan-2016

41 views 5 download

description

Globus Toolkit 4.0 Grid Resource Allocation Manager (GRAM) Job submission. ITCS 4146/5146 Grid Computing, 2007, UNC-Charlotte, B. Wilkinson. Jan 24, 2007. Globus Open Source Grid Software. G T 4. Delegation Service. Community Scheduler Framework [contribution]. Python WS Core - PowerPoint PPT Presentation

transcript

1b.1

Globus Toolkit 4.0

Grid Resource Allocation Manager (GRAM)

Job submission

ITCS 4146/5146 Grid Computing, 2007, UNC-Charlotte, B. Wilkinson. Jan 24, 2007

Data Management

SecurityCommonRuntime

Execution Management

Information Services

Web Services

Components

Non-WS

Components

Pre-WSAuthenticationAuthorization

GridFTP

GridResource

Allocation Mgmt(Pre-WS GRAM)

Monitoring& Discovery

System(MDS2)

C CommonLibraries

GT2

WSAuthenticationAuthorization

ReliableFile

Transfer

OGSA-DAI[Tech Preview]

GridResource

Allocation Mgmt(WS GRAM)

Monitoring& Discovery

System(MDS4)

Java WS Core

CommunityAuthorization

ServiceGT3

ReplicaLocationService

XIO

GT3

CredentialManagement

GT4

Python WS Core[contribution]

C WS Core

CommunitySchedulerFramework

[contribution]

DelegationService

GT4

Globus Open Source Grid Software

I Foster

GRAM

1b.3

Job submission componentsGT4 container

(hosts GT 4 web services)

GRAM services

GRAM adapter (for particular

scheduler)

Local scheduler

Compute nodes

User job submission

Local scheduler on coit-grid02.uncc.edu is Condor(Sun Grid Engine on some other compute nodes)

1b.4

How to specify jobThe job is specified in a file which gives details such as:

• Job description– Name of executable– Arguments– Input files– Output files

• Resource requirements– Memory

Uses a “job description” language• Resource Specification Language (RSL)

Then issue a command that includes name of job specification file.

1b.5

Job submission

• GT 2 job submission file– RSL version 1 language.– Traditional job description language

• GT 3.2/4.0 job submission file– RSL version 2 language– XML based job description language

1b.6

Resource Specification LanguageRSL version 1

• A meta-language describing job and its required execution.

Provides a specification for:• Resource requirements - machine type,

number of nodes, memory, etc.• Job description - directory, executable,

arguments, environment

1b.7

RSL Version 1 examplesConstraints Example

Conjunction (AND): &

• To create 3-5 instances of myProg, each on a machine with at least 64 Mbytes memory available to me for 1 hours:

& (executable=myProg)

(count>=3)(count<=5)(memory>=64)

(max_time=60)

1b.8

Constraints ExampleDisjunction (OR): |

• To create 5 instances of myProg, each on a machine with at least 64 Mbytes memory or 7 instances of myProg, each on a machine with at least 32 Mbytes memory :

&(executable=myProg)(|(&(count=5)(memory>=64))(&(count=7)(memory>=32)))

1b.9

Requesting multiple resourcesmultirequest: +

• To execute 5 instances of myProg1 on a machine with at least 64 Mbytes memory and execute 2 instances of myProg2:

+(&(count=5)(memory>=64))

(executable=myProg1))

(&(count=2)(executable=myProg2))

1b.10

Resource Specification Language, RSL version 2

• GT3 and GT 4 use RSL version 2.– Some differences in RSL language

specification in GT4, so not completely interchangeable!!!.

• RSL Version 2 is an XML language.

1b.11

Resource Specification Language Version 2 (RSL -2)

• XML version of RSL 1

• Can specify everything from executable,

paths, arguments, input/output, error file,

number of processes, max/min execution

time, max/min memory, job type etc. etc.

1b.12

RSL-2

• Much more elegant and flexible, and in keeping with systems using XML.

• Can use XML parsers.

• Allows more powerful mechanisms with job schedulers.

• Resource scheduler/broker applies specification to local resources.

1b.13

RSL-2 ExamplesSpecifying Executable

(executable=/bin/echo)<gram:executable> <rsl:path> <rsl:stringElement value="/bin/echo"/> </rsl:path></gram:executable>

In GT 4 version of RSL-2, can simply write:

<executable>/bin/echo</executable>

1b.14

Specifying Directory(directory=“/bin”)

<gram:directory>

<rsl:path>

<rsl:stringElement value="/bin/"/>

</rsl:path>

</gram:directory>

In GT 4 version of RSL-2, can simply write:

<directory>/bin/</directory>

1b.15

Specifying Number(count=1)

<gram:count>

<rsl:integer value="1"/>

</gram:count>

In GT 4 version of RSL-2, can simply write:

<count>1</count>

1b.16

Specifying Arguments(arguments=“Hello”)

<gram:arguments> <rsl:string> <rsl:stringElement value="Hello World"/></rsl:string></gram:arguments>

In GT 4 version of RSL-2, can simply write: <argment>hello world</argument>

1b.17

RSL and GT 3.2 RSL-2 comparison for echo program

&((executable=/bin/echo)

(directory="/bin")

(arguments="Hello World")

(stdin=/dev/null)

(stdout="stdout")

(stderr="stderr")

(count=1)

)

<?xml version="1.0" encoding="UTF-8"?>• <rsl:rsl xmlns:rsl="http://www.globus.org/namespaces/2003/04/rsl"• xmlns:gram="http://www.globus.org/namespaces/2003/04/rsl/gram"• xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"• xsi:schemaLocation="• http://www.globus.org/namespaces/2003/04/rsl• c:/ogsa-3.0/schema/base/gram/rsl.xsd• http://www.globus.org/namespaces/2003/04/rsl/gram• c:/ogsa-3.0/schema/base/gram/gram_rsl.xsd">• <gram:job>• <gram:executable> <rsl:path>• <rsl:stringElement value="/bin/echo"/> </rsl:path>• </gram:executable>• <gram:directory> <rsl:path>• <rsl:stringElement value="/bin"/> </rsl:path>• </gram:directory>• <gram:arguments>• <rsl:string> <rsl:stringElement value="Hello World"/> </rsl:string>• </gram:arguments>• <gram:stdin> <rsl:path>• <rsl:stringElement value="/dev/null"/> </rsl:path> </gram:stdin>• <gram:stdout>• <rsl:pathArray>• <rsl:path>• <rsl:substitutionRef name="HOME"/>• <rsl:stringElement value="/stdout"/>• </rsl:path>• </rsl:pathArray>• </gram:stdout>• <gram:stderr>• <rsl:pathArray>• <rsl:path>• <rsl:substitutionRef name="HOME"/>• <rsl:stringElement value="/stderr"/>• </rsl:path>• </rsl:pathArray>• </gram:stderr>• <gram:count> <rsl:integer value="1"/> </gram:count>• <gram:jobType>• <gram:enumeration>• <gram:enumerationValue> <gram:multiple/> </gram:enumerationValue>• </gram:enumeration>• </gram:jobType>• <gram:gramMyJobType>• <gram:enumeration>• <gram:enumerationValue> <gram:collective/> </gram:enumerationValue>• </gram:enumeration>• </gram:gramMyJobType>• <gram:dryRun> <rsl:boolean value="false"/> </gram:dryRun>• <gram:saveState> <rsl:boolean value="true"/> </gram:saveState>• <gram:twoPhase> <rsl:integer value="600"/> </gram:twoPhase>• </gram:job>• </rsl:rsl>

1b.18

GT 4.0 RSL-2 for echo program

<?xml version="1.0" encoding="UTF-8"?><job> <executable>/bin/echo</executable> <directory>${GLOBUS_USER_HOME}</directory> <argument>Hello</argument> <argument>World</argument>

<stdout>${GLOBUS_USER_HOME}/stdout</stdout>

<stderr>${GLOBUS_USER_HOME}/stderr</stderr> <count>1</count> <jobType>multiple</jobType></job>

1b.19

Comments on Linux command structure

Linux commands typically consist of the command (program) name, a list of optional and required named “flags”, possibly with arguments.

Flags are preceded with a hyphen and are traditionally a single letter, but can be more letters and may have alterative full names.

Flags case sensitive.

1b.20

Examples

ls –a List files, -a says list all files including hidden files (those starting with a period).

cc –o outputfile prog1compile C program prog1 and create executable called outputfile

1b.21

Comments on Linux command structure

• Command can be quite long and must be entered without a return.

• If need to go onto next line, use \ (backslash) first. Cause next character to be taken “literally”.

• Will not use \ for next line continuation in slides, for clarity.

1b.22

GT4 job submission command globusrun-ws

• Submit and monitor GRAM jobs

• Replaces (java) managed-job-globusrun

• Written in C, faster startup and execution

• Supports multiple and single job submission

• Handles credential management

• Streaming of job stdout/err during execution

1b.23

Simple job submission

• Step 1: Create proxy with grid-proxy-int command.

• Step 2: Issue globusrun-ws with parameters to specify job.

1b.24

Some globusrun-ws flags (options) for job submission

1b.25

Running GT 4 Jobusing XML job description file

• Command:

globusrun-ws –F hostname:port –submit –f prog.xml

where prog.xml specifies the job.

1b.26

Job submission-submit flag

-submit Submits (or resubmits) a job to a job host in one of three output modes:batchinteractive, or interactive-streaming.

Default (without additional flags to specify) is “interactive.”

-submit flag is always required to submit a job.

1b.27

Specifying where job is submitted

-F Specifies the “contact” for the job submission.

Defaulthttps://localhost:8443/wsrf/services/

ManagedJobFactoryService

In assignment 2, simply localhost and container port used, i.e.

-F localhost:8440Jeremy decided on this port.

1b.28

Selecting a different host

Example

globusrun-ws –submit –F

https://140.221.65.193:4444/wsrf/

services/managedJobFactoryService

–f prog1.xml

1b.29

Job description

Two ways:

-f job-description-file Job description read from given file (as in previous slides)

-c takes remaining globusrun-ws arguments and generates job description with named program and arguments. Must be last flag.

1b.30

-f flag

–f job description held in a file.

Example

globusrun-ws –submit –f echo.xml

where echo.xml is an RSL-2 file describing job.

1b.31

Contents of echo.xml

<job>

<executable>/bin/echo</executable>

<argument>hello</argument>

</job>

1b.32

Input/OutputRSL file can specify where stdout/err goes.

Example<job> <executable>/bin/echo</executable> <directory>/tmp</directory> <argument>hello</argument> <stdout>job.out</stdout> <stderr>job.err</stderr>…</job>

1b.33

Stream Input/Output

-s The standard output and standard error files of the job are monitored and data is written to the corresponding output of globusrun-ws.

Allows streaming stdout/err during executing to the terminal.

1b.34

Stream output data files

Can also “stream” output data files. Specify in RSL file where to.

1b.35

Example<job>

<fileStageOut>

<transfer>

<sourceUrl>file:///tmp/job.out</sourceUrl>

<destinationUrl>gsiftp://host.domain:2811/

tmp/stage.out</destinationUrl>

</transfer>

</fileStageOut>

</job>

1b.36

Submitting a single jobwithout XML job description file

-c flag

-c Causes globusrun-ws to generate a simple job description with the named program and arguments.

This flag, if used, must be the last flag.

Only useful for very simple single jobs.

1b.37

ExampleSubmit program echo with argument hello to default local host.

% globusrun-ws –submit –c /bin/echo hello

Submitting job...Done.Job ID: uuid:d23a7be0-f87c-11d9-a53b-0011115aae1fTermination time: 07/20/2005 17:44 GMTCurrent job state: ActiveCurrent job state: CleanUpCurrent job state: DoneDestroying job...Done.

1b.38

In previous example, output would be lost, as stdout not displayed.

To specify a file for the output, need two flags:

-s for streaming output

and

–so to specify output file

1b.39

globusrun-ws -submit -F localhost:8440

-s -so echo_output -c /bin/echo hello

name of file holding output

Argument for echo

1b.40

Batch Submission

Job queued for execution.

-batch Results in ManagedJob EPR as the sole standard output (unless in quiet mode) and then exits.

-o filename Created ManagedJob EPR written to file (if submission successful)

EPR “End point Reference” – the way to locate a web service, more on that later.

1b.41

Batch Job Submission

$ globusrun-ws –submit –batch –o job_epr –s /bin/sleep 50

Submitting job…DoneJoB ID: uuid:f9544174-60c5-11d9-97e3-0002a5ad41e5Termination time: 01/08/2005 16:05 GMT

1b.42

Monitoring Batch Submission

-monitor Attaches to an existing job in interactive or interactive-streaming output modes.

-j filename EPR for ManagedJob read from file.

1b.43

Monitoring Batch Job

globusrun-ws –monitor –j job_epr

job state: ActiveCurrent job state: CleanUpCurrent job state: DoneRequesting original job description...Done.Destroying job...Done

1b.44

Batch Submission

-status Reports the current state of the job and exits

-kill Requests immediate cancellation of job and exits.

1b.45

1b.46

Many other flagsExample

-term time

Set an absolute termination time, or a time relative to the successful job creation

1b.47

Sources of GT 4 information

http://www.globus.org/toolkit/docs

Also: “GRAM, RFT & Job Submission, Execution Management for GT4 Developers,”

S. Martinb and P. Plaszczak, globusWorld, 2005, www.globusworld.org

1b.48

To learn more about Linux commands

“Linux in a Nutshell,” by E. Siever, S. Figgins, and A. Weber, O’Reilly, (4th edition 2003). 928 pages!!

1b.49

Multiple choice quizWhen one issues the GT4.0 command:

globusrun-ws -submit -F localhost:8440 -s -so hello1 -c /bin/echo hello

what is hello?

(a) A java class(b) An xml file containing the description of the job to

be run(c) The executable to run in Globus(d) The argument for the program that will be

executable

1b.50

When one issues the GT4.0 command:

globusrun-ws -submit -F localhost:8440 -s

-so hello1 -c /bin/echo hello

is the order of the flags important, and if so why?

(a) Not important

(b) Important: -c must be last as it uses the remaining arguments

(c) Important: -s must be before -so

(d) Important: -F must be first

1b.51

When one issues the GT4.0 command:

globusrun-ws -submit -F localhost:8440 -s

-so hello1 -c /bin/echo hello

what is localhost?

(a) The server logged into running globusrun-ws.

(b) The computer you are using to log into the server

(c) None of the other answers.

1b.52

What does the tag <count> specify in an RSL-2 file?

(a) The number of different jobs submitted.

(b) The number of computers to use.

(c) The number of identical jobs to submit.

(d) The number of arguments.

1b.53

Questions