Karpjoo Jeong( jeongk@konkuk.ac.kr ) Applied Grid Computing Center

Post on 16-Mar-2016

57 views 4 download

Tags:

description

X-SIGMA (An X ML based S imple data I ntegration system for G athering, M anaging and A ccessing scientific experimental data in grid environments). Karpjoo Jeong( jeongk@konkuk.ac.kr ) Applied Grid Computing Center Department of Advanced Technology Fusion Konkuk University, Seoul, Korea. - PowerPoint PPT Presentation

transcript

X-SIGMA(An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid

environments)

Karpjoo Jeong(jeongk@konkuk.ac.kr)Applied Grid Computing Center

Department of Advanced Technology FusionKonkuk University, Seoul, Korea

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Motivations

Two Application Projects– KOCED. TeleScience and Data Sharing Environments for Civil

Engineering Research in Korea– Glyco-MGrid. Collaborative Molecular Simulation Grid

Environments for GlycomicsRequired to support scientific data management

– Multiple data models, but not many. They may be changed and new models may be added, but not frequently

– Legacy data (just files)and analysis software (just code with input and output files), but they are kind of simple

Conventional Systems– DBMS-based Systems. Too inflexible– Semantic Web Systems. Too advanced and complicated– File based Systems. Too flexible. Custom approach

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Scientific Data Management

Data Models– Metadata– Experimental Data

Indexing & Searching– Metadata Management– Data Indexing and Searching

Data Repository– Storage for Experimental Data – Access Management

Data Repository

Data Models

ExperimentalData Metadata

Store

Register

Search

Explains

•3D Structure•Trajectory

•Scientist: John•Parameter: #$@•Data locationexplain

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Goals: Data Models

Multiple Data ModelsData Model EvolutionData Integration

Data Repository

Data ModelA

ExperimentalData Metadata

Data ModelB

ExperimentalData Metadata

Query

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Metadata Model (Context Data)

Experimental Context– Information about experiments (e.g. owners and parameter

settings)Logical View of Experimental Data

– Logical organization of physical experimental data– Their location information– Associated software information

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

X-SIGMA Metadata Model Example

< (b) User Interface for Editor >

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Logical View of Experimental Data

Location, Format and Associated Software

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Goals: Federated Data RepositoryDistributed RepositoryLocal Site Autonomic ManagementDecoupling between Metadata and Experimental DataLegacy Data Access Support

Global MetadataManagement

Distributed Access to Experimental Data

Site ASite B

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

X-SIGMA System StructureGlobal X-SIGMAGlobal X-SIGMA

GlobalSchema

Management

QueryProcessor

ExperimentalData Access

System

Local X-SIGMA

LocalSchema

Management

Local QueryProcessing

System

LocalExperimental

Access System

Storage(File, Legacy)

Schema &Context Data

SchemaIntegration

DistributedQuery System(OGSA-DAI)

DistributedAccess System(SRB,GRID-FTP)

Register/Insert

Search Access

Local X-SIGMA

LocalSchema

Management

Local QueryProcessing

System

LocalExperimental

Access System

Storage(File, Legacy)

Schema &Context Data

Register/Insert

Search Access

Local X-SIGMA

LocalSchema

Management

Local QueryProcessing

System

LocalExperimental

Access System

Storage(File, Legacy)

Schema &Context Data

Register/Insert

Search Access

ContextData

GlobalSchema

Search& Access

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

X-SIGMA System Architecture

XMLDatabase

< Site A >

Storage XML Database

< Site B >

Storage XML Database

< Site C >

Storage

Global Schema Management

Distributed Query Processor

ExperimentalData Access System

Schema & ContextDataManagement

Query Process

Real Data Access

Schema & ContextDataManagement

Query Process

Real Data Access

Schema & ContextDataManagement

Query Process

Real Data Access

Global X-SIGMA

RDF Database

Local X-SIGMA Local X-SIGMA Local X-SIGMA

OGSA-DAI SRB

Grid Middleware

Integrated Access Layer

ContextData

SearchingContext Data

AccessingReal Data

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Interfaces

GUI-based InterfacesWeb Services based Interfaces

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Glyco-MGrid

Glyco-MGrid is a molecular simulation computing and data grid portal for glycomics

It provides shared and integrated cyber-environments which support simulation, databases, and trajectory analysis in a collaborative way

Data sharing is based on X-SIGMA

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

Glyco-MGrid

Glyco-MGridDatabase Simulation Trajectory Active

Projects

MGrid-SDG(X-SIGMA)

MGrid-CG AnalysisComputing

PSE

XMLDocument

GridFTP/RFTRealData

© Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.

References

“X-SIGMA: XML based Simple data Integration system for Gathering, Managing, and Accessing Scientific Experimental Data in Grid Environments”, 2nd Conference on eScience and Grid Computing, 2006

“Glyco-MGrid : A Collaborative Molecular Simulation Grid for e-Glycomics”, To appear in 3rd Conference on eScience and Grid Computing, 2007