OOI Cyberinfrastructure and Semantics
OOI CI Architecture & Design TeamUCSD/Calit2
Ocean Observing Systems Semantic Interoperability Workshop, November 18
2008
Outline
• OOI Overview
• Cyberinfrastructure Project Plan
• Scientific Investigation Scenario
• Data Collection Workflow
• Data Management Services
• Technologies
OOI Overview
Cyberinfrastructure Project Plan
• Current status: Past FDR; prototyping in pilot period; construction planned for 7/2010
• Spiral development process, five releases, six subsystems
• OOI observatories: Coastal, Global, Regional
• External observatories: IOOS, Neptune CA, VENUS, …
Y 1 Y 2 Y 3 Y 4 Y 5
Release Cycle
R-1
R-2
R-3
R-4
R-5
Project Year
R-1
R-2
R-3
R-4
R-5
Data Distribution Network
Managed Instrument Network
OnDemand Measurement Processing
Integrated Modeling Network
Interactive Ocean Observatory
Construction Start
Science Activity Model
Y 1 Y 2 Y 3 Y 4 Y 5
Release Cycle
R-1
R-2
R-3
R-4
R-5
Project Year
R-1
R-2
R-3
R-4
R-5
Data Distribution Network
Managed Instrument Network
OnDemand Measurement Processing
Integrated Modeling Network
Interactive Ocean Observatory
Construction Start
Scientific Investigation
Data Processing and Availability
Subsystem
• Dynamic Data and Information Distribution−Common data and metadata model
−Stream and message based data distribution network (publish/subscribe)
−Efficient dataset wire transport
−Data/information catalog and repository
−Persistent archive
• Data Integration−Real-time QA/QC’ed data from OOI
observatories
−External observatory and dataset integration
Sensing & Acquisition
Data Management
Planning & Prosecution
Analysis & Synthesis
Common Execution Infrastructure
Common Operating Infrastructure
Capability Container
Data Management
Subsystem
• Data and Information Access−Public data access in various syntactical formats−Flexible search and navigation−Faceted queries using ontologies
• Transformation and Mediation−Syntactical transformation (in and out of the DDN)−Aggregation of datasets−Attribution and association−Semantic mediation using ontologies/vocabularies−Manage provenance, lineage and policy
Sensing & Acquisition
Data Management
Planning & Prosecution
Analysis & Synthesis
Common Execution Infrastructure
Common Operating Infrastructure
Capability Container
Data Management
Transformation Service Data Model
• Relationship between the constituents of the Transformation Service and the Information Container Model
Format Conversion
Data Parser
Mediation
Meta-Data Extraction
V&V
Information Block
InformationModel
Information Content
Meta-data (L2)
describe
Header(Meta-data L3)
Bodydescribe
operates
operates
operates
Syntax
SemanticsOntology rely
Standard rely
operates
operates
TransformationServices
Technology MappingFunctional Component Technology TRLDataset Catalog THREDDS 7Semantic Framework VSTO Semantic Framework 8Semantic Query ESG Facetted Search 8Data Integration with Applications NetCDF lib 8Data Integration with Applications Matlab lib for OpenDAP access 8Dataset Management & Distribution OpenDAP Hyrax Server 7Dataset Preservation iRODS 7General Purpose Database MySQL cluster 9Data Grid File Transfer GridFTP 9Dataset Access Protocol DAP 9Dataset File Format NetCDF 9Metadata Conventions CF Metadata (Climate & Forecast)9Dataset Aggregation Language NcML 9Query language for RDF SPARQL 8Knowledge Discovery Model URIQA 9Oceangraphic Vocabularies & MappingsMMI 7External Data Presentation OGC Services 9 & 7
Summary
• NSF’s OOI CI Goals−Build the next generation cyber-infrastructure
for ocean research, education and public participation
• Timeline−Prototyping ongoing−OOI construction planned for 7/2010 to 6/2015
• Semantics−Ontology based query and mediation−Success depends on the integration of MMI
vocabularies and semantic framework, the VSTO semantic framework and faceted search
Thanks!
Information Container Model
• Encapsulates all kinds of information resources, such as: scientific data, user identities, process definitions, virtual machine images, etc.
• Multiple levels of meta-data
• Separation of concerns between Information services
Information Container
Meta-data(L1)
Information Block
Meta-data(L2)
Information Content
Header (optional)
Body (Content)
Information Container
Information Block
Meta-data (L1)
describe
Information Content
Meta-data (L2)
Header(Meta-data L3)
Body
describe
describe
Process Spec
Science Data
Ingestion
Transformation
operates
operates
InformationServices
InformationModel
Ingestion
• Provides basic mechanisms for identifying the data streams and formats, parsing the content and identifying the associated meta-data, adding version information, and registering the streams with a ISN Repository
Ingestion
Versioning
Exchange
Data Format Detector
Registrar
Metadata Extractor
Data Parser
Ingestion Service Data Model
• Relationship between the constituents of the Ingestion Service and the Information Container Model
Versioning
Data Format Detector
Registrar
Meta-Data Extractor
Data Parser
Information Container
Information Block
Meta-data (L1)
describe
Registration Information
Version
Ownership
Authorship
Policies
Annotations
operates
operates
operates
operates
operates
IngestionServices
InformationModel