+ All Categories
Home > Documents > Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water...

Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water...

Date post: 13-Dec-2015
Category:
Upload: juliette-alway
View: 216 times
Download: 1 times
Share this document with a friend
Popular Tags:
30
Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio Mayorga, UW-APL Ilya Zaslavsky, SDSC David Valentine, SDSC David Tarboton, USU David Lubinski, UC-Boulder A community information model for interoperability among feature-based earth observations
Transcript
Page 1: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

Observations Data Model 2.0

Jeff Horsburgh, USU. Project PI.Anthony K. Aufdenkampe, Stroud Water Research Center

Kerstin Lehnert, IEDA/ColumbiaEmilio Mayorga, UW-APLIlya Zaslavsky, SDSCDavid Valentine, SDSCDavid Tarboton, USUDavid Lubinski, UC-Boulder

A community information model for interoperability among

feature-based earth observations

Page 2: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

Critical Zone Science

Atmosphere

Biosphere

Hydrosphere

Lithosphere

Earth's permeable near-surface layer from the tops of the trees to the bottom of actively cycling groundwater.

• Where rock, soil, water, air, and living organisms interact and shape the Earth's surface.

• Critical to sustaining the earth’s sustaining services• Clean water• Productive soil• Balanced atmosphere

Hillslope Catchment Watershed

Minutes

Decades

Millenia

Eons

Page 3: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

CZO Disciplines

• Biogeochemistry• Biology/Ecology• Biology/Molecular• Climatology/

Meteorology• Data

Management/CyberInfrastructure

• Engineering/Method Development

• Geochemistry/Mineralogy

• Geology/Chronology

• Geomorphology• Geophysics• GIS/Remote Sensing• Hydrology• Modeling/

Computational Science• Outreach/

Education Research• Soil Science/Pedology• Water Chemistry

Page 4: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

CZO DisciplinesBig Data Long Tail Data

Biogeochemistry

Biology/Ecology

Biology/Molecular

Climatology/Meteorology

Data Management/CyberInfrastructureEngineering/Method Development

Geochemistry/Mineralogy

Geology/Chronology

Geomorphology

Geophysics

GIS/Remote Sensing

Hydrology

Modeling/Computational Science

Outreach/Education Research

Soil Science/Pedology

Water Chemistry

Page 5: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

CZO DisciplinesBig Data Long Tail Data

Sample-based

Sensor-based

Geospatial Grids & Vectors

Categorical

Page 6: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

ObservationsCore

SensorExtension

Domain Cyberinfrastructures

CUAHSIHIS

EarthChem CZOData IOOS

FeatureModel

Equipment & LabExtensions

GenericExtension

Common Semantics for Earth Observations

ODM2: Common to Most Data Types

Page 7: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

Catalog

Data Server Clients

MetadataCatalog

Data Storage

Met

adat

a Har

vest

ing

Data Discovery

Data Delivery

MetadataTransfer

MetadataTransfer

DataTransfer

DatabaseEncoding

XML SchemaEncoding

Legend

Data and Metadata Transfer

Information Model

ODM2: Common to All Components

Page 8: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

ODM2: Additional Goals

• Driven by Community & Use Cases: • 3 workshops + ~12 data models + much feedback• use cases: CZOData, Little Bear River, PetDB, IOOS

• Balance between general vs. understandable• External unique identifiers, vocabularies &

taxonomies• Rich Specimen, Site & other Sampling Features• Granular Methods, Data Quality & Equipment• Dataset publishing & archiving via:

• Result “packages”, Versions, Citations, Provenance

• Strong Annotations & general extensibility

Page 9: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

ODM2Core

Page 10: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

ODM2Core

Page 11: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

ODM2SamplingFeatures

Page 12: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

ODM2Results

Page 13: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

ODM2ExternalIdentifiers

Page 14: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

ODM2Provenance

Page 15: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

ODM2Annotations

Page 16: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

ODM2Equipment

Page 17: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

ODM2DataQuality

Page 18: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

ODM2LabAnalyses

Page 19: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

ODM2Sensors

Page 20: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

NSF Scientific Software Integration

BiG CZ SSI project (2014-2015): The community-driven BiG CZ software system for integration and analysis of bio- and geoscience data in the critical zone

• Community Engagement in Software Design through co-design, training & testing workshops.

• BiG CZ Portal web application for high-performance map-based discovery, visualization, access & publication of data on critical zone structure & function

• BiG CZ Toolbox to enable cyber-savvy CZ scientists & data managers to manage and publish the data they produce through a single scientist-focused toolkit

• BiG CZ Central software stack to bridge data systems developed for multiple critical zone domains

Page 21: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

Thank You

Funded by the National Science Foundation

EAR 1224638EAR 1332257ACI 1339834

ODM2 is on GitHUB: https://github.com/UCHIC/ODM2

Page 22: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

ODM2: Object-Relation Map

Page 23: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

What can we do with ODM2?(that we couldn’t do before)

• Add multiple comments/annotations to any entity

• Represent Actions and sequences of Actions that lead to observation Results

• More granularly represent people and organizations

• Store information about Actions that do not have Results

Page 24: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

What can we do with ODM2?(that we couldn’t do before)

• Separate Results from ResultValues – enables multiple ResultTypes

• Move DataValues out of the Core – better facilitates cataloging

• Add taxonomic classifiers to Results, adding an additional dimension to observations

• Create relationships among Results and store provenance

• Group Results into Datasets

Page 25: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

What can we do with ODM2?(that we couldn’t do before)

• Store information about the equipment used to create observations

• Add extension properties to any record in any entity

• Link many entities to external identifier systems

• Support SamplingFeatures of multiple types - Sites and Specimens, among others

• Not limited to a single spatial offset• Not Limited to a single qualifier

Page 26: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

Observation Data Model 2.0

• NSF funded project: PI. Jeff Horsburgh• “Developing a Community Information Model and

Supporting Software to Extend Interoperability of Sensor and Sample Based Earth Observations”

• To achieve interoperability between IEDA, EarthCHEM, CUAHSI HIS, and other data systems

• Better support for samples and unique identifiers (IGSN/SESAR)

• Extensibility to table attributes• Better annotation and provenance• Enable integrated web service based publication of a

broader class of CZO data

Page 27: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

Information Model(All)

StorageEncoding

(USU/LDEO)

CatalogEncoding(SDSC)

Web Service Interface

(UW)

Archival Encoding

(USU)

XML Schema Encoding(SDSC)

ODM2 Functional Use Cases

Page 28: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

Future Directions for CZO Science

• Develop a unifying theoretical framework of CZ evolution;

• Develop coupled systems models to explore how CZ services respond to anthropogenic, climatic, and tectonic forcings;

• Develop four dimensional data sets that• document differing CZ geologic and climatic settings,• inform our theoretical framework, • constrain our conceptual and coupled systems models, • test model-generated hypotheses.

Report prepared by CZO community, Dec. 2010

Page 29: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

EarthCube Critical Zone Domain Workshop

Engaging the Critical Zone community to bridge long tail science with big data

Organizing Committee:

Kerstin Lehnert, IEDA/Columbia.Ilya Zaslavsky, SDSC.David Tarboton, USUJeff Horsburgh, USU.Emilio Mayorga, UW-APL

James Syvitski, CSDMS.Susan Brantley, PSU & SH-CZO.Susan Gill, SWRC.

Convened by A.K. Aufdenkampe, C.J. Duffy, G.E. TuckerUniv. of Delaware: Jan. 21-23, 2013

Page 30: Observations Data Model 2.0 Jeff Horsburgh, USU. Project PI. Anthony K. Aufdenkampe, Stroud Water Research Center Kerstin Lehnert, IEDA/Columbia Emilio.

103 Participants from 16 Disciplines

• Biogeochemistry (30)• Biology/Ecology (15)• Biology/Molecular (3)• Climatology/

Meteorology (15)• Data

Management/CyberInfrastructure (46)

• Engineering/Method Development (8)

• Geochemistry/Mineralogy (13)

• Geology/Chronology (14)

• Geomorphology (15)• Geophysics (8)• GIS/Remote Sensing (31)• Hydrology (46)• Modeling/

Computational Science (36)• Outreach/

Education Research (7)• Soil Science/Pedology (16)• Water Chemistry (14)

Early-Career (28)


Recommended