+ All Categories
Home > Data & Analytics > FAIR data overview

FAIR data overview

Date post: 12-Jan-2017
Category:
Upload: luiz-olavo-bonino-da-silva-santos
View: 652 times
Download: 0 times
Share this document with a friend
53
FAIR DATA OVERVIEW Luiz Olavo Bonino - [email protected]
Transcript
Page 1: FAIR data overview

FAIR DATA OVERVIEW

Luiz Olavo Bonino - [email protected]

Page 2: FAIR data overview

SUMMARY What is FAIR data?

The FAIR ecosystem

Plans and how to realise

Page 3: FAIR data overview

(FAIR) DATA STEWARDSHIP

Page 4: FAIR data overview

DATA STEWARDSHIP Combination of all expertise to treat data well in

a project:■ Experiment design and data-design;■ Re-use of existing data where possible;■ Planning of the storage, networking and

computing infrastructure;■ Data acquisition and processing;■ Data publishing in a format that allows functional

interlinking of data(sets) as well as in a format suitable for long-term preservation.

Page 5: FAIR data overview

FAIR DATA STEWARDSHIP Combination of all expertise to treat data well in

a project:■ Experiment design and data-design;■ Re-use of existing data where possible;■ Planning of the storage, networking and computing

infrastructure;■ Data acquisition and processing;■ Data publishing in a format that allows functional

interlinking of data(sets) as well as in a format suitable for long-term preservation.

FAIR Data

Page 6: FAIR data overview
Page 7: FAIR data overview

DATA STEWARDSHIP – PROCESS VIEW

Page 8: FAIR data overview

DATA STEWARDSHIP – SUSTAINABILITY VIEW

Page 9: FAIR data overview
Page 10: FAIR data overview

Produces Consumes

Page 11: FAIR data overview

Produces Consumes

storage

sustainability

maintenance

license

privacy security

stewardship

access

?

Page 12: FAIR data overview

Produces Consumes

RDFMIAPE

DBMS Excel

APISQL SPARQL Metadat

a

DICOM

MIRIAMSemantics

Page 13: FAIR data overview

Produces Consumes

access

find

query

format

licenseintegrate

Page 14: FAIR data overview

WHAT IS FAIR DATA?FAIR Data aims to support existing communities in their attempts to enable valuable scientific data and knowledge to be published and utilised in a ‘FAIR’ manner.

Findable - (meta)data is uniquely and persistently identifiable. Should have basic machine readable descriptive metadata.

Accessible - data is reachable and accessible by humans and machines using standard formats and protocols.

Interoperable - (meta)data is machine readable and annotated with resolvable vocabularies/ontologies.

Reusable - (meta)data is sufficiently well-described to allow (semi)automated integration with other compatible data sources.

Page 15: FAIR data overview

THE FAIR ECOSYSTEM

FAIR Data Principles

FAIR Data Protocol

FAIR Data Resources

FAIR Data Core Technologies

FAIR Data Systems/Tools

Normative

Artefact

Software

Page 16: FAIR data overview

FAIR ECOSYSTEM - NORMATIVE LEVEL FAIR Data Principles - general principles guiding

FAIR data solutions;

FAIR Data Protocol - complying with the FAIR Data Principles, provide guidelines for implementing FAIR data solutions, e.g., standards, APIs, technologies, …;

Page 17: FAIR data overview

FAIR DATA PROTOCOL

Findable - standards for describing the dataset with the relevant metadata;

Accessible - standards for represent and access the data according to the defined usage license;

Interoperable - standards for machine readable descriptions of the (meta)data and (semantic)annotation;

Reusable - standards for semantic annotation of the (meta)data supporting machine reasoning, and standards for defining data provenance and support citation;

The standards include technologies (e.g., RDF, nano pub, JSON, OWL, etc.) as well as protocols and APIs.

Page 18: FAIR data overview

FAIR ECOSYSTEM - ARTEFACT LEVEL FAIR Data Resource - datasets expressed using

one of the prescribed standards of the FAIR Data Protocol and with metadata complying with the protocol.

Annotation Ontology - reference conceptual model used to provide semantics to elements of FAIR Data Resources through annotation.

Controlled vocabularies, dictionaries, etc.

Page 19: FAIR data overview

FAIR DATA RESOURCEDatasets expressed using one of the prescribed standards of the FAIR Data Protocol, with metadata complying with the protocol and license. The original dataset is transformed into a FAIR format and proper metadata and license are added to produce a FAIR Data Resource. The original and the FAIR version can co-exist, each one fulfilling its own purpose.

FAIR Conversion

FAIR Data Resource

Page 20: FAIR data overview

FAIR DATA RESOURCE

Data Creation

FAIR Data Resource

FAIR Data Creation

FAIR Data Resource

Page 21: FAIR data overview
Page 22: FAIR data overview
Page 23: FAIR data overview

SHARING DATAI would like to exploit common genotype-phenotype relations between Alzheimer’s

Disease and Huntington’s Disease…I need to combine AD and HD data…

I can help with that!I can help with that!

Source: Marcos Roos

Page 24: FAIR data overview

SHARING DATA

Source: Marcos Roos

???

Here’s my data, have

fun!Here’s my data, have

fun!

Page 25: FAIR data overview

SHARING LINKABLE DATA

Source: Marcos Roos

I can go straight to answering my questions with data from multiple data owners!

Patients will be so pleased with this speed-up!

Here’s my Linked Data,

have fun!Here’s my

Linked Data, have fun!

Page 26: FAIR data overview

Raw data(many formats)

Page 27: FAIR data overview

Raw data(many formats)

Processed data(primary storage format)

Initial transformation

Page 28: FAIR data overview

Raw data(many formats)

Processed data(primary storage format)

ProvenanceInitial transformation

Page 29: FAIR data overview

Raw data(many formats)

Processed data(primary storage format)

FAIR transformation

FAIR (meta)data(RDF,XML etc.)

ProvenanceInitial transformation

Page 30: FAIR data overview

Raw data(many formats)

Processed data(primary storage format)

FAIR transformation

FAIR (meta)data(RDF,XML etc.)

ProvenanceInitial transformation

Page 31: FAIR data overview

Raw data(many formats)

FAIR download(in local format)

Processed data(primary storage format)

FAIR transformation

FAIR (meta)data(RDF,XML etc.)

ProvenanceInitial transformation

Page 32: FAIR data overview

Raw data(many formats)

FAIR download(in local format)

Processed data(primary storage format)

FAIR transformation

FAIR (meta)data(RDF,XML etc.)

High-PerformanceAnalysis

ProvenanceInitial transformation

Analysis transformation

Page 33: FAIR data overview

FAIR DATA APPLICATION ECOSYSTEM (NL APPROACH)

Page 34: FAIR data overview

FAIR DATA RESOURCE

FAIR transformation

FAIR Data Resource

Page 35: FAIR data overview

BRING YOUR OWN DATA - BYOD Goals:

■ Learn how to make data linkable “hands-on” with experts

■ Create a “telling story” to demonstrate its use

Composition:■ Data owners – specialists on given datasets■ Data interoperability experts■ Domain experts

Source: Marcos Roos

Page 36: FAIR data overview

BYOD

Page 37: FAIR data overview

FAIRIFIER

Page 38: FAIR data overview

FAIRIFIER

Page 39: FAIR data overview

FAIR DATA MODEL REGISTRY

Page 40: FAIR data overview

FAIRIFIER AND FAIR DATA MODEL REGISTRY

Page 41: FAIR data overview

A particular class of FAIR Data System that provides access to published datasets. The datasets can be external or internal to the FAIR Data Point. Also, the source data can be a regular (non-FAIR) dataset or a FAIR Data Resource. If the source data is non-FAIR, the FAIR Data Point needs to made the necessary FAIR transformations on the fly.

Page 42: FAIR data overview

FAIR DATA POINT

Page 43: FAIR data overview
Page 44: FAIR data overview

A particular class of FAIR Data System to provide support for data interoperability;

Supports publication and access to FAIR data. Fosters an ecosystems of applications and

services; Federated architecture: different FAIRports

(and other FAIR Data Systems) are interconnectable;

Supports citations of datasets and data items; Provides metrics for data usage and citation;

Page 45: FAIR data overview

FAIR DATA PUBLICATION

Page 46: FAIR data overview

FAIR DATA ACCESS

Page 47: FAIR data overview

DISTRIBUTED ARCHITECTURE

Page 48: FAIR data overview

F A IR

Page 49: FAIR data overview

FAIRPORT ECOSYSTEM

Page 50: FAIR data overview

FAIRPORT

Page 51: FAIR data overview

WORK ORGANISATION (NL APPROACH)

Page 52: FAIR data overview

HOW TO REALISEDTL

National FAIR data engineering team

BBMRI

2.0

Elixir

OtherProject

ODE

X4A

LL

ENPADASI

TRAI

TEA

TRIS

FAIR Data IG(P.I.s with

data)

Data SACG.M., B.M., M.G.

Data Core Team

FAIR Data Executive TeamL.B. (CTO), R.H., P.B., M.S., J.B., J.W.

engineers in theFAIR Data virtual team

Core FAIR

TechnologyFAIR Data V.T.

FAIR Data V.T.

FAIR Data V.T.

FAIR Data V.T.

FAIR Data V.T.

FAIR Data V.T.

Local “DTLs”/ projectsprojec

ts

Page 53: FAIR data overview

QUESTIONS?


Recommended