Date post: | 11-Jun-2015 |
Category: |
Technology |
Upload: | brigitte-joerg |
View: | 68 times |
Download: | 1 times |
Data in Context
Co-chairs: Brigitte Jörg, Keith Jeffery
RDA 3rd Plenary, March, 26th - 28th, 2014 Dublin
Brief History• 1st Plenary Gothenburg Preparing a WG Proposal/Case
Statement „Contextual Metadata“• A lot of interest• Revision of Initial Use Cases• Use Cases as specific as possible• Alignment with other WGs / Activities• Four revised use cases:
– Researcher: Find data ..– Manager: Indicate to funder – Provenance: Allow to take segments from streamed data
workflows– Interoperability: Exchange of contextual metadata
• Rename Group to „Data in Context“
Data in Context IG Approach• Lifecycle Approach
– Linear Sequence of Elements– Cyclic Repetition of Elements
• Investigate Lifecycle Models– DCC: Conceptualize; Create;
Access; Use; Appraise; Select; Dispose; etc
– DDI: Discovery & Planning; Initial Data Collection; etc.
– Research Lifecycle (Jisc): Research Process: Simulate Experiment; Manage Data; Analyse; etc.
– etc. ??
• Investigate contextually or subcontextually-aware standardization work– OAIS; CASRAI; CERIF; VIVO; PROV;
PREMIS; MARC; CKAN; DCAT; ISO; W3C; OMG; Research Objects, etc.
• Investigate / Prioritize Reusable Requirements
• Deliverables: – M6: Overview of contextually-
aware standardization work– M12: Priority List of Requirements
• Goal: – Set up of a Working Group– Implementation of Standardized
Profiles
• Long-term Goal: – Automated Transformation
Between Standards
Collaboration / Exchange
• RDA Foundation and Terminology• RDA Metadata Standards Directory WG• RDA PID Information Types WG• ICSU Open Metadata Catalogue and Knowledge
Networks WG• RDA/WDS Workflows for Publishing Data IG• RDA Data Description Registry Interoperability• RDA Semantic Interoperability Activity• RDA Metadata Interest Group• Various W3C groups (LOD, SW....)
Requirements / Needs
• Stakeholders• Data Producers• Data Consumers
• Standardized Open Vocabularies• Standardized Formal Data Profiles• Standardized Formal Semantics
Template
First Steps taken with developing a Template
Apply
DCC – The Curation Lifecycle
Stakeholders
Data Producer
Data Consumer
Standardized Open Vocabularies
Standardized Formal Data Profiles
Standardized Formal Semantics
http://www.dcc.ac.uk/digital-curation/what-digital-curation
DDI Lifecycle
http://www.ddialliance.org/Specification/DDI-CV/
DDI Controlled VocabulariesAnalysis Unit; Character Set; Commonality Type; Lifecycle Event Type; Response Unit; Software Package; Summary Statistic TypeTime Method
Stakeholders
Data Producer
Data Consumer
Standardized Open Vocabularies
Standardized Formal Data Profiles
Standardized Formal Semantics
Data Assets Framework
Stakeholders
Data Producer
Data Consumer
Standardized Open Vocabularies
Standardized Formal Data Profiles
Standardized Formal Semanticshttp://www.data-audit.eu/
Research Lifecycle
DDI Controlled VocabulariesAnalysis Unit; Character Set; Commonality Type; Lifecycle Event Type; Response Unit; Software Package; Summary Statistic TypeTime Method
Stakeholders
Data Producer
Data Consumer
Standardized Open Vocabularies
Standardized Formal Data Profiles
Standardized Formal Semantics
http://www.jisc.ac.uk/whatwedo/campaigns/res3/jischelp.aspx
RDA Practical Policy WG
Stakeholders
Data Producer
Data Consumer
Standardized Open Vocabularies
Standardized Formal Data Profiles
Standardized Formal Semantics
Src: Slide Extract Rainer Stotzka, Reagan Moore provided for „Data in Context“ session, RDA 3rd Plenary
Data Lifecycle
Stakeholders
Data Producer
Data Consumer
Standardized Open Vocabularies
Standardized Formal Data Profiles
Standardized Formal Semantics
DATA
Collaboration &
Visualisation
Dissemination &
Sharing
Archiving &
Preserving
Analysis&
Data Mining
Acquisition &
Modeling
Src: Keynote Tony Hey at RDA 3rd Plenary
Experimental Context, Publishing and Research Objects
Proposal
Approval
SchedulingExperiment/Investigation
Data storage
Record Publication
Scientist submits application for
beamtime
Facility committee approves
applicationFacility registers,
trains, and schedules
scientist’s visit
Scientists visits facility, run’s experiment
Subsequent publication
registered with facility
Raw data filtered, and stored
Data analysis
Tools for processing made
available
Investigation as a first class object
Src: Slide extract Brian Matthews, STFC provided for „Data in Context“ session, RDA 3rd Plenary
Liberalised Meta-DataIs a network
13
Citation
Coverage(Temporal,
Spatial, Topic)
Use, Caveats, Lineage,
Methods, and Licenses
Publisher
People
Institutions
RDI Outputs/ Online
Resources
Projects
Initiatives
Networks
Funders
Relationships are contributed by (1) meta-data mining (2) information from websites conforming to schema (3) social-media-type sites and VREs (4) existing network contributions (5) scraping existing websites (6) ontologies and vocabularies (…)
Src: Slide Extract Wim Hugo, ICSU WDS provided for „Data in Context“ session, RDA 3rd Plenary
Etc.
• Data Curation Profiles (Purdue University)• ODP Model (ISO Reference Model for Open
Distributed Processing)
Standards
Jeffery et. al. 2013 http://resources.metapress.com/pdf-preview.axd?code=vl5422n2u7112669&size=largest
• e.g. • OAIS• CASRAI• CERIF• VIVO• PROV• PREMIS• MARC
• CKAN• DCAT• ISO• W3C• OMG• ODP• etc.
Emerging e-Infrastructure
Discovery
Contextual
Discovery
Jeffery et. al. 2013 http://resources.metapress.com/pdf-preview.axd?code=vl5422n2u7112669&size=largest
AgendaSession 1: Thursday, March 27 - 15:30 - 17:00
• Introduction and Overview from Co-Chairs • Contributions from RDA Members
– Data Publishing Workflows, DCC Data Profiles (Angus Whyte) – Data Description Registry Interoperability (Amir Aryani)– Long-tail Data IG, Data Publishing IG (Jochen Schirrwagen)– WDS Knowledge Network activity (Wim Hugo) – Experimental Context, Publishing and Research Objects (Brian Matthews)– Reference Model Proposal (Yin Chen)
• Discussion
Notes Taking: Alessia Bardi, RDA Early Career Researchers Programme recipient.
AgendaSession 2: Friday, March 28 - 11:00 – 12:30
• Recap and Overview from Co-Chairs • Contributions from RDA Members
– Semantic Interoperability, (Gery Berg-Cross) – Metadata WGs (Keith Jeffery, Rebecca Koskela)– Practical Policy Sessions (Slides Reagan Moore)
• Discussion
Notes Taking: Alessia Bardi, RDA Early Career Researchers Programme recipient.
Rough Work Plan
• M6: Overview of contextually aware standardization work
• M12: Priority List of Requirements
From there set up a RDA Working Group Requirements-drivenImplementation of Standards WG Plan