Date post: | 15-Apr-2017 |
Category: |
Government & Nonprofit |
Upload: | casrai |
View: | 79 times |
Download: | 1 times |
rdc-drc.ca @rdc_drc
Research Data Canada is supported by CANARIE, an organization dedicated to advancing Canada's knowledge and innovation infrastructure.
National Data Services: ReviewMark Leggott, Executive Director | ReConnect 16| Oct 25, 2016Let’s connect: [email protected] | @mleggott
rdc-drc.ca @rdc_drc 2
> Context
rdc-drc.ca @rdc_drc 3
Publish or Perish
Open by
Default
rdc-drc.ca @rdc_drc 5
Data InputData
Enhancement
Data Validation
Reproducibility
Discoverability
Serendipity
Linkages
Innovation
Impact
Training
Reusability
rdc-drc.ca @rdc_drc 6
National Data Services˃Storage & Preservation Services˃Computational & Analysis Services˃Discovery Services˃Identifier Services˃Dissemination Services˃Support and Training Services˃Policy Rationalization and Development˃Communication and Coordination
rdc-drc.ca @rdc_drc 7
National Data Services - Level˃Level
• National• Regional• Consortial• Institutional• Project
˃Design• Centralized• Federated• Hybrid
rdc-drc.ca @rdc_drc 8
National Data Services - Scope˃Data has no boundaries
• Data as Research Outputs• Data as Research Inputs
˃Functions for managing data are pretty much the same for both
˃Can we use same infrastructure for both?
9
Interoperability
Linda Naughton, Jisc. June 2016,
Jisc - RDM shared services
Linda Naughton, Jisc. June 2016,
Front End/ User Interface
Middle Layer
Storage Layer
Preservation Layer
Basic Metadata EntryIngest UI
Registry/catalogue search function
Data discovery UILanding page with DOI,
Discovery Metadata, and metrics
Data Publication UI
Data Registry/ Catalogue/ Repository
API’s CRIS, DataCite, ORCID, LOD, funders Etc.
Archival Management
Access Data Storage
Access Data Storage
Archive Data Storage
Archive Data Storage
Preservation/ Curation Metadata
File Format Identification
tools
File/ media migration/
transformation tools
Emulation tools
Other preservation/ Curation tools
rdc-drc.ca @rdc_drc 11 11
rdc-drc.ca @rdc_drc 15
Storage and Preservation - Current˃Institutional IRs and Domain repositories• NRC Gateway – Canadian Repositories• DANS Easy, EUDAT B2SAFE, Research Data Australia
˃Repositories• CARL Portage Network/Compute Canada FRDR and
integrations with Archivematica and Islandora• Scholar’s Portal, BC, U of Alberta, Dataverse systems• Open Science Framework
˃Services• Cybera/CANARIE/CC DAIR, WestGrid ownCloud
rdc-drc.ca @rdc_drc 16
Storage and Preservation – Possible?˃Pronom-like authority for identifying/transforming research data files and outputs.
˃Policy-based replication of all research outputs to regional and international storage.
˃One-Click acquisition of storage resources from a national shared infrastructure.
˃Synchronization of Active Data Management Plans and auto-provision of storage/compute resources.
˃Create preservation storage via backend allocation of a % of active storage from all institutions.
rdc-drc.ca @rdc_drc 17
Compute & Analysis Services - Current
˃Integration between HPC and Data platforms• EUDAT B2STAGE (iRODS/GridFTP)• VRE4EIC• Compute Canada Globus Portal
˃Integration of Science Workflow systems for computation AND RDM• Taverna, VisTrails, Kepler
˃Visualization Tools• Ninaliit
rdc-drc.ca @rdc_drc 18
Compute & Analysis Services – Possible?˃Automatic selection and analysis of slice of big data based on English language query
˃Virtual Research Data Centres – secure and accessible
˃EU Open Science Cloud˃BitTorrent for Live Research Data?
rdc-drc.ca @rdc_drc 19
Discovery Services - Current˃National/International Federated Metadata Repos• SHARE, DANS Search, DLI Service• EUDAT B2SHARE, B2FIND
˃CARL Portage/Compute Canada• FRDR System, Discovery Paper• UBC Open Collections system
˃Federal/Provincial/Municipal Data• GoC Open Data Portal, Alberta OG, DataBC, Toronto
rdc-drc.ca @rdc_drc 20
Discovery Services – Possible?˃Siri for Research – AI Interfaces to all Outputs
˃Index fulltext/intelligent harvest of all outputs in domain/region
˃Rich Linked Data repository of all outputs• ResearchLink• Research Connection
˃Other Interesting Technologies• ContentMine, Research Data Switchboard
rdc-drc.ca @rdc_drc 21
Identifier Services - Current˃Integration of ORCID into wide range of systems• ORCID CA Project• ORCID CA Feedback Form
˃Research Networking tools and systems˃RDC Best Practices Document
• Unique Identifiers: Current Landscape and Future Trends
˃Canadian Services• UBC DOI Services• DataCite Canada
rdc-drc.ca @rdc_drc 22
Identifier Services – Possible?˃Automatic collaborator detection engine based on description of new research approach.
˃Auto-selection of peer reviewers attached to open peer review system.
˃Simpler harvest of disparate research/data systems via a single API (e.g. ORCID).
˃Development of lightweight ID minting services that can be integrated into any SW platform.
˃Adoption of ORCID by all Canadian organizations and uptake by 100% of researchers.
rdc-drc.ca @rdc_drc 23
Dissemination Services - Current˃Data Sharing
• EUDAT B2DROP• Compute Canada Globus Portal
˃Data Publication• OpenTrials, Open Lab/Note Books, Zenodo, Open Data
Button• Default publication of all results
– JNRBM, JNR, PLOS Missing Pieces• Danish Open Access Barometer
rdc-drc.ca @rdc_drc 24
Dissemination Services – Possible?˃CI service with full compute environment & data
˃Default to Containers for Reproducible Research• GUIdock, SSI, OSF Container Strategies Workshop,
ReproZip˃Innovation in data/outputs/alerting/editing• Biosharing• nowomics-style updates on the latest outputs• symplur-style “flattening” of data from all sources• Dokieli-style article publishing
rdc-drc.ca @rdc_drc 25
Support and Training - Current˃Support Networks
• Portage– DMP Tool, RDM Services, Network of Expertise
• GoC Open Data eXchange
rdc-drc.ca @rdc_drc 26
Support and Training – Possible?˃A modular international curriculum˃Development of an Open Textbook for RDM
˃Use of Open Notebooks and related Open Data frameworks as learning platforms
rdc-drc.ca @rdc_drc 27
Policy - Current˃Principles and Policies
• TC3 OA Policy and RDM Guidelines• RDC RDM Principles
˃Research Information Infrastructure• OpenRIF semantic efforts• CASRAI Community
rdc-drc.ca @rdc_drc 28
Policy – Possible?˃Allocation of 2% of total R&D annual spend by public institutions.
˃Adoption of a common set of RDM Principles by all publicly funded organizations by 2026.
˃Adoption of RDM and Open by Default Policies by 50% of publicly funded institutions by 2020.
˃Synchronization of Canadian policy frameworks with EU and other partners by 2020.
˃Require immediate data sharing for public health emergencies
rdc-drc.ca @rdc_drc 29
RDC Portage
CASRAI RDA
Re-Use
Research Data
Research Information
LCDICC
CANARIE
NRC
COU
ISED
CUCCIO
CARL CAULODC TC3+
Open Information
Open Data
ONC
Comms & Coordination - Current
rdc-drc.ca @rdc_drc 30
Comms & Coordination – Possible?˃A single source of coordination for Canada’s RDM and DRI organizations, with representation from all core organizations.
˃A coordination of funding for National Data Services.
rdc-drc.ca @rdc_drc 31
Portage
RDC
Coordination
rdc-drc.ca @rdc_drc 32
> Research Data Canada works with stakeholders to ensure research data is available to support innovation that benefits all Canadians.
rdc-drc.ca @rdc_drc 33
The DCC Curation Lifecycle Model: http://www.ijdc.net/index.php/ijdc/article/viewFile/69/48.
Universities
Federal Funding Agencies
Federal Research Agencies
Provincial Funding Agencies
Provincial Research Agencies
Open Data Organizatio
ns
Non-Profit & NGO
Research Organizatio
ns
Commercial Research
Organizations
International Agencies and Collaborators
rdc-drc.ca @rdc_drc 34
Role of RDC˃Engage full stakeholder community
• Organizations that receive public research funds• Organizations that give public research funds• Organizations that facilitate these efforts
˃Facilitation and Coordination˃Outreach and Communication˃Development and Promotion of Best Practices
˃International Liaison
rdc-drc.ca @rdc_drc 35
researchlink.rdc-drc.ca/vivo
rdc-drc.ca @rdc_drc 36
RDC Outputs
National Data Services
Framework Requirements
& Best PracticesMar 2017
Portage/CC/CASRAI Outputs
& Systems
Jul 2016 Jun 2017
RDA Outputs
Federal & Provincial Outputs
Other Canadian Outputs
Jisc OutputsOther
International Outputs
Vision for a National Data
Services FrameworkNov 2016
National Data
Services and Federated Research
Data Repository Framework
RDM Ecosystem Map
Semantic
Repository Pilot
ORCID-CA +
CAF SPs
DOI Service
s
rdc-drc.ca @rdc_drc 37
Brainstorming Session˃Charge
• Where do we want to be in 10 years?• Let’s Blue Sky, worry about how at the next meeting!• There will be a prize for the team that generates the
most ideas!˃Not allowed
• But there are privacy issues…• That would be so expensive…• Who would do that?
rdc-drc.ca @rdc_drcContact me:
Research Data Canada is supported by CANARIE, an organization dedicated to advancing Canada's knowledge and innovation infrastructure.
[email protected] | @mleggott