Post on 17-Oct-2014
description
transcript
Building Communities & Services in Support of Data-‐Intensive Research
From Flickr by oennuja
NISO webinar 17 Sept 2013
Carly Strasser | California Digital Library @carlystrasser
Why is data curation a hot topic?
From Flickr by Velo Steve
Back in the day…
Da Vinci
Curie
Newton
classicalschool.blogspot.com
Darwin
Digital data From
Flickr by Flickm
or
From
Flickr by US Arm
y En
vironm
ental C
omman
d
From
Flickr by DW08
25
C. Strasser
Courtesey of W
HOI
From
Flickr by deltaMike
From Flickr by ~Minnea~
Data management Documentation Reproducibility
From Flickr by Michael Tinkler
From Flickr by Michael Tinkler
Data Curation
Data curation is a continuation of the library’s long-‐standing mission to connect patrons with content in meaningful ways across barriers of space and time.
-‐ C Tenopir et al. 2012
From
Flickr by ne
ilio
From
Flickr by Rich
ard Eriksson
Culture Shift Ahead
Data are being recognized as first class products of research
From Flickr by Richard Moross
Data management plans
Data sharing mandates
Data publications
Data citation
From Flickr by torkildr
Plan
Collect
Describe
Analyze
Preserve
Share Data Life Cycle
Plan > Collect > Describe > Analyze > Preserve > Share
dmptool.org
Step-‐by-‐step wizard, open to community Create, edit, re-‐use, share, & save
data management plans
Plan > Collect > Describe > Analyze > Preserve > Share
Customization: • Suggested answers • Help text • Resources
DMPTool Uptake
0
100
200
300
400
500
600
700
800
0
1000
2000
3000
4000
5000
6000
Num
ber o
f Ins
titut
ions
Num
ber o
f Pla
ns (s
olid
) & U
niqu
e U
sers
(das
hed)
Unique Users Plans Institutions
736
5211
4519
DMPTool2: Responding to the Community
Administrator interface Open API / Interoperability Improved functionality
Winter 2013-2014
Plan > Collect > Describe > Analyze > Preserve > Share
dataup.cdlib.org
Plan > Collect > Describe > Analyze > Preserve > Share
Open source tool to describe, manage, and
share tabular data
Features Best practices check Generate metadata
Get identifier & citation Post data to repository
• NSF funding via DataONE
• Partnership with Microsoft Research, SDSC
• Enable Customization From animationresources.org
Plan > Collect > Describe > Analyze > Preserve > Share
merritt.cdlib.org
Plan > Collect > Describe > Analyze > Preserve > Share
Repository for preservation & access to digital assets
• Open to the UC community and external partners
• Content-‐agnostic • Dark archive for long-‐term
preservation • Bright archive for sharing
Plan > Collect > Describe > Analyze > Preserve > Share
was.cdlib.org
Analysis tools Full-‐text search 10,772 web sites
Preserve & store websites
“The New Internet” from siliconangle.com
Plan > Collect > Describe > Analyze > Preserve > Share
n2t.net/ezid
Plan > Collect > Describe > Analyze > Preserve > Share
Create persistent identifiers Manage identifiers & associated metadata
Resolve identifiers
DOI: 10.1890/1540-‐9295-‐10.2.59 ARK: 90135/q13f4mjk
Res
olve
r
Website with
“object”
ARKs
DOIs IDF
EZID CLIENTS
DOIs
DOIs
Where are these identifiers from?
EZID CLIENTS
Identifiers & Data Citation
Allows readers to find data products Get credit for data and publications
Promotes reproducibility Better measure of research impact
Example: Sidlauskas, B. 2007. Data from: Testing for unequal rates of morphological diversification in the absence of a detailed phylogeny: a case study from characiform fishes. Dryad Digital Repository. doi:10.5061/dryad.20
Website Email Tweet Slides
CDL Blog
carlystrasser.net carlystrasser@gmail.com @carlystrasser slideshare.net/carlystrasser datapub.cdlib.org
cdlib.org/services/uc3