NISO Webinar on data curation services at the CDL

Post on 17-Oct-2014

420 views 1 download

Tags:

description

"Building communities and Services in Support of Data-Intensive Research". Webinar on 18 Sept 2013 for the NISO Webinar Series. This was part 2 of 2 for Data Curation

transcript

Building  Communities  &  Services    in  Support  of  Data-­‐Intensive  Research  

From  Flickr  by  oennuja  

NISO  webinar  17  Sept  2013  

Carly  Strasser    |    California  Digital  Library    @carlystrasser  

Why  is  data  curation  a  hot  topic?  

From  Flickr  by  Velo  Steve  

Back in the day…

Da  Vinci  

Curie  

Newton  

classicalschool.blogspot.com  

Darwin  

Digital  data  From

 Flickr  by  Flickm

or  

From

 Flickr  by  US  Arm

y  En

vironm

ental  C

omman

d  

From

 Flickr  by    DW08

25  

C.  Strasser  

Courtesey  of  W

HOI  

From

 Flickr  by    deltaMike  

From  Flickr  by  ~Minnea~  

Data  management  Documentation  Reproducibility  

From  Flickr  by  Michael  Tinkler  

From  Flickr  by  Michael  Tinkler  

Data  Curation  

 

Data  curation  is  a  continuation  of  the  library’s  long-­‐standing  mission  to  connect  patrons  with  content  in  meaningful  ways  across  barriers  of  space  and  time.  

-­‐  C  Tenopir  et  al.  2012  

From

 Flickr  by  ne

ilio  

From

 Flickr  by  Rich

ard  Eriksson

 

Culture  Shift  Ahead  

Data  are  being  recognized  as  first  class  products  of  research  

From  Flickr  by  Richard  Moross  

Data  management  plans  

Data  sharing  mandates  

Data  publications  

Data  citation  

From  Flickr  by  torkildr  

Plan  

Collect  

Describe  

Analyze  

Preserve  

Share   Data  Life  Cycle  

Plan  >  Collect  >  Describe  >  Analyze  >  Preserve  >  Share  

dmptool.org  

Step-­‐by-­‐step  wizard,  open  to  community    Create,  edit,  re-­‐use,  share,  &  save    

data  management  plans  

Plan  >  Collect  >  Describe  >  Analyze  >  Preserve  >  Share  

Customization:  •  Suggested  answers  •  Help  text  •  Resources  

DMPTool  Uptake  

0

100

200

300

400

500

600

700

800

0

1000

2000

3000

4000

5000

6000

Num

ber o

f Ins

titut

ions

Num

ber o

f Pla

ns (s

olid

) & U

niqu

e U

sers

(das

hed)

Unique Users Plans Institutions

736

5211

4519

DMPTool2:    Responding  to  the  Community  

Administrator  interface  Open  API  /  Interoperability  Improved  functionality  

Winter 2013-2014

Plan  >  Collect  >  Describe  >  Analyze  >  Preserve  >  Share  

dataup.cdlib.org  

Plan  >  Collect  >  Describe  >  Analyze  >  Preserve  >  Share  

Open  source  tool  to  describe,  manage,  and  

share  tabular  data  

Features  Best  practices  check  Generate  metadata  

Get  identifier  &  citation  Post  data  to  repository  

•  NSF  funding  via  DataONE  

•  Partnership  with  Microsoft  Research,  SDSC  

•  Enable  Customization  From  animationresources.org  

Plan  >  Collect  >  Describe  >  Analyze  >  Preserve  >  Share  

merritt.cdlib.org  

Plan  >  Collect  >  Describe  >  Analyze  >  Preserve  >  Share  

Repository  for  preservation    &  access  to  digital  assets    

•  Open  to  the  UC  community  and  external  partners  

•  Content-­‐agnostic  •  Dark  archive  for  long-­‐term  

preservation  •  Bright  archive  for  sharing  

Plan  >  Collect  >  Describe  >  Analyze  >  Preserve  >  Share  

was.cdlib.org  

Analysis  tools  Full-­‐text  search  10,772  web  sites  

Preserve  &  store  websites  

“The  New  Internet”  from  siliconangle.com  

Plan  >  Collect  >  Describe  >  Analyze  >  Preserve  >  Share  

n2t.net/ezid  

Plan  >  Collect  >  Describe  >  Analyze  >  Preserve  >  Share  

Create  persistent  identifiers  Manage  identifiers  &  associated  metadata  

Resolve  identifiers  

DOI:    10.1890/1540-­‐9295-­‐10.2.59  ARK:  90135/q13f4mjk    

Res

olve

r  

Website  with  

“object”  

 ARKs  

DOIs  IDF  

EZID  CLIENTS  

DOIs  

DOIs  

Where  are  these  identifiers  from?  

EZID  CLIENTS  

Identifiers  &  Data  Citation  

Allows  readers  to  find  data  products  Get  credit  for  data  and  publications  

Promotes  reproducibility  Better  measure  of  research  impact  

Example:  Sidlauskas,  B.  2007.  Data  from:  Testing  for  unequal  rates  of  morphological  diversification  in  the  absence  of  a  detailed  phylogeny:  a  case  study  from  characiform  fishes.  Dryad  Digital  Repository.  doi:10.5061/dryad.20  

Website  Email  Tweet  Slides  

CDL  Blog  

carlystrasser.net  carlystrasser@gmail.com  @carlystrasser    slideshare.net/carlystrasser  datapub.cdlib.org  

cdlib.org/services/uc3