+ All Categories
Home > Documents > Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific...

Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific...

Date post: 06-Jan-2018
Category:
Upload: tyrone-hawkins
View: 219 times
Download: 1 times
Share this document with a friend
Description:
Data publication opportunities
46
Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill Michener, Board of Directors, [email protected] Meredith Morovati, Executive Director, [email protected] Todd Vision, Chair, Board of Directors, [email protected] @datadryad http://datadryad.org
Transcript
Page 1: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Evolving a Community Digital Repository: Lessons from Dryad

Making data underlying scientific publications discoverable, freely reusable, and citable

Bill Michener, Board of Directors, [email protected]

Meredith Morovati, Executive Director, [email protected]

Todd Vision, Chair, Board of Directors, [email protected]

@datadryadhttp://datadryad.org

Page 2: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

e-science

http://research.microsoft.com/en- us/collaboration/fourthparadigm/4th_paradigm_book_jim_gray_transcript.pdf

Page 3: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

http://www.stm-assoc.org/2011_12_5_ODE_Report_On_Integration_of_Data_and_Publications.pdf

Data publication opportunities

Page 4: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Tenopir C et al. (2011) Data Sharing by [n=1329] Scientists: Practices and Perceptions. PLoS ONE doi:10.1371/journal.pone.0021101

Researcher attitudes toward data reuse

Agree strongly or somewhat

I would be willing to share data across a broad group of researchers who use data in different ways

78.2%

It is important that my data are cited when used by other researchers.

88.9%

It is appropriate to create new datasets from shared data

73.7%

Page 5: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Volu

me

Rank frequency of data type

Specialized repositories(e.g. GenBank)

After Heidorn (2008) http://hdl.handle.net/2142/9127

Long tail dataDryad’s vision is a world where research data is openly available, integrated with the scholarly literature, and routinely re-used to create knowledge. Dryad’s mission is to provide the infrastructure for, and promote the re-use of, data underlying the scholarly literature.

Page 6: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Why archive data at the time of publication ?

Vines TH et al. (2013) Current Biology DOI:10.1016/j.cub.2013.11.014

Page 7: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Roadmap

• Dryad • Lessons• Challenges

Page 8: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Dryad

Page 9: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Joint Data Archiving Policy (JDAP) - 2011• Data are important products of the scientific

enterprise, and they should be preserved and usable for decades in the future

• As a condition for publication, data supporting the results in the article should be deposited in an appropriate public archive

• Authors may elect to embargo access to the data for a period up to a year after publication

• Exceptions may be granted at the discretion of the editor, especially for sensitive information

http://datadryad.org/jdap

Page 10: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Credit

10

Page 11: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Integration of manuscript and data submission

Page 12: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Curation

Page 13: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Dryad links to journalsProvides citation instructions

Page 14: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Datasets are being cited

Page 15: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Portals for integrated journals (beta)http://datadryad.org/journal/amNat

Page 16: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

• All data are from a vetted scientific publication (such as a peer-reviewed article, thesis/dissertation, or book) and receive professional curation

• Submission integration with journals makes deposit easy for authors and curators provide user support

• Flexible to journal data policy (e.g. on embargoes, review, standards) • Reciprocal linkage between article and data via a persistent,

resolvable data DOI • Data are citable, and preserved for the long term• Data are free to download & reuse due to modest data publication

charges • Backed by a nonprofit organization sustained and governed by its

diverse stakeholders

Dryad Digital Repository

Page 17: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

How to participate in Dryad• Become a member

– Elect the Board of Directors and approve changes to ByLaws 

– Stay informed through the Annual Community Meeting– Get discounts on submission fees– Financially sustain the repository– Help steer the future direction of the organization

• Integrate your journal with Dryad– Ensure the article and data are bidirectionally linked– Lower the burden on authors to make their data

available– Improve compliance with the journal’s data policy– The process is tailored to each journal (e.g. embargo

option)• Sponsor data publication charges

– As a service to your authors/researchers

Page 18: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Members

NORDIC SOCIETY OIKOS

Page 19: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Metrics

Page 20: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Growth (2015 = ~3,800 +)

Page 21: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Building a valuable science resource

Page 22: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

lessons

Page 23: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

1. “You can't build a great building on a weak foundation.” Gordon B. Hinckley

Page 24: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Understand costs & diversify funding streams

• Data publication charges (DPC) – primary source of

revenue– enable free access in perpetuity

• Membership fees – fund annual

membership meetings – provide a cost savings

on DPC• Project grants – support R&D activities

Page 25: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Sponsoring Data Publication ChargesIndividuals can deposit data associated with an article on their own, regardless of payment plan. If an author finds the journal it is submitting to does not have a payment plan, they can elect to pay $90 for deposit. Supporting payment plans on behalf of your authors makes it easy for authors and saves money.

Page 26: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Membership tiers

Page 27: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Partner & leverage

Page 29: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Remote work

Page 30: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Skype, webex, google hangout, etc.

Page 31: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Slack

Page 33: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.
Page 34: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Turn the plan into action

Page 35: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Trello

Page 36: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Challenges

Page 37: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.
Page 38: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Table 1. Journal and publication year of 100 reviewed studies with associated data publicly archived in the digital repository Dryad (http://datadryad.org/).

Roche DG, Kruuk LEB, Lanfear R, Binning SA (2015) Public Data Archiving in Ecology and Evolution: How Well Are We Doing?. PLoS Biol 13(11): e1002295. doi:10.1371/journal.pbio.1002295http://journals.plos.org/plosbiology/article?id=info:doi/10.1371/journal.pbio.1002295

Page 39: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Fig 2. Completeness and reusability scores.

Roche DG, Kruuk LEB, Lanfear R, Binning SA (2015) Public Data Archiving in Ecology and Evolution: How Well Are We Doing?. PLoS Biol 13(11): e1002295. doi:10.1371/journal.pbio.1002295http://journals.plos.org/plosbiology/article?id=info:doi/10.1371/journal.pbio.1002295

Page 40: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Student training in the classroom

Page 41: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Training resources & workshops

Page 42: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

PLoS Comp. Biology, Oct. 22, 2015, “Ten Simple Rules for Creating a Good Data Management Plan,” WK Michener. DOI:10.1371/journal.pcbi.1004525

(1) Determine the research sponsor requirements;

(2) Identify the data to be collected;(3) Define how the data will be

organized;(4) Explain how the data will be

documented;(5) Describe how data quality will be

assured;(6) Present a sound data storage and

preservation strategy;(7) Define the project’s data policies;(8) Describe how the data will be

disseminated; (9) Assign roles and responsibilities; and (10)Prepare a realistic budget.

Page 43: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

NSF Public Access Plan• Applies to proposals submitted in 2016• Restates longstanding policy that

– “Investigators are expected to share with other researchers, at no more than incremental cost and within a reasonable time, the primary data, samples, physical collections and other supporting materials created or gathered in the course of work under NSF grants”

– Allows costs of archiving within grants• Restates 2011 Data Management Plan requirement

– Further requires archiving plan in DMP to be followed• Restates 2013 Biosketch policy allowing data to

count as a product

http://www.nsf.gov/pubs/2015/nsf15052/nsf15052.pdf

Page 44: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Efficacy of funder vs journal data policy

Figure 5. Availability of archived phylogenetic data as a function of age.

After: Magee et al. (2014) The Dawn of Open Access to Phylogenetic Data. PLoS ONE 9(10): e110268. doi:10.1371/journal.pone.0110268

20022011 2014

year

Prop

ortio

n of

dat

a av

aila

ble

20022011 2014

20022011 2014

20022011 2014

Page 45: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

Roche DG, Kruuk LEB, Lanfear R, Binning SA (2015) Public Data Archiving in Ecology and Evolution: How Well Are We Doing?. PLoS Biol 13(11): e1002295. doi:10.1371/journal.pbio.1002295http://journals.plos.org/plosbiology/article?id=info:doi/10.1371/journal.pbio.1002295

Other ideas ?

Page 46: Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.

To learn more• Dryad Digital Repository: http://datadryad.org• Dryad News & Views blog:

http://blog.datadryad.org• Twitter: @datadryad• Feedback (Ideas Forum):

http://datadryad.org/feedback


Recommended