+ All Categories
Home > Documents > Creating Citable Data Identifiers

Creating Citable Data Identifiers

Date post: 23-Feb-2016
Category:
Upload: sahara
View: 31 times
Download: 0 times
Share this document with a friend
Description:
Creating Citable Data Identifiers. Ryan Scherle Mark Diggory. Mimosa house 807 South Virginia Dare Trail Kill Devil Hills, NC USA 27948. 1903-12- 17 36.019705 N, 75.668769 W. 79330-S84-A41 WP0ZZZ99ZTS392124. Loxosceles reclusa. Citing identifiers. Mimosa house - PowerPoint PPT Presentation
Popular Tags:
39
Creating Citable Data Identifiers Ryan Scherle Mark Diggory
Transcript
Page 1: Creating Citable Data Identifiers

Creating Citable Data IdentifiersRyan ScherleMark Diggory

Page 2: Creating Citable Data Identifiers

Mimosa house 807 South Virginia Dare Trail Kill Devil Hills, NC USA 27948

Page 3: Creating Citable Data Identifiers

1903-12-17 36.019705 N, 75.668769 W

Page 4: Creating Citable Data Identifiers

79330-S84-A41 WP0ZZZ99ZTS392124

Page 5: Creating Citable Data Identifiers

Loxosceles reclusa

Page 6: Creating Citable Data Identifiers

Citing identifiers Mimosa house 807 South Virginia Dare Trail 1903-12-17 27948 Loxosceles reclusa 36.019705 N, 75.668769 W 79330-S84-A41 WP0ZZZ99ZTS392124

Page 7: Creating Citable Data Identifiers

Identifiers matter Some identifiers are machine-friendly,

some are human-friendly For citations, you need to strike a

balance Good identifiers are a critical selling

point for an repository

Page 8: Creating Citable Data Identifiers
Page 9: Creating Citable Data Identifiers

http://purl.dlib.indiana.edu/iudl/lilly/slocum/LL-SLO-009276

Page 10: Creating Citable Data Identifiers
Page 11: Creating Citable Data Identifiers
Page 12: Creating Citable Data Identifiers

Principles ofcitable identifiers

Page 13: Creating Citable Data Identifiers

1. Use DOIs http://dx.doi.org/10.5061/dryad.123ab Scientists are familiar with DOIs

Page 14: Creating Citable Data Identifiers

1. Use DOIs http://dx.doi.org/10.5061/dryad.123ab Scientists are familiar with DOIs DOIs are supported by many tools and

services

Page 15: Creating Citable Data Identifiers

1. Use DOIs http://dx.doi.org/10.5061/dryad.123ab Scientists are familiar with DOIs DOIs are supported by many tools and

services

Current support:Eprints Dspace FedoraNo No With work

Page 16: Creating Citable Data Identifiers

2. Keep identifiers simple http://dx.doi.org/10.5061/dryad.123ab Complex identifiers are fine for machines, but

they’re bad for humans. Despite best intentions, humans sometimes

need to work with identifiers manually.

http://dx.doi.org/10.1179/1743131X11Y.0000000009

http://dx.doi.org/10.1016/B978-0-12-220851-5.00003-4

Page 17: Creating Citable Data Identifiers

2. Keep identifiers simple http://dx.doi.org/10.5061/dryad.123ab Complex identifiers are fine for machines, but

they’re bad for humans. Despite best intentions, humans sometimes

need to work with identifiers manually.

Current support:Eprints Dspace FedoraYes Yes Yes

Page 18: Creating Citable Data Identifiers

3. Use syntax to illustrate relationships http://dx.doi.org/10.5061/dryad.123ab/3 Adding a tiny bit of semantics to an

identifier is incredibly usefulhttp://files.eprints.org/691/http://files.eprints.org/447/http://files.eprints.org/556/

Useful for various human “hacks” Useful for statistics

Page 19: Creating Citable Data Identifiers

3. Use syntax to illustrate relationships http://dx.doi.org/10.5061/dryad.123ab/3 Adding a tiny bit of semantics to an

identifier is incredibly useful

Current support:Eprints Dspace FedoraNo No With work

Page 20: Creating Citable Data Identifiers

4. When “meaning-bearing” content changes, create a versioned identifier Scientists want data to be invariant to

enable reuse by machines Even a single bit makes a difference Watch out for implicit abstractions…

http://dx.doi.org/10.5061/dryad.123ab/thumbnail

What about DOI conventions?

Page 21: Creating Citable Data Identifiers

5. When “meaningless” content changes, retain the current identifier Descriptive metadata must be editable

without creating a new identifier. Humans rarely care about metadata

changes, especially for citation purposes!

Caveat: machine-oriented systems may consider the “metadata” to be data, which requires identifier changes

Page 22: Creating Citable Data Identifiers

Current versioning supportEPrints Support for flexible versioning/relationships,

but no support for expressing these relationships in identifiers.

DSpace None.

Fedora Implicit versioning of all data and metadata. This is highly useful, but it is too granular for citation purposes.

Page 23: Creating Citable Data Identifiers

Principles of citable identifiers1. Use DOIs2. Keep identifiers simple3. Use syntax to illustrate relationships 4. When “meaning-bearing” content changes, create a versioned identifier5. When “meaningless” content changes, retain the current identifier

Page 24: Creating Citable Data Identifiers

Hacking DSpace to support…

DOI identifier registrationSemantics in identifiersCitation publicationVersioning

Page 25: Creating Citable Data Identifiers

DSpace identifier services Handle system independence

More future identifier systems will come. Granular control

Separate reservation from registration Citation

Registration of metadata with external services

Page 26: Creating Citable Data Identifiers

DSpace identifier services

Page 27: Creating Citable Data Identifiers

DataCite content service

Page 28: Creating Citable Data Identifiers

Promoting accurate citationsAdded suggested citation formats up front

Page 29: Creating Citable Data Identifiers

Versioning Versioning is item “editioning” Creation of new versions is a “user

mediated” process (submitter or reviewer)

Versioning does not alter the original item

Version relationships are maintained independent of the item’s metadata

Page 30: Creating Citable Data Identifiers

Submission-based revisions

Page 31: Creating Citable Data Identifiers
Page 32: Creating Citable Data Identifiers
Page 33: Creating Citable Data Identifiers

Result: Citable data versionsdoi:10.5061/dryad.bb7m4

Page 34: Creating Citable Data Identifiers

Future technical directions Add metadata versioning under the

hood -- may need to rethink some of the current system

Integrate our changes to core DSpace Moving these features into the core

requires further discussion with the Dspace user community

Page 35: Creating Citable Data Identifiers

How are we doing?For 186 articles associated with Dryad deposits:

77% had “good” citations to the data 2% had “bad” citations to the data 21% had no data citations

Standards for data citation are still evolving. Journals have yet to agree on where to place data citations, and authors are just starting to become familiar with the concept.

Page 36: Creating Citable Data Identifiers
Page 37: Creating Citable Data Identifiers
Page 38: Creating Citable Data Identifiers

What should you do now? Analyze how data is used and cited

outside the repository Determine whether use is more

machine-oriented or more human-oriented

Design identifiers and identifier management to facilitate the observed uses

Page 39: Creating Citable Data Identifiers

Thanks!

Ryan [email protected]

Mark [email protected]


Recommended