+ All Categories
Home > Education > Share: discovery: a focus on papers

Share: discovery: a focus on papers

Date post: 08-May-2015
Category:
Upload: lorcan-dempsey
View: 1,054 times
Download: 1 times
Share this document with a friend
Description:
A presentation about aggregation and discovery issues presented to the ARL Fall Forum. It covers some issues prompted by the Share proposal. Considers metadata aggregation and the general move from 'strings to things' in general Internet services. Touches on linked data, metadata processing, user expectation. Concludes with some general issues to consider.
30
@LorcanD Lorcan Dempsey, OCLC 11 October 2013 ARL Fall Forum: Mobilizing the research enterprise #ARLforum13 SHARE Discovery:Focus on papers
Transcript
Page 1: Share: discovery: a focus on papers

@LorcanD

Lorcan Dempsey, OCLC 11 October 2013

ARL Fall Forum: Mobilizing the research enterprise

#ARLforum13

SHAREDiscovery:Focus on papers

Page 2: Share: discovery: a focus on papers

Aggregation is a pain

Page 3: Share: discovery: a focus on papers

Shenghui Wang (OCLC), Antoine Isaac (Europeana), Valentine Charles (Europeana), Rob Koopman (OCLC), Anthi Agoropoulou (Europeana), and Titia van der Werf (OCLC)Hunting for Semantic Clusters: Hierarchical Structuring of Cultural Heritage Objects within Large Aggregations17th International conference on Theory and Practice of Digital Libraries (TPDL), 22-26 September 2013, Valletta (Malta)

Page 4: Share: discovery: a focus on papers

Duplicates

Page 5: Share: discovery: a focus on papers

Duplicates? Same object: different providers

Page 6: Share: discovery: a focus on papers

Duplicates? Same page: different digital copies

Page 7: Share: discovery: a focus on papers

Cataloging error

Harvested – points to repository splash page

Analytic – essay in book

Catalan translation

Loaded from Crossref

Loaded from Elsevier

Three ‘expressions’.Cataloging now fixed

Page 8: Share: discovery: a focus on papers

Cross repository record matching issues – confused identities

• Different data models– Mapping is lossy.– Relationship issues, e.g.

• Preprint• Published article• Publisher splash page• Repository splash page• …

• Replication of content across repositories • Different content and ‘fullness’ standards• Granularity issues

– What is being described?

• ‘Business’ issues– Publisher wants separate display?

Page 9: Share: discovery: a focus on papers

From strings to things … an emerging pattern?

Search enginesLinked data

Page 10: Share: discovery: a focus on papers

The social graph

Page 11: Share: discovery: a focus on papers

Three benefits acc to Google:

1. Find the right thing2. Get the best summary3. Go deeper and broader

Within a discovery service …1. Aspire to a singular identity for entities/things

(people, works, places, organizations, …) 2. Gather data associated with those identities

(e.g. ‘cards’)3. Create relationships between identities.

Page 12: Share: discovery: a focus on papers
Page 13: Share: discovery: a focus on papers
Page 14: Share: discovery: a focus on papers
Page 15: Share: discovery: a focus on papers
Page 16: Share: discovery: a focus on papers
Page 17: Share: discovery: a focus on papers
Page 18: Share: discovery: a focus on papers
Page 19: Share: discovery: a focus on papers

Make data work harder so that the user doesn’t • Create singular identity

(‘entification’)• Gather information about

entities (e.g. cards)• Create relationships between

entities (navigation – citation, co-creation, derivative, affiliation, recommendations, …)

• Strongly leverage four types of metadata about things ..– ‘Professional’– Crowdsourced (claiming

profiles, …)– Programmatically

promoted (entity extraction, categorization, clusters, ….)

– Usage (relationships based on usage)

• Now: shredding records• Future: manage entities in

linked data world

• Plural - Work with what you have.

• Wikipedia – an addressible knowledgebase

• Wikidata/Freebase – source of structured data

Page 20: Share: discovery: a focus on papers

National Libraries

English Wikipedia

VIAF Matching Algorithm

German Wikipedia

Other Wikipedias

Wikidata Wikibase

3rd Party Users

Submit VIAF IDs /Show centralized data

Submit VIAF IDs /Show centralized data

Read data

VIAF matches Articles /Wikipedia shows matched IDs

A small example of links/entities

Page 21: Share: discovery: a focus on papers
Page 22: Share: discovery: a focus on papers

The scholarly graph?

• Architecture components– Author IDs– Paper/work IDs– Institutions?

• Signals of interest– Research analytics– Research workflow

• Questions– What is the role of libraries/SHARE/….?– Vivo?– Who will manage entity backbones in linked data

world?

Page 23: Share: discovery: a focus on papers

Questions and issues …

Page 24: Share: discovery: a focus on papers

Repository scope

Campus bibliography*-printsDigital materials

Tactical ‘structure up’/SEO

More links to entities in records - Identifiers

Orcid, ISNI, VIAF, …DOI, Pubmed ID, …

Schema.org markupSite maps; ResourceSyncWhat do hubs want to see? (e.g. Scholar)

Page 25: Share: discovery: a focus on papers

Purposeful syndication

Share data with network/disciplinary hubs

A discovery service?

A discovery destination? The bar is getting higher …

A source of data for others?

Page 26: Share: discovery: a focus on papers

Sourcing and scaling … Workflow, Repository, Disclosure, Discovery, …

ScalingRightscalingDifferent things done at different scalesInstitution, Consortium, ARL, world?

SourcingCollaboratively sourced?Third party?Existing agency?Multiple approaches?

Page 27: Share: discovery: a focus on papers

Discovery and SHARE

What is Share’s role in creating and/or maintaining the scholarly graph?

Page 28: Share: discovery: a focus on papers

Credits

Page 29: Share: discovery: a focus on papers

Ack kind advice from …

• Max Klein, Merrilee Proffitt, Karen Smith Yoshimura, Thom Hickey (Wikidata/Wikipedia/Viaf)

• Shenghui Wang, Rob Koopman, Titia van der Werf (clustering and Europeana data)

• Jeff Young, Eric Childress

Page 30: Share: discovery: a focus on papers

©2013 OCLC. This work is licensed under a Creative Commons Attribution 3.0 Unported License. Suggested attribution: “This work uses content from [presentation title] © OCLC, used under a Creative Commons Attribution license: http://creativecommons.org/licenses/by/3.0/”

Q@LorcanD


Recommended