+ All Categories
Home > Documents > Linked Data initiatives at OCLC · It was being built using open source technologies such as...

Linked Data initiatives at OCLC · It was being built using open source technologies such as...

Date post: 17-Aug-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
24
DR. AXEL KASCHTE STRATEGY DIRECTOR, OCLC EMEA Linked Data initiatives at OCLC
Transcript
Page 1: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

DR. AXEL KASCHTE

STRATEGY DIRECTOR, OCLC EMEA

Linked Data initiatives at OCLC

Page 2: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

Status of Linked Data initiatives within OCLC and

where we hope to be in the near future ...

Page 3: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

OCLC Linked Data ResearchThere are quite many original OCLC research papers.

changing

resource

description

workflows

“We believe that linked data will become the de-facto standard for

describing things on the internet, including bibliographic objects.”

Page 4: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

Wait, but why?

Page 5: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

Why Linked Data?

Better outcomes

New functionality

Cleaner data

Improve internal data

management & quality

Improve data integration

across domains and regions

Replicate existing library

functions more efficiently

Connect library resources with other domains

Greater web visibility

A better user experience

More ways for discovery of library resources

Page 6: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

Clarify context vs. inventory control

https://upload.wikimedia.org/wikipedia/commons/8/8e/Documents_on_repository_shelving_at_The_National_Archives.jpg

Page 7: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

Problem statement

The library community’s

foundational bibliographic standard

is no longer sufficient to take

advantage of the tremendous

opportunities offered by the web.

Page 8: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

Areas of inquiry

• Searching across a wide pool of described ‘things’ (entities)

• Improving display of connections and context

• Improving connectivity of inbound data

Building a ”context layer”

https://commons.wikimedia.org/wiki/File:Big_Trinity_Lake.jpg

Page 9: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

work place

person event

conceptorganization

Aspirations: IFLA - LRM

Page 10: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

What have we done?

Page 11: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

Manifestations

15 billion triples

FAST

23 million triples

VIAF

2 billion triples

ISNI

10-50 million triples

Linked Data of OCLC

Persons

500 million triples

Works

5 billion triples

Page 12: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

View Fields & Values

Define Field Profiles

ANALYZE

Batch updates for metadata strings

CLEANUP

Match

against controlled vocabs

Get persistent identifiers

Add identifiers to Metadata

RECONCILE

Produce RDF Triples

Search a Triple Store using SPARQL

TRANSFORM

Metadata refinery prototypeTransforming records to entities

Page 13: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

CONTEXTCOMMENTARY CONTENT

IIIF prototype / separation of related concerns

Wikidata

Wikimedia

CommonsWikipedia

Narrative commentary Context through

verifiable statements,

entity properties and

relationships

Digital content, rights,

and technical

metadata

Page 14: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

WikiData Pilot Project “Passage”

Phase I Partners (Dec ’17 - Apr ‘18)

• Cornell University

• University of California, Davis

Phase II Partners (May ‘18 – Sep ‘18)

• American University

• Brigham Young University

• Cleveland Public Library

• Harvard University

• Michigan State University

• National Library of Medicine

• North Carolina State University

• Northwestern University

• Princeton University

• Smithsonian Library

• Temple University

• University of Minnesota

• University of New Hampshire

• Yale University

Global Technologies

Research

Global Product Management

Page 15: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

What was “Passage” about• Develop an Entity Ecosystem that facilitates:

o Creation and editing of new entities

o Connecting entities to the Web

• Build a community of users who can:

o Create/Curate data in the ecosystem

o Imagine/propose workflow uses

o Communicate easily with each other and with OCLC to iteratively improve

the prototype

• Provide services to:

o Reconcile data

o Explore the data

It was being built using open source technologies

such as MediaWiki, WikiBase, and OpenRefine.

See here for more details

Page 16: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

Results of “Passage”

• Final Report Aug 2019

• The simple prototype described at the beginning matured

over time to a robust set of third-party tools based on

WikiBase to manage over a million entities.

• Goals achieved:

– Collaboration

– Reconciliation

– Editing

Page 17: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

Create/Curate data

Page 18: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

Create/Curate data

Page 19: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

Reconcile data

• Searching for vocabulary and entities

as you type

Page 20: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

Explore the data

Statements

Translations

Identifiers

Page 21: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

Explore the data

Statements

Translations

Identifiers

Page 22: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

Explore the data

Statements

Translations

Identifiers

Page 23: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

WorldCat

WorldCat with Entity Backbone

ControlledCatalog

ExchangeServices

→ MARC21 (w/ links)

→ BibFrame

→ Schema.org

→ other…

More info: https://www.oclc.org/research/themes/data-science/linkeddata/linked-data-prototype.html

WorkbenchServices

ExplorerServices

Page 24: Linked Data initiatives at OCLC · It was being built using open source technologies such as MediaWiki, WikiBase, and OpenRefine. See here for more details. ... Dr. Axel Kaschte.

Thank you

Dr. Axel KaschteSTRATEGY DIRECTOR, OCLC EMEA

[email protected]


Recommended