+ All Categories
Home > Documents > Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management...

Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management...

Date post: 14-Oct-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
25
Finnish National Bibliography Fennica as Linked Open Data Osma Suominen HELDIG Summit, 23 October 2018
Transcript
Page 1: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

Finnish National Bibliography Fennica as Linked Open Data

Osma Suominen

HELDIG Summit, 23 October 2018

Page 2: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

NATIONAL BIBLIOGRAPHY

with apologies to Scott Adams

Page 3: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

Why?

1. Making our data more visible, also internationally

2. Improving the quality and interoperability of our metadata

3. Building competency for the future

4. Why not? :)

Page 4: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

bib record

bib record

bib record

bib record

auth record

auth record

auth record

bib record

bib record

auth record

auth record

auth record

1M bib records 125k person names

40k corporate names

35k subjects (YSA)bib record

bib record

Page 5: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

bib record

bib record

bib record

bib record

auth record

auth record

auth record

bib record

bib record

auth record

auth record

auth record

Work

Instance

Person

Subject1M bib records 125k person names

40k corporate names

35k subjects (YSA)bib record

bib record

Place

Organization

Page 6: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

Work

Instance

Person Subject

Image credit: MaryMaking blog

bib record

bib record

bib record

bib record

auth record

auth record

auth record

bib record

bib record

auth record

auth record

auth record

125k person names

40k corporate names

35k subjects (YSA)bib record

bib record

1M bib records

Page 7: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)
Page 8: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

As seen in:

SWIB16 talk

DCMI webinar

o-bib journal article

“From MARC silos to Linked Data silos”

Page 9: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

with separate Works and Instances like BIBFRAME,as enabled by the bibliographic extensions

because it allows us to describe our resourcesfrom a common-sense, Web user perspective

(and we get a metadata haircut for free!)

Special thanks to Richard Wallisfor help with applying schema.org!

Page 10: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

MARC LinkedData?

Page 11: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

MARCXML

BIBFRAME RDF

Schema.org RDF

Linked to external URIs

MARC / Aleph seq

With deduplicated works

Work keys

With deduplicated agents

Agent keys

Convert &clean usingCatmandu

Convert usingmarc2bibframe2

Convert to Schema.org using SPARQL CONSTRUCT

YSA subjects

YSO subjects

Corporate names

RDA Media, Content, Carrier

Link against controlled vocabularies using SPARQL

Generate work keysfor merging using SPARQL

Merge worksusing SPARQL

Merge agents(person, org)using SPARQL

RDFstore

https://github.com/NatLibFi/bib-rdf-pipeline

Page 12: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

Data dump downloads

Publishing as Linked Open Datafor human & machine access

RDFHDT

Jena Fuseki

bib-lod-uiFlask app

HTML+JSON-LD

OpenSearch API

Linked Data RDFRDFstore

RDFN-Triples

MARCrecords Linked Data

Fragmentsserver

SPARQL

LDF

Page 13: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

Demohttp://data.nationallibrary.fi/bib/me/W00009584100

Page 14: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

Downloadable dumps

Page 15: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

Data model documentation

Page 16: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

Challenges

Page 17: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

Identity management

Libraries have traditionally managed identities (e.g. persons, works, places, subjects) by using authorized names and headings - i.e. strings.

This is a fragile way to assert identity. It would be better to represent things and give them persistent identifiers. This is not yet standard practice in MARC.

We have a relatively large number of duplicate persons and works in the data set:● cannot know for certain if persons with the same name are really the same● extracting works from traditional bibliographic records is a hard problem

Page 18: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

“Cool URIs don’t change” -- Tim Berners-Lee

...but we rely on conversion of MARC records that change all the time!

Page 19: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

Linking

Work

Instance

Person

Subject

Place

Organization

LCSH

Finnish Place Name Registry

Wikidata

Page 20: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

Linking

Work

Instance

Person

Subject

Place

Organization

LCSH

Finnish Place Name Registry

Wikidata

WorldCat

Other nationallibraries

WorldCat Works

LIBRIS XL

ISNI

VIAF ISNI

Wikidata

Page 21: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

Opportunities

Page 22: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

SPARQL endpoint http://data.nationallibrary.fi/bib/sparql

Page 23: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

Persons featured in >100 works

Page 24: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

What next?1. Enriching and cleaning the RDF data, e.g. using subclasses like Map

2. More links to other Linked Data sets

3. Expanding to new data sets: Viola discography, Arto article database

Page 25: Finnish National Bibliography Fennica as Linked Open Data · 10/23/2018  · Identity management Libraries have traditionally managed identities (e.g. persons, works, places, subjects)

Thank you!Questions?

[email protected] - @OsmaSuominen

http://data.nationallibrary.fi - @NatLibFiData

Code:https://github.com/NatLibFi/bib-rdf-pipeline

https://github.com/NatLibFi/bib-lod-ui

These slides: http://tinyurl.com/fennica-ld-heldig


Recommended