Post on 18-May-2015
transcript
Digital Humanities in a Linked Data world: Semantic Annotations Dov Winer
NLI / EAJC (DM2E/Judaica Europeana)
http://www.makash.org.il/docs/dh_usp_2013.pdf
Digital Humanities:Scholarly Primitives
Exemplos
Transformação do ciclo de trabalho
escolástico
Projetos de ponta e o universo da
Europeana
Dados linkados: o Web como banco
de dados global
Outline
Digital Humanities
Scholarly Primitives Scholarly Primitives: what methods do
humanities researchers have in common, and
how might our tools reflect this?
John Unsworth Humanities Computing: formal methods, experimental
practice
King’s College, London, May 13, 2000
Discovering Annotating
Comparing Referring
Sampling Illustrating
Representing
Unsworth primitive Bamboo theme of scholarly
practice OCLC Scholarly Information Activity
Discovery Gathering / Foraging
Searching (direct searching, chaining, browsing, probing, accessing)
Sampling
Synthesizing / Filtering Comparing
Collecting (gathering, organizing)
Referring
Contextualizing
Searching (chaining, browsing, probing) Collecting (organizing)
Cross-cutting (monitoring)
Illustrating Representing
Comparing
Conceptualizing, Refining and Critiquing
Reading (scanning, assessing, rereading) Cross-cutting (note taking, translating)
Writing (assembling)
Collaborating (consulting)
Representing Documenting methods Writing (disseminating) Cross-cutting (translating)
Discovering Referring Representing
Managing data
Searching (accessing) Collecting (organizing)
Collaborating (coordinating, consulting)
Annotating Annotating / documenting
Writing (assembling) Cross-cutting (note taking)
Illustrating Representing
Modelling / visualizing Cross-cutting (translating) Writing (assembling)
Representing
Overlapping teaching and research
Collaborating (coordinating) Cross-cutting (translating)
Representing Sharing / dissemination / publishing
Writing (disseminating)
Suggested parenthetically Funding No analogue
Common thread Collaborating
Writing (co-authoring) Collaborating (coordinating, networking, consulting)
Referring
Citation, credit, peer-review Reading (assessing) Writing (dissemination)
Collaborating (consulting)
OCLC: Scholarly Information Practices in the Online Environment http://www.oclc.org/content/dam/research/publications/library/2009/2009-02.pdf?urlm=162919
Project Bamboo Scholarly Practice Report https://wikihub.berkeley.edu/display/pbamboo/Project+Bamboo+Scholarly+Practice+Report
Scholarly primitives: Building institutional
infrastructure for humanities e-Science
Tobias Blanke, Mark Hedges
King’s College London, Centre for e-Research
Future Generation Computer Systems 29 (2013) 654-661
Scholarly Information Practices in the Online
Environment
Carole L. Palmer, Lauren C. Teffeau, Carrie M. Pirmannn
2009 OCLC Online Computer Library Center, Inc.
OCLC Online Computer Library Center 2009 http://www.oclc.org/content/dam/research/publications/library/2009/2009-02.pdf?urlm=162919
Scholarly Primitives
Examples
Republic of Letters network visualisation / Oxford
and Stanford
Republic of Letters networks
American Civil War Freebase Documentation
http://www.freebase.com
Freebase: an open linked data database service
Michele Pasin – Enrico Motta
Ontological requirements for annotation and
navigation of philosophical resources
Synthese (2011) 182:235-267
Ontology based annotation for Philosophy texts
A formal model for describing Philosophical ideas
CIDOC-CRM event centered
A formal model for
describing philosophical
ideas:
Argument-entity.
Problem-area.
Problem.
Method.
View: Thesis, Theory,
Philosophical-system,
School of thought.
Rhetorical figure.
Concept.
Distinction .
http://www.visualdataweb.org/relfinder.php
http://relfinder.dbpedia.org/relfinder.html
Shai Ophir (2010). A New Type of Historical Knowledge. Information
Society,, 26: 144-150, 2010,
Transformação do ciclo de
trabalho escolástico
Ciclo de trabalho escolástico
From S.Gradmann and J.C. Meister, Digital document and interpretation: re-thinking “text” and scholarship in electronic
settings . Poiesis & Praxis, V5 N2 (2008)
From S.Gradmann and J.C. Meister, Digital document and interpretation: re-thinking “text” and scholarship in electronic
settings . Poiesis & Praxis, V5 N2 (2008)
Ciclo de trabalho escolástico
Ciclo de trabalho escolástico
From S.Gradmann and J.C. Meister, Digital document and interpretation: re-thinking “text” and scholarship in electronic
settings . Poiesis & Praxis, V5 N2 (2008)
From Gradmann (2008)
http://www.slideshare.net/gradmans/europeana-semantica
Processing source data in the Humanities: aggregation
From Gradmann (2008)
http://www.slideshare.net/gradmans/europeana-semantica
… modeling …
From Gradmann (2008)
http://www.slideshare.net/gradmans/europeana-semantica
… and digital heuristics?
Projetos de Ponta
Scholarly services
Document Mapping;
Concordance;
Collocation/Cloud; Frequency;
Morphological Analysis;
Syntactic Analysis; Named
Entity Identification; Proxied
SEASR Analytics
Europeana Projects
10/25/2013 37
Prof. Stefan Gradmann
Prof. Christian Bizer
LOD
Dados linkados – o Web como
banco de dados global
Dados Linkados Datasets on the Web
http://www.linkeddata.org
http://esw.w3.org/DataSetRDFDump
http://esw.w3.org/TaskForces/CommunityProje
cts/LinkingOpenData/DataSets/Statistics
Linking Open Data
cloud diagram, by
Richard Cyganiak
and Anja Jentzsch.
http://lod-cloud.net/
Over 31.7 billion
RDF triples
(10/2011)
Over 40 billion
on
February 2012
17.10.2012 41 VI Encontro do CEDAP
Preservação do
Patrimônio e
Democratização da
Linked Data:
structured
data on the Web
David Woood
Marsha Zeidman
Luke Ruth
with
Michael Hausenblas
Manning Publications
MEAP 2013
The next following slides were taken from :
Linked Data and the Semantic Web in an Archival Context
Mark A. Matienzo (2012)
http://matienzo.org
http://www.slideshare.net/anarchivist/linked-data-and-the-
semantic-web-in-the-archival-context
Usage of Linked Data Introduction and Application
Scenarios
Barry Norton (2013)
EUCLID
Education Curriculum for the usage of Linked Data
http://euclid-project.eu/
The essence of RDF: the “triple”
Source: “The thirty minute guide to RDF and Linked Data”, by Ian Davis and Tom Heath
subject property
value
VI Encontro do CEDAP
Preservação do
Patrimônio e
Democratização da
Ross Singer
The Linked Library Data Cloud
LOD4LIB 2010
Source: “The thirty minute guide to RDF and Linked Data”, by Ian Davis and Tom Heath
RDB Direct
Mapping
RDF
automatic
Direct Mapping
RDB2RDF 66
Person
ID (pk) NAME AGE
1 Alice 25
2 Bob NULL
67 RDB2RDF
Direct Mapping on Table
ID (pk) NAME AGE
1 Alice 25
2 Bob NULL
Person
68 RDB2RDF
Direct Mapping on Table
ID (pk) NAME AGE
1 Alice 25
2 Bob NULL
Person
<http://www.ex.com/Person/ID=1>
<http://www.ex.com/Person#NAME>
"Alice" .
69 RDB2RDF
Direct Mapping on Table
RDB
RDF
Dump
SPARQL
Extract – Transform – Load (ETL)
70 RDB2RDF
Music Ontology
71
• MusicArtist
– ArtistEvent, member_of
• SignalGroup
‘Album’ as per Release_Group
• Release
– ReleaseEvent
• Record
• Track
• Work
• Composition
http://musicontology.com/
RDB2RDF
Scale
72
• MusicBrainz RDF derived via R2RML:
lb:artist_member a rr:TriplesMap ; rr:logicalTable [rr:sqlQuery """SELECT a1.gid, a2.gid AS band FROM artist a1 INNER JOIN l_artist_artist ON a1.id = l_artist_artist.entity0 INNER JOIN link ON l_artist_artist.link = link.id INNER JOIN link_type ON link_type = link_type.id INNER JOIN artist a2 on l_artist_artist.entity1 = a2.id WHERE link_type.gid='5be4c609-9afa-4ea0-910b-12ffb71e3821'"""] ; rr:subjectMap [rr:template "http://musicbrainz.org/artist/{gid}#_"] ; rr:predicateObjectMap [rr:predicate mo:member_of ; rr:objectMap [rr:template "http://musicbrainz.org/artist/{band}#_" ; rr:termType rr:IRI]] .
300M
Triples
73 RDB2RDF
74 RDB2RDF
75 RDB2RDF
76 RDB2RDF
77 RDB2RDF
Thank you for your attention!
Dov Winer
dov.winer @ gmail.com
http://www.makash.org.il/docs/dh_usp_2013.pdf