+ All Categories
Home > Science > Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a...

Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a...

Date post: 14-Apr-2017
Category:
Upload: christophe-debruyne
View: 236 times
Download: 0 times
Share this document with a friend
15
IRL: Irish Record Linkage, 1864 - 1913 Crea;ng and Consuming Metadata from Transcribed Historical Vital Records for Inges;on in a Long-term Digital Preserva;on PlaIorm Dolores Grant (a) Christophe Debruyne (b), Rebecca Grant (a), and Sandra Collins (a) (a) Digital Repository of Ireland, Royal Irish Academy, Dublin, Ireland (b) ADAPT @ Trinity College Dublin, Dublin, Ireland October 27, 2015 @ META4eS
Transcript
Page 1: Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform

IRL:IrishRecordLinkage,1864-1913

Crea;ngandConsumingMetadatafromTranscribedHistoricalVitalRecordsforInges;oninaLong-term

DigitalPreserva;onPlaIorm

DoloresGrant(a)ChristopheDebruyne(b),RebeccaGrant(a),andSandraCollins(a)

(a)  DigitalRepositoryofIreland,RoyalIrishAcademy,Dublin,Ireland(b)  ADAPT@TrinityCollegeDublin,Dublin,Ireland

October27,2015@META4eS

Page 2: Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform

IRL:IrishRecordLinkage,1864-1913

DevelopingaplaLormapplyingsemanMctechnologiestohistoricalbirth,deathandmarriagecerMficates.AnsweringquesMonssuchas:“Howaccuratearehistoricmaternalmortalityrates(MMR)andinfantmortalityrates(IMR)forDublin?”Teamconsistsofresearchers(historians),digitalarchivists,andknowledgeengineers.

Knowledge and Linked Data Engineers

HistoriansDigital Archivists

Page 3: Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform

IRL:IrishRecordLinkage,1864-1913

General Registers Office (GRO)•  Vital registration data: birth-

certificates, death-certificates and marriage records.

•  Digitised TIFF images of hardcopy indexes and registers.

•  2 TB of data•  Database describing the

digitised records allowing searches on some fields.

©General Records Office of Ireland 2014

Page 4: Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform

IRL:IrishRecordLinkage,1864-1913

Inpriorwork(see[1]),wecreatedaLinkedDataplaLormthatallowedDigitalArchiviststotranscriberegisterpages,whichwerethentransformedintoRDF.ThatRDFwasthenusedtopopulateothertriplestorestoanalyzethatdata.Partoftheproject,however,wasalsotoinves;gatethedigitallong-termpreserva;onofthedigi;zedregisterpages,andthecorrespondingRDF.

CreaMonofIRLKnowledgeBase

RelaMonalDatabase

GROTriplestore

TransformaMonVitalRecordsOntology

SeparaMo

nofCon

cerns

HistoricalEventsOntology

IRLTriplestore

DataAnalyMcs

DigitalArchivist Historian

LODCloud

Page 5: Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform

IRL:IrishRecordLinkage,1864-1913

Relatedwork•  RelatedworkonthepreservaMonofharvestedmetadataexist,

e.g.,inthecontextofGLAMS.

•  Lialeworkwastobefoundinthecontextofhistorical(vital)records.ItwaslimitedtointegraMonproblemsandaddressingtheproblemrecordlinkingindatabases.

•  WealsowantedtofocusonresearchprojectagnosMctranscripMonofhistoricalvitalrecords(separaMonofconcerns)

Page 6: Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform

IRL:IrishRecordLinkage,1864-1913

Method:Crea;ngRDFDocuments•  RegisterpagesareidenMfiedbyastampnumber(e.g.

“4646439”).WecollectthetriplesaroundapageandrelatedrecordswiththefollowingquerytocreateanRDFdocument.

•  PREFIXrec:<hap://purl.org/net/irish-record-linkage/records#>DESCRIBE*{ ?pagerec:stampNumber"4646439"; rec:withRecord?record. }

•  Wealsoaddafoaf:primaryTopicstatementtothedocument.

Page 7: Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform

IRL:IrishRecordLinkage,1864-1913

Method:Crea;ngQualifiedDublinCoreMetadata•  AdopMngtheguidelinesformulatedin[2],weadoptedXSPARQL

[3]totransformRDFdocumentsinQualifiedDublinCoreMetadataDocuments.WethushaveanRDFfileandaQDCfileforeachregisterpage.

Page 8: Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform

IRL:IrishRecordLinkage,1864-1913

RegisterPage

District/Union/County[SPATIALCOVERAGE]Superintendentregistrar'sdistrictDatecerMfiedastruecopybysuperintendentregistrar[ISSUED]DatecerMfiedbyregistrar[CREATED]Forename/surnameregistraronpageForename/surnamesuperintendentregistrar[CREATOR]Pagenumber/Volume/QuarterStampnumber[IDENTIFIER/usedinTITLE]Yearregistered[TEMPORALCOVERAGE]

Record

DateofregistraMonTitle/forename/surnameregistrarAmendmentsNumberinregister

CerMficate

Forename/surname(ofsubject)[PARTOFDESCRIPTION]Address(ofsubject)Sex(ofsubject)[PARTOFDESCRIPTION]Forename/surnameinformantQualificaMonofinformantRelaMonshipofinformantResidenceofinformant

DeathRecord

Forename/surnameofregistrarDateofdeath[PARTOFDESCRIPTION]CauseofdeathandduraMonofillnessCondiMonAgelastbirthdayPlaceofresidenceRank,professionoroccupaMon

1

0..10

Page 9: Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform

IRL:IrishRecordLinkage,1864-1913

Page 10: Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform

IRL:IrishRecordLinkage,1864-1913

RelaMonalDatabase

GROTriplestore

TransformaMon

VitalRecordsOntology

DigitalArchivist

RDFFile1

RDFFile2

RDFFilen

QualifiedDublinCore

XML1

QualifiedDublinCore

XML2

QualifiedDublinCore

XMLn

RegiserPage1

RegiserPage2

RegiserPagen

transform

Digitallong-termpreservaMonplaLorm

ingesMon

PartoftheIRLPlaLorm

Page 11: Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform

IRL:IrishRecordLinkage,1864-1913

Method:BulkInges;onintoaDigitalLongTermRepository•  WeadoptedtheDigitalRepositoryofIreland

hap://repository.dri.ie/

•  ProvidesitembyitemingesMon,orbulkinges;onviaacommandlinetools.

•  Files(digiMzedregisterpages,RDFandQDC)arenamedinacertainwaytorelatedQDCwiththedigiMzedassetandRDFtranscripMon.

Page 12: Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform

IRL:IrishRecordLinkage,1864-1913

Page 13: Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform

IRL:IrishRecordLinkage,1864-1913

ConclusionsandFutureWork•  WecreatedanautomatedprocessforcreaMnganduploading

assets,RDFtranscripMonsandassociatedmetadatainalongtermpreservaMonplaLorm.

•  EvaluaMonislimitedduetothedatasharingagreements;intermsofdiscoverabilityontherepositoryviafacetedsearchandintermsofsuitabilityofthemetadataviaexpertfeedback.

•  ComparisonofQualifiedDublinCorewithEncodedArchivalDescripMon(EAD)istobeconductedaswell.

Page 14: Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform

IRL:IrishRecordLinkage,1864-1913

References1.  ChristopheDebruyne,OyaDenizBeyan,RebeccaGrant,SandraCollins,StefanDecker:On

aLinkedDataPlaLormforIrishHistoricalVitalRecords.TPDL2015:99-1102.  BusMllo,M.,Collins,S.,Gallagher,D.,Grant,R.,Harrower,N.,Kenny,S.,NíCholla,R.,

O’Carroll,A.,Redmond,S.,Webb,S.:QualifiedDublinCoreandtheDigitalRepositoryofIreland(Grant,R.ed.).Tech.rep.,Maynooth:MaynoothUniversity;Dublin:TrinityCollegeDublin;Dublin:RoyalIrishAcademy;Galway:NaMonalUniversityofIreland,Galway(2015)

3.  Dell’Aglio,D.,Polleres,A.,Lopes,N.,Bischof,S.:QueryingtheWebofDatawithXSPARQL1.1.In:Verborgh,R.,Mannens,E.(eds.)ProceedingsoftheISWCDevelopersWorkshop2014,co-locatedwiththe13thInternaMonalSemanMcWebConference(ISWC2014),RivadelGarda,Italy,October19,2014.CEURWork-shopProceedings,vol.1268,pp.113–118.CEUR-WS.org(2014)

Page 15: Creating and Consuming Metadata from Transcribed Historical Vital Records for Ingestion in a Long-Term Digital Preservation Platform

IRL:IrishRecordLinkage,1864-1913

QuesMons?

MoreinformaMon•  Twiaer:@IRL_Project•  Projectwebsitehap://irishrecordlinkage.wordpress.com/


Recommended