Post on 14-Aug-2020
transcript
Dela bilder på webben med IIIF
Fallstudie: Riksarkivet
Mats Berggren, Riksarkivet, IT-enheten, 2019-03-11
• Digitization at the Swedish National Archives
• Using IIIF and Universal Viewer
• Current development
IIIF case study: Swedish National Archives
IIIF case study: Swedish National Archives
• Digitization at the Swedish National Archives
• Using IIIF and Universal Viewer
• Current development
• The Swedish Archival Agency founded in 1618
Main office in Stockholm
Seven regional state archives
Department for digitization (DIT) located in Fränsta and Ramsele
Currently about 500 employees in total
• Collections
Paper documents in total about 750 shelf kilometers
Audio and video, analogue and digital
Born digital data delivered from agencies
The Swedish National Archives
• Scanning of documents and microfilm
Scanning of documents (Fränsta) and microfilm (Ramsele)
Microfilm scanning by FamilySearch in Salt Lake City, USA.
• Digitized collections to date:
About 512000 volumes (boxes) of archival material
About 210 million digital images
About 66.6 million images available for public access on Internet
Digitization of documents
• Image file formats for long term preservation.
TIFF/IT (TIFF 6.0), Black/White, Group4, BitsPerSample=1, 400 dpi
TIFF/IT (TIFF 6.0), Grayscale, BitsPerSample=8, 300dpi
TIFF/IT (TIFF 6.0), Colour RGB, BitsPerSample=8x3, 300 dpi
• Image file formats for public access.
DjVU, Used for presentation and public access by the National Archives and Lantmäteriet (The Swedish Mapping, Cadastral and Land Registration Authority). Produced through conversion from TIFF
JPEG, Used by a few projects. Accepted as delivery format from agencies
Image formats
Digitization of audiovisual media
• Digitization of analogue audio/video media (project DIANA)
Digitization mainly done in house by the National Archives
Digitization also done by the Royal Library for the National Archives
Project started 2015, digitization started 2017
• Analogue audio/video information:
Approximately 20000 hours of audio and about 10000 hours of video
Corresponds to 40 TB of digital audio and 500 TB of digital video
• Audio files:
WAV-format. All audio samples digitized from analogue media should be digitized with a minimum sampling frequency of 48 kHz (96 kHz recommeded), 24 bit.
• Video files:
Matroska file format with FFV1 video encoding. Specification as follows:
Wrapper: Matroska (.mkv)
Codec: FFV1
ChromaSub: 4:2:2
Bit depth: 10-bit
Colorspace: YUV
Video standard: PAL
Framerate: 25
Audio: (L)PCM
Audio: >= 48 kHz
Audio bit depth: >= 16 bit
Audiovisual formats for long term storage
• Audio files:
MPEG-1 Layer 3 (Mp3),*.mp3, 2 ch Stereo, 48kHz, >=CBR 128 kbps
• Video files:
MPEG-4, Advanced Video Coding (Part 10) (H.264/AVC)
Wrapper: mp4
Bitrate: 2000-3000kbps
Codec: avc1
Frame: 720*576 pixels
Frame rate: 25 fps
Display aspect ratio: 4:3
Standard: PAL
Color space: YUV
Scan type: Progressive
Bit depth: 8 bits
Audio: AAC, ca 160 Kbps stream bitrate, 2 Ch, 44 alt. 48 kHz, 16 bit
Audiovisual formats for public access
• Digitization at the Swedish National Archives
• Using IIIF and Universal Viewer
• Current development
IIIF case study: Swedish National Archives
• The Search service (https://sok.riksarkivet.se) provides access to:
The National Archives archival information system (ARKIS)
The Swedish National Archival Database (NAD). Fonds and inventories from 180 swedish archival institutions
Various special databases: Swedish historical censuses (1880, 1890, 1900, 1910 and 1930), inventory of estates, property records, medieval charters, historical maps, etc etc
• Data:
Data from 224000 fonds
Archival inventory objects (fonds, series, items) in total: 13 million
Special database objects: 31 million records
SOLR objects in total: About 45 million
The National Archives on line search service
• A new image viewing application was developed 2015-2016. The reasons were:
No support for the DjVU-format on many platforms (mobiles, tablets)
A desire to have deep zoom (like in DjVU!)
A desire to make the image viewing application independent of image formats
A desire to incorporate more metadata about the images in the presentation
• The new application:
Universal Viewer: the National Archives participates in the development
The IIIF-server was implemented in house by the National Archives based on ”Level 1” of the specifications ”IIIF Image API 2.0” and ”IIIF Presentation API 2.0”
All image files for access are stored in an object store (Hitachi Content Platform)
Metadata in the IIIF manifests are generated on demand from the ARKIS-database and from other databases providing indexing information for images
New image viewing application 2016
Architecture for image access
Object Store (HCP)
iiif.riksarkivet.se
IIIF-server
sok.riksarkivet.se
Search Service
ResearcherResearcher
The Swedish National Archival Database
(MSSQL Server)
Special Databases (Census data etc)(MSSQL Server)
SOLR CloudEnterprise search platform
Digital Images
PDFDocuments
Load balancing cluster
IIIF Pool
External userInternal user
Object Store (HCP)
Digital Images
PDF-Documents
SpecialDatabases
ARKIS-DB
Internet
Internal
lbiiif.riksarkivet.seVirtual iiif-server
Image access with a load balanced IIIF-server pool
Access to JPEG- and DjVU-files through IIIF
IIIF-Server
Object Store (HCP)
DigitalImages
Application DPSS from Cuminas
On demand conversion of DjVU to JPEG when files are requested by the IIIF-server
JPEG-files
DjVU-files
• Digitization at the Swedish National Archives
• Using IIIF and Universal Viewer
• Current development
IIIF case study: Swedish National Archives
Using IIIF and UV for playback of audio/video
• Current development
During 2018 the National Archives has implemented playback of
audio/video as a part of the Universal Viewer based application
One manifest for each digitized analogue media (tape or video etc)
Manifests will contain references to audio or video files and to
documentation files in PDF-format
The IIIF-server application has been modified to support audio/video
and the “IIIF Presentation API 3.0” (https://iiif.io/api/presentation/3.0)
Topgrafiskt register på Riksarkivet (TORA) – Historical villages and settlements
• Suecia images from Royal Library
• Map layers from Lantmäteriet
TORA – map application (test version)
• Attribute data from LOD-API
• Map images from IIIF-API
IIIF case study: Example applications
• Examples of IIIF-based applications at the Swedish National Archives
Search service with image viewer: https://sok.riksarkivet.se
Example of Audio access:
https://demosok.riksarkivet.se/?Sokord=SE/RA/326418
Example of Video access:
https://demosok.riksarkivet.se/?Sokord=SE/KRA/0335
TORA map application: https://toramaptest.riksarkivet.se