IIPC General AssemblyWashington, DC, May 1 2012
Herbert Van de SompelRobert Sanderson
Los Alamos National LaboratoryResearch Library
IIPC General AssemblyWashington, DC, May 1 2012
Well, almost …
New IIPC Member: Los Alamos National Laboratory, Research Library
2
IIPC General AssemblyWashington, DC, May 1 2012
• aDORe repository for long term storage & access to the Research Library literature collection
• mod_oai plugin for Apache servers (Tools for a Preservation Ready Web – With Old Dominion U.)
• Increased obsession with (lack of) Time and the Web, cf. OAI-ORE Aggregations
• Memento “Time Travel for the Web”
Digital Preservation Interest Exemplified
3
http://www.ctwatch.org/quarterly/multimedia/11 /ORE_prototype-demo/
IIPC General AssemblyWashington, DC, May 1 2012
Memento wants to make it easy to access to Web of the Past
4
Digital Preservation Award 2010
Memento is funded by The Library of Congress
IIPC General AssemblyWashington, DC, May 1 2012 5
Original Resources and Mementos
IIPC General AssemblyWashington, DC, May 1 2012 6
Bridge from Present to Past
IIPC General AssemblyWashington, DC, May 1 2012 7
Bridge from Past to Present
IIPC General AssemblyWashington, DC, May 1 2012 8
Memento Framework
IIPC General AssemblyWashington, DC, May 1 2012
• Collaboration with IA on improved Memento integration in Wayback
• Collaboration with Old Dominion University on improved MementoFox FireFox add-on
• Linked Data archives: DBpedia & Live DBpedia
• Work with WikiPedia aimed at native Memento support
• Collaboration with IIPC on Memento Aggregator
Memento Progress
9
http://mementoweb.org/depot/
IIPC General AssemblyWashington, DC, May 1 2012
Transactional Web Archive:Another Approach toArchiving the Web
10
IIPC General AssemblyWashington, DC, May 1 2012
For example: Heritrix crawler for Internet Archive
11
Crawl-Based Web Archiving
IIPC General AssemblyWashington, DC, May 1 2012
For example: TTApache, PageVault, Vignette Web Capture
12
Server-Side Transactional Web Archiving
IIPC General AssemblyWashington, DC, May 1 2012
• Open source release upcoming• 1+ year development• Tested in various environments• Memento compliant• Deduplication optimizations• Offload to WARC files
• Of interest to institutions that want to pro-actively archive their Web presence
• Of interest regarding archiving dynamic Web content (see Thursday’s Workshop)
LANL Transactional Web Archive Software
13