LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
1
Memento, Open Annotation, SharedCanvas
Robert Sanderson [email protected] Los Alamos National Laboratory
Herbert Van de Sompel [email protected] Los Alamos National Laboratory
With thanks to our collaborators: • Benjamin Albritton, at Stanford • Tim Cole, at UIUC • Jane Hunter, at Queensland • Michael L Nelson, at Old Dominion • Many others!
This research is funded in part by the Andrew W. Mellon Foundation and the Library of Congress.
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
2
Overview … for the Web
• Memento • Time Travel for the Web • Uses
• Open Annotation • Interoperable Annotation for the Web • Use Case: SharedCanvas
• SharedCanvas • Distributed Digital Facsimiles for the Web • Uses
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
3
Memento: Time Travel for the Web
Memento wants to make it Easy
to Navigate the Web of the Past
Website: http://www.mementoweb.org/
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
4
Tate Online Today
Select Date March 16 2008
Tate Online March 16 2008
From UK National Archives
4
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
5
Old Versions of Web Pages Exist …
http://www.cnn.com/ at 2001-09-11: From the Internet Archive
http://en.wikipedia.org/wiki/September_11_attacks at 2001-12-20: From the Wikipedia history
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
6
… But are not Integrated with the Web
• We note three issues:
• Cannot talk about a resource as it used to exist
• Cannot navigate to a prior version of a resource, knowing its current version
• Cannot navigate to the current version of a resource, knowing a prior version
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
7
Memento Framework
• Memento:
• Introduces a uniform version mechanism to the Web: Time
• Does not build new archives, but leverages all existing ones
• Is based on the Architecture of the World Wide Web: resources, representations and links
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
8 8
Original Resources and Mementos
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
9 9
TimeGate: A Bridge from the Present to the Past
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
10 10
Memento Framework in Practice
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
11 11
Memento Uses: Browsing with MementoFox
Get it from: http://bit.ly/memfox
Firefox 3.6+ 4000 downloads 400 daily users
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
12 12
Memento Uses: Time Series Analysis
Historical data retrieved via Memento from DBPedia archive
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
13 13
Memento Uses: Archiving Extent for Scholarship
• 72% in archives and/or still exist
• High proportion of archived URLs, possibly due to academic level and general disciplines
• 78% in archives and/or still exist
• 45% still exist, but not archived! Possibly due to high value, but very discipline specific references
UNT
arXiv
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
14
Web-Centric Annotation: No Persistence
Google Sidewiki Annotation on http://news.bbc.co.uk/ as of 2010-06-14
1
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
15
Web-Centric Annotation: No Annotations
Archived page from: http://www.dracos.co.uk/work/bbc-news-archive/2010/03/08/07.05.html
1
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
16
Web-Centric Annotation: Cross-Linking via Memento
1
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
17
Scholarly Communication and Annotation
Scholarly Communication is increasingly: • Online • Open • Distributed • Collaborative • Data-Oriented
Annotation is a scholarly primitive, spanning discipline and level. Need to ensure that Digital Annotations fall under these headings!
• Apply the standards and architecture of the World Wide Web to the Annotation use case. • Even if scholar doesn’t share annotations with others, she will want to access them from different tools and environments.
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
18
Open Annotation
• Website: http://www.openannotation.org/
• Focus on interoperable sharing of annotations • Web-centric and open, not locked down silos • Create, consume and interact in different environments • Build from a simple model for simple cases, to more detailed for complex scholarly annotation requirements
• Status: Beta, with 9 ongoing funded experiments to inform 1.0
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
19
Basic Model
The basic model has three resources: • Annotation (an RDF document) • Body (the ‘comment’ of the annotation) • Target (the resource the Body is ‘about’)
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
20
Basic Model Example
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
21
Segments of Resources
Most annotations are about part of a resource
Different segments for different media types:
• Text: paragraph, arbitrary span of words • Image: rectangular or arbitrary shaped area • Audio: start and end time points, track name/number • Video: area and time points • Other: slice of a data set, volume in a 3d object, …
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
22
Constraints
• Introduce a Constraint that describes the segment of interest • And a ConstrainedTarget that identifies the segment of interest • Constraints are resources, so can be expressive and detailed
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
23
Constraint Example
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
24
Annotation Protocols
Protocol: publish, subscribe, consume tied together
Unlike previous systems, Open Annotation does not mandate a protocol.
No reliance on a client/server combination gives the client autonomy.
Instead we promote a publish/subscribe methodology, where annotations may be stored and consumed from anywhere.
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
25
Publish/Subscribe Method
publish
We don’t specify how this transfer should occur
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
26
publish subscribe
Publish/Subscribe Method
Nor this.
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
27
publish subscribe consume
Publish/Subscribe Method
Nor this.
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
28
Open Annotation Usage: SharedCanvas
http://www.shared-canvas.org/
Digital surrogates for digitized documents: • Initial use case: Medieval Manuscripts
A collaborative future: • Rich landscape of interconnected
repositories, with seamless user interfaces
We need: • Shared description to act as input to
shared viewing environments
BNF f.fr 113, folio 1 recto
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
29
Naïve Approach: Transcribe Images Directly
Align multiple images, pages without images, fragments of pages… ?!
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
30
Canvas Paradigm
A Canvas is an empty space in which to build up a display • HTML5, SVG, PDF, … even Powerpoint! • Can "paint" many different resources, including text, images and audio, on to a Canvas
We can use a Canvas to represent a page of a manuscript.
Distributed nature is fundamental in the requirements • Idea: Use Annotations to paint resources and comments • Annotations target the Canvas instead of the Images
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
31
Canvas to Page Relationship
The Canvas's top left and bottom right corners correspond to the corners of the page
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
32
Annotate Image to Canvas
We can paint the canvas by annotating it with resources.
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
33
Annotate Text to Canvas Segment
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
34
Annotate Choice of Image
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
35
Annotate Many Texts
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
36
SharedCanvas: Implementation
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
37
Annotate Audio Resources
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
38
SharedCanvas: Implementation
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
39
SharedCanvas: Distributed Transcription …
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
40
… of Lab Notebooks and Dissertations?
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
41
Summary
SharedCanvas brings primary research objects to the desktop in a powerful, extensible and interoperable fashion
Memento wants to make it Easy to Navigate the Web of the Past
Open Annotation enables sharing Annotations between users, tools and environments.
LANL Proto Team: Memento, Open Annotation, SharedCanvas NLLC 2011, LANL, New Mexico, 6th October 2011
42
Thank You!
Me: Robert Sanderson [email protected] [email protected] @azaroth42
Memento: Web: http://www.mementoweb.org/
Open Annotation: Web: http://www.openannotation.org/
SharedCanvas: Web: http://www.shared-canvas.org/
These Slides: http://slidesha.re/…