Making Web Annotations Persistent over Time

Post on 11-May-2015

2,314 views 1 download

Tags:

description

JCDL 2010 presentation about using Memento to reconstruct the state for web resources involved in annotation.

transcript

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

RobertSandersonrsanderson@lanl.govazaroth42@gmail.com

HerbertVandeSompelherbertv@lanl.govhvdsomp@gmail.com

DigitalLibraryResearchandPrototypingTeam

LosAlamosNa@onalLaboratory,USA

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:2

Overview

•  BackgroundandResearchQues@on

•  OpenAnnota@onCollabora@on

•  Memento

•  Experiments

•  Conclusions

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:3

10+YearsofAnnota@onResearch

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:4

Our2020Vision

web

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:5

ScholarshipisMovingtotheWeb

•  Fromsilosofhighlycuratedcontenttoopenworldmodels

•  Publishingofscholarlycommunica@ononline:•  FirstJournals(DLib,PLoS,…)•  ThenUsers(Vivo,ORCID,…)•  NowData(NVO,OREChem,…)

•  NexttheScien@ficProcess?(MyExperiment,Annota@on,…)

•  Withthewebcomesnewchallenges…

•  Persistence

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:6

WebResourcesChange…

Google Sidewiki Annotation on http://news.bbc.co.uk/ as of 2010-06-14

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:7

ArchivedCopy,ButNoAnnota@ons

Archived page from: http://www.dracos.co.uk/work/bbc-news-archive/2010/03/08/07.05.html

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:8

CanWeFixThis?Automa@cally?

The desired outcome: Display the correct representation of the Web Resource with the Annotation.

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:9

PreviousAnnota@onPersistenceMethods

•  Migrateannota@onsfromoneversiontothenext:•  Seektodiscovernewloca@onofoldtargetsegment

•  Otherwisediscardtheannota@onasnolongerrelevant•  TreatstheAnnota@onasofsecondaryimportance

•  Focusedonheuris@cs:•  Crossformat,crossloca@on

•  Editedtextinsamedocument

•  Dynamicallyscalingtargetareas,marksofannota@on•  …

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:10

PersistentandWeb‐Centric?

•  OAC:DescribeAnnota@onsinaWeb‐centricModel

+

•  Memento:MakeNaviga@ngthePastWebEasy

=

•  GivenanAnnota@on,displayappropriatearchivedWebResource?•  GivenanarchivedWebResource,displayappropriateAnnota@ons?

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:11

OpenAnnota@onCollabora@on

•  Focus: Interoperabilitybetweensystemstoenablesharing

•  Founda@on: ArchitectureoftheWorldWideWeb

•  Framework: LinkedDataGuidelines

•  Funding: MellonFounda@onfor18months

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:12

OACDataModel:Basics

•  Anoac:Annota@onisanore:Aggrega@onoftwoormoreresources,suchthatone(oac:Body)annotatesatleastoneother(oac:Target)

•  WegetOAI‐OREen@@esforfree(ore:ResourceMap,ore:Proxy)

•  Allresourcesareregularwebresources

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:13

OACDataModel:Basics

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:14

OACDataModel:Segments

•  MostAnnota@onsareaboutpartofaresource

•  Resourcesareatomic,intermsofiden@fica@on(byaURI)

•  SegmentsoftheresourceapplyinthecontextoftheAnnota@on

•  Solu@on:aaachaDescrip@onoftheSegmentofinteresttoanOREProxyfortheresource

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:15

OACDataModel:Segments

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:16

OACDataModel:Time

•  Asregularwebresources,ResourceMap,BodyandTargethaverepresenta@onsthatcanchangeover@me

•  TheResourceMap,BodyandTargetcanchangeindependentlyofeachother

•  IfanAnnota@oninvolvesresourcesastheyexistedatapar@cularpointin@me,thisneedstoberecorded

•  ThreedifferentTimemodelsarepossible…

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:17

TimelessAnnota@ons

•  TheAnnota@onisalwaysapplicable,regardlessoftherepresenta@onservedfromtheURIsoftheBodyandTargets.

•  Example:"ThisisthehomepageofCNN"

•  TimelessAnnota@onsdonotneedaspecial@mestamp.

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:18

UniformTimeAnnota@ons

•  TheAnnota@onisnotalwaysapplicable,butpertainstothestateoftheBodyandTargetatasinglemomentin@me.

•  Example:Tweetisaboutcontemporarystateofawebpage.

•  Addmem:whenpropertytoAnnota@on

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:19

VariedTimeAnnota@ons

•  TheAnnota@onisnotalwaysapplicable,butpertainstothestateoftheBodyandTargetatdifferentmomentsin@me.

•  Example:Blogpostisaboutpreviousday'sstateofawebpage•  Addmem:whenpropertytoProxiesforresources

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:20

Memento

•  Weknow:theURIoftheTargetandthe@meitwasannotated

•  Wedon'tknow:theURIwhereitisarchived•  Weneed:Memento

•  MementowantstomakeitEasytoNavigatetheWeb’sPast

•  Basicidea:UseHTTPtogetfromthecurrentresourcetothearchivedcopyofthatresourceatagiven@me,ratherthansearching

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:21

IssueswiththePastWeb

•  Newnamesforarchivedresources

• Whatwascnn.com,becomesarchive.org/web/20010120…•  …Andlotsofothernames

• Withnowaytodiscoverthemwithoutsearchingbyhand

•  Peopledonotliketosearch•  Especiallywhenacomputercoulddoit.

•  Naviga@onisinconsistent•  Stuckinwebarchivecontentsilo(URIsrewriaen)•  Orendupbackinpresent(URIsnotrewriaen)

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:22

TheWebwithoutTimeDimension

eg: http://www.cnn.com/ eg: http://web.archive.org/web/ 20020209001709rn_1/www.cnn.com/?

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:23

TheWebwithTimeDimensionfromMemento

eg: http://www.cnn.com/ eg: http://web.archive.org/web/ 20020209001709rn_1/www.cnn.com/?

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:24

current

Apr 10 2001, 21:39:30 UTC

Aug 15 2004, 08:45:27 UTC

Aug 15 2007, 19:21:58 UTC

www.cnn.com web.archive.org

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:25

current

Apr 10 2001, 21:39:30 UTC

Aug 15 2004, 08:45:27 UTC

Aug 15 2007, 19:21:58 UTC

www.cnn.com web.archive.org

Original Resource

Mementos ?

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:26

current

Apr 10 2001, 21:39:30 UTC

Aug 15 2004, 08:45:27 UTC

Aug 15 2007, 19:21:58 UTC

www.cnn.com web.archive.org

Original Resource

Mementos ?

TimeGate

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:27

current

Apr 10 2001, 21:39:30 UTC

Aug 15 2004, 08:45:27 UTC

Aug 15 2007, 19:21:58 UTC

www.cnn.com web.archive.org

Original Resource

Mementos ?

TimeGate

Conneg with TimeGate to Mementos Link Headers

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:28

Experiments

1.  Givenanannota@on,findtheappropriaterepresenta@ons•  Createannota@ononresourceknowntochange•  Canweusetheinforma@onfromtheannota@ontofaithfullyrecreatetheenvironmentthroughMemento?

2.  Givenanarchivedresource,findtheappropriateannota@ons•  Createannota@onsatdifferent@mesonresourceknowntobearchived

•  Canweusetheinforma@onfromMementotofindtheappropriateannota@ons?

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:29

Experiment1

Retrieve

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:30

Experiment1

Reconstruct

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:31

Experiment1:CreateAnnota@on

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:32

Experiment1:TestwithoutMemento

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:33

Experiment1:TestwithMemento

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:34

Experiment2

•  Need to find Original URI, start, end time of representation •  Need searchable collection of annotations

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:35

Experiment2:CreateAnnota@ons

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:36

Experiment2:CreateAnnota@ons

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:37

Experiment2:CreateAnnota@ons

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:38

Experiment2:TestwithoutMemento

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:39

Experiment2:TestwithMemento

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:40

Conclusions

•  Annota@on,asacorescholarlyprac@ce,isincreasinglyweb‐based

•  WeproposeusingOACandMementotoprovideasolu@onforpersistenceofannota@ons,bydisplayingannota@onsintheiroriginalcontextanddisplayingrelevantannota@onsforarchivedresources

•  Archivingofannotatedandannota@ngresourcesimportant

•  Interes@ngresearchques@onofwhetherannota@onspansmul@plearchivedresources

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:41

ThankYou

•  Authors:•  azaroth42@gmail.com/rsanderson@lanl.gov

•  hvdsomp@gmail.com/herbertv@lanl.gov

•  OAC:•  hap://www.openannota@on.org/•  hap://groups.google.com/group/oac‐discuss

•  Memento:•  hap://www.mementoweb.org/

•  hap://groups.google.com/group/memento‐dev

•  ThanksTo:•  ScoaAinsworth,LudaBalakireva,TimCole,AnnaGerber,BernhardHaslhofer,EricHetzner,JaneHunter,CliffLynch,MichaelNelson,DougReside,HariharShankar

Persistent Web Annotations Rob Sanderson, Herbert Van de Sompel

JCDL 2010, June 21-25, Surfers Paradise, Australia

Slide:42

MementoHTTPFlowHEAD R, (Accept-Datetime)

LinkG

302M, Vary, TCN, LinkR,B,M

200, Content-Datetime, LinkR,B,M

GET G, Accept-Datetime

GET M, (Accept-Datetime)