Date post: | 10-Dec-2015 |
Category: |
Documents |
Upload: | london-carll |
View: | 216 times |
Download: | 0 times |
Ingo Frommholz
COLLATE – Collaboratory for Annotation, Indexing and Retrieval of Digitized Historical Archive Material
DELOS International Cooperation Workshop, May 30, 2003
Ingo FrommholzFraunhofer IPSI, Darmstadt
[email protected]://ipsi.fraunhofer.de/
DELOS International Cooperation Workshop Prague, May 30, 2003
2 Ingo Frommholz
Digital Libraries in Cultural Heritage
Valuable historic document collections exist, but are scattered in national archivesSources mostly not available onlineDifficult-to-use database & referencing systemsLack of content-based indexing & access
Valuable expert domain knowledge exists, but mostly inaccessible to externalsTacit knowledge, insufficiently documentedProfessional communities lack technology
support for collaborative knowledge working
DELOS International Cooperation Workshop Prague, May 30, 2003
3 Ingo Frommholz
The COLLATE Project (IST-1999-20882)
Constructing a “Collaborative Information Space” Preserve historic documents in a distributed multimedia
repository European historic film documentation (20ies and 30ies)Historic film censorship (legal docs, applications & decisions,
correspondence, etc.), Press material (articles), Photos (stills, portraits) & film posters, Digital film/video fragments
XML metadata (cataloguing & content indexing) Ensure accessibility
Work environment for content indexing & annotationContent- and context-based retrieval
Evaluate acceptabilityPreservation case studies by film expertsEmpirical studies of real-life user behavior
DELOS International Cooperation Workshop Prague, May 30, 2003
4 Ingo Frommholz
Partners
Content providers / pilot usersDeutsches Filminstitut – DIF, Frankfurt, Germany
Filmarchiv Austria, Vienna, Austria
Národní Filmový Archiv, Prague, Czechia
Technology developersFraunhofer IPSI, Darmstadt, Germany
University of Bari, LACAM Lab, Bari, Italy
Sword ICT S.r.l., Bari, Italy
Evaluation partnerRisø National Laboratory, Systems Analysis Dept, Denmark
DELOS International Cooperation Workshop Prague, May 30, 2003
5 Ingo Frommholz
Why a Cultural Collaboratory?
Support existing work processes in cultural sciencesInterpretative content analysis of documentsReconstruct „unity“ of cultural phenomena,
interlinking scattered knowledge sources Offer new knowledge working environment
Organize collaborative workBring together divergent user communities & roles
Create enhanced cultural information services Raise awareness & visibility of cultural archives
DELOS International Cooperation Workshop Prague, May 30, 2003
6 Ingo Frommholz
Censorship / Registration Cards
DELOS International Cooperation Workshop Prague, May 30, 2003
8 Ingo Frommholz
Conceptual Integration COLLATE-Ontology
Collate Entity
Form, Genre Physical Cha-racteristics
Abstraction
Film- and Censorship Topic
Work
Temporality ActualityLocation
Moving Image
Film AgentFilm Event Film ActivityFilm Situation
Situation Event Action AgentManifes-
tation
Censorship Document
Film Censorship
Agent
Film Cen-sorship Activity
Film Censor-ship Event
Generic LevelABC-Model
Cultural Heritage Do-main LevelCIDOC CRM, FRBR
Film Archive Subdomain Level:LC TGM IIFIAF Classification
COLLATE Appli-cation Level: Collate Keywords
DELOS International Cooperation Workshop Prague, May 30, 2003
9 Ingo Frommholz
Model of the Concept „Film Life Cycle“
filmcreation
x
originalversion
x
censor-ship
x
shortedversion
xprecedes
precedesfollows
Directingx
hasAction
cencorshipdecision
xhasActio
n
hasParticipant
Work x
Filmcopy A
x
has Result
realizesWork
involves
Filmcopy B
x
has Result
realizesWork
hasParticipant
DELOS International Cooperation Workshop Prague, May 30, 2003
10 Ingo Frommholz
System Architecture (OAIS)
DescriptiveInformation(DigiProt) Archival Storage
(Distributed Data Repository)
Access(Retrieval Service Provider)
Administration
Data Management
XML Content Manager Indexing Service
Ingest
SOAP
User-generated Information
Cataloguing
Indexing
Annotation
Interlinking
ScannedDocuments
Digital Watermarking
Image & Video Analysis
Document Processing &Classification (WISDOM++)
Document Pre-Processing
Collaboration
SO
APSO
AP
DELOS International Cooperation Workshop Prague, May 30, 2003
11 Ingo Frommholz
Collaboration in COLLATE
cataloguingannotation interlinking indexingterminology
development
internal (COLLATE system)external
traditional
- meetings - phone- mail - email
computer-supported/online
discussion forum
implicit explicit
specified relation types
communication(e.g. requests)
about:
DELOS International Cooperation Workshop Prague, May 30, 2003
12 Ingo Frommholz
Discourse Structures
“Discourses represent extended communication between two or more participants in a shared context.” (Rich & Sidner, 1998)
Establishing a discourse contextModeling discourse as interrelated nested
annotationsAnnotation thread reflects scientific discourseTyped links (DSR) between
Document and annotation Annotation of annotations
Annotation1
Annotation3
Annotation4
Annotation2
Annotation5
DELOS International Cooperation Workshop Prague, May 30, 2003
13 Ingo Frommholz
Communication Acts: Discourse Structure Relations
inte
rper
sona
lelaboration
background information
argumentation
comparison
cause
interpretation
analogy
difference
support argument
counterargument
DELOS International Cooperation Workshop Prague, May 30, 2003
14 Ingo Frommholz
Semantic Web Integration – COLLATE RDF(S)
DELOS International Cooperation Workshop Prague, May 30, 2003
15 Ingo Frommholz
Document Retrieval in COLLATE
For a query q, a ranking of documents is returned. Therefore, a retrieval weight r is calculated for each document.
Documents are ranked according to descending retrieval weights
The retrieval is based on the document’s metadata (given by film scientists or extracted from the digitized documents) and on the annotation thread.
DELOS International Cooperation Workshop Prague, May 30, 2003
16 Ingo Frommholz
Context-based Retrieval in COLLATE
In COLLATE, we deal with the discourse context.
A document is seen in the light of its interpretations
We also consider at which point of the discourse a statement is made and what relation exists between the statement and the entity this statement refers to.
Example: Consider the query for “all censorship decisions made for political reasons”.
DELOS International Cooperation Workshop Prague, May 30, 2003
17 Ingo Frommholz
Query: “censorship decisions for political reasons”:Metadata Only
I think the reasons mentionedhere are not the real reasons.I see a political background as the main reason.
I disagree. There were a lot ofsimilar decisions with thesame argumentation. Of course, there might be apolitical background, but Ithink this is not the mainreason in this case. ...
<filmtitle>Kuhle Wampe</filmtitle>
...
...<assessors_chairman>Oberregierungsrat Dr
BeckerBeisitzer: Justizrat
Dr. Rosenthal...</assessors_chairman>
...
...<controlled_keyword>
obscene actions</controlled_keyword>
...
Cataloguing
Inter-pretation
Counterargument
Keyword
DocumentInterpretation
0.01
DELOS International Cooperation Workshop Prague, May 30, 2003
18 Ingo Frommholz
Query: “censorship decisions for political reasons”:Metadata + Interpretation
I think the reasons mentionedhere are not the real reasons.I see a political background as the main reason.
I disagree. There were a lot ofsimilar decisions with thesame argumentation. Of course, there might be apolitical background, but Ithink this is not the mainreason in this case. ...
<filmtitle>Kuhle Wampe</filmtitle>
...
...<assessors_chairman>Oberregierungsrat Dr
BeckerBeisitzer: Justizrat
Dr. Rosenthal...</assessors_chairman>
...
...<controlled_keyword>
obscene actions</controlled_keyword>
...
Cataloguing
Inter-pretation
Counterargument
Keyword
DocumentInterpretation
0.32
DELOS International Cooperation Workshop Prague, May 30, 2003
19 Ingo Frommholz
Query: “censorship decisions for political reasons”:Analysis of Discourse Structure Relations
I think the reasons mentionedhere are not the real reasons.I see a political background as the main reason.
I disagree. There were a lot ofsimilar decisions with thesame argumentation. Of course, there might be apolitical background, but Ithink this is not the mainreason in this case. ...
<filmtitle>Kuhle Wampe</filmtitle>
...
...<assessors_chairman>Oberregierungsrat Dr
BeckerBeisitzer: Justizrat
Dr. Rosenthal...</assessors_chairman>
...
...<controlled_keyword>
obscene actions</controlled_keyword>
...
Cataloguing
Inter-pretation
Counterargument
Keyword
DocumentInterpretation
0.19
DELOS International Cooperation Workshop Prague, May 30, 2003
20 Ingo Frommholz
COLLATE – User Interface
DELOS International Cooperation Workshop Prague, May 30, 2003
21 Ingo Frommholz
Current State
A first prototype was delivered to the archives and is used by them
A second prototype will be delivered soon, introducing discourse structure relations and advanced collaboration features to the users
A third prototype will contain context-based retrieval
DELOS International Cooperation Workshop Prague, May 30, 2003
22 Ingo Frommholz
Outlook
Evaluate collaborative approach and context-based retrieval
Apply COLLATE technology in other domains?