Linking Folksonomies to Knowledge OrganizationSystems
Jakob Voß (VZG)
Metadata and Semantic Research (MTSR), November 30th,2012
Social Tagging and Folksonomies
Wikipedia and Stack Exchange
Linking Folksonomies to KOS
Results
Section 1
Social Tagging and Folksonomies
Social Tagging
I Keywords/tags manually assigned to documents bymembers of a distributed community of volunteers
I Tags freely chosen (or easy to create new)
I Outcome of tagging activity in a tagging system:Folksonomy
Properties of Social Tagging
I Tagging systems are very dynamic— especially compared to normal KOS
I Different types of tagging systems exist:— don’t compare apples and oranges!
I source of resourcesI tagging rights, tagging support, tag managementI tag aggregationI tag connectivity
Tag aggregation: Bag-Model
Tag aggregation: Bag-Model
Tag aggregation: Set-Model
Tag aggregation: Set-Model
Folksonomies in set-model tagging systems
Dynamic knowledge organization systemscreated by communities of distributed volunteers
I Directly given as snapshot of community consensus
I Limited awareness among community members
I Two popular examples:
I Categories in WikipediaI Tags in Stack Exchange
Section 2
Wikipedia and Stack Exchange
Categories in Wikipedia
I folksonomy is a thesaurus
Tags in Stack Exchange
I tags in a questionI anyone can edit tags (after a while)I folksonomy is a flat file of keywords (with some synonyms)
Section 3
Linking Folksonomies to KOS
Knowledge Organization Systems (KOS)
I Classifications, thesauri, taxonomies, authority files. . .
I Common model of KOS for exchange and interlinking:Simple Knowledge Organization System (SKOS):
<http://example.org/a> a skos:Concept ;
skos:broader <http://example.org/b> ;
skos:narrower <http://example.org/c> ;
skos:related <http://example.org/d> ;
skos:exactMatch <http://example.com/A> ;
skos:closeMatch <http://example.com/aa> .
Harvest and SKOSify folksonomy
Additional links from tag names
I Reuse of Computing Research Repository (CoRR) notations
Mapping to CoRR classification
@prefix cst: <http://cstheory.stackexchange.com/tags/> .
[ skos:notation "LO";
skos:prefLabel "Logic in Computer Science"@en ]
skos:closeMatch cst:lo.logic .
[ skos:notation "DS" ;
skos:prefLabel "Data Structures and Algorithms"@en ]
skos:narrowMatch
cst:ds.algorithms ,
cst:ds.data-structures .
Feedback of enrichment and links into the community
Enrichtment of tag description in Wikipedia
Enrichtment of tag description in Wikipedia
<http://de.dbpedia.org/resource/Kategorie:H%C3%B6rspiel>
a skos:Concept ; skos:prefLabel "Horspiel"@de ;
skos:narrowMatch
<http://data.bib.uni-mannheim.de/data/rvk/AP_36320> ,
<http://data.bib.uni-mannheim.de/data/rvk/EC_7980> ;
skos:closeMatch
<http://dewey.info/class/791.447/> , # DDC
<http://d-nb.info/gnd/4025435-5> . # GND
Enrichtment of tag description in Stack Exchange
Enrichtment of tag description in Stack Exchange
@prefix libse: <http://libraries.stackexchange.com/tags/>
libse:ils a skos:Concept ;
skos:broader
libse:software ;
skos:narrower
libse:circulation> ,
libse:collection-management ,
libse:cataloging> ,
libse:opac ;
skos:closeMatch
<http://dbpedia.org/resource/Integrated_library_system> ,
<http://id.loc.gov/authorities/subjects/sh95003216> ,
<http://eprints.rclis.org/handle/10760/3775> ,
<http://d-nb.info/gnd/4583297-3> .
Feedback of enrichment and links into the community
I Feedback with additional benefit: find related documents
I One must be able to directly link to a rich collected ofdocuments indexed with the concept notation/identifier ofeach KOS
Section 4
Results
hierarchical links
Mappings to other KOS
I Published in SKOS/RDF as shown
I Simple mappings in form of BEACON files
#PREFIX: http://libraries.stackexchange.com/tags/
#RELATION: http://www.w3.org/2004/02/skos/core#closeMatch
ils|http://dbpedia.org/resource/Integrated_library_system
ils|http://id.loc.gov/authorities/subjects/sh95003216
ils|http://eprints.rclis.org/handle/10760/3775
ils|http://d-nb.info/gnd/4583297-3
archives|http://eprints.rclis.org/handle/10760/3660
Outcome and Benefits
I Show related documentsguide people from communities to collections
I Catalog enrichmentguide people from collections to communities
I Collection analysis
Compare collections with measuring at one KOS
Collection of papers at arXiv.org compared to question attheoretical computer science Stack Exchange:
I complexity theory is asked much more then published
I information theory is published much more then asked about
References
I Scripts and data harvested available athttp://github.com/nichtich/se2skos.
I Source of paper and slides available athttp://github.com/jakobib/mtsr2012.
I Both Wikipedia and Stack Exchange data available underCC-BY-SA accessible via open APIs:
I http://stackapps.com/I http://en.wikipedia.org/w/api.php
I Icons CC-BY based on picol.org