+ All Categories
Home > Environment > Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

Date post: 27-Nov-2014
Category:
Upload: riccardo-albertoni
View: 156 times
Download: 0 times
Share this document with a friend
Description:
The development of a Spatial Data Infrastructure (SDI) at European level is strategic to answer the needs of environmental management requested by the European, national and local policies. Several European projects and initiatives aim to share, integrate and make accessible large amount of environmental data in order to overcome cross- border/language/cultural barriers. To this purpose, environmental thesauri are used as shared nomenclatures in metadata compilation and information discovery, and they are increasingly made available on the web. This paper provides a methodological approach for creating a catalogue of the environmental thesauri available on the web and assessing their reusability with respect to domain independent criteria. It highlights critical issues providing some recommendations for improving thesauri reusability.
20
Environmental Thesauri Under the Lens of Reusability R. Albertoni, M. De Martino, P. Podestà Istituto di Matematica Applicata e Tecnologie Informatiche "Enrico Magenes”, CONSIGLIO NAZIONALE DELLE RICERCHE (CNR-IMATI) EGOVIS 2014 Munich, Germany, September 1-5 2014
Transcript
Page 1: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

Environmental Thesauri Under the Lens of

ReusabilityR. Albertoni, M. De Martino, P. Podestà

Istituto di Matematica Applicata e Tecnologie Informatiche "Enrico Magenes”,

CONSIGLIO NAZIONALE DELLE RICERCHE (CNR-IMATI)

EGOVIS 2014Munich, Germany, September 1-5 2014

Page 2: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

Summary

OverviewObjectivesMotivation

Methodological ApproachTerminological Resources Cataloguing Reusability Criteria Identification Evaluation of the catalogue

Conclusions Consideration and RecommendationForeseen Future Activity

EGOVIS 2014Munich, Germany, September 1-5

20142

Page 3: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

3

Overview

EGOVIS 2014Munich, Germany, September 1-5

2014

Page 4: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

General ObjectiveTo provide a state of play of the environment

thesauri available on the Web and to assess their reusability.

4

OverviewObjective

Reusability «Easiness to access and to exploit Thesaurus content”

Licence Type

• Openness of licence

LD Complianc

e

• 5 star LD• Stressing dereferenceable HTTP

URIs as identifiers for resourcesEGOVIS 2014

Munich, Germany, September 1-5 2014

Page 5: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

Why Thesauri ?Thesauri are employed as solution to the multilingual and multicultural issues in the environmental data sharing

5

OverviewINSPIRE SDI vs thesauri

Information discovery across applications and platforms

Uniformity in Data description

MetadataMetadata

Metadata

INSPIRE Implementation rulesrecommend the adoption of (multilingual) thesauri when compiling metadata for data/services

EGOVIS 2014Munich, Germany, September 1-5

2014

Page 6: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

Different thesauri have been developed, and may be deployed for cataloguing the geographical, e.g.,

Id1

Id2

Id3

Id6

skos:broder

skos:broder

skos:broder

DMEER/Treats Biodiversity By

Biogeographical Regions

Id1

Id2

Id3

Id6

skos:broderskos:broder

skos:related

IUCN Classification

Id3

Id6

Skos:RelatedMatch

Id2

????

Id1???

Id1

????

Id4

Id5Id6

EARTh GEMET

GEMET Published by EEA

According to Linked Data Best Practice

Id1

Id4

Id3

skos:broder

skos:broder

Skos:ExactMatch

Skos:ExactMatch

Skos:ExactMatch

…THiST

Thesauri heterogeneity wrt thematic coverage, multilingualism, granularities, popularity in certain communities

Heterogeneity is precious!!!

OverviewINSPIRE SDI vs thesauri

Need of common thesaurus framework to exploit thesauri heterogeneity

INSPIRE2014Aalborg, June 16-20 2014

Page 7: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

INSPIRE2014Aalborg, June 16-20 2014 7

Not only one thesaurus … But

OverviewMotivation: EU projects NatureSDI and

eENVplus integration of different available thesauri

cross walking from a thesaurus to another Thesaurus Framework(TF)

Design Principle

Simple Knowledge Organization System(SKOS) to encode the thesaurus content

Linked Data best practices to publish the thesaurus in machine understandable

format

ModularityTo add new KOS as a new module plugged in the set of thesauri in the TF

OpennessTo easily extendable each KOS keeping separated the original oneInterlinking

Linking among the terms referringto the same concepts in more thenone thesaurus in order to harmonize their usage.

ExploitabilityTo encode in a standard and flexible formatin order to encourage the adoption and its enrichment from third party system

LusTRE: Linked Thesaurus fRamework for Environment http://linkeddata.ge.imati.cnr.it:2020/

Page 8: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

Methodological

Approach

8EGOVIS 2014

Munich, Germany, September 1-5 2014

Page 9: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

Methodological Approach

EGOVIS 2014Munich, Germany, September 1-5

2014

Page 10: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

ApproachTerminological Resources Cataloguing

Literature review Scientific international journals (i.e. SWJ)Data hub (http://datahub.io/) Resource associated to the keywords "thesaurus

skos". Thesauri for Environment, Geology, GI LOD Cloud resources in the data hub and included in the

LOD Cloud datasets (2007-2011)

State of Play Analysis

Thesaurus Expert Users

Others

Questionnaire# Answers (54-100%)

EGOVIS 2014Munich, Germany, September 1-5

2014

Page 11: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

ApproachSynthesis of Resources Catalogue

# of Total Resources: 62 Not only thesauri, but

different kinds of artefact The presence of the same

terminological resources in LOD Cloud, SWJ dataset section, or data hub provides a thumb rule for reusability and for dataset popularity in the Linked Data community

# of Thesauri: 24

Other KOS, dataset, ontologies

EGOVIS 2014Munich, Germany, September 1-5

2014

Considered in our

analysis

Page 12: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

LD Compliance

5 stars classification

Tim Berners-Lee

• Basic criteria for LD compliance: “Dereferenceable URI “

Licence

• Basic criteria “Openness of licence”

ApproachPhase II: reusability criteria

EGOVIS 2014Munich, Germany, September 1-5

2014

Page 13: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

5 Stars classification of LD by Tim Berners-Lee HTTP dereferenceability of the URI mandatory LD

prerequisite to check authoritativeness of information associated to thesaurus concepts to exploit mappings among thesauri concepts in order to discover further

information in a follow-your-nose fashion

13

ApproachReusability: LD Criteria definition

1 star resources available on the web (whatever format)

2 stars resources available as machine-readable structured data (e.g., Excel)

3 stars as 2 stars plus non-proprietary format (e.g., CSV instead of Excel)

3,5 stars

resources available as RDF dump without dereferenceable HTTP URI

3,9 stars

resources provided as RDFa (RDF embedded in XHTML) or SPARQL end point which are very close to be LD ready but without dereferenceable HTTP URI

4 stars all the above plus, use open standards from W3C (RDF and SPARQL)and HTTP dereferenceable URI to identify things, so that people can pointat published resources

5 stars all the above, plus interlinks to other data to provide context

EGOVIS 2014Munich, Germany, September 1-5

2014

Page 14: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

Categories based on some existing and well-known type of licences (i.e., Creative Commons framework) Inspired by “Rodriguez-Doncel, V., Gomez-Perez, A., Mihindukulasooriya, N.:

Rights declaration in linked data. In: 4th Int. Work. on Consuming Linked Data (2013)”

Level of reusability: 1=low reusability … 5= high reusability

14

ApproachReusability: Licence definition

Open licences, without severe restrictions: complete reuse, including commercial transformation and publication of a resource

EGOVIS 2014, Munich, Germany, September 1-5 2014

Page 15: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

15

ApproachPhase III: LD Thesauri Evaluation

LD analysis of thesauri in the reference catalogue Identification of three Macro Categories of LD Thesauri

LD ready

• LD stars>=4• thesauri published according to the LD best

practices and exposing dereferenceable concept URIs returning the proper RDF/XML fragments.

RDF ready

• 3< LD stars <4 • thesauri provided in RDF document but without

exposing HTTP dereferenceable URI for their concepts

Other

• LD stars<=3 • thesauri available in other format than RDF

EGOVIS 2014Munich, Germany, September 1-5

2014

Page 16: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

16

ApproachPhase III: Licence Thesauri Evaluation

Licence analysis of thesauri in the reference catalogue Identification of three Licence Macro Categories

Open

Licenced

Thesauri

• Licence evaluation>=4• highly reusable thesauri released under public

domain, attribution or share-alike licences. They can be modified and extended and deployed in commercial and non-commercial context

Partially Ope

n Licenced

• Licence evaluation =3.5• thesauri licenced with some further restrictions in

reusability.

Closed

Licenced

Thesauri

• Licence evaluation<3.5• It considers thesauri in which licence forbids the free

reuse or for which a licence is not provided yet

EGOVIS 2014Munich, Germany, September 1-5

2014

Page 17: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

17

ApproachPhase III: Overall Thesauri Evaluation

Analysis of the thesauri respect to the macro-categories identified for LD stars and licence

Results 11 (c.a. 46%) Thesauri are LD ready (6 are interlinked with third party

thesauri) 8 (33%) have the SKOS deployed and are RDF ready Thesauri are equally distributed among Licence categories,

=> only the 33% of thesauri are truly open Licenced

EGOVIS 2014Munich, Germany, September 1-5

2014

Page 18: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

18

Considerations The Thesaurus Catalogue provides good level of reusability

58% of Thesauri are LD/RDF ready and Open/Partial Open Licence LD seems quite popular in the community of Environmental

Thesaurus providers c.a. 46% already exposed as linked data

Conclusions Consideration and recommendation

Recommendations to improve reusability More attention to HTTP dereferenceability of Concept URIs

54% of thesauri fail providing HTTP dereferenceable URIs!!! Licence should be more carefully stated

Thesauri are available in more then one sources but rarely licence is stated in all the sources ( e.g. thesaurus’s portal, datahub)

Sometimes it is missing an explicit web link to the licenceEGOVIS 2014

Munich, Germany, September 1-5 2014

Page 19: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

Outcomes Reference catalogue of thesauri on the web and their evaluation

with respect to licence and LD compliance. Investigation approach and stress of reusability criteria domain

independent Recommendations to improve reusability

Future work Analysis refinement

Evaluation of multilingualism SKOS quality (e.g. QSKOS) Quality of interlinking:

How enabling are interlinkings in a joint exploitation of the thesauri? A web portal to expose the whole catalogue / the reusability

evaluation. LusTRE … A new release end of year

19

Conclusions & Future Work

EGOVIS 2014Munich, Germany, September 1-5

2014

Page 20: Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)

EGOVIS 2014Munich, Germany,

September 1-5 2014 20

Thanks for your attention!

Contacts:[email protected]@[email protected]


Recommended