Introducing a content integration process for a federation of agricultural institutional...

Post on 30-Nov-2014

568 views 0 download

Tags:

description

Presentation titled "Introducing a content integration process for a federation of agricultural institutional repositories". MTSR 2011, Izmir, Turkey, 12/10/2011

transcript

Introducing a content Introducing a content integration process for a integration process for a federation of agricultural federation of agricultural institutional repositoriesinstitutional repositories

V. Protonotarios1, L. Gavrilut1, I. Athanasiadis1, Hatzakis1, M-A. Sicilia2

1Greek Research & Technology Network (GRNET)

II University of Alcala, Computer Science Department

4th International Workshop on Metadata and Semantics for Agriculture, Food and Environment

MTSR 2011, October 12th, 2011, Izmir

Part I: About VOA3R

About VOAAbout VOA33R ProjectR Project

What is VOA3R?◦Virtual Open Access Agriculture &

Aquaculture Repository◦36-months CIP-ICT-PSP EU project

VOA3R aims to:Improve access to EU agriculture &

aquaculture open access research results

About VOAAbout VOA33R ProjectR Project

What is VOA3R about?

Sharing Scientific and Scholarly Research related to Agriculture, Food & Environment, using (among others):◦ A federated repository feeding with scholarly content

◦ A social platform which makes use of ….

◦ A set of domain ontologies◦ and other integrated components…

About VOAAbout VOA33R ProjectR Project

What is VOA3R going to develop?Among others, the VOA3R federated

repository, which will harvest scholarly content from institutional repositories.

How is this going to happen?VOA3R will develop an AP based on

the requirements of the project’s content providers

Where to find VOA3R?Where to find VOA3R?

1. Website: http://www.voa3r.eu

2. Social Platform: currently in beta

3. VOA3R Repository Tool (Confolio)

VOAVOA33R Web 2.0 toolsR Web 2.0 tools

1. Facebook group: www.facebook.com/groups/voa3r.project/

2. Twitter account: @VOA3R

3. Flickr: http://www.flickr.com/photos/voa3r/

4. Blogs:

Part II: Content

What about the content?What about the content?Scholarly content from institutional

repositories on agriculture and aquaculture will be aggregated to VOA3R repository

= metadata descriptions

VOA3R Content Providers currently use a wide variety of metadata standards (e.g. AGRIS, Dublin Core)

What about the content?What about the content?

The issue:How to align all these different

metadata AP

The solution:To work on a common AP (VOA3R

AP), based on the requirements of the VOA3R content providers

VOAVOA33R Content providers R Content providers (1/3)(1/3)

Epsilon repository

OceanDocs

Organic Eprints

VOAVOA33R Content providers R Content providers (2/3)(2/3)

ProdINRA

U-GOV

ARI Repository

VOAVOA33R Content providers R Content providers (3/3)(3/3)

PLUS:

An additional number of content providers, not using a digital repository at this time

Part III: Content population process

Content Population Content Population MethodologyMethodology Controlled Testing phase (7-

9/2011) Enrichment of test metadata records using

Confolio Phase 1 (10-12/2011)

Integration of repositories using OAI-PMH Phase 2 (1-8/2012)

Integration of repositories with no OAI-PMH support

Phase 3 (9/2012 – 5/2013) Content population with content from

external collaborators

Content Population Content Population MethodologyMethodology

Overview of the ProcessOverview of the Process

1. Uploading/IntegrationPre-Check against Core Criteria

yes no

1. Accessibility under the specified technical criteria.

The provider confirms that the resource can be opened or accessed through the provided URL (link). yes no

2. Appropriateness against violence, pornography, racism, etc.

The provider confirms that the resource does not contain any violent, pornograpic or racist content/information. yes no

3. Relation of the metadata/content to Agriculture & Aquaculture.

The provider confirms that the resource is relevant to agriculture or aquaculture. yes no4. The IPR (intellectual property rights) rules do not prohibit that the resource is promoted through the VOA3R network. The provider confirms that the resource is free of any IPR restrictions that are against its promotion/description within the VOA3R network.

Overview of the ProcessOverview of the Process

2. Enrichment

Overview of the ProcessOverview of the Process

3. Validation

Overview of the ProcessOverview of the Process

4. Quality Review/Assessment

Scenario of Use: Testing Scenario of Use: Testing PhasePhaseConfolio was used by the VOA3R

content providers as a controlled environment for creating the metadata records of their resources:

Scenario of Use: Testing Scenario of Use: Testing PhasePhaseUploading/Integration:

Scenario of Use: Testing Scenario of Use: Testing PhasePhaseEnrichment:

Scenario of Use: Testing Scenario of Use: Testing PhasePhaseValidation:

Pre-Check against Core Criteriayes no

1. Accessibility under the specified technical criteria.

The provider confirms that the resource can be opened or accessed through the provided URL (link). yes no

2. Appropriateness against violence, pornography, racism, etc.

The provider confirms that the resource does not contain any violent, pornograpic or racist content/information. yes no

3. Relation of the metadata/content to Agriculture & Aquaculture.

The provider confirms that the resource is relevant to agriculture or aquaculture. yes no4. The IPR (intellectual property rights) rules do not prohibit that the resource is promoted through the VOA3R network. The provider confirms that the resource is free of any IPR restrictions that are against its promotion/description within the VOA3R network.

Scenario of Use: Testing Scenario of Use: Testing PhasePhaseQuality Review/Assessment:

Grid for VOA3R Subject Experts

1 2 3 4 5

1 Clarity & Relevance: Is the content clear and relevant to the agricultural environment ?

1 (not clear & relevant) to 5 (absolutely clear & relevant) 1 2 3 4 52 Quality: Does the content has a high quality in terms of balanced presentation of ideas, and appropriate level of detail ?1 (no) to 5 (yes) 1 2 3 4 53 Appropriateness: Does the resource use appropriate vocabulary, language and concepts for the target age of people it is adressing ?1 (no, the resource uses inappropriate vocabulary) to 5 (yes, the resource uses appropriate vocabulary) 1 2 3 4 54 Motivation: Is the content motivating a target group of people to start reading more about the subject it presents ?1 (the content is not motivating) to 5 (the content is motivating) 1 2 3 4 5

5 Veracity & accuracy: Is the content true and accurate regarding the agricultural environment ?

1 (no) to 5 (yes) 1 2 3 4 5

6 Updated: Is the content up to date or the data and information presented are outdated ?

1 (information is outdated) to 5 (information is up to date) 1 2 3 4 5

7 Accessibility: How accessible is the content to the target group of people ?

1 (poorly accessible) to 5 (fully accessible)8 Reusability: Does the content has ability to be used again in another environment and to be understood by people with different backgrounds ?

1 (the content cannot be reused) to 5 (the content can be reused) reject

Final recommendation: Please give your final mark and a short comment to justify it.

Comment to the submitter:

Comment to the VOA3R federation:

accept without modification

accept with modification

ConclusionsConclusionsDespite the wealth of scholarly content found

in institutional repositories, the use of different metadata APs raises an issue

Agreeing on a common metadata format is a challenge but the VOA3R AP aims to achieve this goal

The design and implementation of a well-defined content population/integration process is a crucial component in populating a repository

For more information please visit

www.voa3r.eu

Thank you for your attention!