Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
agriXchange workshop at EFITA 2011, Praha July
Dr. Johannes KeizerOffice of Knowledge Exchange, Research and ExtensionFood and Agriculture Organization of the UN
CIARD and agINFRA - creating a global framework for information sharing in agricultural research and innovation
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
We will promote research for food and agriculture, including research to
adapt to, and mitigate climate change, and access to research results and
technologies at national, regional and international levels.
We will reinvigorate national research systems and will share information
and best practices. We will improve access to knowledge.
world food summit 2009
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
http://aims.fao.org
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
http://www.ciard.net
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
2nd IISAST Consultation
CIARD Initiative launched
(15 founding partners)
Regional Consultations
70 countries 150 info prof.
1 st IISAST Consultation
TASK FORCES
CIARD endorsed (GCARD and FARA)
+112 partners and growing…
20092007 20082005
Coherence in Information for Agricultural Research for Development
A new global movement to provide a platform for coherence between information-related initiatives
to make public domain agricultural research information and knowledge truly accessible to all
e-Consultation & Beijing Consultation
+ Regional Workshops
GCARD 2012
2010 20122011
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
Information Infrastructure for Agricultural Research and Innovation
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
Distributed Repositories
• stats• gene banks• gis data• blogs, • journals• open archives• raw data• technologies• learning objects• ………..
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
The solution: agINFRA
Produce linked open data from all datasets
Use common reference vocabularies to interlink
Don’t wait ! Wrap the Legacy
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
RING
routemap to information nodes and gateways
ToolsLOD
enabled software
VocBenchvocabulary server
concepts and entities triples
LOD Generator
triplifier, concept and entity
identifier
Data Services
Webservices + APIs to triple stores
Cloud
storage for RDF triples
The Infrastructure elements
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
Lod Generator: processLOD Generator
triplifier, concept and entity
identifier
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
Data Services process
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
agINFRA – the Project
FAO and the Chinese Academy of Agricultural Science are Senior Users in the Project
4 Million Euros funding, but for 11 partners
Project starts on November 1 for 3 years
CIARD partners can post their requirements
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
Under Construction
VocBench
AGROVOC Linked Open Data
AgroTagger
Triplifying AGRIS
Serendipity linking
Drupal front ends for triple stores
The CIARD R.I.N.G
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
RING
routemap to information nodes and gateways
ToolsLOD
enabled software
VocBenchvocabulary server
concepts and entities triples
LOD Generator
triplifier, concept and entity
identifier
Data Services
Webservices + APIs to triple stores
Cloud
storage for RDF triples
The Infrastructure elements
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
The VocBench VocBench
concepts and entities triples
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
VocBench Features
Domain independent
Structure independent (i.e. thesauri, Glossaries, etc)
Supports RDF (SKOS, SKOS-XL), OWL
Supports collaborative editing
Supports editorial workflow, with user roles
Simple and advanced search
Supports data export: SKOS, Relational format (MySQL)
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
RING
routemap to information nodes and gateways
ToolsLOD
enabled software
VocBenchvocabulary server
concepts and entities triples
LOD Generator
triplifier, concept and entity
identifier
Data Services
Webservices + APIs to triple stores
Cloud
storage for RDF triples
The Infrastructure elements
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
• Does Concept identification in unstructured texts
• Uses Agrovoc as a controlled vocabulary
• Prototype under testing with excellent results (entire repository of ICARDA indexed)
• Will produce in future Structured RDF files that can be used to link data like “open Calais”
•
AgroTagger
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
Triplifying AGRIS (small exemple)
<?xml version="1.0" encoding="utf-8"?><rdf:RDF xmlns:ags="http://purl.org/agmes/1.1/" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dct="http://purl.org/dc/terms/"><bibo:Journal rdf:about="http://aims.fao.org/aos/journal/c_b6e4ca85">
<bibo:ISSN>0101-9066</bibo:ISSN><bibo:ISSN>0101-9066</bibo:ISSN><dct:title><![CDATA[Circular técnica]]></dct:title><dct:alternative><![CDATA[Circular técnica (Centro Nacional de Pesquisa de Seringueira e Dendê)]]></dct:alternative><dct:alternative><![CDATA[Circular Tecnica - Centro Nacional de Pesquisa da Seringueira e Dende]]></dct:alternative><dct:alternative><![CDATA[Circular técnica - CNPSD]]></dct:alternative><dct:alternative><![CDATA[Circ. téc.]]></dct:alternative><ags:publisherPlace rdf:resource="http://aims.fao.org/aos/geopolitical.owl#Brazil"/><dct:publisher><![CDATA[Empresa Brasileira de Pesquisa Agropecuária, Centro Nacional de Pesquisa de Seringueira e
Dendê]]></dct:publisher><dct:language>por</dct:language><dct:date>1980</dct:date><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_10795"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_4650"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_32372"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_332"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_3589"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_5556"/>
</bibo:Journal>
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
Journal disambiguation: results
2.644.818 AGRIS records
2.171.113 records are journal records (82.09%)
1.788.083 journal records have been covered by the disambiguation process (82.35%)
14.658 journals have been correctly disambiguated
~20.000 strings must be examined yet: they refer to journal’s titles
Triples have been generated:
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
RING
routemap to information nodes and gateways
ToolsLOD
enabled software
VocBenchvocabulary server
concepts and entities triples
LOD Generator
triplifier, concept and entity
identifier
Data Services
Webservices + APIs to triple stores
Cloud
storage for RDF triples
The Infrastructure elements
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
“Serendipity Linking”
With four predefined queries we try to find in Google further information record related:
• Search by Title, to find the full text of the document if it is available on line
• Search by Author(s)+Agrovoc keywords, to find not only information about the author of the document but also other author’s publications about the same subjects
• Search by Jounal Title
• Search by Conference
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
The Biotech Glossary:
Using Drupalas a Triple Store Browser
Data in VocBench
Triple Store
OWL ART API
Drupal
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
RING
routemap to information nodes and gateways
ToolsLOD
enabled software
VocBenchvocabulary server
concepts and entities triples
LOD Generator
triplifier, concept and entity
identifier
Data Services
Webservices + APIs to triple stores
Cloud
storage for RDF triples
The Infrastructure elements
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
RING - Charts and numbers
http://ring.ciard.net
Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer
RING – Numbers
Number of documents potentially reachable through the services registered in the RING.
Types of service considered: document repositories and bibliographic databases.
http://ring.ciard.net/totals