agrixchange workshop at efita 2011, praha july

33
Building the CIARD Framework for Data and Information Sharing Praha, July 12, - johannes keizer agriXchange workshop at EFITA 2011, Praha July Dr. Johannes Keizer Office of Knowledge Exchange, Research and Extension Food and Agriculture Organization of the UN CIARD and agINFRA - creating a global framework for information sharing in agricultural research and innovation

Upload: chiku

Post on 16-Jan-2016

18 views

Category:

Documents


0 download

DESCRIPTION

CIARD and agINFRA - creating a global framework for information sharing in agricultural research and innovation. Dr. Johannes Keizer Office of Knowledge Exchange, Research and Extension Food and Agriculture Organization of the UN. agriXchange workshop at EFITA 2011, Praha July. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

agriXchange workshop at EFITA 2011, Praha July

Dr. Johannes KeizerOffice of Knowledge Exchange, Research and ExtensionFood and Agriculture Organization of the UN

CIARD and agINFRA - creating a global framework for information sharing in agricultural research and innovation

Page 2: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

We will promote research for food and agriculture, including research to

adapt to, and mitigate climate change, and access to research results and

technologies at national, regional and international levels.

We will reinvigorate national research systems and will share information

and best practices. We will improve access to knowledge.

world food summit 2009

Page 3: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

http://aims.fao.org

Page 4: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

http://www.ciard.net

Page 5: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

2nd IISAST Consultation

CIARD Initiative launched

(15 founding partners)

Regional Consultations

70 countries 150 info prof.

1 st IISAST Consultation

TASK FORCES

CIARD endorsed (GCARD and FARA)

+112 partners and growing…

20092007 20082005

Coherence in Information for Agricultural Research for Development

A new global movement to provide a platform for coherence between information-related initiatives

to make public domain agricultural research information and knowledge truly accessible to all

e-Consultation & Beijing Consultation

+ Regional Workshops

GCARD 2012

2010 20122011

Page 6: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Information Infrastructure for Agricultural Research and Innovation

Page 7: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Distributed Repositories

• stats• gene banks• gis data• blogs, • journals• open archives• raw data• technologies• learning objects• ………..

Page 8: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

The solution: agINFRA

Produce linked open data from all datasets

Use common reference vocabularies to interlink

Don’t wait ! Wrap the Legacy

Page 9: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

RING

routemap to information nodes and gateways

ToolsLOD

enabled software

VocBenchvocabulary server

concepts and entities triples

LOD Generator

triplifier, concept and entity

identifier

Data Services

Webservices + APIs to triple stores

Cloud

storage for RDF triples

The Infrastructure elements

Page 10: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Lod Generator: processLOD Generator

triplifier, concept and entity

identifier

Page 11: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Data Services process

Page 12: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

agINFRA – the Project

FAO and the Chinese Academy of Agricultural Science are Senior Users in the Project

4 Million Euros funding, but for 11 partners

Project starts on November 1 for 3 years

CIARD partners can post their requirements

Page 13: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Under Construction

VocBench

AGROVOC Linked Open Data

AgroTagger

Triplifying AGRIS

Serendipity linking

Drupal front ends for triple stores

The CIARD R.I.N.G

Page 14: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

RING

routemap to information nodes and gateways

ToolsLOD

enabled software

VocBenchvocabulary server

concepts and entities triples

LOD Generator

triplifier, concept and entity

identifier

Data Services

Webservices + APIs to triple stores

Cloud

storage for RDF triples

The Infrastructure elements

Page 15: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

The VocBench VocBench

concepts and entities triples

Page 16: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

VocBench Features

Domain independent

Structure independent (i.e. thesauri, Glossaries, etc)

Supports RDF (SKOS, SKOS-XL), OWL

Supports collaborative editing

Supports editorial workflow, with user roles

Simple and advanced search

Supports data export: SKOS, Relational format (MySQL)

Page 17: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 18: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 19: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 20: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 21: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

RING

routemap to information nodes and gateways

ToolsLOD

enabled software

VocBenchvocabulary server

concepts and entities triples

LOD Generator

triplifier, concept and entity

identifier

Data Services

Webservices + APIs to triple stores

Cloud

storage for RDF triples

The Infrastructure elements

Page 22: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

• Does Concept identification in unstructured texts

• Uses Agrovoc as a controlled vocabulary

• Prototype under testing with excellent results (entire repository of ICARDA indexed)

• Will produce in future Structured RDF files that can be used to link data like “open Calais”

AgroTagger

Page 23: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 24: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 25: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Triplifying AGRIS (small exemple)

<?xml version="1.0" encoding="utf-8"?><rdf:RDF xmlns:ags="http://purl.org/agmes/1.1/" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dct="http://purl.org/dc/terms/"><bibo:Journal rdf:about="http://aims.fao.org/aos/journal/c_b6e4ca85">

<bibo:ISSN>0101-9066</bibo:ISSN><bibo:ISSN>0101-9066</bibo:ISSN><dct:title><![CDATA[Circular técnica]]></dct:title><dct:alternative><![CDATA[Circular técnica (Centro Nacional de Pesquisa de Seringueira e Dendê)]]></dct:alternative><dct:alternative><![CDATA[Circular Tecnica - Centro Nacional de Pesquisa da Seringueira e Dende]]></dct:alternative><dct:alternative><![CDATA[Circular técnica - CNPSD]]></dct:alternative><dct:alternative><![CDATA[Circ. téc.]]></dct:alternative><ags:publisherPlace rdf:resource="http://aims.fao.org/aos/geopolitical.owl#Brazil"/><dct:publisher><![CDATA[Empresa Brasileira de Pesquisa Agropecuária, Centro Nacional de Pesquisa de Seringueira e

Dendê]]></dct:publisher><dct:language>por</dct:language><dct:date>1980</dct:date><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_10795"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_4650"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_32372"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_332"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_3589"/><dct:subject rdf:resource="http://aims.fao.org/aos/agrovoc/c_5556"/>

</bibo:Journal>

Page 26: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Journal disambiguation: results

2.644.818 AGRIS records

2.171.113 records are journal records (82.09%)

1.788.083 journal records have been covered by the disambiguation process (82.35%)

14.658 journals have been correctly disambiguated

~20.000 strings must be examined yet: they refer to journal’s titles

Triples have been generated:

Page 27: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

RING

routemap to information nodes and gateways

ToolsLOD

enabled software

VocBenchvocabulary server

concepts and entities triples

LOD Generator

triplifier, concept and entity

identifier

Data Services

Webservices + APIs to triple stores

Cloud

storage for RDF triples

The Infrastructure elements

Page 28: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

“Serendipity Linking”

With four predefined queries we try to find in Google further information record related:

• Search by Title, to find the full text of the document if it is available on line

• Search by Author(s)+Agrovoc keywords, to find not only information about the author of the document but also other author’s publications about the same subjects

• Search by Jounal Title

• Search by Conference

Page 29: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

Page 30: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

The Biotech Glossary:

Using Drupalas a Triple Store Browser

Data in VocBench

Triple Store

OWL ART API

Drupal

Page 31: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

RING

routemap to information nodes and gateways

ToolsLOD

enabled software

VocBenchvocabulary server

concepts and entities triples

LOD Generator

triplifier, concept and entity

identifier

Data Services

Webservices + APIs to triple stores

Cloud

storage for RDF triples

The Infrastructure elements

Page 32: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

RING - Charts and numbers

http://ring.ciard.net

Page 33: agriXchange workshop at  EFITA 2011,  Praha   July

Building the CIARD Framework for Data and Information SharingPraha, July 12, - johannes keizer

RING – Numbers

Number of documents potentially reachable through the services registered in the RING.

Types of service considered: document repositories and bibliographic databases.

http://ring.ciard.net/totals