SlideShare ist ein Scribd-Unternehmen logo
1 von 32
AGRIS
From a bibliographical database to a linked
open data application extending
knowledge mining to the world wide web
Fabrizio Celli and Johannes Keizer – 04/11/2015
fabrizio celli johannes keizerhttp://aims.fao.org
Outline
 What is AGRIS?
 (S)Mash-up!
 Mining and indexing the web
2
WHAT IS AGRIS?
fabrizio celli johannes keizerhttp://aims.fao.org
AGRIS
 The International System for Agricultural
Science and Technology
 A collection of more than 8 million
multilingual bibliographic resources
 A network of more than 150 institutions
from 65 countries
 A Web portal (http://agris.fao.org/)
4
fabrizio celli johannes keizerhttp://aims.fao.org 5
fabrizio celli johannes keizerhttp://aims.fao.org
AGRIS 2001
fabrizio celli johannes keizerhttp://aims.fao.org
AGRIS 2001
7
johannes keizerhttp://aims.fao.org
AGRIS 2015
fabrizio celli johannes keizerhttp://aims.fao.org
AGRIS users
• Researchers, professors, graduated
students looking for bibliographies
• Librarians, cataloguers
• Small journal publishers, professional
associations, conference organizers
• Government officers asking for reports on a
specific topic
9
fabrizio celli johannes keizerhttp://aims.fao.org
Impact
10
It supports both developed and developing countries
Accessed from more than 200 countries and territories
Google Analytics
October 2015
fabrizio celli johannes keizerhttp://aims.fao.org
Statistics
1
8,142,755 multilingual bibliographic records
~ 400,000 from Latin America
~ 150,000 from Africa
~ 760,000 from Asia + 400,000 links to CASDD
(China)
253,286,038 triples
(S)Mash-up!
12
fabrizio celli johannes keizerhttp://aims.fao.org
LOD infrastructure
 Since December 2013 AGRIS moved to the
RDF world
 Generation of mashup pages
• users looking for specific topics can access a
publication from the AGRIS database, combined with
other related resources extracted from other
preselected datasets
• external resources are not only bibliographic
metadata, but also distribution maps, statistics,
germplasm accessions, and so on.
13
fabrizio celli johannes keizerhttp://aims.fao.org
The RDF-ization process
 Translation of the AGRIS AP XML database
to RDF
• Selection of existing vocabularies
• Data cleaning and normalization
• Index all records with the AGROVOC thesaurus
• Run the conversion and publish RDF data!
 Selection of external datasets we want to
interlink to AGRIS
14
fabrizio celli johannes keizerhttp://aims.fao.org
AGRIS RDF
1
bibo:Article
bibo:abstract
bibo:doi
bibo:isbn
bibo:presentedAt ->
bibo:Conference -> dct:title
bibo:uri
dct:alternative
dct:creator ->
foaf:organization ->
foaf:name
dct:creator ->
foaf:Person -> foaf:name
dct:dateSubmitted
dct:description
dct:extent
dct:identifier
dct:language
dct:isPartOf
dct:issued
dct:publisher ->
foaf:Organization ->
foaf:name
dct:source
dct:subject
dct:title
dct:type
dct:rights
fabrizio celli johannes keizerhttp://aims.fao.org
AGROVOC
 The FAO multilingual vocabulary containing
around 32 000 concepts in up to 21
languages
 Backbone: the magic that allows the
interlinking to external datasets
 Two ways to implement the interlinking:
• Using AGROVOC formal aligments to other thesauri
• Querying external WebServices with scientific names
16
johannes keizerhttp://aims.fao.org
Relationships, Relationships
http://aims.fao.org/aos/agrovoc/c_1474.html
johannes keizerhttp://aims.fao.org
johannes keizerhttp://aims.fao.org
http://agris.fao.org
http://agris.fao.org/agris-
search/search.do?recordID=PH2011000084
http://agris.fao.org/agris-
search/search.do?recordID=PL200300203
6
Mashup
20
fabrizio celli johannes keizerhttp://aims.fao.org
From AGRIS to DBPedia
AGRIS
URI
AGROV
OC URI
dcterms:subject
DBPedia
URI
skos:closeMatch
skos:exactMatch
DBPedia
Abstract
Wikipedia
URL
DBPedia
Picture
foaf:isPrimaryTopicOfdbpedia-owl:abstract
foaf:depiction
Entry
point!
AGROVOC
is the
backbone
fabrizio celli johannes keizerhttp://aims.fao.org
SPARQL in action!
1. From an AGRIS URI, get the list of the AGROVOC URIs
(dcterms:subject)
PREFIX dct: <http://purl.org/dc/terms/>
SELECT ?agr
WHERE {
<AGRIS_Uri> dct:subject ?agr .
}
2. For each AGROVOC URI
2.1. Get skos:closeMatch and skos:exactMatch (formal alignments to other
thesauri)
PREFIX skos: <http://www.w3.org/2004/02/skos/core#>
SELECT?em ?cm {
OPTIONAL { <AGROVOC_Uri> skos:exactMatch ?em } .
OPTIONAL { <AGROVOC_Uri> skos:closeMatch ?cm } .
}
fabrizio celli johannes keizerhttp://aims.fao.org
Get DBPedia
2.2. The JAVA code filters DBPedia URIs, to avoid adding a new FILTER in
the SPARQL query (it’s heavy…)
2.3. For each DBPedia URI, query the DBPedia SPARQL endpoint to get
information to display in an AGRIS widget
SELECT ?abs ?img ?wiki
WHERE {
OPTIONAL {<DBP_Uri> dbpedia-owl:abstract ?abs} .
OPTIONAL {<DBP_Uri> foaf:depiction ?img} .
OPTIONAL {<DBP_Uri> foaf:isPrimaryTopicOf ?wiki} .
FILTER ( (lang(?abs ) ="en") || (!bound(?abs)) )
}
fabrizio celli johannes keizerhttp://aims.fao.org
Bibliography
2
«Migrating bibliographic datasets to the Semantic Web:
The AGRIS case». Stefano Anibaldi, Yves Jaques,
Fabrizio Celli, Armando Stellato, Johannes Keizer.
Semantic Web journal
«OpenAGRIS: using bibliographical data for linking into the
agricultural knowledge web». Fabrizio Celli, Stefano
Anibaldi, Maria Folch, Yves Jaques, Johannes Keizer.
AOS 2011
25
Mining and indexing the web
fabrizio celli johannes keizerhttp://aims.fao.org
The context
 Scientists and researchers publish their
results not only in journals or at conferences,
but also via web 2.0 tools and other media
 Corpora of ongoing research activities,
unpublished material, grey literature, quick
discussions, and experiments with negative
results and ideas
 This information is usually unstructured and
not exposed using web services
26
fabrizio celli johannes keizerhttp://aims.fao.org
Goal
 Crawl the web (manually preselected
websites)
 Machine learning algorithms to index
discovered web resources using AGROVOC
 Select relevant resources using a
recommender system
 Interlink to AGRIS!
27
fabrizio celli johannes keizerhttp://aims.fao.org
Crawling and indexing
28
https://github.com/fcproj/agrotagger
fabrizio celli johannes keizerhttp://aims.fao.org
Recommender system
29
• A JAVA component that computes meaningful
intersections between the Crawler Database
and the AGRIS database
• Offline process, recommendations are stored in
a triplestore
fabrizio celli johannes keizerhttp://aims.fao.org
Interlinking
30
https://github.com/fcproj/recommender
fabrizio celli johannes keizerhttp://aims.fao.org 31
fabrizio celli johannes keizerhttp://aims.fao.org
Bibliography
32
Discovering, Indexing and Interlinking Information
Resources Fabrizio Celli, Johannes Keizer, Yves
Jaques, Stasinos Konstantopoulos, Dušan Vudragović.
F1000 Research
Version 2 under revision

Weitere ähnliche Inhalte

Was ist angesagt?

Visualizing Open Data Using Neo4J (Full)
Visualizing Open Data Using Neo4J (Full)Visualizing Open Data Using Neo4J (Full)
Visualizing Open Data Using Neo4J (Full)Scott Sosna
 
ORCID & other Person iDs
ORCID & other Person iDsORCID & other Person iDs
ORCID & other Person iDsSimeon Warner
 
GBIF-Norway at NMBU, January 2015
GBIF-Norway at NMBU, January 2015GBIF-Norway at NMBU, January 2015
GBIF-Norway at NMBU, January 2015Dag Endresen
 
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016GBIF BIFA mentoring, Day 1 GBIF intro, July 2016
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016Dag Endresen
 
Open Geodata, DIT, December 2010
Open Geodata, DIT, December 2010Open Geodata, DIT, December 2010
Open Geodata, DIT, December 2010Richard Cantwell
 
#HepaticaWeek April 2016, GBIF data publishing
#HepaticaWeek April 2016, GBIF data publishing#HepaticaWeek April 2016, GBIF data publishing
#HepaticaWeek April 2016, GBIF data publishingDag Endresen
 
Rich Data? Poor Data? Depends on...
Rich Data? Poor Data? Depends on...Rich Data? Poor Data? Depends on...
Rich Data? Poor Data? Depends on...Lars G. Svensson
 
Federated Query Formulation and Processing through BioFed
Federated Query Formulation and Processing through BioFedFederated Query Formulation and Processing through BioFed
Federated Query Formulation and Processing through BioFedSyed Muhammad Ali Hasnain
 
KESW2012 Hackathon St Petersburg
KESW2012 Hackathon St PetersburgKESW2012 Hackathon St Petersburg
KESW2012 Hackathon St PetersburgAI4BD GmbH
 
2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portals2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portalsDag Endresen
 
ORCID Implementation - Wikipedia (A. Mabbett)
ORCID Implementation - Wikipedia (A. Mabbett)ORCID Implementation - Wikipedia (A. Mabbett)
ORCID Implementation - Wikipedia (A. Mabbett)ORCID, Inc
 
Enabling information interoperability with identifiers (L. Haak)
Enabling information interoperability with identifiers  (L. Haak)Enabling information interoperability with identifiers  (L. Haak)
Enabling information interoperability with identifiers (L. Haak)ORCID, Inc
 

Was ist angesagt? (14)

Visualizing Open Data Using Neo4J (Full)
Visualizing Open Data Using Neo4J (Full)Visualizing Open Data Using Neo4J (Full)
Visualizing Open Data Using Neo4J (Full)
 
Release of AGRIS 2.0: Searching agricultural bibliografic data
Release of AGRIS 2.0: Searching agricultural bibliografic dataRelease of AGRIS 2.0: Searching agricultural bibliografic data
Release of AGRIS 2.0: Searching agricultural bibliografic data
 
ORCID & other Person iDs
ORCID & other Person iDsORCID & other Person iDs
ORCID & other Person iDs
 
GBIF-Norway at NMBU, January 2015
GBIF-Norway at NMBU, January 2015GBIF-Norway at NMBU, January 2015
GBIF-Norway at NMBU, January 2015
 
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016GBIF BIFA mentoring, Day 1 GBIF intro, July 2016
GBIF BIFA mentoring, Day 1 GBIF intro, July 2016
 
Open Geodata, DIT, December 2010
Open Geodata, DIT, December 2010Open Geodata, DIT, December 2010
Open Geodata, DIT, December 2010
 
Open Data: a view from the trenches
Open Data: a view from the trenchesOpen Data: a view from the trenches
Open Data: a view from the trenches
 
#HepaticaWeek April 2016, GBIF data publishing
#HepaticaWeek April 2016, GBIF data publishing#HepaticaWeek April 2016, GBIF data publishing
#HepaticaWeek April 2016, GBIF data publishing
 
Rich Data? Poor Data? Depends on...
Rich Data? Poor Data? Depends on...Rich Data? Poor Data? Depends on...
Rich Data? Poor Data? Depends on...
 
Federated Query Formulation and Processing through BioFed
Federated Query Formulation and Processing through BioFedFederated Query Formulation and Processing through BioFed
Federated Query Formulation and Processing through BioFed
 
KESW2012 Hackathon St Petersburg
KESW2012 Hackathon St PetersburgKESW2012 Hackathon St Petersburg
KESW2012 Hackathon St Petersburg
 
2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portals2016-10-12 MUSIT & GBIF - Dataset portals
2016-10-12 MUSIT & GBIF - Dataset portals
 
ORCID Implementation - Wikipedia (A. Mabbett)
ORCID Implementation - Wikipedia (A. Mabbett)ORCID Implementation - Wikipedia (A. Mabbett)
ORCID Implementation - Wikipedia (A. Mabbett)
 
Enabling information interoperability with identifiers (L. Haak)
Enabling information interoperability with identifiers  (L. Haak)Enabling information interoperability with identifiers  (L. Haak)
Enabling information interoperability with identifiers (L. Haak)
 

Andere mochten auch

2007 08 26 Dc Keynote Keizer
2007 08 26 Dc Keynote Keizer2007 08 26 Dc Keynote Keizer
2007 08 26 Dc Keynote KeizerJohannes Keizer
 
AGRIS - From a bibliographical database to a Web data service on agricultural...
AGRIS - From a bibliographical database to a Web data service on agricultural...AGRIS - From a bibliographical database to a Web data service on agricultural...
AGRIS - From a bibliographical database to a Web data service on agricultural...IAALD Community
 
Cluster Settore Agroalimentare “Contaminazioni: formaggi freschi al gusto di ...
Cluster Settore Agroalimentare “Contaminazioni: formaggi freschi al gusto di ...Cluster Settore Agroalimentare “Contaminazioni: formaggi freschi al gusto di ...
Cluster Settore Agroalimentare “Contaminazioni: formaggi freschi al gusto di ...Sardegna Ricerche
 

Andere mochten auch (20)

(Meta)data
(Meta)data(Meta)data
(Meta)data
 
AGROVOC, AGRIS and the CIARD RING, using RDF vocabularies and technologies f...
AGROVOC, AGRIS and the CIARD RING,  using RDF vocabularies and technologies f...AGROVOC, AGRIS and the CIARD RING,  using RDF vocabularies and technologies f...
AGROVOC, AGRIS and the CIARD RING, using RDF vocabularies and technologies f...
 
2005 09 Dc Keynote
2005 09 Dc Keynote2005 09 Dc Keynote
2005 09 Dc Keynote
 
AGRIS is an important network for access to technical and scientific publicat...
AGRIS is an important network for access to technical and scientific publicat...AGRIS is an important network for access to technical and scientific publicat...
AGRIS is an important network for access to technical and scientific publicat...
 
Thai AGROVOC ontology base for agricultural information retrieval
Thai AGROVOC ontology base for agricultural information retrievalThai AGROVOC ontology base for agricultural information retrieval
Thai AGROVOC ontology base for agricultural information retrieval
 
2007 08 26 Dc Keynote Keizer
2007 08 26 Dc Keynote Keizer2007 08 26 Dc Keynote Keizer
2007 08 26 Dc Keynote Keizer
 
Agricultural Ontology Service AgStandards-AIMS
Agricultural Ontology Service AgStandards-AIMSAgricultural Ontology Service AgStandards-AIMS
Agricultural Ontology Service AgStandards-AIMS
 
OpenAGRIS: using bibliographical data for linking into the agricultural knowl...
OpenAGRIS: using bibliographical data for linking into the agricultural knowl...OpenAGRIS: using bibliographical data for linking into the agricultural knowl...
OpenAGRIS: using bibliographical data for linking into the agricultural knowl...
 
Open Access in Agricultural Research for Development : a Global Movement
Open Access in Agricultural Research for Development : a Global MovementOpen Access in Agricultural Research for Development : a Global Movement
Open Access in Agricultural Research for Development : a Global Movement
 
Webinar@ASIRA: Introduction to Using TEEAL to Access Agricultural Journals
Webinar@ASIRA: Introduction to Using TEEAL to Access Agricultural Journals Webinar@ASIRA: Introduction to Using TEEAL to Access Agricultural Journals
Webinar@ASIRA: Introduction to Using TEEAL to Access Agricultural Journals
 
AGRIS - From a bibliographical database to a Web data service on agricultural...
AGRIS - From a bibliographical database to a Web data service on agricultural...AGRIS - From a bibliographical database to a Web data service on agricultural...
AGRIS - From a bibliographical database to a Web data service on agricultural...
 
Webinar@ASIRA: AGRIS: Providing Access to Agricultural Research and Technolog...
Webinar@ASIRA: AGRIS: Providing Access to Agricultural Research and Technolog...Webinar@ASIRA: AGRIS: Providing Access to Agricultural Research and Technolog...
Webinar@ASIRA: AGRIS: Providing Access to Agricultural Research and Technolog...
 
Webinar@ASIRA: Emerging Themes in Agricultural Research Publishing
Webinar@ASIRA: Emerging Themes in Agricultural Research PublishingWebinar@ASIRA: Emerging Themes in Agricultural Research Publishing
Webinar@ASIRA: Emerging Themes in Agricultural Research Publishing
 
Cluster Settore Agroalimentare “Contaminazioni: formaggi freschi al gusto di ...
Cluster Settore Agroalimentare “Contaminazioni: formaggi freschi al gusto di ...Cluster Settore Agroalimentare “Contaminazioni: formaggi freschi al gusto di ...
Cluster Settore Agroalimentare “Contaminazioni: formaggi freschi al gusto di ...
 
Webinar@ASIRA: Access to Global Online Research in Agriculture (AGORA)
Webinar@ASIRA: Access to Global Online Research in Agriculture (AGORA) Webinar@ASIRA: Access to Global Online Research in Agriculture (AGORA)
Webinar@ASIRA: Access to Global Online Research in Agriculture (AGORA)
 
Webinar@ASIRA: AuthorAID: Supporting Developing Country Researchers in Publis...
Webinar@ASIRA: AuthorAID: Supporting Developing Country Researchers in Publis...Webinar@ASIRA: AuthorAID: Supporting Developing Country Researchers in Publis...
Webinar@ASIRA: AuthorAID: Supporting Developing Country Researchers in Publis...
 
Using AGRIS as a portal of choice to access agricultural research and technol...
Using AGRIS as a portal of choice to access agricultural research and technol...Using AGRIS as a portal of choice to access agricultural research and technol...
Using AGRIS as a portal of choice to access agricultural research and technol...
 
Wifi Security
Wifi SecurityWifi Security
Wifi Security
 
Agris
AgrisAgris
Agris
 
Agris
AgrisAgris
Agris
 

Ähnlich wie 2015 11 agris-medes

Presentation at the ISTIC workshop on Knowleddge Organization
Presentation at the ISTIC workshop on Knowleddge OrganizationPresentation at the ISTIC workshop on Knowleddge Organization
Presentation at the ISTIC workshop on Knowleddge OrganizationJohannes Keizer
 
2012 07 ictk-johanneskeizer
2012 07 ictk-johanneskeizer2012 07 ictk-johanneskeizer
2012 07 ictk-johanneskeizerJohannes Keizer
 
Presentation at the VIVO 2011 conference
Presentation at the VIVO 2011 conferencePresentation at the VIVO 2011 conference
Presentation at the VIVO 2011 conferenceJohannes Keizer
 
AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...
AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...
AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...Valeria Pesce
 
Berlin8 keizer 2010-10-25
Berlin8 keizer 2010-10-25Berlin8 keizer 2010-10-25
Berlin8 keizer 2010-10-25Johannes Keizer
 
Libraries and Linked open Data
Libraries and Linked open DataLibraries and Linked open Data
Libraries and Linked open DataJohannes Keizer
 
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseAnita de Waard
 
Wikidata: Verifiable, Linked Open Knowledge That Anyone Can Edit
Wikidata: Verifiable, Linked Open Knowledge That Anyone Can EditWikidata: Verifiable, Linked Open Knowledge That Anyone Can Edit
Wikidata: Verifiable, Linked Open Knowledge That Anyone Can EditDario Taraborelli
 
Accessing Digital Collections Data Sources for Research: A Tour of iDigBio Da...
Accessing Digital Collections Data Sources for Research: A Tour of iDigBio Da...Accessing Digital Collections Data Sources for Research: A Tour of iDigBio Da...
Accessing Digital Collections Data Sources for Research: A Tour of iDigBio Da...Matthew J Collins
 
Global Information Systems for Plant Genetic Resources (2009)
Global Information Systems for Plant Genetic Resources (2009)Global Information Systems for Plant Genetic Resources (2009)
Global Information Systems for Plant Genetic Resources (2009)Dag Endresen
 
The agINFRA Linked Data layer
The agINFRA Linked Data layerThe agINFRA Linked Data layer
The agINFRA Linked Data layerValeria Pesce
 
The agINFRA Linked Data layer by Valeria Pesce, Giovanni l'Abate, Luca Mattei...
The agINFRA Linked Data layer by Valeria Pesce, Giovanni l'Abate, Luca Mattei...The agINFRA Linked Data layer by Valeria Pesce, Giovanni l'Abate, Luca Mattei...
The agINFRA Linked Data layer by Valeria Pesce, Giovanni l'Abate, Luca Mattei...CIARD Movement
 
Building Federated FAIR Data Spaces, Yann Le Franc, EOSC-Pillar
Building Federated FAIR Data Spaces, Yann Le Franc, EOSC-PillarBuilding Federated FAIR Data Spaces, Yann Le Franc, EOSC-Pillar
Building Federated FAIR Data Spaces, Yann Le Franc, EOSC-PillarEOSC-Pillar European Project
 

Ähnlich wie 2015 11 agris-medes (20)

Linked open Vocabularies for Linked Open Data - the role of AGROVOC
Linked open Vocabularies for Linked Open Data - the role of AGROVOCLinked open Vocabularies for Linked Open Data - the role of AGROVOC
Linked open Vocabularies for Linked Open Data - the role of AGROVOC
 
Presentation at the ISTIC workshop on Knowleddge Organization
Presentation at the ISTIC workshop on Knowleddge OrganizationPresentation at the ISTIC workshop on Knowleddge Organization
Presentation at the ISTIC workshop on Knowleddge Organization
 
2012 07 ictk-johanneskeizer
2012 07 ictk-johanneskeizer2012 07 ictk-johanneskeizer
2012 07 ictk-johanneskeizer
 
Open Access International Tendences
Open Access International TendencesOpen Access International Tendences
Open Access International Tendences
 
open access in Science
open access in Scienceopen access in Science
open access in Science
 
Presentation at the VIVO 2011 conference
Presentation at the VIVO 2011 conferencePresentation at the VIVO 2011 conference
Presentation at the VIVO 2011 conference
 
AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...
AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...
AgriVIVO: A Global Ontology-Driven RDF Store Based on a Distributed Architect...
 
World bank 2011-05
World bank 2011-05World bank 2011-05
World bank 2011-05
 
Berlin8 keizer 2010-10-25
Berlin8 keizer 2010-10-25Berlin8 keizer 2010-10-25
Berlin8 keizer 2010-10-25
 
Libraries and Linked open Data
Libraries and Linked open DataLibraries and Linked open Data
Libraries and Linked open Data
 
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
 
Wikidata: Verifiable, Linked Open Knowledge That Anyone Can Edit
Wikidata: Verifiable, Linked Open Knowledge That Anyone Can EditWikidata: Verifiable, Linked Open Knowledge That Anyone Can Edit
Wikidata: Verifiable, Linked Open Knowledge That Anyone Can Edit
 
Accessing Digital Collections Data Sources for Research: A Tour of iDigBio Da...
Accessing Digital Collections Data Sources for Research: A Tour of iDigBio Da...Accessing Digital Collections Data Sources for Research: A Tour of iDigBio Da...
Accessing Digital Collections Data Sources for Research: A Tour of iDigBio Da...
 
Global Information Systems for Plant Genetic Resources (2009)
Global Information Systems for Plant Genetic Resources (2009)Global Information Systems for Plant Genetic Resources (2009)
Global Information Systems for Plant Genetic Resources (2009)
 
2014 04 semic
2014 04 semic2014 04 semic
2014 04 semic
 
The agINFRA Linked Data layer
The agINFRA Linked Data layerThe agINFRA Linked Data layer
The agINFRA Linked Data layer
 
The agINFRA Linked Data layer by Valeria Pesce, Giovanni l'Abate, Luca Mattei...
The agINFRA Linked Data layer by Valeria Pesce, Giovanni l'Abate, Luca Mattei...The agINFRA Linked Data layer by Valeria Pesce, Giovanni l'Abate, Luca Mattei...
The agINFRA Linked Data layer by Valeria Pesce, Giovanni l'Abate, Luca Mattei...
 
Building Federated FAIR Data Spaces, Yann Le Franc, EOSC-Pillar
Building Federated FAIR Data Spaces, Yann Le Franc, EOSC-PillarBuilding Federated FAIR Data Spaces, Yann Le Franc, EOSC-Pillar
Building Federated FAIR Data Spaces, Yann Le Franc, EOSC-Pillar
 
The Agricultural Ontology Service and its Vision
The Agricultural Ontology Service and its VisionThe Agricultural Ontology Service and its Vision
The Agricultural Ontology Service and its Vision
 
The agricultural ontology service and its vision
The agricultural ontology service and its visionThe agricultural ontology service and its vision
The agricultural ontology service and its vision
 

Mehr von Johannes Keizer (20)

Presentation CABI Beijing 2019 11-04
Presentation CABI Beijing  2019 11-04Presentation CABI Beijing  2019 11-04
Presentation CABI Beijing 2019 11-04
 
eROSA presentation at CAAS, September 2018
eROSA presentation at CAAS, September 2018eROSA presentation at CAAS, September 2018
eROSA presentation at CAAS, September 2018
 
2018 03 apan
2018 03 apan2018 03 apan
2018 03 apan
 
2017 11-15 macs
2017 11-15 macs2017 11-15 macs
2017 11-15 macs
 
2016 10 caas-ats
2016 10 caas-ats2016 10 caas-ats
2016 10 caas-ats
 
2016 08 gxaas
2016 08 gxaas2016 08 gxaas
2016 08 gxaas
 
2016 06 chengdu
2016 06 chengdu2016 06 chengdu
2016 06 chengdu
 
2017 08 apan
2017 08 apan2017 08 apan
2017 08 apan
 
2017 09 caas
2017 09 caas2017 09 caas
2017 09 caas
 
2017 11 wageningen-keizer
2017 11 wageningen-keizer2017 11 wageningen-keizer
2017 11 wageningen-keizer
 
2017 11 eosc-keizer
2017 11 eosc-keizer2017 11 eosc-keizer
2017 11 eosc-keizer
 
2017 11 cascd
2017 11 cascd2017 11 cascd
2017 11 cascd
 
2017 04 igad-jk
2017 04 igad-jk2017 04 igad-jk
2017 04 igad-jk
 
2017 02 apan
2017 02 apan2017 02 apan
2017 02 apan
 
2017 06 itpgrfa
2017 06 itpgrfa2017 06 itpgrfa
2017 06 itpgrfa
 
2017 03 brussels
2017 03 brussels2017 03 brussels
2017 03 brussels
 
2017 076 efita-sponsor-godan
2017 076 efita-sponsor-godan2017 076 efita-sponsor-godan
2017 076 efita-sponsor-godan
 
2017 07 montpellier-keizer
2017 07 montpellier-keizer2017 07 montpellier-keizer
2017 07 montpellier-keizer
 
2017 04 embl
2017 04 embl2017 04 embl
2017 04 embl
 
The FAIR principle in the Big Data World
The FAIR principle in the Big Data WorldThe FAIR principle in the Big Data World
The FAIR principle in the Big Data World
 

Kürzlich hochgeladen

Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012rehmti665
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxDyna Gilbert
 
Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Paul Calvano
 
Git and Github workshop GDSC MLRITM
Git and Github  workshop GDSC MLRITMGit and Github  workshop GDSC MLRITM
Git and Github workshop GDSC MLRITMgdsc13
 
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)Dana Luther
 
PHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationPHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationLinaWolf1
 
Magic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMagic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMartaLoveguard
 
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Sonam Pathan
 
Call Girls Service Adil Nagar 7001305949 Need escorts Service Pooja Vip
Call Girls Service Adil Nagar 7001305949 Need escorts Service Pooja VipCall Girls Service Adil Nagar 7001305949 Need escorts Service Pooja Vip
Call Girls Service Adil Nagar 7001305949 Need escorts Service Pooja VipCall Girls Lucknow
 
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一Fs
 
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一Fs
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa494f574xmv
 
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一Fs
 
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一Fs
 
Contact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New DelhiContact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New Delhimiss dipika
 
Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Excelmac1
 

Kürzlich hochgeladen (20)

Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
 
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
young call girls in Uttam Nagar🔝 9953056974 🔝 Delhi escort Service
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptx
 
Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24
 
Git and Github workshop GDSC MLRITM
Git and Github  workshop GDSC MLRITMGit and Github  workshop GDSC MLRITM
Git and Github workshop GDSC MLRITM
 
Hot Sexy call girls in Rk Puram 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in  Rk Puram 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in  Rk Puram 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Rk Puram 🔝 9953056974 🔝 Delhi escort Service
 
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
Packaging the Monolith - PHP Tek 2024 (Breaking it down one bite at a time)
 
PHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 DocumentationPHP-based rendering of TYPO3 Documentation
PHP-based rendering of TYPO3 Documentation
 
Magic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMagic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptx
 
Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in  Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in  Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝
 
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Uttam Nagar Delhi 💯Call Us 🔝8264348440🔝
 
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
Call Girls In The Ocean Pearl Retreat Hotel New Delhi 9873777170
 
Call Girls Service Adil Nagar 7001305949 Need escorts Service Pooja Vip
Call Girls Service Adil Nagar 7001305949 Need escorts Service Pooja VipCall Girls Service Adil Nagar 7001305949 Need escorts Service Pooja Vip
Call Girls Service Adil Nagar 7001305949 Need escorts Service Pooja Vip
 
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
 
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa
 
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
定制(AUT毕业证书)新西兰奥克兰理工大学毕业证成绩单原版一比一
 
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
定制(Management毕业证书)新加坡管理大学毕业证成绩单原版一比一
 
Contact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New DelhiContact Rya Baby for Call Girls New Delhi
Contact Rya Baby for Call Girls New Delhi
 
Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...Blepharitis inflammation of eyelid symptoms cause everything included along w...
Blepharitis inflammation of eyelid symptoms cause everything included along w...
 

2015 11 agris-medes

  • 1. AGRIS From a bibliographical database to a linked open data application extending knowledge mining to the world wide web Fabrizio Celli and Johannes Keizer – 04/11/2015
  • 2. fabrizio celli johannes keizerhttp://aims.fao.org Outline  What is AGRIS?  (S)Mash-up!  Mining and indexing the web 2
  • 4. fabrizio celli johannes keizerhttp://aims.fao.org AGRIS  The International System for Agricultural Science and Technology  A collection of more than 8 million multilingual bibliographic resources  A network of more than 150 institutions from 65 countries  A Web portal (http://agris.fao.org/) 4
  • 5. fabrizio celli johannes keizerhttp://aims.fao.org 5
  • 6. fabrizio celli johannes keizerhttp://aims.fao.org AGRIS 2001
  • 7. fabrizio celli johannes keizerhttp://aims.fao.org AGRIS 2001 7
  • 9. fabrizio celli johannes keizerhttp://aims.fao.org AGRIS users • Researchers, professors, graduated students looking for bibliographies • Librarians, cataloguers • Small journal publishers, professional associations, conference organizers • Government officers asking for reports on a specific topic 9
  • 10. fabrizio celli johannes keizerhttp://aims.fao.org Impact 10 It supports both developed and developing countries Accessed from more than 200 countries and territories Google Analytics October 2015
  • 11. fabrizio celli johannes keizerhttp://aims.fao.org Statistics 1 8,142,755 multilingual bibliographic records ~ 400,000 from Latin America ~ 150,000 from Africa ~ 760,000 from Asia + 400,000 links to CASDD (China) 253,286,038 triples
  • 13. fabrizio celli johannes keizerhttp://aims.fao.org LOD infrastructure  Since December 2013 AGRIS moved to the RDF world  Generation of mashup pages • users looking for specific topics can access a publication from the AGRIS database, combined with other related resources extracted from other preselected datasets • external resources are not only bibliographic metadata, but also distribution maps, statistics, germplasm accessions, and so on. 13
  • 14. fabrizio celli johannes keizerhttp://aims.fao.org The RDF-ization process  Translation of the AGRIS AP XML database to RDF • Selection of existing vocabularies • Data cleaning and normalization • Index all records with the AGROVOC thesaurus • Run the conversion and publish RDF data!  Selection of external datasets we want to interlink to AGRIS 14
  • 15. fabrizio celli johannes keizerhttp://aims.fao.org AGRIS RDF 1 bibo:Article bibo:abstract bibo:doi bibo:isbn bibo:presentedAt -> bibo:Conference -> dct:title bibo:uri dct:alternative dct:creator -> foaf:organization -> foaf:name dct:creator -> foaf:Person -> foaf:name dct:dateSubmitted dct:description dct:extent dct:identifier dct:language dct:isPartOf dct:issued dct:publisher -> foaf:Organization -> foaf:name dct:source dct:subject dct:title dct:type dct:rights
  • 16. fabrizio celli johannes keizerhttp://aims.fao.org AGROVOC  The FAO multilingual vocabulary containing around 32 000 concepts in up to 21 languages  Backbone: the magic that allows the interlinking to external datasets  Two ways to implement the interlinking: • Using AGROVOC formal aligments to other thesauri • Querying external WebServices with scientific names 16
  • 21. fabrizio celli johannes keizerhttp://aims.fao.org From AGRIS to DBPedia AGRIS URI AGROV OC URI dcterms:subject DBPedia URI skos:closeMatch skos:exactMatch DBPedia Abstract Wikipedia URL DBPedia Picture foaf:isPrimaryTopicOfdbpedia-owl:abstract foaf:depiction Entry point! AGROVOC is the backbone
  • 22. fabrizio celli johannes keizerhttp://aims.fao.org SPARQL in action! 1. From an AGRIS URI, get the list of the AGROVOC URIs (dcterms:subject) PREFIX dct: <http://purl.org/dc/terms/> SELECT ?agr WHERE { <AGRIS_Uri> dct:subject ?agr . } 2. For each AGROVOC URI 2.1. Get skos:closeMatch and skos:exactMatch (formal alignments to other thesauri) PREFIX skos: <http://www.w3.org/2004/02/skos/core#> SELECT?em ?cm { OPTIONAL { <AGROVOC_Uri> skos:exactMatch ?em } . OPTIONAL { <AGROVOC_Uri> skos:closeMatch ?cm } . }
  • 23. fabrizio celli johannes keizerhttp://aims.fao.org Get DBPedia 2.2. The JAVA code filters DBPedia URIs, to avoid adding a new FILTER in the SPARQL query (it’s heavy…) 2.3. For each DBPedia URI, query the DBPedia SPARQL endpoint to get information to display in an AGRIS widget SELECT ?abs ?img ?wiki WHERE { OPTIONAL {<DBP_Uri> dbpedia-owl:abstract ?abs} . OPTIONAL {<DBP_Uri> foaf:depiction ?img} . OPTIONAL {<DBP_Uri> foaf:isPrimaryTopicOf ?wiki} . FILTER ( (lang(?abs ) ="en") || (!bound(?abs)) ) }
  • 24. fabrizio celli johannes keizerhttp://aims.fao.org Bibliography 2 «Migrating bibliographic datasets to the Semantic Web: The AGRIS case». Stefano Anibaldi, Yves Jaques, Fabrizio Celli, Armando Stellato, Johannes Keizer. Semantic Web journal «OpenAGRIS: using bibliographical data for linking into the agricultural knowledge web». Fabrizio Celli, Stefano Anibaldi, Maria Folch, Yves Jaques, Johannes Keizer. AOS 2011
  • 26. fabrizio celli johannes keizerhttp://aims.fao.org The context  Scientists and researchers publish their results not only in journals or at conferences, but also via web 2.0 tools and other media  Corpora of ongoing research activities, unpublished material, grey literature, quick discussions, and experiments with negative results and ideas  This information is usually unstructured and not exposed using web services 26
  • 27. fabrizio celli johannes keizerhttp://aims.fao.org Goal  Crawl the web (manually preselected websites)  Machine learning algorithms to index discovered web resources using AGROVOC  Select relevant resources using a recommender system  Interlink to AGRIS! 27
  • 28. fabrizio celli johannes keizerhttp://aims.fao.org Crawling and indexing 28 https://github.com/fcproj/agrotagger
  • 29. fabrizio celli johannes keizerhttp://aims.fao.org Recommender system 29 • A JAVA component that computes meaningful intersections between the Crawler Database and the AGRIS database • Offline process, recommendations are stored in a triplestore
  • 30. fabrizio celli johannes keizerhttp://aims.fao.org Interlinking 30 https://github.com/fcproj/recommender
  • 31. fabrizio celli johannes keizerhttp://aims.fao.org 31
  • 32. fabrizio celli johannes keizerhttp://aims.fao.org Bibliography 32 Discovering, Indexing and Interlinking Information Resources Fabrizio Celli, Johannes Keizer, Yves Jaques, Stasinos Konstantopoulos, Dušan Vudragović. F1000 Research Version 2 under revision

Hinweis der Redaktion

  1. Chinese Agricultural Sci-tech Documents Database (CASDD) CGRIS germplasm database The World Bank Nature OpenSearch Europena FAO Geopolitical Ontology Global Biodiversity Information Facility Bioversity International FAO Fisheries and Aquaculture DBPedia