SEMLIB Final Conference | DERI presentation

•

1 gefällt mir•710 views

SemLib Project

SLRD, the SEMLIB Linked Data Recommendation Engine

Technologie

Our participation — and motivation — in the
project involved the research &
development of a recommendation engine
that...
● leveraged the ubiquitousness and richness of
linked data from the Web of Data
● would produce new linked data as a result of
those recommendations. In addition, this would
provide data interlinking

In general, we were concerned with...
●How to perform recommendation computations
with the linked data? Furthermore, how to do this
scalably?
● How to input linked data into such a system?
●How to output linked data from those
recommendations?

For recommendation types, we focused on
implementing the primary types:

Collaborative filtering &
Content-based
With an array of algorithms including — Cosine Similarity, Pearson
Correlation, Jaccard Distance, Co-occurrence, etc.

●An initial direction for the computation of
recommendations ✓
Challenge: adapting these algorithms for linked
●

data ✓

SPARQL for the Input of Linked Data

SELECT ?s ?nationality ?influences WHERE {
?s dbpedia-ontology:occupation dbpedia-
resource:Poet.
?s dbpedia-property:influences ?influences.
?s dbpedia-ontology:nationality ?nationality.
}

●Declarative and expressive method for data materialisation ✓
●SPARQL endpoint communication ✓

Output computed recommendations as linked
RDF data.

⟨http://www.grouplens.org/user/1⟩
semlibproject:hasRecommendation
_:node175.
_:node175 ⟨semlibproject:recommends⟩
⟨http://www.grouplens.org/movie/2858⟩.
_:node175 ⟨semlibproject:hasScore⟩ 240.0.

RDF creation and interlinking ✓
●

Sometimes...

Linked data → Big data

Therefore, we went in the direction of a distributed and
parallel framework — MapReduce

Overview of Results
● SPARQL execution, RDF materialisation and
output → design the system using established
tools and libraries
● The adaptation of the recommendation
algorithms for RDF → formalisations
presented in a paper [ECAI 2012]
● Scalability with the possibly large amount of
data that can be input → a parallel and
distributed framework

Implementation
Our Framework
SPARQL Query Extraction/Communication

Machine Learning/Recommendation Algorithms

As well as other technologies and libraries

Deployment and Use
To get SLDR running, a JSP web server, such
as Tomcat or Jetty is required.
SLDR is deployed as a web application (WAR).
From there, the recommendation engine is
fully accessible from your web browser to start
creating and running jobs.

The Recommendation Job Control Panel

Saved Jobs

Active Status Output

A Recommendation Job

Algorithm Selection
SPARQL Endpoints

Query Configuration

Retrieving Recommendations
Users have the option of viewing computed
recommendations through either SPARQL and
the output triplestore or through a REST API
implemented into the systems backend.
The REST API can be utilised for better
integration into already existing systems (e.g.
HTML, JavaScript, etc.)

Summary

● Ongoing improvement and development
● Have tested sucessfully with some of the
SME's
● More information available at
http://sldr.deri.ie

Empfohlen

Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryPeter Haase

SHACL-based data life cycle managementConnected Data World

Hybrid Enterprise Knowledge GraphsPeter Haase

Quality aware subgraph matching over inconsistent probabilistic graph databasesieeechennai

Micro-Servicing Linked DataopenCypher

RDF Data Quality Assessment - connecting the piecesConnected Data World

Semantika IntroductionJosef Hardi

Connected data meetup group - introduction & scopeConnected Data World

Empfohlen

Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryPeter Haase

SHACL-based data life cycle managementConnected Data World

Hybrid Enterprise Knowledge GraphsPeter Haase

Quality aware subgraph matching over inconsistent probabilistic graph databasesieeechennai

Micro-Servicing Linked DataopenCypher

RDF Data Quality Assessment - connecting the piecesConnected Data World

Semantika IntroductionJosef Hardi

Connected data meetup group - introduction & scopeConnected Data World

When We Spark and When We Don’t: Developing Data and ML PipelinesStitch Fix Algorithms

Toronto OpenRefine MeetUp Nov 2015Martin Magdinier

schema.org, Linked Data's Gateway DrugConnected Data World

Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...semanticsconference

Tracking data lineage at Stitch FixStitch Fix Algorithms

iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...andimou

GraphDB Connectors – Powering Complex SPARQL QueriesMarin Dimitrov

Let your data shine... with OpenRefineOpen Knowledge Belgium

Amundsen at Brex and Looker integrationmarkgrover

It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...Shawn Jones

Smarter content with a Dynamic Semantic Publishing PlatformOntotext

Robert Isele | eccenca CorporateMemory - Semantically integrated Enterprise D...semanticsconference

2017-01-08-scaling tribalknowledgeChristopher Williams

High quality Linked Data generation for librariansandimou

GraphDB Cloud: Enterprise Ready RDF Database on DemandOntotext

Joseph Bradley, Software Engineer, Databricks Inc. at MLconf SEA - 5/01/15MLconf

20181123 dn2018 graph_analytics_k_patengeKarin Patenge

Iterative data discovery and transformation with open refineMartin Magdinier

Enabling Low-cost Open Data Publishing and ReuseMarin Dimitrov

On-Demand RDF Graph Databases in the CloudMarin Dimitrov

Pundit - SemLib Annotation ToolSemLib Project

Pundit: Semantically Structured Annotations for Web Contents and Digital Libr...SemLib Project

Weitere ähnliche Inhalte

Was ist angesagt?

When We Spark and When We Don’t: Developing Data and ML PipelinesStitch Fix Algorithms

Toronto OpenRefine MeetUp Nov 2015Martin Magdinier

schema.org, Linked Data's Gateway DrugConnected Data World

Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...semanticsconference

Tracking data lineage at Stitch FixStitch Fix Algorithms

iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...andimou

GraphDB Connectors – Powering Complex SPARQL QueriesMarin Dimitrov

Let your data shine... with OpenRefineOpen Knowledge Belgium

Amundsen at Brex and Looker integrationmarkgrover

It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...Shawn Jones

Smarter content with a Dynamic Semantic Publishing PlatformOntotext

Robert Isele | eccenca CorporateMemory - Semantically integrated Enterprise D...semanticsconference

2017-01-08-scaling tribalknowledgeChristopher Williams

High quality Linked Data generation for librariansandimou

GraphDB Cloud: Enterprise Ready RDF Database on DemandOntotext

Joseph Bradley, Software Engineer, Databricks Inc. at MLconf SEA - 5/01/15MLconf

20181123 dn2018 graph_analytics_k_patengeKarin Patenge

Iterative data discovery and transformation with open refineMartin Magdinier

Enabling Low-cost Open Data Publishing and ReuseMarin Dimitrov

On-Demand RDF Graph Databases in the CloudMarin Dimitrov

Was ist angesagt? (20)

When We Spark and When We Don’t: Developing Data and ML Pipelines

Toronto OpenRefine MeetUp Nov 2015

schema.org, Linked Data's Gateway Drug

Stephen Buxton | Data Integration - a Multi-Model Approach - Documents and Tr...

Tracking data lineage at Stitch Fix

iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...

GraphDB Connectors – Powering Complex SPARQL Queries

Let your data shine... with OpenRefine

Amundsen at Brex and Looker integration

It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...

Smarter content with a Dynamic Semantic Publishing Platform

Robert Isele | eccenca CorporateMemory - Semantically integrated Enterprise D...

2017-01-08-scaling tribalknowledge

High quality Linked Data generation for librarians

GraphDB Cloud: Enterprise Ready RDF Database on Demand

Joseph Bradley, Software Engineer, Databricks Inc. at MLconf SEA - 5/01/15

20181123 dn2018 graph_analytics_k_patenge

Iterative data discovery and transformation with open refine

Enabling Low-cost Open Data Publishing and Reuse

On-Demand RDF Graph Databases in the Cloud

Andere mochten auch

Pundit - SemLib Annotation ToolSemLib Project

Pundit: Semantically Structured Annotations for Web Contents and Digital Libr...SemLib Project

SEMLIB Final Conference | IN2 presentationSemLib Project

SEMLIB Final Conference | Net7 presentationSemLib Project

SEMLIB Final Conference | Liberologico presentationSemLib Project

SEMLIB Final Conference | UNIVPM presentationSemLib Project

Almacenamiento e información en las culturas prehispanicasjoseelguapote

LOS INVENTOS DE LOS MAYASSanti Gil

Andere mochten auch (8)

Pundit - SemLib Annotation Tool

Pundit: Semantically Structured Annotations for Web Contents and Digital Libr...

SEMLIB Final Conference | IN2 presentation

SEMLIB Final Conference | Net7 presentation

SEMLIB Final Conference | Liberologico presentation

SEMLIB Final Conference | UNIVPM presentation

Almacenamiento e información en las culturas prehispanicas

LOS INVENTOS DE LOS MAYAS

Ähnlich wie SEMLIB Final Conference | DERI presentation

Continuous delivery for machine learningRajesh Muppalla

Deploying Data Science Engines to ProductionMostafa Majidpour

AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...Cambridge Semantics

The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...Gezim Sejdiu

LD4KD 2015 - Demos and toolsVrije Universiteit Amsterdam

Data processing with spark in r & pythonMaloy Manna, PMP®

Scaling Analytics with Apache SparkQuantUniversity

RDF-Gen: Generating RDF from streaming and archival dataGiorgos Santipantakis

A BASILar Approach for Building Web APIs on top of SPARQL EndpointsEnrico Daga

[DSC Europe 23] Djordje Grozdic - Transforming Business Process Automation wi...DataScienceConferenc1

UnifiedViews: Towards ETL Tool for Simple yet Powerful RDF Data Management.tomasknap

Spark WorkshopNavid Kalaei

Deploying Python Machine Learning Models with Apache Spark with Brandon Hamri...Databricks

MongoDB for Spatio-Behavioral Data Analysis and VisualizationMongoDB

Machine Learning by Example - Apache SparkMeeraj Kunnumpurath

Wed roman tut_open_datapubeswcsummerschool

Open core summit: Observability for data pipelines with OpenLineageJulien Le Dem

Apache Spark 101 - Demi Ben-AriDemi Ben-Ari

Discovering Emerging Tech through Graph Analysis - Henry Hwangbo @ GraphConne...Neo4j

Data Science Salon: A Journey of Deploying a Data Science Engine to ProductionFormulatedby

Ähnlich wie SEMLIB Final Conference | DERI presentation (20)

Continuous delivery for machine learning

Deploying Data Science Engines to Production

AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...

The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...

LD4KD 2015 - Demos and tools

Data processing with spark in r & python

Scaling Analytics with Apache Spark

RDF-Gen: Generating RDF from streaming and archival data

A BASILar Approach for Building Web APIs on top of SPARQL Endpoints

[DSC Europe 23] Djordje Grozdic - Transforming Business Process Automation wi...

UnifiedViews: Towards ETL Tool for Simple yet Powerful RDF Data Management.

Spark Workshop

Deploying Python Machine Learning Models with Apache Spark with Brandon Hamri...

MongoDB for Spatio-Behavioral Data Analysis and Visualization

Machine Learning by Example - Apache Spark

Wed roman tut_open_datapub

Open core summit: Observability for data pipelines with OpenLineage

Apache Spark 101 - Demi Ben-Ari

Discovering Emerging Tech through Graph Analysis - Henry Hwangbo @ GraphConne...

Data Science Salon: A Journey of Deploying a Data Science Engine to Production

Kürzlich hochgeladen

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

Partners Life - Insurer Innovation Award 2024The Digital Insurer

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Histor y of HAM Radio presentation slidevu2urc

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Developing An App To Navigate The Roads of BrazilV3cube

Salesforce Community Group Quito, Salesforce 101Paola De la Torre

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

🐬 The future of MySQL is Postgres 🐘RTylerCroy

Kürzlich hochgeladen (20)

Injustice - Developers Among Us (SciFiDevCon 2024)

08448380779 Call Girls In Civil Lines Women Seeking Men

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Handwritten Text Recognition for manuscripts and early printed texts

[2024]Digital Global Overview Report 2024 Meltwater.pdf

Partners Life - Insurer Innovation Award 2024

08448380779 Call Girls In Friends Colony Women Seeking Men

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Histor y of HAM Radio presentation slide

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

How to Troubleshoot Apps for the Modern Connected Worker

Developing An App To Navigate The Roads of Brazil

Salesforce Community Group Quito, Salesforce 101

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

The Codex of Business Writing Software for Real-World Solutions 2.pptx

Data Cloud, More than a CDP by Matt Robison

Driving Behavioral Change for Information Management through Data-Driven Gree...

Exploring the Future Potential of AI-Enabled Smartphone Processors

🐬 The future of MySQL is Postgres 🐘

SEMLIB Final Conference | DERI presentation

1. The SemLib Linked Data Recommendation Engine

2. Our participation — and motivation — in the project involved the research & development of a recommendation engine that... ● leveraged the ubiquitousness and richness of linked data from the Web of Data ● would produce new linked data as a result of those recommendations. In addition, this would provide data interlinking

3. In general, we were concerned with... ●How to perform recommendation computations with the linked data? Furthermore, how to do this scalably? ● How to input linked data into such a system? ●How to output linked data from those recommendations?

4. For recommendation types, we focused on implementing the primary types: Collaborative filtering & Content-based With an array of algorithms including — Cosine Similarity, Pearson Correlation, Jaccard Distance, Co-occurrence, etc. ●An initial direction for the computation of recommendations ✓ Challenge: adapting these algorithms for linked ● data ✓

5. SPARQL for the Input of Linked Data SELECT ?s ?nationality ?influences WHERE { ?s dbpedia-ontology:occupation dbpedia- resource:Poet. ?s dbpedia-property:influences ?influences. ?s dbpedia-ontology:nationality ?nationality. } ●Declarative and expressive method for data materialisation ✓ ●SPARQL endpoint communication ✓

6. Output computed recommendations as linked RDF data. ⟨http://www.grouplens.org/user/1⟩ semlibproject:hasRecommendation _:node175. _:node175 ⟨semlibproject:recommends⟩ ⟨http://www.grouplens.org/movie/2858⟩. _:node175 ⟨semlibproject:hasScore⟩ 240.0. RDF creation and interlinking ✓ ●

7. Sometimes... Linked data → Big data Therefore, we went in the direction of a distributed and parallel framework — MapReduce

8. Overview of Results ● SPARQL execution, RDF materialisation and output → design the system using established tools and libraries ● The adaptation of the recommendation algorithms for RDF → formalisations presented in a paper [ECAI 2012] ● Scalability with the possibly large amount of data that can be input → a parallel and distributed framework

9. Implementation Our Framework SPARQL Query Extraction/Communication Machine Learning/Recommendation Algorithms As well as other technologies and libraries

10. Deployment and Use To get SLDR running, a JSP web server, such as Tomcat or Jetty is required. SLDR is deployed as a web application (WAR). From there, the recommendation engine is fully accessible from your web browser to start creating and running jobs.

11. The Recommendation Job Control Panel Saved Jobs Active Status Output

12. A Recommendation Job Algorithm Selection SPARQL Endpoints Query Configuration

13. The Backend System Workflow

14. Retrieving Recommendations Users have the option of viewing computed recommendations through either SPARQL and the output triplestore or through a REST API implemented into the systems backend. The REST API can be utilised for better integration into already existing systems (e.g. HTML, JavaScript, etc.)

15. Summary ● Ongoing improvement and development ● Have tested sucessfully with some of the SME's ● More information available at http://sldr.deri.ie