SlideShare ist ein Scribd-Unternehmen logo
1 von 80
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery hub - a discovery engine on the top of DBpedia
Nicolas Marie, Fabien Gandon, Damien Legrand, Myriam Ribière
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
2
CONTEXT
RESEARCH QUESTION
RESEARCH
- Proposition
- Implementation
- Operational prototypes
- Users evaluations
PUBLICATION
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
3
CONTEXT
RESEARCH QUESTION
RESEARCH
- Proposition
- Implementation
- Operational prototypes
- Users evaluations
PUBLICATION
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Search is only a partially
solved problem
[ White, 2006]
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Gary Marchioninni, 2006
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Exploration/discovery todayLookup today
« Claude Monet » + impressionism« Claude Monet » + birthday
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Search is only a partially
solved problem, White 2006
The degree of structure of the web
content is the determining factor
for the type of functionality that
search engines can provide,
Bizer and al., 2012
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
SEMANTIC WEB
• “The Semantic Web is a mesh of information linked up in such a way as to
be easily processable by machines, on a global scale. You can think of it
as being an efficient way of representing data on the World Wide Web, or
as a globally linked database.” Marianna Sigala, Luisa Mich, Jamie Murphy
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Tim Berners-Lee, WWW1994
[Stankovic, 2012]
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
3 000+ words, 13p. 191 triples
http://en.wikipedia.org/wiki/Claude_Monet
http://dbpedia.org/resource/Claude_Monet
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
•Accessible through:
- Browsers
- Dumps
- SPARQL endpoint
Select * where {
<http://dbpedia.org/resource/Claude_Monet>
<http://dbpedia.org/property/influencedBy>
?x
}
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
DBpedia
Google
Knowledge
Graph
Linked Open
Data cloud3.77 millions things
270 millions facts
500 millions things
3.5 billions facts
31+ billions facts
Close OpenOpen
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Google knowledge graph, 2012
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Since 1995 Since 2007
2001 2007
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
• Linked-data based exploratory
search systems
User interest, start point
Interactive result space
Results choice
Ranking
Sorting/categorization
Explanation
dbpedia:
Claude Monet
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub State of the art: Seevl, 2010
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub State of the art: Yovisto, 2010
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub State of the art: LED, 2010
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub State of the art: MORE, 2012
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub State of the art: Aemoo, 2011
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub State of the art: Kaminskas et al., 2011
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub State of the art: Google Knowledge Panel, 2012
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
25
CONTEXT
RESEARCH QUESTION
RESEARCH
- Proposition
- Implementation
- Operational prototypes
- Users evaluations
PUBLICATION
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
•Challenge: enable on-the-fly linked data processing
for exploratory search
•3 major benefits:
- Results freshness
- Composite exploration enablement
- Fine-grained querying capabilities
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub Freshness
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
• 3.77 millions resources
• 2 resources possible combinations: 14.212.900.000.000
• 3 resources possible combinations: 53.582.633.000.000.000.000
&
Composite interest exploration: knowing my interest
for X and Y what can I discover/learn which is related
to all these resources?
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub Fine-grained querying capabilities
• Artists_from_Paris
• French_painters --
• Impressionist_painters ++
painted ++
influenced by ++
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub Fine-grained querying
• 1960s_science_fiction_films
• American_epic_films
• Films_set_on_the_Moon
• Artificial_intelligence_in_fiction
• Space_adventure_films, …
directed by
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
31
CONTEXT
RESEARCH QUESTION
RESEARCH
- Proposition
- Implementation
- Operational prototypes
- Users evaluations
PUBLICATION
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Refer to publications for
the complete algorithm
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Spreading activation basis –
monocentric
Claude_Monet
…
…
…
…
…
…
Iteration 0 Iteration 1 Iteration 2
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Claude_Monet
Musée d’Orsay
Musée de l’Orangerie
…
……
Vincent Van Gogh
Spreading activation basis –
polycentric
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Wheeler_School
Art Institute of Chicago Gustave_Courbet
Cadmium_sulfide
Farmington_Mountain
DBO:Museum
DBO:ChemicalSubstance
DBO:Mountain
DBO:Artist
cat:Impressionist_p…
cat:Alumni_of_beaux…
2
0 0 0
3 +2
Propagation domain: artist, book,
film, museum, river, television show,
university, writer,…
cat:Impressionist painters
cat:Alumni_of_Beaux_Arts
DBO:School
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
• How to be fast ?
How to execute it fast
On very a large graph
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub Very large graph
Locate the processing
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
1.sparql endpoint = http://xxx/sparql
2.seed(s) = xxx_Beatles
3. compute the propagation domain (w(i,o))
(4. find a path between the seeds)
5. import path nodes & their neighbors
6. for(i=1; i<=maxPulse; i++){
7. pulse
8. if(sampleSize <= maxSampleSize){
9. extend the sample
10. }
11.}
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
select distinct ?x ?y where {
service <sparqlEndpoint>
{
select * where {
?a(<…wikiPageWikiLink>|
^<…wikiPageWikiLink>){0,X} :: $path ?b
filter (?a=<resource1> &&?b=<resource2>)
}
}
graph $path {?x ?p ?y}
filter(?x!=<resource1> && ?x!=<resource2>)
}
Path query using Kgram for
polycentric SA
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
41
CONTEXT
RESEARCH QUESTION
RESEARCH
- Proposition
- Implementation
- Operational prototypes
- Users evaluations
PUBLICATION
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
• Analysis method
Analysis performed on a set of 100.000 queries
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0
500
1000
1500
2000
2500
3000
3500
4000
4500
0 5000 10000 15000 20000
KT
Ms
Triples loading limit
Similarity of top 100 results (Kendall-Tau)
from one loading limit to another
maxSampleSize
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Similarity of top 100 results (shared results, KT)
from one iteration to another
maxPulse
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0
10
20
30
40
50
60
70
80
90
100
1 2 3 4 5 6 7 8 9 10
Kendall-Tau
Sharedresults
Iterations
KT
shared results
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
0
2000
4000
6000
8000
10000
12000
14000
16000
18000
20000
Milliseconds
Queries response time histogram
Response time histogram
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Algorithm visualization, available @
http://www.youtube.com/user/wearediscoveryhub
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Polycentric query propagation
visualization, iteration 0
• In red: Claude Monet
• In blue : Musée d’Orsay
• In purple: Recovery
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Polycentric query propagation
visualization, iteration 6
• In red: Claude Monet
• In blue : Musée d’Orsay
• In purple: Recovery
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Semantic spreading activation
Distance A Distance B Gap A - B Max
distance A
Max
distance B
poly1 - top 10 1.53 1.68 0.34 / /
poly2 - top 10 1.52 1.66 0.33 / /
poly1 - top 100 1.90 2.12 0.49 2.60 2.60
poly2 - top 100 1.88 2.11 0.48 2.58 2.58
Polycentric
Polycentric queries, average distances
of top results from each seed.
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
• We studied the convergence of the algorithm according to various graph
metrics. For this purpose we generated many graphs thanks to the
Graphstream graph library, conclusion : the diameter is crucial.
Discovery Hub Influence of graph diameter
on algorithm convergence
http://graphstream-project.org/doc/Generators/
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
1 2 3 4 5 6 7 8 9 1011121314151617181920
Resultssimilarity
Iterations
diamètre 4.14
diamètre 6.72
diamètre 9.94
diamètre 13.28
diamètre 15.43
diamètre 19.59
diamètre 22.03
diamètre 24.87
diamètre 28.85
Influence of graph diameter
on algorithm convergence
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
0
10
20
30
40
50
60
70
80
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Averagerankvariation
Iterations
diamètre 4.14
diamètre 6.72
diamètre 9.94
diamètre 13.28
diamètre 15.43
diamètre 19.59
diamètre 22.03
diamètre 24.87
diamètre 28.85
Influence of graph diameter
on algorithm convergence
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
• Discovery hub is an exploratory search engine which helps you to
discover things you might like or be interested in. It widens your
cultural and knowledge horizons by revealing and explaining
unattended information.
• It allows performing queries in an innovative way and helps you to
navigate rich results. As a hub, it proposes redirections to others
platforms to make you benefit from your discoveries (Youtube,
Deezer and more). The results are explained in depth thanks to 3
explanatory features.
• Discovery Hub supports simple and composite explorations i.e.
starting from one or several items of interest. It proposes and is able
to combine advanced exploration modes such as serendipitous,
multi-lingual, and fine-grained ones.
Discovery Hub powered
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
54
CONTEXT
RESEARCH QUESTION
RESEARCH
- Proposition
- Implementation
- Operational prototypes
- Users evaluations
PUBLICATION
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
1. Start from what you
like or are interested in
2.
Explore, discover, under
stand
3. Be redirected on great
platforms to experience
your discoveries
powered
Book
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub V1
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub V2
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
• 3 features to understand the results: common properties
Discovery Hub
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
• 3 features to understand the results: Wikipedia crossed references
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
• 3 features to understand the results: explanatory graph
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub Internationalization
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub Serendipitous mode
?
?
?
?
Claude_Monet
…
?
…
?
?
…
?
…
?
…
?
…
Iteration 0 Iteration 1 Iteration 2
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub Multi-lingual mode
dbpedia:Claude_Monet sameAs fr.dbpedia:Claude_Monet
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub Fine search mode
1960s_science_fiction_films
Films_set_on_the_Moon
Artificial_intelligence_in_fiction
Space_adventure_films
Top 4 films
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Directed by Stanley Kubrick
Top 4 films
Fine search mode
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
1960s_science_fiction_films
Films_set_on_the_Moon
Artificial_intelligence_in_fiction
Space_adventure_films
Directed by Stanley Kubrick
Top 4 films
Fine search mode
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub Multi-criterias mode
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
« Surprise mode »
Multi-lingual
Fine-search
Multi-criterias mode
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
Demo videos, available @
http://www.youtube.com/user/wearediscoveryhub
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
70
CONTEXT
RESEARCH QUESTION
RESEARCH
- Proposition
- Implementation
- Operational prototypes
- Users evaluations
PUBLICATION
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
• Mono-centric queries: evaluated positively on movie domain against
another algorithm: the sSVM implemented in MORE movie recommender
Very
interesting
Not
interesting
at all
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Scores for partial lists
Discovery Hub
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub - Films_about_criticism_and_refusal_of_work
- Anti-modernist_films
- Fiction_with_unreliable_narrators
0
0.2
0.4
0.6
0.8
1
1.2
1.4
1.6
1.8
2
Neutral Personalized
Interesting
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub • Poly-centric queries: evaluated positively
0
0.5
1
1.5
2
2.5
3
0
0.5
1
1.5
2
2.5
3
Relevance
Discovery
Very
interesting
Not interesting
at all
Very surprizing
Not suprizing
at all
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
•Recovery between relevance and unexpectedness:
- 61.6% of results were rated as strongly relevant or relevant
by the participants.
- 65% of results were rated as strongly unexpected or
unexpected.
- 35.42% of results were rated both as strongly relevant or
relevant and strongly unexpected or unexpected.
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub •Explanation features
0
0.5
1
1.5
2
2.5
3
In
Common
Wikipedia Graph Overall
Monocentric Polycentric
Common
prop.
Wiki-based Graph-based Overall
Very
Helpful
Not helpful
At all
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Publications:
• Nicolas Marie, Fabien Gandon, Myriam Ribière, Florentin Rodio, Discovery Hub:
on-the-fly linked data exploratory search. I-Semantics 2013, Graz, 4 – 6
september (paper).
• Nicolas Marie, Fabien Gandon, Damien Legrand, Myriam Ribière, Exploratory
Search on the top of DBpedia chapters with the Discovery Hub Application.
ESWC2013, Montpellier, 26 – 30 may (demo+poster).
• Nicolas Marie, Olivier Corby, Fabien Gandon, Myriam Ribière, Composite interests
exploration thanks to on-the-fly linked data spreading activation, Hypertext 2013,
1-3 may, Paris (paper).
• Clare J. Hooper, Nicolas Marie, Evangelos Kalampokis, Dissecting the Butterfly:
Representation of Disciplines Publishing at the Web Science Conference Series,
Web Science 2012, Northeastern university, Evanston, United States, 22-24 june
(paper).
• Nicolas Marie, Fabien Gandon. Advanced social objects recommendation in
multidimensional social networks. Social Object Workshop 2011, MIT, Boston, USA
(paper).
Discovery Hub
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
78
CONTEXT
RESEARCH QUESTION
RESEARCH
- Proposition
- Implementation
- Operational prototypes
- Users evaluation
PUBLICATION
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED.
Discovery Hub
ncmarie3&@gmail.com
http://ncmarie.tumblr.com
http://discoveryhub.co
werarediscoveryhub@gmail.com
Thank you !

Weitere ähnliche Inhalte

Ähnlich wie DBpedia Discovery Hub Search Engine

Using Open Wonderland Preview 5 for Education
Using Open Wonderland Preview 5 for EducationUsing Open Wonderland Preview 5 for Education
Using Open Wonderland Preview 5 for EducationNicole Yankelovich
 
Frt 110523.3 - Vision Meeting - Intrapeneurship - Alcatel-Lucent - presentatie
Frt   110523.3 - Vision Meeting - Intrapeneurship - Alcatel-Lucent - presentatieFrt   110523.3 - Vision Meeting - Intrapeneurship - Alcatel-Lucent - presentatie
Frt 110523.3 - Vision Meeting - Intrapeneurship - Alcatel-Lucent - presentatieFlevum
 
Managing Massive data of the IoT through cooperative semantic nodes
Managing Massive data of the IoT through cooperative semantic nodesManaging Massive data of the IoT through cooperative semantic nodes
Managing Massive data of the IoT through cooperative semantic nodesBenoit Christophe
 
Introduction to Rapid, Real-Time High Resolution Site Characterization
Introduction to Rapid, Real-Time High Resolution Site CharacterizationIntroduction to Rapid, Real-Time High Resolution Site Characterization
Introduction to Rapid, Real-Time High Resolution Site CharacterizationJohn Sohl
 
Time Series Data Storage in MongoDB
Time Series Data Storage in MongoDBTime Series Data Storage in MongoDB
Time Series Data Storage in MongoDBsky_jackson
 
Towards Open Pervasive Displays (Keynote at UbiSummit, Helsinki, May 2011)
Towards Open Pervasive Displays (Keynote at UbiSummit, Helsinki, May 2011)Towards Open Pervasive Displays (Keynote at UbiSummit, Helsinki, May 2011)
Towards Open Pervasive Displays (Keynote at UbiSummit, Helsinki, May 2011)Adrian Friday
 
Towards Open Pervasive Displays (Keynote at Tekes UbiSummit, May 2011)
Towards Open Pervasive Displays (Keynote at Tekes UbiSummit, May 2011)Towards Open Pervasive Displays (Keynote at Tekes UbiSummit, May 2011)
Towards Open Pervasive Displays (Keynote at Tekes UbiSummit, May 2011)Adrian Friday
 
Keynote de Mike Milinkovich
Keynote de Mike MilinkovichKeynote de Mike Milinkovich
Keynote de Mike MilinkovichEclipseDayParis
 
Open Cultuur Data: initiatieven, producten en lessons learned
Open Cultuur Data: initiatieven, producten en lessons learnedOpen Cultuur Data: initiatieven, producten en lessons learned
Open Cultuur Data: initiatieven, producten en lessons learneddatable_be
 
User Generated Art: the Artist as Manipulator
User Generated Art: the Artist as ManipulatorUser Generated Art: the Artist as Manipulator
User Generated Art: the Artist as ManipulatorC4Media
 
Fun and education with the PolarSys Rover and PolarSys Solutions
Fun and education with the PolarSys Rover and PolarSys SolutionsFun and education with the PolarSys Rover and PolarSys Solutions
Fun and education with the PolarSys Rover and PolarSys Solutions Gaël Blondelle
 
Roland harwood UK-IRC Summit 2011 Presentation
Roland harwood   UK-IRC Summit 2011 PresentationRoland harwood   UK-IRC Summit 2011 Presentation
Roland harwood UK-IRC Summit 2011 Presentation100%Open
 
The OpenMDM Working Group @ Eclipse: Open collaboration as a service, Philipp...
The OpenMDM Working Group @ Eclipse: Open collaboration as a service, Philipp...The OpenMDM Working Group @ Eclipse: Open collaboration as a service, Philipp...
The OpenMDM Working Group @ Eclipse: Open collaboration as a service, Philipp...OW2
 
From Scarcity to Abundance of learning resources: Ariadne and the Snowflake E...
From Scarcity to Abundance of learning resources: Ariadne and the Snowflake E...From Scarcity to Abundance of learning resources: Ariadne and the Snowflake E...
From Scarcity to Abundance of learning resources: Ariadne and the Snowflake E...Erik Duval
 
Research and Development at Sound and Vision
Research and Development at Sound and Vision Research and Development at Sound and Vision
Research and Development at Sound and Vision Victor de Boer
 
B5 present ppopp - mcv - public
B5   present ppopp - mcv - publicB5   present ppopp - mcv - public
B5 present ppopp - mcv - publicluigi spiga
 

Ähnlich wie DBpedia Discovery Hub Search Engine (20)

Using Open Wonderland Preview 5 for Education
Using Open Wonderland Preview 5 for EducationUsing Open Wonderland Preview 5 for Education
Using Open Wonderland Preview 5 for Education
 
Frt 110523.3 - Vision Meeting - Intrapeneurship - Alcatel-Lucent - presentatie
Frt   110523.3 - Vision Meeting - Intrapeneurship - Alcatel-Lucent - presentatieFrt   110523.3 - Vision Meeting - Intrapeneurship - Alcatel-Lucent - presentatie
Frt 110523.3 - Vision Meeting - Intrapeneurship - Alcatel-Lucent - presentatie
 
Managing Massive data of the IoT through cooperative semantic nodes
Managing Massive data of the IoT through cooperative semantic nodesManaging Massive data of the IoT through cooperative semantic nodes
Managing Massive data of the IoT through cooperative semantic nodes
 
Open for learning workshop score in july 2011
Open for learning workshop score in july 2011Open for learning workshop score in july 2011
Open for learning workshop score in july 2011
 
Introduction to Rapid, Real-Time High Resolution Site Characterization
Introduction to Rapid, Real-Time High Resolution Site CharacterizationIntroduction to Rapid, Real-Time High Resolution Site Characterization
Introduction to Rapid, Real-Time High Resolution Site Characterization
 
Time Series Data Storage in MongoDB
Time Series Data Storage in MongoDBTime Series Data Storage in MongoDB
Time Series Data Storage in MongoDB
 
Towards Open Pervasive Displays (Keynote at UbiSummit, Helsinki, May 2011)
Towards Open Pervasive Displays (Keynote at UbiSummit, Helsinki, May 2011)Towards Open Pervasive Displays (Keynote at UbiSummit, Helsinki, May 2011)
Towards Open Pervasive Displays (Keynote at UbiSummit, Helsinki, May 2011)
 
Towards Open Pervasive Displays (Keynote at Tekes UbiSummit, May 2011)
Towards Open Pervasive Displays (Keynote at Tekes UbiSummit, May 2011)Towards Open Pervasive Displays (Keynote at Tekes UbiSummit, May 2011)
Towards Open Pervasive Displays (Keynote at Tekes UbiSummit, May 2011)
 
Keynote de Mike Milinkovich
Keynote de Mike MilinkovichKeynote de Mike Milinkovich
Keynote de Mike Milinkovich
 
Open Cultuur Data: initiatieven, producten en lessons learned
Open Cultuur Data: initiatieven, producten en lessons learnedOpen Cultuur Data: initiatieven, producten en lessons learned
Open Cultuur Data: initiatieven, producten en lessons learned
 
User Generated Art: the Artist as Manipulator
User Generated Art: the Artist as ManipulatorUser Generated Art: the Artist as Manipulator
User Generated Art: the Artist as Manipulator
 
Fun and education with the PolarSys Rover and PolarSys Solutions
Fun and education with the PolarSys Rover and PolarSys SolutionsFun and education with the PolarSys Rover and PolarSys Solutions
Fun and education with the PolarSys Rover and PolarSys Solutions
 
Roland harwood UK-IRC Summit 2011 Presentation
Roland harwood   UK-IRC Summit 2011 PresentationRoland harwood   UK-IRC Summit 2011 Presentation
Roland harwood UK-IRC Summit 2011 Presentation
 
The OpenMDM Working Group @ Eclipse: Open collaboration as a service, Philipp...
The OpenMDM Working Group @ Eclipse: Open collaboration as a service, Philipp...The OpenMDM Working Group @ Eclipse: Open collaboration as a service, Philipp...
The OpenMDM Working Group @ Eclipse: Open collaboration as a service, Philipp...
 
From Scarcity to Abundance of learning resources: Ariadne and the Snowflake E...
From Scarcity to Abundance of learning resources: Ariadne and the Snowflake E...From Scarcity to Abundance of learning resources: Ariadne and the Snowflake E...
From Scarcity to Abundance of learning resources: Ariadne and the Snowflake E...
 
The aDORe Federation Architecture
The aDORe Federation ArchitectureThe aDORe Federation Architecture
The aDORe Federation Architecture
 
Vu quangcuong resume
Vu quangcuong resumeVu quangcuong resume
Vu quangcuong resume
 
Research and Development at Sound and Vision
Research and Development at Sound and Vision Research and Development at Sound and Vision
Research and Development at Sound and Vision
 
B5 present ppopp - mcv - public
B5   present ppopp - mcv - publicB5   present ppopp - mcv - public
B5 present ppopp - mcv - public
 
Denver - ARTstor Digital Library
Denver - ARTstor Digital LibraryDenver - ARTstor Digital Library
Denver - ARTstor Digital Library
 

Kürzlich hochgeladen

The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 

Kürzlich hochgeladen (20)

The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 

DBpedia Discovery Hub Search Engine

  • 1. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery hub - a discovery engine on the top of DBpedia Nicolas Marie, Fabien Gandon, Damien Legrand, Myriam Ribière
  • 2. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. 2 CONTEXT RESEARCH QUESTION RESEARCH - Proposition - Implementation - Operational prototypes - Users evaluations PUBLICATION
  • 3. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. 3 CONTEXT RESEARCH QUESTION RESEARCH - Proposition - Implementation - Operational prototypes - Users evaluations PUBLICATION
  • 4. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Search is only a partially solved problem [ White, 2006]
  • 5. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Gary Marchioninni, 2006
  • 6. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Exploration/discovery todayLookup today « Claude Monet » + impressionism« Claude Monet » + birthday
  • 7. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub
  • 8. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub
  • 9. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Search is only a partially solved problem, White 2006 The degree of structure of the web content is the determining factor for the type of functionality that search engines can provide, Bizer and al., 2012
  • 10. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub SEMANTIC WEB • “The Semantic Web is a mesh of information linked up in such a way as to be easily processable by machines, on a global scale. You can think of it as being an efficient way of representing data on the World Wide Web, or as a globally linked database.” Marianna Sigala, Luisa Mich, Jamie Murphy
  • 11. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Tim Berners-Lee, WWW1994 [Stankovic, 2012]
  • 12. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub 3 000+ words, 13p. 191 triples http://en.wikipedia.org/wiki/Claude_Monet http://dbpedia.org/resource/Claude_Monet
  • 13. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub •Accessible through: - Browsers - Dumps - SPARQL endpoint Select * where { <http://dbpedia.org/resource/Claude_Monet> <http://dbpedia.org/property/influencedBy> ?x }
  • 14. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub DBpedia Google Knowledge Graph Linked Open Data cloud3.77 millions things 270 millions facts 500 millions things 3.5 billions facts 31+ billions facts Close OpenOpen
  • 15. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Google knowledge graph, 2012
  • 16. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Since 1995 Since 2007 2001 2007
  • 17. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub • Linked-data based exploratory search systems User interest, start point Interactive result space Results choice Ranking Sorting/categorization Explanation dbpedia: Claude Monet
  • 18. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub State of the art: Seevl, 2010
  • 19. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub State of the art: Yovisto, 2010
  • 20. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub State of the art: LED, 2010
  • 21. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub State of the art: MORE, 2012
  • 22. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub State of the art: Aemoo, 2011
  • 23. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub State of the art: Kaminskas et al., 2011
  • 24. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub State of the art: Google Knowledge Panel, 2012
  • 25. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. 25 CONTEXT RESEARCH QUESTION RESEARCH - Proposition - Implementation - Operational prototypes - Users evaluations PUBLICATION
  • 26. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub •Challenge: enable on-the-fly linked data processing for exploratory search •3 major benefits: - Results freshness - Composite exploration enablement - Fine-grained querying capabilities
  • 27. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Freshness
  • 28. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub • 3.77 millions resources • 2 resources possible combinations: 14.212.900.000.000 • 3 resources possible combinations: 53.582.633.000.000.000.000 & Composite interest exploration: knowing my interest for X and Y what can I discover/learn which is related to all these resources?
  • 29. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Fine-grained querying capabilities • Artists_from_Paris • French_painters -- • Impressionist_painters ++ painted ++ influenced by ++
  • 30. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Fine-grained querying • 1960s_science_fiction_films • American_epic_films • Films_set_on_the_Moon • Artificial_intelligence_in_fiction • Space_adventure_films, … directed by
  • 31. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. 31 CONTEXT RESEARCH QUESTION RESEARCH - Proposition - Implementation - Operational prototypes - Users evaluations PUBLICATION
  • 32. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Refer to publications for the complete algorithm
  • 33. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Spreading activation basis – monocentric Claude_Monet … … … … … … Iteration 0 Iteration 1 Iteration 2
  • 34. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Claude_Monet Musée d’Orsay Musée de l’Orangerie … …… Vincent Van Gogh Spreading activation basis – polycentric
  • 35. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Wheeler_School Art Institute of Chicago Gustave_Courbet Cadmium_sulfide Farmington_Mountain DBO:Museum DBO:ChemicalSubstance DBO:Mountain DBO:Artist cat:Impressionist_p… cat:Alumni_of_beaux… 2 0 0 0 3 +2 Propagation domain: artist, book, film, museum, river, television show, university, writer,… cat:Impressionist painters cat:Alumni_of_Beaux_Arts DBO:School
  • 36. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub • How to be fast ? How to execute it fast On very a large graph
  • 37. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Very large graph Locate the processing
  • 38. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub
  • 39. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub 1.sparql endpoint = http://xxx/sparql 2.seed(s) = xxx_Beatles 3. compute the propagation domain (w(i,o)) (4. find a path between the seeds) 5. import path nodes & their neighbors 6. for(i=1; i<=maxPulse; i++){ 7. pulse 8. if(sampleSize <= maxSampleSize){ 9. extend the sample 10. } 11.}
  • 40. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub select distinct ?x ?y where { service <sparqlEndpoint> { select * where { ?a(<…wikiPageWikiLink>| ^<…wikiPageWikiLink>){0,X} :: $path ?b filter (?a=<resource1> &&?b=<resource2>) } } graph $path {?x ?p ?y} filter(?x!=<resource1> && ?x!=<resource2>) } Path query using Kgram for polycentric SA
  • 41. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. 41 CONTEXT RESEARCH QUESTION RESEARCH - Proposition - Implementation - Operational prototypes - Users evaluations PUBLICATION
  • 42. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub • Analysis method Analysis performed on a set of 100.000 queries
  • 43. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 500 1000 1500 2000 2500 3000 3500 4000 4500 0 5000 10000 15000 20000 KT Ms Triples loading limit Similarity of top 100 results (Kendall-Tau) from one loading limit to another maxSampleSize
  • 44. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Similarity of top 100 results (shared results, KT) from one iteration to another maxPulse 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 10 20 30 40 50 60 70 80 90 100 1 2 3 4 5 6 7 8 9 10 Kendall-Tau Sharedresults Iterations KT shared results
  • 45. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub 0 2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 Milliseconds Queries response time histogram Response time histogram
  • 46. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Algorithm visualization, available @ http://www.youtube.com/user/wearediscoveryhub
  • 47. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Polycentric query propagation visualization, iteration 0 • In red: Claude Monet • In blue : Musée d’Orsay • In purple: Recovery
  • 48. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Polycentric query propagation visualization, iteration 6 • In red: Claude Monet • In blue : Musée d’Orsay • In purple: Recovery
  • 49. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Semantic spreading activation Distance A Distance B Gap A - B Max distance A Max distance B poly1 - top 10 1.53 1.68 0.34 / / poly2 - top 10 1.52 1.66 0.33 / / poly1 - top 100 1.90 2.12 0.49 2.60 2.60 poly2 - top 100 1.88 2.11 0.48 2.58 2.58 Polycentric Polycentric queries, average distances of top results from each seed.
  • 50. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. • We studied the convergence of the algorithm according to various graph metrics. For this purpose we generated many graphs thanks to the Graphstream graph library, conclusion : the diameter is crucial. Discovery Hub Influence of graph diameter on algorithm convergence http://graphstream-project.org/doc/Generators/
  • 51. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 2 3 4 5 6 7 8 9 1011121314151617181920 Resultssimilarity Iterations diamètre 4.14 diamètre 6.72 diamètre 9.94 diamètre 13.28 diamètre 15.43 diamètre 19.59 diamètre 22.03 diamètre 24.87 diamètre 28.85 Influence of graph diameter on algorithm convergence
  • 52. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub 0 10 20 30 40 50 60 70 80 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Averagerankvariation Iterations diamètre 4.14 diamètre 6.72 diamètre 9.94 diamètre 13.28 diamètre 15.43 diamètre 19.59 diamètre 22.03 diamètre 24.87 diamètre 28.85 Influence of graph diameter on algorithm convergence
  • 53. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. • Discovery hub is an exploratory search engine which helps you to discover things you might like or be interested in. It widens your cultural and knowledge horizons by revealing and explaining unattended information. • It allows performing queries in an innovative way and helps you to navigate rich results. As a hub, it proposes redirections to others platforms to make you benefit from your discoveries (Youtube, Deezer and more). The results are explained in depth thanks to 3 explanatory features. • Discovery Hub supports simple and composite explorations i.e. starting from one or several items of interest. It proposes and is able to combine advanced exploration modes such as serendipitous, multi-lingual, and fine-grained ones. Discovery Hub powered
  • 54. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. 54 CONTEXT RESEARCH QUESTION RESEARCH - Proposition - Implementation - Operational prototypes - Users evaluations PUBLICATION
  • 55. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub 1. Start from what you like or are interested in 2. Explore, discover, under stand 3. Be redirected on great platforms to experience your discoveries powered Book
  • 56. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub V1
  • 57. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub V2
  • 58. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. • 3 features to understand the results: common properties Discovery Hub
  • 59. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub • 3 features to understand the results: Wikipedia crossed references
  • 60. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub • 3 features to understand the results: explanatory graph
  • 61. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Internationalization
  • 62. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Serendipitous mode ? ? ? ? Claude_Monet … ? … ? ? … ? … ? … ? … Iteration 0 Iteration 1 Iteration 2
  • 63. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Multi-lingual mode dbpedia:Claude_Monet sameAs fr.dbpedia:Claude_Monet
  • 64. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Fine search mode 1960s_science_fiction_films Films_set_on_the_Moon Artificial_intelligence_in_fiction Space_adventure_films Top 4 films
  • 65. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Directed by Stanley Kubrick Top 4 films Fine search mode
  • 66. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub 1960s_science_fiction_films Films_set_on_the_Moon Artificial_intelligence_in_fiction Space_adventure_films Directed by Stanley Kubrick Top 4 films Fine search mode
  • 67. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Multi-criterias mode
  • 68. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub « Surprise mode » Multi-lingual Fine-search Multi-criterias mode
  • 69. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub Demo videos, available @ http://www.youtube.com/user/wearediscoveryhub
  • 70. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. 70 CONTEXT RESEARCH QUESTION RESEARCH - Proposition - Implementation - Operational prototypes - Users evaluations PUBLICATION
  • 71. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub • Mono-centric queries: evaluated positively on movie domain against another algorithm: the sSVM implemented in MORE movie recommender Very interesting Not interesting at all
  • 72. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Scores for partial lists Discovery Hub
  • 73. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub - Films_about_criticism_and_refusal_of_work - Anti-modernist_films - Fiction_with_unreliable_narrators 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 Neutral Personalized Interesting
  • 74. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub • Poly-centric queries: evaluated positively 0 0.5 1 1.5 2 2.5 3 0 0.5 1 1.5 2 2.5 3 Relevance Discovery Very interesting Not interesting at all Very surprizing Not suprizing at all
  • 75. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub •Recovery between relevance and unexpectedness: - 61.6% of results were rated as strongly relevant or relevant by the participants. - 65% of results were rated as strongly unexpected or unexpected. - 35.42% of results were rated both as strongly relevant or relevant and strongly unexpected or unexpected.
  • 76. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub •Explanation features 0 0.5 1 1.5 2 2.5 3 In Common Wikipedia Graph Overall Monocentric Polycentric Common prop. Wiki-based Graph-based Overall Very Helpful Not helpful At all
  • 77. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Publications: • Nicolas Marie, Fabien Gandon, Myriam Ribière, Florentin Rodio, Discovery Hub: on-the-fly linked data exploratory search. I-Semantics 2013, Graz, 4 – 6 september (paper). • Nicolas Marie, Fabien Gandon, Damien Legrand, Myriam Ribière, Exploratory Search on the top of DBpedia chapters with the Discovery Hub Application. ESWC2013, Montpellier, 26 – 30 may (demo+poster). • Nicolas Marie, Olivier Corby, Fabien Gandon, Myriam Ribière, Composite interests exploration thanks to on-the-fly linked data spreading activation, Hypertext 2013, 1-3 may, Paris (paper). • Clare J. Hooper, Nicolas Marie, Evangelos Kalampokis, Dissecting the Butterfly: Representation of Disciplines Publishing at the Web Science Conference Series, Web Science 2012, Northeastern university, Evanston, United States, 22-24 june (paper). • Nicolas Marie, Fabien Gandon. Advanced social objects recommendation in multidimensional social networks. Social Object Workshop 2011, MIT, Boston, USA (paper). Discovery Hub
  • 78. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. 78 CONTEXT RESEARCH QUESTION RESEARCH - Proposition - Implementation - Operational prototypes - Users evaluation PUBLICATION
  • 79. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub
  • 80. COPYRIGHT © 2011 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Discovery Hub ncmarie3&@gmail.com http://ncmarie.tumblr.com http://discoveryhub.co werarediscoveryhub@gmail.com Thank you !