SlideShare ist ein Scribd-Unternehmen logo
1 von 43
How To Make Linked Data More than Data Semantic Technology Conference 2010, June 23, 2010, San Francisco Prateek Jain, Pascal Hitzler, Amit Sheth Kno.e.sis: Ohio Center of Excellence onKnowledge-enabled Computing Wright State University, Dayton, OH http://www.knoesis.org Peter Z. Yeh, KunalVerma Accenture Technology Labs San Jose, CA
What is Semantic Web Semantics? Semantic Web Semantics:shareable		(independent of your particular software)declarative		(not dependent on imperative algorithms)computable		(otherwise we don’t gain much)	meaning You can do Mashups without Semantic Web semantics. You can do information integration without Semantic Web semantics. You can do most things without Semantic Web semantics. But then it will be one-off, less scalable, less reusable.
What Is Semantic Web Semantics? Semantic Web requires a shareable, declarative and computablesemantics. I.e., the semantics must be a formal entity which is clearly defined and automatically computable. Ontology languages provide this by means of their formal semantics. Semantic Web Semantics is given by a relation – the logical consequence relation. Note: This is considerably more than saying that the semantics of an ontology is the set of its logical consequences!
In other words We capture the meaning of information 		not by specifying its meaning directly (which is impossible) 		but by specifying, precisely,  how information interacts with other information. We describe the meaning indirectly through its effects. 	- An example (from LoD) of unintended errors when adequate    semantics is not used:  Linked MDB links to Dbpedia URI for Hollywood for country 
Linked Open Data Where is the semantics?
Example: GeoNames Where is the semantics?
Example: GovTrack “Nancy Pelosi voted in favor of the Health Care Bill.” Vote: 2009-887 vote:hasOption Votes:2009-887/+ vote:vote vote:votedBy rdfs:label Aye vote:hasAction people/P000197 Where is the semantics? H.R. 3962: Affordable Health Care for America Act dc:title name On Passage: H R 3962 Affordable Health Care for America Act Nancy Pelosi dc:title Bills:h3962
Don’t get us wrong Linked Open Data is great, useful, cool, and a very important step. But if we stay semantics-free, Linked Open Data will be of limited usefulness!
The Semantic Data Web Layer Cake To leverage LoD, we require schema knowledge application-type driven (reusable for same kind of application) less messy than LoD(as required by application) overarching several LoD datasets (as required by application) ... Application Application Application Application Application Application Application Application Application Application Application Application Application Application Application Application ... Schema Schema Schema Schema less messy Linked Open Data messy humaneyes only Traditional Web content
Schema on top of the LoD cloud
Schema on top of the LOD Cloud Obvious solution to create an ontology capturing the relationships on top of the LOD Schema datasets. Perform a matching of the LOD Schemas using state of the art ontology matching tools. The datasets can be mapped to an upper level ontology which can capture the relationships. Considering the size, heterogeneity and complexity of LOD, at least have results which can be curated by a human being.
LOD Schema Alignment using state of the art tools
LOD Schema Alignment  ,[object Object]
 Nation = Menstruation, Confidence=0.9 
They are tuned to perform  on the established benchmarks, but do not seem to work well in more unconstrained/preselected cases.   Most current systems excel on Ontology Alignment Evaluation Initiative Benchmark.
LOD Schemas are of very different nature
Created by community for community.
LOD has so far emphasized number of instances, not number of meaningful relationships.
Require solutions beyond syntactic and structural matching.,[object Object]
Step 1: Enrich SchemasBLOOMS – Bootstrapping based Linked Open Data Ontology Matching Systems.
Step 1: Semantic Enrichment BLOOMS – Bootstrapping based Linked Open Data Ontology Matching Systems. At the highest level of abstraction our approach takes in two different ontologies and tries to match them using the following steps  (1) Using Alignment API to identify direct correspondences.  (2) Using the categorization of concepts using Wikipedia.  (3) Running a reasoner on the results found using step (2) and directly on the ontologies.
Creation Wikipedia Category Hierarchy Utilizes the Wikipedia Web service to identify the matching concepts. Thus for the term Conductor the following definitions are obtained Electrical Conductor Conducting Conductor_(album) Conductor (architecture) Mr. Conductor Conductor (ring theory) These terms correspond to articles on Wikipedia for the concepts in the ontology.
Build Category Tree Next step utilize the Web service for identifying Wikipedia categories for building the Wikipedia category tree. Conductor Electrical conductor Conducting Conductor (album) cat:Occupations_in_music cat:Musical_Terminology cat:Musical_Notation cat:Music performance
For each different sense of concept c, match it with the different possible senses of the c’. Artist Conductor cat: Arts occupations Conducting cat:Occupations_in_music cat:Music performance cat: Arts_occupations
Connected Classes Using the position of the categories identify the relationships. Conductor Is-a Conducting Artist cat:Music performance cat:Occupations_in_music cat: Arts_occupations Ponzetto & Strube, 2007 Thus this helps in identifying approximately the relationship between the various concepts.
Disconnected Classes Some senses do not relate to each other Conductor Artist Conductor_(transportation) cat:Occupations_in_music cat:Bus_Transport cat: :Transportation occupations cat: Arts_occupations cat: Transportation Thus this helps in identifying disconnected relationships.
Equivalent Classes Some senses are identical to each other Lady_Finger Okra cat: Abelmoschus Okra cat: Hibisceae cat: Abelmoschus cat: Hibisceae cat: Malvoideae Thus this helps in identifyingequivalence relationships.
LOD Schema Alignment using BLOOMS  Testing done on 10 different pairs of LOD schemas
Linked Schema’s DBpedia  Ontology Music Ontology Schema Jamendo Music Brainz DBTunes Geonames SWC Pisa IEEE BBC Program ACM FOAF SIOC AKT Portal Ontology
Observations Heavy connections at instance level, do not translate to schema level. Case in point: Geonames and Dbpedia. only SpatialThing in Geonames matches to Dbpedia concepts. ,[object Object],Case in point: Dbpedia and AKT Reference Ontology have over 100+ relationship between concepts. Possibility to create links between instance level. Example: Dbpedia “Scientist” Class can contain “Computer Scientist”. ,[object Object],dbpedia:Hollywoodrdf:typedbpedia:Country dbpedia:CountrydisjointWithuscensus:Community uscensus:Hollywoodrdf:typeuscensus:Community
Step 2: Integrated Access/Federated QueryingLOQUS: Linked Open Data SPARQL Querying System (LOQUS)
Federated Querying Transform a query and broadcast it to a group of disparate and relevant datasets  with the appropriate syntax. Merging the results collected from the datasets. Presenting them succinctly and unified format with least duplication. Automatically sort the merged result set.
Federated Querying Challenges User is required to have intimate knowledge about the domain of datasets. User needs to understand the exact structure of datasets. For each relevant dataset user needs to form separate queries. Entity disambiguation has to be performed on similar entities. Retrieved results have to be processed and merged.
Querying Federated Sources Identify artists, whose albums have been tagged as punk and the population of the places they are based near.
Relevant Datasets Geonames Data Music Ontology Census Data
Querying the Datasets Music Ontology Give me artists with punk as genre and their locations? Geonames Data Give me the identifier  used by Census Bureau for geographic locations?   Census Data Give me population figures of geographical entities?
LOQUS Linked Open Data SPARQL Querying System. User can pose federated queries without having to know the exact structure and links between the different datasets. Automatically maps user’s query to the relevant datasets using mapping repository created using BLOOMS. Executes individual queries and merges the results into a single, complete answer.
Traditionally to Retrieve Results User has to …. Music Data Geographic Data Census Data Perform disambiguation Perform Union and Join Process Results
LOQUS Architecture A single source of reference consisting of mapping to the specific LOD datasets. Module to identify concepts contained in the query and perform the translations to the LOD cloud datasets. Module to split the query mapped to LOD datasets concepts into sub-queries corresponding to different datasets. Module to execute the queries remotely and process the results and deliver the final result to the user.
Querying using LOQUS Give me artists with punk as genre and their locations? Identify artists, whose albums have been tagged as punk and the population of the places they are based near. Music Data Give me artists with punk as genre and their locations? Give me the identifier  used by Census Bureau for  geographic locations?   LOQUS Give me the identifier  used by Census Bureau for geographic locations?   Query is decomposed into sub-queries User looks up mapping repository to identify concepts of interest and formulates query Query is routed to the appropriate dataset Geographic Data Give me population figures of geographical entities? Census Data Give me population figures of geographical entities? Mapping Repository
Querying Using LOQUS Music Data Results are returned for the sub-queries. LOQUS Geographic Data Census Data
LOQUS Processes Partial Results Partial results are processed for union, join and disambiguation by LOQUS. LOQUS
Results are Returned to User LOQUS combines the results and presents them back to the user.

Weitere ähnliche Inhalte

Was ist angesagt?

Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...
Andre Freitas
 
2006-05-25__coi-semdis
2006-05-25__coi-semdis2006-05-25__coi-semdis
2006-05-25__coi-semdis
webuploader
 
Ontology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval SystemOntology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval System
IJTET Journal
 
Doctoral seminar (DBIS RWTH Aachen)
Doctoral seminar  (DBIS RWTH Aachen)Doctoral seminar  (DBIS RWTH Aachen)
Doctoral seminar (DBIS RWTH Aachen)
Zina Petrushyna
 
Schema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
Schema-Agnostic Queries (SAQ-2015): Semantic Web ChallengeSchema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
Schema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
Andre Freitas
 

Was ist angesagt? (20)

Semantic web Santhosh N Basavarajappa
Semantic web   Santhosh N BasavarajappaSemantic web   Santhosh N Basavarajappa
Semantic web Santhosh N Basavarajappa
 
Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...
 
Linked Data to Improve the OER Experience
Linked Data to Improve the OER ExperienceLinked Data to Improve the OER Experience
Linked Data to Improve the OER Experience
 
Semantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and RefinementSemantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and Refinement
 
SDA2013 Pundit: Creating, Exploring and Consuming Annotations
SDA2013 Pundit: Creating, Exploring and Consuming AnnotationsSDA2013 Pundit: Creating, Exploring and Consuming Annotations
SDA2013 Pundit: Creating, Exploring and Consuming Annotations
 
2006-05-25__coi-semdis
2006-05-25__coi-semdis2006-05-25__coi-semdis
2006-05-25__coi-semdis
 
Semantics at Scale: A Distributional Approach
Semantics at Scale: A Distributional ApproachSemantics at Scale: A Distributional Approach
Semantics at Scale: A Distributional Approach
 
Ontology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval SystemOntology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval System
 
2015 07-tuto3-mining hin
2015 07-tuto3-mining hin2015 07-tuto3-mining hin
2015 07-tuto3-mining hin
 
FLOWER VOICE: VIRTUAL ASSISTANT FOR OPEN DATA
FLOWER VOICE: VIRTUAL ASSISTANT FOR OPEN DATAFLOWER VOICE: VIRTUAL ASSISTANT FOR OPEN DATA
FLOWER VOICE: VIRTUAL ASSISTANT FOR OPEN DATA
 
EXTRACTING ARABIC RELATIONS FROM THE WEB
EXTRACTING ARABIC RELATIONS FROM THE WEBEXTRACTING ARABIC RELATIONS FROM THE WEB
EXTRACTING ARABIC RELATIONS FROM THE WEB
 
Semantic Digital Libraries
Semantic Digital LibrariesSemantic Digital Libraries
Semantic Digital Libraries
 
Doctoral seminar (DBIS RWTH Aachen)
Doctoral seminar  (DBIS RWTH Aachen)Doctoral seminar  (DBIS RWTH Aachen)
Doctoral seminar (DBIS RWTH Aachen)
 
How hard is this Query? Measuring the Semantic Complexity of Schema-agnostic ...
How hard is this Query? Measuring the Semantic Complexity of Schema-agnostic ...How hard is this Query? Measuring the Semantic Complexity of Schema-agnostic ...
How hard is this Query? Measuring the Semantic Complexity of Schema-agnostic ...
 
Schema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
Schema-Agnostic Queries (SAQ-2015): Semantic Web ChallengeSchema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
Schema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
 
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept AnalysisExtracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
 
Pula 5 Giugno 2007
Pula 5 Giugno 2007Pula 5 Giugno 2007
Pula 5 Giugno 2007
 
Context Sensitive Relatedness Measure of Word Pairs
Context Sensitive Relatedness Measure of Word PairsContext Sensitive Relatedness Measure of Word Pairs
Context Sensitive Relatedness Measure of Word Pairs
 
Semantic Web from the 2013 Perspective
Semantic Web from the 2013 PerspectiveSemantic Web from the 2013 Perspective
Semantic Web from the 2013 Perspective
 
Discovering latent informaion by
Discovering latent informaion byDiscovering latent informaion by
Discovering latent informaion by
 

Ähnlich wie How To Make Linked Data More than Data

Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark Greaves
Mediabistro
 

Ähnlich wie How To Make Linked Data More than Data (20)

Semantics in Financial Services -David Newman
Semantics in Financial Services -David NewmanSemantics in Financial Services -David Newman
Semantics in Financial Services -David Newman
 
X api chinese cop monthly meeting feb.2016
X api chinese cop monthly meeting   feb.2016X api chinese cop monthly meeting   feb.2016
X api chinese cop monthly meeting feb.2016
 
Corrib.org - OpenSource and Research
Corrib.org - OpenSource and ResearchCorrib.org - OpenSource and Research
Corrib.org - OpenSource and Research
 
From Linked Data to Semantic Applications
From Linked Data to Semantic ApplicationsFrom Linked Data to Semantic Applications
From Linked Data to Semantic Applications
 
Information Extraction and Linked Data Cloud
Information Extraction and Linked Data CloudInformation Extraction and Linked Data Cloud
Information Extraction and Linked Data Cloud
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark Greaves
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data Visualization
 
Towards Ontology Development Based on Relational Database
Towards Ontology Development Based on Relational DatabaseTowards Ontology Development Based on Relational Database
Towards Ontology Development Based on Relational Database
 
Exploiting Semantic Web Techniques For Representing And Utilising
Exploiting Semantic Web Techniques For Representing And UtilisingExploiting Semantic Web Techniques For Representing And Utilising
Exploiting Semantic Web Techniques For Representing And Utilising
 
Un unbis-agrovoc 2010-09-03
Un unbis-agrovoc 2010-09-03Un unbis-agrovoc 2010-09-03
Un unbis-agrovoc 2010-09-03
 
Semantic Web in Action
Semantic Web in ActionSemantic Web in Action
Semantic Web in Action
 
Doing Clever Things with the Semantic Web
Doing Clever Things with the Semantic WebDoing Clever Things with the Semantic Web
Doing Clever Things with the Semantic Web
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8
 
Semantic IoT Semantic Inter-Operability Practices - Part 1
Semantic IoT Semantic Inter-Operability Practices - Part 1Semantic IoT Semantic Inter-Operability Practices - Part 1
Semantic IoT Semantic Inter-Operability Practices - Part 1
 
Repositories thru the looking glass
Repositories thru the looking glassRepositories thru the looking glass
Repositories thru the looking glass
 
SADI SWSIP '09 'cause you can't always GET what you want!
SADI SWSIP '09  'cause you can't always GET what you want!SADI SWSIP '09  'cause you can't always GET what you want!
SADI SWSIP '09 'cause you can't always GET what you want!
 
Semantic Interoperability - grafi della conoscenza
Semantic Interoperability - grafi della conoscenzaSemantic Interoperability - grafi della conoscenza
Semantic Interoperability - grafi della conoscenza
 
Tutorial on Semantic Digital Libraries (WWW'2007)
Tutorial on Semantic Digital Libraries (WWW'2007)Tutorial on Semantic Digital Libraries (WWW'2007)
Tutorial on Semantic Digital Libraries (WWW'2007)
 
Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"
Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"
Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"
 
Semantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-WorldSemantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-World
 

Kürzlich hochgeladen

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Kürzlich hochgeladen (20)

How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 

How To Make Linked Data More than Data

  • 1. How To Make Linked Data More than Data Semantic Technology Conference 2010, June 23, 2010, San Francisco Prateek Jain, Pascal Hitzler, Amit Sheth Kno.e.sis: Ohio Center of Excellence onKnowledge-enabled Computing Wright State University, Dayton, OH http://www.knoesis.org Peter Z. Yeh, KunalVerma Accenture Technology Labs San Jose, CA
  • 2. What is Semantic Web Semantics? Semantic Web Semantics:shareable (independent of your particular software)declarative (not dependent on imperative algorithms)computable (otherwise we don’t gain much) meaning You can do Mashups without Semantic Web semantics. You can do information integration without Semantic Web semantics. You can do most things without Semantic Web semantics. But then it will be one-off, less scalable, less reusable.
  • 3. What Is Semantic Web Semantics? Semantic Web requires a shareable, declarative and computablesemantics. I.e., the semantics must be a formal entity which is clearly defined and automatically computable. Ontology languages provide this by means of their formal semantics. Semantic Web Semantics is given by a relation – the logical consequence relation. Note: This is considerably more than saying that the semantics of an ontology is the set of its logical consequences!
  • 4. In other words We capture the meaning of information not by specifying its meaning directly (which is impossible) but by specifying, precisely, how information interacts with other information. We describe the meaning indirectly through its effects. - An example (from LoD) of unintended errors when adequate semantics is not used: Linked MDB links to Dbpedia URI for Hollywood for country 
  • 5. Linked Open Data Where is the semantics?
  • 6. Example: GeoNames Where is the semantics?
  • 7. Example: GovTrack “Nancy Pelosi voted in favor of the Health Care Bill.” Vote: 2009-887 vote:hasOption Votes:2009-887/+ vote:vote vote:votedBy rdfs:label Aye vote:hasAction people/P000197 Where is the semantics? H.R. 3962: Affordable Health Care for America Act dc:title name On Passage: H R 3962 Affordable Health Care for America Act Nancy Pelosi dc:title Bills:h3962
  • 8. Don’t get us wrong Linked Open Data is great, useful, cool, and a very important step. But if we stay semantics-free, Linked Open Data will be of limited usefulness!
  • 9. The Semantic Data Web Layer Cake To leverage LoD, we require schema knowledge application-type driven (reusable for same kind of application) less messy than LoD(as required by application) overarching several LoD datasets (as required by application) ... Application Application Application Application Application Application Application Application Application Application Application Application Application Application Application Application ... Schema Schema Schema Schema less messy Linked Open Data messy humaneyes only Traditional Web content
  • 10. Schema on top of the LoD cloud
  • 11. Schema on top of the LOD Cloud Obvious solution to create an ontology capturing the relationships on top of the LOD Schema datasets. Perform a matching of the LOD Schemas using state of the art ontology matching tools. The datasets can be mapped to an upper level ontology which can capture the relationships. Considering the size, heterogeneity and complexity of LOD, at least have results which can be curated by a human being.
  • 12. LOD Schema Alignment using state of the art tools
  • 13.
  • 14. Nation = Menstruation, Confidence=0.9 
  • 15. They are tuned to perform on the established benchmarks, but do not seem to work well in more unconstrained/preselected cases. Most current systems excel on Ontology Alignment Evaluation Initiative Benchmark.
  • 16. LOD Schemas are of very different nature
  • 17. Created by community for community.
  • 18. LOD has so far emphasized number of instances, not number of meaningful relationships.
  • 19.
  • 20. Step 1: Enrich SchemasBLOOMS – Bootstrapping based Linked Open Data Ontology Matching Systems.
  • 21. Step 1: Semantic Enrichment BLOOMS – Bootstrapping based Linked Open Data Ontology Matching Systems. At the highest level of abstraction our approach takes in two different ontologies and tries to match them using the following steps (1) Using Alignment API to identify direct correspondences. (2) Using the categorization of concepts using Wikipedia. (3) Running a reasoner on the results found using step (2) and directly on the ontologies.
  • 22. Creation Wikipedia Category Hierarchy Utilizes the Wikipedia Web service to identify the matching concepts. Thus for the term Conductor the following definitions are obtained Electrical Conductor Conducting Conductor_(album) Conductor (architecture) Mr. Conductor Conductor (ring theory) These terms correspond to articles on Wikipedia for the concepts in the ontology.
  • 23. Build Category Tree Next step utilize the Web service for identifying Wikipedia categories for building the Wikipedia category tree. Conductor Electrical conductor Conducting Conductor (album) cat:Occupations_in_music cat:Musical_Terminology cat:Musical_Notation cat:Music performance
  • 24. For each different sense of concept c, match it with the different possible senses of the c’. Artist Conductor cat: Arts occupations Conducting cat:Occupations_in_music cat:Music performance cat: Arts_occupations
  • 25. Connected Classes Using the position of the categories identify the relationships. Conductor Is-a Conducting Artist cat:Music performance cat:Occupations_in_music cat: Arts_occupations Ponzetto & Strube, 2007 Thus this helps in identifying approximately the relationship between the various concepts.
  • 26. Disconnected Classes Some senses do not relate to each other Conductor Artist Conductor_(transportation) cat:Occupations_in_music cat:Bus_Transport cat: :Transportation occupations cat: Arts_occupations cat: Transportation Thus this helps in identifying disconnected relationships.
  • 27. Equivalent Classes Some senses are identical to each other Lady_Finger Okra cat: Abelmoschus Okra cat: Hibisceae cat: Abelmoschus cat: Hibisceae cat: Malvoideae Thus this helps in identifyingequivalence relationships.
  • 28. LOD Schema Alignment using BLOOMS Testing done on 10 different pairs of LOD schemas
  • 29. Linked Schema’s DBpedia Ontology Music Ontology Schema Jamendo Music Brainz DBTunes Geonames SWC Pisa IEEE BBC Program ACM FOAF SIOC AKT Portal Ontology
  • 30.
  • 31. Step 2: Integrated Access/Federated QueryingLOQUS: Linked Open Data SPARQL Querying System (LOQUS)
  • 32. Federated Querying Transform a query and broadcast it to a group of disparate and relevant datasets with the appropriate syntax. Merging the results collected from the datasets. Presenting them succinctly and unified format with least duplication. Automatically sort the merged result set.
  • 33. Federated Querying Challenges User is required to have intimate knowledge about the domain of datasets. User needs to understand the exact structure of datasets. For each relevant dataset user needs to form separate queries. Entity disambiguation has to be performed on similar entities. Retrieved results have to be processed and merged.
  • 34. Querying Federated Sources Identify artists, whose albums have been tagged as punk and the population of the places they are based near.
  • 35. Relevant Datasets Geonames Data Music Ontology Census Data
  • 36. Querying the Datasets Music Ontology Give me artists with punk as genre and their locations? Geonames Data Give me the identifier used by Census Bureau for geographic locations? Census Data Give me population figures of geographical entities?
  • 37. LOQUS Linked Open Data SPARQL Querying System. User can pose federated queries without having to know the exact structure and links between the different datasets. Automatically maps user’s query to the relevant datasets using mapping repository created using BLOOMS. Executes individual queries and merges the results into a single, complete answer.
  • 38. Traditionally to Retrieve Results User has to …. Music Data Geographic Data Census Data Perform disambiguation Perform Union and Join Process Results
  • 39. LOQUS Architecture A single source of reference consisting of mapping to the specific LOD datasets. Module to identify concepts contained in the query and perform the translations to the LOD cloud datasets. Module to split the query mapped to LOD datasets concepts into sub-queries corresponding to different datasets. Module to execute the queries remotely and process the results and deliver the final result to the user.
  • 40. Querying using LOQUS Give me artists with punk as genre and their locations? Identify artists, whose albums have been tagged as punk and the population of the places they are based near. Music Data Give me artists with punk as genre and their locations? Give me the identifier used by Census Bureau for geographic locations? LOQUS Give me the identifier used by Census Bureau for geographic locations? Query is decomposed into sub-queries User looks up mapping repository to identify concepts of interest and formulates query Query is routed to the appropriate dataset Geographic Data Give me population figures of geographical entities? Census Data Give me population figures of geographical entities? Mapping Repository
  • 41. Querying Using LOQUS Music Data Results are returned for the sub-queries. LOQUS Geographic Data Census Data
  • 42. LOQUS Processes Partial Results Partial results are processed for union, join and disambiguation by LOQUS. LOQUS
  • 43. Results are Returned to User LOQUS combines the results and presents them back to the user.
  • 44. Technology Stack Proprietarysoftware LOQUS BLOOMS Open Source Technologies Jena/ARQ SPARQL RDF Linked Open Data cloud Java
  • 45. LOQUS Advantage LOQUS expects just the query from the user and does rest of the work .
  • 46. Pre-requisites LOQUS requires an upper level ontology for query federation
  • 47.
  • 48. Conclusions…. continued BLOOMS is one approach for semi-automatically linking different ontologies A new approach for ontology mapping that leverages knowledge in DBPedia A more semantic LOD cloud can enable more intelligent applications such as open question answering LOQUS shows how enriched schemas can enable automatic federated queries, making LOD significantly more useful
  • 49. References Prateek Jain, Pascal Hitzler, Peter Z. Yeh, KunalVerma, Amit P. Sheth, Linked Data is Merely More Data , AAAI Spring Symposium "Linked Data Meets Artificial Intelligence",March 22-24, 2010 Prateek Jain, KunalVerma, Pascal Hitzler, Peter Z. Yeh, Amit P. Sheth, “LOQUS: Linked Open Data SPARQL Querying System”
  • 50. Thanks! This work is funded primarily by NSF Award:IIS-0842129, titled ''III-SGER: Spatio-Temporal-Thematic Queries of Semantic Web Data: a Study of Expressivity and Efficiency''. More at Kno.e.sis – Ohio Center of Excellence on Knowledge-enabled Computing: http://knoesis.org

Hinweis der Redaktion

  1. For each concept in the ontology , do a text search using Wikipedia webservice. Using that try to identify the articles which are related to these terms. Once these different terms are identified, build their category trees. The category trees are built upto level 4, since after that, the category tree is too abstract and not much useful for this particular purpose of Ontology Matching.
  2. Take the category of each of these senses and compare them. For example for Conductor, its different senses would be Conducting, Conducting_Album and so on. Try to compare each of these senses to each other. Thus the sense Conducting is being matched here to the term Artist.
  3. Wikipedia categorization has been demonstrated as a taxonomy in the work of : Ponzetto, S.P., Strube, M.: Deriving a large scale taxonomy from Wikipedia. In: AAAI’07: Proceedings of the 22nd national conference on Artificial intelligence, AAAI Press (2007) 1440–1445.The overlap of the two categorization trees helps us in determining the relationship between the trees. The overlap is a numerical amount (threshold) which can be specified by the user. The numerical amount depends on a rough heuristics: (1) If the two ontologies to be matched are of similar domains such as AKT Reference Ontology and Semantic Web Ontology (Publication Domain), then use a higher threshold. It means terms require a tighter integration. (2) If utilizing an upper level ontology, then terms will be abstract. Hence utilize a lower threshold for that. It depends on the kind of results user wants to obtain. To want a High Precision & Low Recall, choose a high threshold. To want a Low Precision & High Recall, choose a low threshold.
  4. Some senses do not related to each other at all. They do not share any common categories or instances.
  5. Wikipedia since its rich in language and terms, can help in identifying things which can’t be matched using normal syntactic tools.
  6. System-1: Alignment APISystem-2: OMViaUO – Our approach outperforms actually 5 different state of the art systems published in the recent past.
  7. 1.Linked Open Data Cloud isn’t complete in terms of its linkage2. Possibility to add lot more meaningful connections which are motivated from the direction of Schema to Instance (Common-Sense) then the other way round. Unfortunately, as of now the other way round dominates.3. Using common reasoning, made possible through distributed and approximate reasoning, its possible to identify and clean the LOD Cloud. A lot of the messiness can be thrown away.
  8. Animation causes a mess in the textbox.