SlideShare ist ein Scribd-Unternehmen logo
1 von 29
Downloaden Sie, um offline zu lesen
Mapping,	
  Interlinking	
  and	
  
Exposing	
  MusicBrainz	
  as	
  
Linked	
  Data	
  
1st	
  Interna*onal	
  Workshop	
  on	
  	
  
Seman*c	
  Music	
  and	
  Media	
  (SMAM2013)	
  

Sydney,	
  Oct	
  21,	
  2013	
  

Peter	
  Haase	
  
What	
  this	
  talk	
  is	
  about	
  
A	
  Linked	
  Data	
  Perspec=ve	
  
worksOn
publishedTo

affiliation
affiliation (previous)

isAbout
builtWith

participatesIn

participatesIn
EUCLID:	
  EdUca=onal	
  Curriculum	
  for	
  the	
  
usage	
  of	
  LinkedData	
  
	
  
http://www.euclid-project.eu

Course

eBook

Other channels

@euclid_project

euclidproject

euclidproject
Analysis	
  &	
  
Mining	
  Module	
  

Visualiza*on	
  
Module	
  

RDFa	
  

Data acquisition

LD Dataset

Access

Application

EUCLID	
  Scenario	
  

SPARQL
Endpoint

Vocabulary	
  
Mapping	
  

Publishing

Interlinking	
  

Physical	
  Wrapper	
  

Streaming providers

Downloads

Musical Content

Cleansing	
  

LD	
  Wrapper	
  

R2R	
  Transf.	
  

Integrated	
  
Dataset	
  

LD	
  Wrapper	
  

RDF/	
  
XML	
  
Metadata

Other content
MusicBrainz	
  
•  MusicBrainz	
  is	
  an	
  open	
  music	
  encyclopedia	
  that	
  collects	
  
music	
  metadata	
  and	
  makes	
  it	
  available	
  to	
  the	
  public.	
  
•  MusicBrainz	
  aims	
  to	
  be:	
  
• 	
  The	
  ul=mate	
  source	
  of	
  music	
  informa=on	
  by	
  allowing	
  anyone	
  to	
  
contribute	
  and	
  releasing	
  the	
  data	
  under	
  open	
  licenses.	
  
• 	
  The	
  universal	
  lingua	
  franca	
  for	
  music	
  by	
  providing	
  a	
  reliable	
  and	
  
unambiguous	
  form	
  of	
  
music	
  iden*fica*on,	
  enabling	
  both	
  people	
  and	
  machines	
  to	
  have	
  meaningful	
  
conversa*ons	
  about	
  music.	
  

•  Like	
  Wikipedia,	
  MusicBrainz	
  is	
  maintained	
  by	
  a	
  global	
  
community	
  of	
  users	
  and	
  we	
  want	
  everyone	
  —	
  including	
  
you	
  —	
  to	
  par*cipate	
  and	
  contribute.	
  
•  MusicBrainz	
  is	
  operated	
  by	
  the	
  
MetaBrainz	
  Founda*on,	
  dedicated	
  to	
  keeping	
  
MusicBrainz	
  free	
  and	
  open	
  source.	
  
LD	
  Dataset	
  

Access	
  

Publishing	
  Rela=onal	
  Databases	
  as	
  RDF:	
  
W3C	
  RDB2RDF	
  
SPARQL	
  
Endpoint	
  

Publishing	
  

Integrated	
  
Data	
  in	
  
Triplestore	
  

Vocabulary	
  
Mapping	
  

Interlinking	
  

R2RML	
  
Engine	
  

Cleansing	
  

Task:	
  Publish	
  data	
  from	
  
rela*onal	
  DBMS	
  as	
  	
  
Linked	
  Data	
  
	
  
Approach:	
  map	
  from	
  
rela*onal	
  schema	
  to	
  
seman*c	
  vocabulary	
  with	
  
R2RML	
  
	
  
Publishing:	
  two	
  alterna*ves	
  –	
  

Data	
  acquisi*on	
  

• 
• 
Rela*onal	
  
DBMS	
  

Translate	
  SPARQL	
  into	
  SQL	
  on	
  
the	
  fly	
  
Batch	
  transform	
  data	
  into	
  
RDF,	
  infer,	
  index	
  ,	
  integrate	
  
and	
  provide	
  SPARQL	
  access	
  in	
  
a	
  triplestore	
  
Publishing	
  MusicBrainz	
  
h"ps://wiki.musicbrainz.org/Next_Genera;on_Schema	
  

MusicBrainz	
  DB	
  	
  

	
  h"p://musicontology.com	
  

Music	
  
Ontology	
  

R2RML	
  

Concrete	
  Example	
  Mapping	
  
Table	
  Recording(gid,	
  length)	
  

R2RML	
  Mapping	
  

Ontology	
  concept	
  mo:recording	
  	
  
MusicBrainz	
  Next	
  Gen	
  Schema	
  
ar=st	
  
	
  As	
  pre-­‐NGS,	
  but	
  	
  	
  
	
  	
  	
  	
  further	
  a`ributes	
  

ar=st_credit	
  
	
  Allows	
  joint	
  credit	
  

release_group	
  
	
  Cf.	
  ‘album’	
  	
  
	
  	
  	
  	
  versus:	
  

•  work	
  
release	
   •  track	
  
medium	
  	
   •  tracklist	
   •  recording	
  
https://wiki.musicbrainz.org/Next_Generation_Schema
Music	
  Ontology	
  
OWL	
  ontology	
  with	
  following	
  core	
  concepts	
  (classes)	
  and	
  
rela*onships	
  (proper*es):	
  

Source: http://musicontology.com
R2RML	
  Class	
  Mapping	
  
Mapping	
  tables	
  to	
  classes	
  is	
  ‘easy’:	
  
	
  
lb:Artist	
  a	
  rr:TriplesMap	
  ;	
  
	
  	
  rr:logicalTable	
  [rr:tableName	
  "artist"]	
  ;	
  
	
  	
  rr:subjectMap	
  	
  
	
  	
  	
  	
  [rr:class	
  mo:MusicArtist	
  ;	
  
	
  	
  	
  	
  	
  rr:template	
  	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "http://musicbrainz.org/artist/{gid}#_"]	
  ;
	
  
	
  	
  rr:predicateObjectMap	
  	
  
	
  	
  	
  	
  [rr:predicate	
  mo:musicbrainz_guid	
  ;	
  
	
  	
  	
  	
  	
  rr:objectMap	
  [rr:column	
  "gid"	
  ;	
  	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  rr:datatype	
  xsd:string]]	
  .	
  

	
  
R2RML	
  Property	
  Mapping	
  
Mapping	
  columns	
  to	
  proper*es	
  can	
  be	
  easy:	
  
	
  
lb:artist_name	
  a	
  rr:TriplesMap	
  ;	
  
	
  	
  rr:logicalTable	
  [rr:sqlQuery	
  	
  
	
  	
  	
  	
  """SELECT	
  artist.gid,	
  artist_name.name	
  	
  	
  
	
  	
  	
  	
  	
  	
  	
  FROM	
  artist	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  INNER	
  JOIN	
  artist_name	
  ON	
  artist.name	
  =	
  
artist_name.id"""]	
  ;	
  
	
  	
  rr:subjectMap	
  [rr:template	
  	
  	
  	
  	
  	
  	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "http://musicbrainz.org/artist/{gid}#_"]	
  ;	
  
	
  	
  rr:predicateObjectMap	
  	
  
	
  	
  	
  	
  [rr:predicate	
  foaf:name	
  ;	
  
	
  	
  	
  	
  	
  rr:objectMap	
  [rr:column	
  "name"]]	
  .	
  
NGS	
  Advanced	
  Rela=ons	
  
Major	
  en**es	
  (Ar*st,	
  Release	
  Group,	
  Track,	
  etc.)	
  plus	
  URL	
  
are	
  paired	
  
	
  (l_ar*st_ar*st)	
  
Each	
  pairing	
  
	
  of	
  instances	
  
	
  refers	
  to	
  a	
  Link	
  
Links	
  have	
  types	
  	
  
	
  (cf.	
  RDF	
  proper*es)	
  
	
  and	
  a`ributes	
  
	
  
	
  	
  
http://wiki.musicbrainz.org/Advanced_Relationship
R2RML	
  Mapping	
  Editor	
  
R2RML: Expose data from
relational DBMS as RDF /
via SPARQL Endpoint
Problem: R2RML
Mappings are
hard to create

R2RML	
  
Engine	
  

R2RML	
  
Mappings	
  

R2RML	
  Edi*ng	
  Made	
  Easy!
	
  
Hides	
  vocabulary	
  intricacies	
  from	
  end-­‐user	
  
Access	
  to	
  metadata	
  about	
  rela*onal	
  databases	
  
Preview	
  of	
  generated	
  triples	
  and	
  SQL	
  queries	
  
Very	
  expressive	
  (Supports	
  most	
  of	
  R2RML)	
  

SPARQL	
  Endpoint	
  

Rela*onal	
  
Database	
  

See our R2RML Mapping Editor in the ISWC Demo Session on Wednesday!
Scale	
  
MusicBrainz	
  RDF	
  derived	
  via	
  R2RML:	
  

150M
Triples

lb:artist_member	
  a	
  rr:TriplesMap	
  ;	
  
	
  	
  rr:logicalTable	
  [rr:sqlQuery	
  
	
  	
  	
  	
  """SELECT	
  a1.gid,	
  a2.gid	
  AS	
  band	
  
	
  	
  	
  	
  	
  	
  	
  FROM	
  artist	
  a1	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  INNER	
  JOIN	
  l_artist_artist	
  ON	
  a1.id	
  =	
  
l_artist_artist.entity0	
  	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  INNER	
  JOIN	
  link	
  ON	
  l_artist_artist.link	
  =	
  link.id	
  	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  INNER	
  JOIN	
  link_type	
  ON	
  link_type	
  =	
  link_type.id	
  	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  INNER	
  JOIN	
  artist	
  a2	
  on	
  l_artist_artist.entity1	
  =	
  a2.id	
  	
  
	
  	
  	
  	
  	
  	
  	
  WHERE	
  
link_type.gid='5be4c609-­‐9afa-­‐4ea0-­‐910b-­‐12ffb71e3821'"""]	
  ;	
  
	
  	
  rr:subjectMap	
  [rr:template	
  "http://musicbrainz.org/artist/{gid}
#_"]	
  ;	
  
	
  	
  rr:predicateObjectMap	
  	
  
	
  	
  	
  	
  [rr:predicate	
  mo:member_of	
  ;	
  
	
  	
  	
  	
  	
  rr:objectMap	
  [rr:template	
  "http://musicbrainz.org/artist/{band}
#_"	
  ;	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  rr:termType	
  rr:IRI]]	
  .	
  
Some	
  Sta=s=cs	
  –	
  RDF	
  Dump	
  
(Lead) Table
area
artist
dbpedia
label
medium
recording
release_group
release
track
work

Triples
59798
36868228
172017
201832
18069143
11400354
3050818
9764887
75506495
1728955
156822527

Time (s)
2
423
13
3
163
209
31
151
794
20
1809
Informa=on	
  Workbench	
  
PlaGorm	
  for	
  Linked	
  Data	
  Applica=ons	
  
§  Seman*cs-­‐	
  &	
  Linked	
  Data-­‐based	
  
integra=on	
  of	
  private	
  and	
  public	
  
data	
  sources	
  based	
  on	
  data	
  
providers	
  
• 
• 
• 

Generic	
  and	
  specific	
  providers	
  for	
  
various	
  data	
  formats	
  and	
  sources	
  
Supports	
  established	
  mapping	
  
frameworks	
  (e.g.	
  R2RML,	
  SILK,	
  …)	
  
Named	
  graphs	
  for	
  managing	
  
contexts	
  and	
  provenance	
  

§  Intelligent	
  Data	
  Access	
  and	
  Analy=cs	
  
• 
• 
• 

Flexible	
  self-­‐service	
  UI	
  
Visualiza*on,	
  explora*on,	
  
dashboarding	
  and	
  repor*ng	
  
Seman*c	
  search	
  

§  Collabora=on	
  and	
  knowledge	
  
management	
  
• 
• 

Cura*on	
  &	
  authoring	
  
Collabora*ve	
  workflows	
  

§ 

	
  

Open	
  standards	
  and	
  technologies	
  

• 
• 
• 

Seman*c	
  Wiki	
  based	
  frontend	
  	
  
(Using	
  SMW	
  Syntax)	
  	
  
Suppor*ng	
  W3C	
  standards	
  (OWL,	
  RDF,	
  
SPARQL,,	
  …)	
  
Community	
  Edi*on	
  (Open	
  Source)	
  +	
  
Enterprise	
  Edi*on	
  (Commercial)	
  
Realiza=on	
  within	
  the	
  	
  
Informa=on	
  Workbench	
  Architecture	
  
Customized	
  applica*on	
  
solu*ons	
  
Reusable	
  UI	
  and	
  data	
  
integra*on	
  components	
  	
  
Data	
  storage	
  and	
  
management	
  plajorm	
  
External	
  resources	
  to	
  reuse	
  
data	
  and	
  create	
  mashups	
  
The	
  “MusicBrainz	
  Explorer”	
  Applica=on	
  
Music Ontology
Ontology

Data

R2RML
Data Providers

Templates

Widgets
Ontology	
  as	
  a	
  “Structural	
  Backbone”	
  
Resource	
  page	
  
	
  
	
  
	
  

Defining	
  
UI	
  
structure
	
  

Resource	
  page	
  
	
  
	
  
	
  

mo:Track	
  
mo:Ar=st	
  

Defining	
  
data	
  
structure
	
  

rdf:type	
  

Yesterday	
  

UI	
  templates	
  
Template:	
  …	
  
	
  
Template:mo:Track	
  
	
   	
  
Template:mo:Ar=st	
  
	
   	
  
	
  
	
  
	
  
	
  

Ontology	
  
(RDFS/OWL)	
  

rdf:type	
  
The_Beatles	
  

RDF	
  Data	
  
Graph	
  
Information	
  Workbench:	
  	
  
Browsing	
  a	
  Music	
  Artist	
  
Information	
  Workbench:	
  	
  
Visualization	
  techniques	
  
Naviga=on	
  Through	
  the	
  Data	
  

Source: http://musicbrainz.fluidops.net/resource/Analytical5
SPARQL	
  visualization	
  
Top ten The Beatles releases according to the sum of
track durations in minutes
SPARQL	
  Query	
  	
  
SELECT	
  ?release	
  	
  
	
  	
  	
  	
  	
  	
  	
  ((SUM(xsd:double(?duration/60000)))	
  AS	
  ?avg)	
  	
  
WHERE	
  {	
  	
  
	
  <http://dbpedia.org/resource/The_Beatles>	
  	
  	
  
	
  	
  	
  	
  	
  	
  	
  foaf:made	
  ?release	
  .	
  
	
  ?release	
  mo:record	
  ?record	
  .	
  
	
  ?record	
  mo:track	
  ?track	
  .	
  
	
  ?track	
  mo:duration	
  ?duration	
  .}	
  	
  
GROUP	
  BY	
  ?release	
  
ORDER	
  BY	
  DESC(?avg)	
  
LIMIT	
  10	
  

Result	
  set	
  
SPARQL	
  visualization	
  
Top ten The Beatles releases according to the sum of track durations
in minutes
Widget	
  
{{#widget:	
  BarChart	
  |	
  
query	
  ='SELECT	
  (COUNT(?Release)	
  AS	
  ?COUNT)	
  ?
label	
  WHERE	
  {	
  	
  	
  	
  
<http://musicbrainz.org/artist/8538e728-­‐ca0b-­‐4321-­‐b7e5-­‐
cff6565dd4c0#_>	
  foaf:made	
  ?Release.	
  	
  

	
  ?Release	
  rdf:type	
  mo:Release	
  .	
  
	
  ?Release	
  dc:title	
  ?label	
  .}	
  
GROUP	
  BY	
  ?label	
  
ORDER	
  BY	
  DESC(?COUNT)	
  
LIMIT	
  20'	
  
|	
  settings	
  =	
  'Settings:barvertical_mb'	
  	
  
|	
  asynch	
  =	
  'true'	
  
|	
  input	
  =	
  'label'	
  
|	
  output	
  =	
  'COUNT'	
  
|	
  height	
  =	
  '300’}}	
  

Visualization:	
  Bar	
  chart	
  
Information	
  Workbench:	
  	
  
SPARQL	
  visualization	
  
Top ten The Beatles releases according to the sum of track
durations in minutes
Other	
  visualiza*ons	
  of	
  the	
  same	
  result	
  set	
  …	
  

Line	
  chart:	
  

Pie	
  chart:	
  
Automated	
  Widget	
  Suggestion	
  

1	
  

Table
Pivot
view
Bar chart
Line chart
Pie chart

2	
   Select a suggested visualization

3	
  

Visualization
automatically
built
Try	
  it	
  out!	
  
R2RML	
  Mappings	
  
• 

h`ps://github.com/LinkedBrainz/MusicBrainz-­‐R2RML	
  

MusicBrainz	
  RDF	
  Dump	
  
• 

h`p://mbsandbox.org/~barry/	
  

MusicBrainz	
  Linked	
  Data	
  Demo	
  system	
  
•  h`p://musicbrainz.fluidops.net/	
  
Informa*on	
  Workbench	
  
• 

h`p://www.fluidops.com/informa*on-­‐workbench/	
  

Euclid	
  Project	
  
• 

	
  
	
  

h`p://euclid-­‐project.eu/	
  
Acknowledgements	
  
The	
  Euclid	
  Project	
  
Barry	
  Norton	
  	
  
Michael	
  Meier	
  
Andriy	
  Nikolov	
  
Yves	
  Raimond	
  
Kurt	
  Jacobson	
  
Thomas	
  Gaengler	
  
Juan	
  Sequeda	
  
Simon	
  Dixon	
  
	
  
(in	
  no	
  par;cular	
  order)	
  

	
  
Thank	
  you!	
  
Contact	
  
	
  
Peter	
  Haase	
  
fluid	
  Opera*ons	
  AG	
  
Altro`str.	
  31	
  
Walldorf	
  
Germany	
  
	
  
+49	
  (0)	
  6227	
  358087-­‐0	
  
www.fluidops.com	
  
peter.haase@fluidOps.com	
  
	
  

Weitere ähnliche Inhalte

Was ist angesagt?

WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...
WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...
WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...GUANGYUAN PIAO
 
Apache Spark's Built-in File Sources in Depth
Apache Spark's Built-in File Sources in DepthApache Spark's Built-in File Sources in Depth
Apache Spark's Built-in File Sources in DepthDatabricks
 
FedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked DataFedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked Dataaschwarte
 
Introduction to DBIx::Class
Introduction to DBIx::ClassIntroduction to DBIx::Class
Introduction to DBIx::ClassDoran Barton
 
GDG Meets U event - Big data & Wikidata - no lies codelab
GDG Meets U event - Big data & Wikidata -  no lies codelabGDG Meets U event - Big data & Wikidata -  no lies codelab
GDG Meets U event - Big data & Wikidata - no lies codelabCAMELIA BOBAN
 
Maximising (Re)Usability of Library metadata using Linked Data
Maximising (Re)Usability of Library metadata using Linked Data Maximising (Re)Usability of Library metadata using Linked Data
Maximising (Re)Usability of Library metadata using Linked Data Asuncion Gomez-Perez
 
HiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint Federation
HiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint FederationHiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint Federation
HiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint FederationMuhammad Saleem
 
Session 19 - MapReduce
Session 19  - MapReduce Session 19  - MapReduce
Session 19 - MapReduce AnandMHadoop
 
Introduction to R and R Studio
Introduction to R and R StudioIntroduction to R and R Studio
Introduction to R and R StudioRupak Roy
 
Semantic web and Drupal: an introduction
Semantic web and Drupal: an introductionSemantic web and Drupal: an introduction
Semantic web and Drupal: an introductionKristof Van Tomme
 
Federated SPARQL query processing over the Web of Data
Federated SPARQL query processing over the Web of DataFederated SPARQL query processing over the Web of Data
Federated SPARQL query processing over the Web of DataMuhammad Saleem
 
Flagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertierFlagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertierFlagis VZW
 
E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016Sven Schlarb
 
MapReduce Design Patterns
MapReduce Design PatternsMapReduce Design Patterns
MapReduce Design PatternsDonald Miner
 

Was ist angesagt? (20)

SWT Lecture Session 3 - SPARQL
SWT Lecture Session 3 - SPARQLSWT Lecture Session 3 - SPARQL
SWT Lecture Session 3 - SPARQL
 
WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...
WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...
WISE2017 - Factorization Machines Leveraging Lightweight Linked Open Data-ena...
 
Apache Spark's Built-in File Sources in Depth
Apache Spark's Built-in File Sources in DepthApache Spark's Built-in File Sources in Depth
Apache Spark's Built-in File Sources in Depth
 
FedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked DataFedX - Optimization Techniques for Federated Query Processing on Linked Data
FedX - Optimization Techniques for Federated Query Processing on Linked Data
 
Introduction to DBIx::Class
Introduction to DBIx::ClassIntroduction to DBIx::Class
Introduction to DBIx::Class
 
HUG France - Apache Drill
HUG France - Apache DrillHUG France - Apache Drill
HUG France - Apache Drill
 
GDG Meets U event - Big data & Wikidata - no lies codelab
GDG Meets U event - Big data & Wikidata -  no lies codelabGDG Meets U event - Big data & Wikidata -  no lies codelab
GDG Meets U event - Big data & Wikidata - no lies codelab
 
Maximising (Re)Usability of Library metadata using Linked Data
Maximising (Re)Usability of Library metadata using Linked Data Maximising (Re)Usability of Library metadata using Linked Data
Maximising (Re)Usability of Library metadata using Linked Data
 
HiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint Federation
HiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint FederationHiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint Federation
HiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint Federation
 
GitHubGraph
GitHubGraphGitHubGraph
GitHubGraph
 
Session 19 - MapReduce
Session 19  - MapReduce Session 19  - MapReduce
Session 19 - MapReduce
 
Introduction to R and R Studio
Introduction to R and R StudioIntroduction to R and R Studio
Introduction to R and R Studio
 
Semantic web and Drupal: an introduction
Semantic web and Drupal: an introductionSemantic web and Drupal: an introduction
Semantic web and Drupal: an introduction
 
Subsetting
SubsettingSubsetting
Subsetting
 
Federated SPARQL query processing over the Web of Data
Federated SPARQL query processing over the Web of DataFederated SPARQL query processing over the Web of Data
Federated SPARQL query processing over the Web of Data
 
Flagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertierFlagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertier
 
Apache PIG
Apache PIGApache PIG
Apache PIG
 
E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016
 
Hive(ppt)
Hive(ppt)Hive(ppt)
Hive(ppt)
 
MapReduce Design Patterns
MapReduce Design PatternsMapReduce Design Patterns
MapReduce Design Patterns
 

Andere mochten auch

Using puppet, foreman and git to develop and operate a large scale internet s...
Using puppet, foreman and git to develop and operate a large scale internet s...Using puppet, foreman and git to develop and operate a large scale internet s...
Using puppet, foreman and git to develop and operate a large scale internet s...techblog
 
Continuously-Integrated Puppet in a Dynamic Environment
Continuously-Integrated Puppet in a Dynamic EnvironmentContinuously-Integrated Puppet in a Dynamic Environment
Continuously-Integrated Puppet in a Dynamic EnvironmentPuppet
 
Better encryption & security with MariaDB 10.1 & MySQL 5.7
Better encryption & security with MariaDB 10.1 & MySQL 5.7Better encryption & security with MariaDB 10.1 & MySQL 5.7
Better encryption & security with MariaDB 10.1 & MySQL 5.7Colin Charles
 
Ruby application based on http
Ruby application based on httpRuby application based on http
Ruby application based on httpRichard Huang
 
vSphere APIs for performance monitoring
vSphere APIs for performance monitoringvSphere APIs for performance monitoring
vSphere APIs for performance monitoringAlan Renouf
 
Dlsecyx pgroammr (Dyslexic Programmer - cool stuff for scaling)
Dlsecyx pgroammr (Dyslexic Programmer - cool stuff for scaling)Dlsecyx pgroammr (Dyslexic Programmer - cool stuff for scaling)
Dlsecyx pgroammr (Dyslexic Programmer - cool stuff for scaling)Gleicon Moraes
 
PostgreSQL Materialized Views with Active Record
PostgreSQL Materialized Views with Active RecordPostgreSQL Materialized Views with Active Record
PostgreSQL Materialized Views with Active RecordDavid Roberts
 
The Complete MariaDB Server Tutorial - Percona Live 2015
The Complete MariaDB Server Tutorial - Percona Live 2015The Complete MariaDB Server Tutorial - Percona Live 2015
The Complete MariaDB Server Tutorial - Percona Live 2015Colin Charles
 
Redis — The AK-47 of Post-relational Databases
Redis — The AK-47 of Post-relational DatabasesRedis — The AK-47 of Post-relational Databases
Redis — The AK-47 of Post-relational DatabasesKarel Minarik
 
Taking Control of Chaos with Docker and Puppet
Taking Control of Chaos with Docker and PuppetTaking Control of Chaos with Docker and Puppet
Taking Control of Chaos with Docker and PuppetPuppet
 
Detecting headless browsers
Detecting headless browsersDetecting headless browsers
Detecting headless browsersSergey Shekyan
 
Monitoring in an Infrastructure as Code Age
Monitoring in an Infrastructure as Code AgeMonitoring in an Infrastructure as Code Age
Monitoring in an Infrastructure as Code AgePuppet
 
How to make keynote like presentation with markdown
How to make keynote like presentation with markdownHow to make keynote like presentation with markdown
How to make keynote like presentation with markdownHiroaki NAKADA
 
Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12
Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12
Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12Puppet
 
Lessons I Learned While Scaling to 5000 Puppet Agents
Lessons I Learned While Scaling to 5000 Puppet AgentsLessons I Learned While Scaling to 5000 Puppet Agents
Lessons I Learned While Scaling to 5000 Puppet AgentsPuppet
 
Orchestrating Docker containers at scale
Orchestrating Docker containers at scaleOrchestrating Docker containers at scale
Orchestrating Docker containers at scaleMaciej Lasyk
 

Andere mochten auch (18)

Using puppet, foreman and git to develop and operate a large scale internet s...
Using puppet, foreman and git to develop and operate a large scale internet s...Using puppet, foreman and git to develop and operate a large scale internet s...
Using puppet, foreman and git to develop and operate a large scale internet s...
 
Continuously-Integrated Puppet in a Dynamic Environment
Continuously-Integrated Puppet in a Dynamic EnvironmentContinuously-Integrated Puppet in a Dynamic Environment
Continuously-Integrated Puppet in a Dynamic Environment
 
JSON and the APInauts
JSON and the APInautsJSON and the APInauts
JSON and the APInauts
 
Better encryption & security with MariaDB 10.1 & MySQL 5.7
Better encryption & security with MariaDB 10.1 & MySQL 5.7Better encryption & security with MariaDB 10.1 & MySQL 5.7
Better encryption & security with MariaDB 10.1 & MySQL 5.7
 
Sensu
SensuSensu
Sensu
 
Ruby application based on http
Ruby application based on httpRuby application based on http
Ruby application based on http
 
vSphere APIs for performance monitoring
vSphere APIs for performance monitoringvSphere APIs for performance monitoring
vSphere APIs for performance monitoring
 
Dlsecyx pgroammr (Dyslexic Programmer - cool stuff for scaling)
Dlsecyx pgroammr (Dyslexic Programmer - cool stuff for scaling)Dlsecyx pgroammr (Dyslexic Programmer - cool stuff for scaling)
Dlsecyx pgroammr (Dyslexic Programmer - cool stuff for scaling)
 
PostgreSQL Materialized Views with Active Record
PostgreSQL Materialized Views with Active RecordPostgreSQL Materialized Views with Active Record
PostgreSQL Materialized Views with Active Record
 
The Complete MariaDB Server Tutorial - Percona Live 2015
The Complete MariaDB Server Tutorial - Percona Live 2015The Complete MariaDB Server Tutorial - Percona Live 2015
The Complete MariaDB Server Tutorial - Percona Live 2015
 
Redis — The AK-47 of Post-relational Databases
Redis — The AK-47 of Post-relational DatabasesRedis — The AK-47 of Post-relational Databases
Redis — The AK-47 of Post-relational Databases
 
Taking Control of Chaos with Docker and Puppet
Taking Control of Chaos with Docker and PuppetTaking Control of Chaos with Docker and Puppet
Taking Control of Chaos with Docker and Puppet
 
Detecting headless browsers
Detecting headless browsersDetecting headless browsers
Detecting headless browsers
 
Monitoring in an Infrastructure as Code Age
Monitoring in an Infrastructure as Code AgeMonitoring in an Infrastructure as Code Age
Monitoring in an Infrastructure as Code Age
 
How to make keynote like presentation with markdown
How to make keynote like presentation with markdownHow to make keynote like presentation with markdown
How to make keynote like presentation with markdown
 
Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12
Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12
Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12
 
Lessons I Learned While Scaling to 5000 Puppet Agents
Lessons I Learned While Scaling to 5000 Puppet AgentsLessons I Learned While Scaling to 5000 Puppet Agents
Lessons I Learned While Scaling to 5000 Puppet Agents
 
Orchestrating Docker containers at scale
Orchestrating Docker containers at scaleOrchestrating Docker containers at scale
Orchestrating Docker containers at scale
 

Ähnlich wie Mapping, Interlinking and Exposing MusicBrainz as Linked Data

Applying large scale text analytics with graph databases
Applying large scale text analytics with graph databasesApplying large scale text analytics with graph databases
Applying large scale text analytics with graph databasesData Ninja API
 
Analyze one year of radio station songs aired with Spark SQL, Spotify, and Da...
Analyze one year of radio station songs aired with Spark SQL, Spotify, and Da...Analyze one year of radio station songs aired with Spark SQL, Spotify, and Da...
Analyze one year of radio station songs aired with Spark SQL, Spotify, and Da...Paul Leclercq
 
Intro to Linked, Dutch Ships and Sailors and SPARQL handson
Intro to Linked, Dutch Ships and Sailors and SPARQL handson Intro to Linked, Dutch Ships and Sailors and SPARQL handson
Intro to Linked, Dutch Ships and Sailors and SPARQL handson Victor de Boer
 
Knowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything ProjectKnowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything ProjectEnrico Daga
 
Consuming linked data by machines
Consuming linked data by machinesConsuming linked data by machines
Consuming linked data by machinesPatrick Sinclair
 
A hands on overview of the semantic web
A hands on overview of the semantic webA hands on overview of the semantic web
A hands on overview of the semantic webMarakana Inc.
 
Wi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX toolWi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX toolLaura Po
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Enrico Daga
 
A Little SPARQL in your Analytics
A Little SPARQL in your AnalyticsA Little SPARQL in your Analytics
A Little SPARQL in your AnalyticsDr. Neil Brittliff
 
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Nikolaos Konstantinou
 
RDA data, linked data, and benefits for users / Gordon Dunsire
RDA data, linked data, and benefits for users / Gordon DunsireRDA data, linked data, and benefits for users / Gordon Dunsire
RDA data, linked data, and benefits for users / Gordon DunsireCIGScotland
 
Triplewave: a step towards RDF Stream Processing on the Web
Triplewave: a step towards RDF Stream Processing on the WebTriplewave: a step towards RDF Stream Processing on the Web
Triplewave: a step towards RDF Stream Processing on the WebDaniele Dell'Aglio
 
Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)Bradley Allen
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic WebIvan Herman
 
Triplestore and SPARQL
Triplestore and SPARQLTriplestore and SPARQL
Triplestore and SPARQLLino Valdivia
 
Introduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in RIntroduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in RYanchang Zhao
 
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...andimou
 
Triplificating and linking XBRL financial data
Triplificating and linking XBRL financial dataTriplificating and linking XBRL financial data
Triplificating and linking XBRL financial dataRoberto García
 
Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Nikolaos Konstantinou
 

Ähnlich wie Mapping, Interlinking and Exposing MusicBrainz as Linked Data (20)

Applying large scale text analytics with graph databases
Applying large scale text analytics with graph databasesApplying large scale text analytics with graph databases
Applying large scale text analytics with graph databases
 
Analyze one year of radio station songs aired with Spark SQL, Spotify, and Da...
Analyze one year of radio station songs aired with Spark SQL, Spotify, and Da...Analyze one year of radio station songs aired with Spark SQL, Spotify, and Da...
Analyze one year of radio station songs aired with Spark SQL, Spotify, and Da...
 
Intro to Linked, Dutch Ships and Sailors and SPARQL handson
Intro to Linked, Dutch Ships and Sailors and SPARQL handson Intro to Linked, Dutch Ships and Sailors and SPARQL handson
Intro to Linked, Dutch Ships and Sailors and SPARQL handson
 
Knowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything ProjectKnowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything Project
 
Consuming linked data by machines
Consuming linked data by machinesConsuming linked data by machines
Consuming linked data by machines
 
A hands on overview of the semantic web
A hands on overview of the semantic webA hands on overview of the semantic web
A hands on overview of the semantic web
 
Wi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX toolWi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX tool
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.
 
A Little SPARQL in your Analytics
A Little SPARQL in your AnalyticsA Little SPARQL in your Analytics
A Little SPARQL in your Analytics
 
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
 
RDA data, linked data, and benefits for users / Gordon Dunsire
RDA data, linked data, and benefits for users / Gordon DunsireRDA data, linked data, and benefits for users / Gordon Dunsire
RDA data, linked data, and benefits for users / Gordon Dunsire
 
Triplewave: a step towards RDF Stream Processing on the Web
Triplewave: a step towards RDF Stream Processing on the WebTriplewave: a step towards RDF Stream Processing on the Web
Triplewave: a step towards RDF Stream Processing on the Web
 
Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
 
Triplestore and SPARQL
Triplestore and SPARQLTriplestore and SPARQL
Triplestore and SPARQL
 
Introduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in RIntroduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in R
 
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data...
 
Triplificating and linking XBRL financial data
Triplificating and linking XBRL financial dataTriplificating and linking XBRL financial data
Triplificating and linking XBRL financial data
 
Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...
 
Semantic Web talk TEMPLATE
Semantic Web talk TEMPLATESemantic Web talk TEMPLATE
Semantic Web talk TEMPLATE
 

Mehr von Peter Haase

Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryVisual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryPeter Haase
 
Hybrid Enterprise Knowledge Graphs
Hybrid Enterprise Knowledge GraphsHybrid Enterprise Knowledge Graphs
Hybrid Enterprise Knowledge GraphsPeter Haase
 
Ephedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federationEphedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federationPeter Haase
 
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudBuilding Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudPeter Haase
 
ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsPeter Haase
 
Getting Started with Knowledge Graphs
Getting Started with Knowledge GraphsGetting Started with Knowledge Graphs
Getting Started with Knowledge GraphsPeter Haase
 
Smart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphSmart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphPeter Haase
 
Discovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsDiscovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsPeter Haase
 
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the EnterpriseThe Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the EnterprisePeter Haase
 
On demand access to Big Data through Semantic Technologies
 On demand access to Big Data through Semantic Technologies On demand access to Big Data through Semantic Technologies
On demand access to Big Data through Semantic TechnologiesPeter Haase
 
Linked Data as a Service
Linked Data as a ServiceLinked Data as a Service
Linked Data as a ServicePeter Haase
 
Fedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data ProcessingFedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data ProcessingPeter Haase
 
Everything Self-Service:Linked Data Applications with the Information Workbench
Everything Self-Service:Linked Data Applications with the Information WorkbenchEverything Self-Service:Linked Data Applications with the Information Workbench
Everything Self-Service:Linked Data Applications with the Information WorkbenchPeter Haase
 
The Information Workbench as a Self-Service Platform for Linked Data Applicat...
The Information Workbench as a Self-Service Platform for Linked Data Applicat...The Information Workbench as a Self-Service Platform for Linked Data Applicat...
The Information Workbench as a Self-Service Platform for Linked Data Applicat...Peter Haase
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentPeter Haase
 
Semantic Technologies for Enterprise Cloud Management
Semantic Technologies for Enterprise Cloud ManagementSemantic Technologies for Enterprise Cloud Management
Semantic Technologies for Enterprise Cloud ManagementPeter Haase
 

Mehr von Peter Haase (16)

Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryVisual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
 
Hybrid Enterprise Knowledge Graphs
Hybrid Enterprise Knowledge GraphsHybrid Enterprise Knowledge Graphs
Hybrid Enterprise Knowledge Graphs
 
Ephedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federationEphedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federation
 
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudBuilding Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
 
ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge Graphs
 
Getting Started with Knowledge Graphs
Getting Started with Knowledge GraphsGetting Started with Knowledge Graphs
Getting Started with Knowledge Graphs
 
Smart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphSmart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge Graph
 
Discovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsDiscovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data Portals
 
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the EnterpriseThe Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
 
On demand access to Big Data through Semantic Technologies
 On demand access to Big Data through Semantic Technologies On demand access to Big Data through Semantic Technologies
On demand access to Big Data through Semantic Technologies
 
Linked Data as a Service
Linked Data as a ServiceLinked Data as a Service
Linked Data as a Service
 
Fedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data ProcessingFedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data Processing
 
Everything Self-Service:Linked Data Applications with the Information Workbench
Everything Self-Service:Linked Data Applications with the Information WorkbenchEverything Self-Service:Linked Data Applications with the Information Workbench
Everything Self-Service:Linked Data Applications with the Information Workbench
 
The Information Workbench as a Self-Service Platform for Linked Data Applicat...
The Information Workbench as a Self-Service Platform for Linked Data Applicat...The Information Workbench as a Self-Service Platform for Linked Data Applicat...
The Information Workbench as a Self-Service Platform for Linked Data Applicat...
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
 
Semantic Technologies for Enterprise Cloud Management
Semantic Technologies for Enterprise Cloud ManagementSemantic Technologies for Enterprise Cloud Management
Semantic Technologies for Enterprise Cloud Management
 

Kürzlich hochgeladen

Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 

Kürzlich hochgeladen (20)

Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 

Mapping, Interlinking and Exposing MusicBrainz as Linked Data

  • 1. Mapping,  Interlinking  and   Exposing  MusicBrainz  as   Linked  Data   1st  Interna*onal  Workshop  on     Seman*c  Music  and  Media  (SMAM2013)   Sydney,  Oct  21,  2013   Peter  Haase  
  • 2. What  this  talk  is  about   A  Linked  Data  Perspec=ve   worksOn publishedTo affiliation affiliation (previous) isAbout builtWith participatesIn participatesIn
  • 3. EUCLID:  EdUca=onal  Curriculum  for  the   usage  of  LinkedData     http://www.euclid-project.eu Course eBook Other channels @euclid_project euclidproject euclidproject
  • 4. Analysis  &   Mining  Module   Visualiza*on   Module   RDFa   Data acquisition LD Dataset Access Application EUCLID  Scenario   SPARQL Endpoint Vocabulary   Mapping   Publishing Interlinking   Physical  Wrapper   Streaming providers Downloads Musical Content Cleansing   LD  Wrapper   R2R  Transf.   Integrated   Dataset   LD  Wrapper   RDF/   XML   Metadata Other content
  • 5. MusicBrainz   •  MusicBrainz  is  an  open  music  encyclopedia  that  collects   music  metadata  and  makes  it  available  to  the  public.   •  MusicBrainz  aims  to  be:   •   The  ul=mate  source  of  music  informa=on  by  allowing  anyone  to   contribute  and  releasing  the  data  under  open  licenses.   •   The  universal  lingua  franca  for  music  by  providing  a  reliable  and   unambiguous  form  of   music  iden*fica*on,  enabling  both  people  and  machines  to  have  meaningful   conversa*ons  about  music.   •  Like  Wikipedia,  MusicBrainz  is  maintained  by  a  global   community  of  users  and  we  want  everyone  —  including   you  —  to  par*cipate  and  contribute.   •  MusicBrainz  is  operated  by  the   MetaBrainz  Founda*on,  dedicated  to  keeping   MusicBrainz  free  and  open  source.  
  • 6. LD  Dataset   Access   Publishing  Rela=onal  Databases  as  RDF:   W3C  RDB2RDF   SPARQL   Endpoint   Publishing   Integrated   Data  in   Triplestore   Vocabulary   Mapping   Interlinking   R2RML   Engine   Cleansing   Task:  Publish  data  from   rela*onal  DBMS  as     Linked  Data     Approach:  map  from   rela*onal  schema  to   seman*c  vocabulary  with   R2RML     Publishing:  two  alterna*ves  –   Data  acquisi*on   •  •  Rela*onal   DBMS   Translate  SPARQL  into  SQL  on   the  fly   Batch  transform  data  into   RDF,  infer,  index  ,  integrate   and  provide  SPARQL  access  in   a  triplestore  
  • 7. Publishing  MusicBrainz   h"ps://wiki.musicbrainz.org/Next_Genera;on_Schema   MusicBrainz  DB      h"p://musicontology.com   Music   Ontology   R2RML   Concrete  Example  Mapping   Table  Recording(gid,  length)   R2RML  Mapping   Ontology  concept  mo:recording    
  • 8. MusicBrainz  Next  Gen  Schema   ar=st    As  pre-­‐NGS,  but              further  a`ributes   ar=st_credit    Allows  joint  credit   release_group    Cf.  ‘album’            versus:   •  work   release   •  track   medium     •  tracklist   •  recording   https://wiki.musicbrainz.org/Next_Generation_Schema
  • 9. Music  Ontology   OWL  ontology  with  following  core  concepts  (classes)  and   rela*onships  (proper*es):   Source: http://musicontology.com
  • 10. R2RML  Class  Mapping   Mapping  tables  to  classes  is  ‘easy’:     lb:Artist  a  rr:TriplesMap  ;      rr:logicalTable  [rr:tableName  "artist"]  ;      rr:subjectMap            [rr:class  mo:MusicArtist  ;            rr:template                        "http://musicbrainz.org/artist/{gid}#_"]  ;      rr:predicateObjectMap            [rr:predicate  mo:musicbrainz_guid  ;            rr:objectMap  [rr:column  "gid"  ;                                          rr:datatype  xsd:string]]  .    
  • 11. R2RML  Property  Mapping   Mapping  columns  to  proper*es  can  be  easy:     lb:artist_name  a  rr:TriplesMap  ;      rr:logicalTable  [rr:sqlQuery            """SELECT  artist.gid,  artist_name.name                    FROM  artist                    INNER  JOIN  artist_name  ON  artist.name  =   artist_name.id"""]  ;      rr:subjectMap  [rr:template                                            "http://musicbrainz.org/artist/{gid}#_"]  ;      rr:predicateObjectMap            [rr:predicate  foaf:name  ;            rr:objectMap  [rr:column  "name"]]  .  
  • 12. NGS  Advanced  Rela=ons   Major  en**es  (Ar*st,  Release  Group,  Track,  etc.)  plus  URL   are  paired    (l_ar*st_ar*st)   Each  pairing    of  instances    refers  to  a  Link   Links  have  types      (cf.  RDF  proper*es)    and  a`ributes         http://wiki.musicbrainz.org/Advanced_Relationship
  • 13. R2RML  Mapping  Editor   R2RML: Expose data from relational DBMS as RDF / via SPARQL Endpoint Problem: R2RML Mappings are hard to create R2RML   Engine   R2RML   Mappings   R2RML  Edi*ng  Made  Easy!   Hides  vocabulary  intricacies  from  end-­‐user   Access  to  metadata  about  rela*onal  databases   Preview  of  generated  triples  and  SQL  queries   Very  expressive  (Supports  most  of  R2RML)   SPARQL  Endpoint   Rela*onal   Database   See our R2RML Mapping Editor in the ISWC Demo Session on Wednesday!
  • 14. Scale   MusicBrainz  RDF  derived  via  R2RML:   150M Triples lb:artist_member  a  rr:TriplesMap  ;      rr:logicalTable  [rr:sqlQuery          """SELECT  a1.gid,  a2.gid  AS  band                FROM  artist  a1                    INNER  JOIN  l_artist_artist  ON  a1.id  =   l_artist_artist.entity0                      INNER  JOIN  link  ON  l_artist_artist.link  =  link.id                      INNER  JOIN  link_type  ON  link_type  =  link_type.id                      INNER  JOIN  artist  a2  on  l_artist_artist.entity1  =  a2.id                  WHERE   link_type.gid='5be4c609-­‐9afa-­‐4ea0-­‐910b-­‐12ffb71e3821'"""]  ;      rr:subjectMap  [rr:template  "http://musicbrainz.org/artist/{gid} #_"]  ;      rr:predicateObjectMap            [rr:predicate  mo:member_of  ;            rr:objectMap  [rr:template  "http://musicbrainz.org/artist/{band} #_"  ;                                        rr:termType  rr:IRI]]  .  
  • 15. Some  Sta=s=cs  –  RDF  Dump   (Lead) Table area artist dbpedia label medium recording release_group release track work Triples 59798 36868228 172017 201832 18069143 11400354 3050818 9764887 75506495 1728955 156822527 Time (s) 2 423 13 3 163 209 31 151 794 20 1809
  • 16. Informa=on  Workbench   PlaGorm  for  Linked  Data  Applica=ons   §  Seman*cs-­‐  &  Linked  Data-­‐based   integra=on  of  private  and  public   data  sources  based  on  data   providers   •  •  •  Generic  and  specific  providers  for   various  data  formats  and  sources   Supports  established  mapping   frameworks  (e.g.  R2RML,  SILK,  …)   Named  graphs  for  managing   contexts  and  provenance   §  Intelligent  Data  Access  and  Analy=cs   •  •  •  Flexible  self-­‐service  UI   Visualiza*on,  explora*on,   dashboarding  and  repor*ng   Seman*c  search   §  Collabora=on  and  knowledge   management   •  •  Cura*on  &  authoring   Collabora*ve  workflows   §    Open  standards  and  technologies   •  •  •  Seman*c  Wiki  based  frontend     (Using  SMW  Syntax)     Suppor*ng  W3C  standards  (OWL,  RDF,   SPARQL,,  …)   Community  Edi*on  (Open  Source)  +   Enterprise  Edi*on  (Commercial)  
  • 17. Realiza=on  within  the     Informa=on  Workbench  Architecture   Customized  applica*on   solu*ons   Reusable  UI  and  data   integra*on  components     Data  storage  and   management  plajorm   External  resources  to  reuse   data  and  create  mashups  
  • 18. The  “MusicBrainz  Explorer”  Applica=on   Music Ontology Ontology Data R2RML Data Providers Templates Widgets
  • 19. Ontology  as  a  “Structural  Backbone”   Resource  page         Defining   UI   structure   Resource  page         mo:Track   mo:Ar=st   Defining   data   structure   rdf:type   Yesterday   UI  templates   Template:  …     Template:mo:Track       Template:mo:Ar=st               Ontology   (RDFS/OWL)   rdf:type   The_Beatles   RDF  Data   Graph  
  • 20. Information  Workbench:     Browsing  a  Music  Artist  
  • 21. Information  Workbench:     Visualization  techniques  
  • 22. Naviga=on  Through  the  Data   Source: http://musicbrainz.fluidops.net/resource/Analytical5
  • 23. SPARQL  visualization   Top ten The Beatles releases according to the sum of track durations in minutes SPARQL  Query     SELECT  ?release                  ((SUM(xsd:double(?duration/60000)))  AS  ?avg)     WHERE  {      <http://dbpedia.org/resource/The_Beatles>                    foaf:made  ?release  .    ?release  mo:record  ?record  .    ?record  mo:track  ?track  .    ?track  mo:duration  ?duration  .}     GROUP  BY  ?release   ORDER  BY  DESC(?avg)   LIMIT  10   Result  set  
  • 24. SPARQL  visualization   Top ten The Beatles releases according to the sum of track durations in minutes Widget   {{#widget:  BarChart  |   query  ='SELECT  (COUNT(?Release)  AS  ?COUNT)  ? label  WHERE  {         <http://musicbrainz.org/artist/8538e728-­‐ca0b-­‐4321-­‐b7e5-­‐ cff6565dd4c0#_>  foaf:made  ?Release.      ?Release  rdf:type  mo:Release  .    ?Release  dc:title  ?label  .}   GROUP  BY  ?label   ORDER  BY  DESC(?COUNT)   LIMIT  20'   |  settings  =  'Settings:barvertical_mb'     |  asynch  =  'true'   |  input  =  'label'   |  output  =  'COUNT'   |  height  =  '300’}}   Visualization:  Bar  chart  
  • 25. Information  Workbench:     SPARQL  visualization   Top ten The Beatles releases according to the sum of track durations in minutes Other  visualiza*ons  of  the  same  result  set  …   Line  chart:   Pie  chart:  
  • 26. Automated  Widget  Suggestion   1   Table Pivot view Bar chart Line chart Pie chart 2   Select a suggested visualization 3   Visualization automatically built
  • 27. Try  it  out!   R2RML  Mappings   •  h`ps://github.com/LinkedBrainz/MusicBrainz-­‐R2RML   MusicBrainz  RDF  Dump   •  h`p://mbsandbox.org/~barry/   MusicBrainz  Linked  Data  Demo  system   •  h`p://musicbrainz.fluidops.net/   Informa*on  Workbench   •  h`p://www.fluidops.com/informa*on-­‐workbench/   Euclid  Project   •      h`p://euclid-­‐project.eu/  
  • 28. Acknowledgements   The  Euclid  Project   Barry  Norton     Michael  Meier   Andriy  Nikolov   Yves  Raimond   Kurt  Jacobson   Thomas  Gaengler   Juan  Sequeda   Simon  Dixon     (in  no  par;cular  order)    
  • 29. Thank  you!   Contact     Peter  Haase   fluid  Opera*ons  AG   Altro`str.  31   Walldorf   Germany     +49  (0)  6227  358087-­‐0   www.fluidops.com   peter.haase@fluidOps.com