This document summarizes the origins and development of Schema.org. It began as an effort by Tim Berners-Lee in 1989 to conceive of the World Wide Web. Later developments included the semantic web in 2001 and linked open data in 2009. Schema.org was introduced in 2011 as a joint effort between Google, Bing, Yahoo, and Yandex to create a common set of schemas for structured data on web pages. It has since grown significantly, with over 12 million websites now using Schema.org markup and over 500 types and 800 properties defined. Various communities like libraries have also influenced Schema.org through extensions and standards like LRMI.
1. Schema.org
Where did that come from!
Richard Wallis
Evangelist and Founder
Data Liberate
richard.wallis@dataliberate.com
@rjw
DC- 2016
Copenhagen
October 13, 2016
4. Independent Consultant, Evangelist & Founder
richard.wallis@dataliberate.com — @rjw
25+ Years – Library systems technology
10+ Years – Semantic Web & Linked Data
5. Independent Consultant, Evangelist & Founder
W3C Community Groups:
• Schema Bib Extend (Chair)
• Schema.org for bibliographic data
• bib.schema.org
• Schema Architypes (Chair)
• Financial Industry Business Ontology – fibo.schema.org
• Tourism Structured Web Data (Co-Chair)
• Schema Course Extension
richard.wallis@dataliberate.com — @rjw
25+ Years – Library systems technology
10+ Years – Semantic Web & Linked Data
6. Independent Consultant, Evangelist & Founder
Working With:
• Google – Schema.org vocabulary, site, extensions
documentation and community
W3C Community Groups:
• Schema Bib Extend (Chair)
• Schema.org for bibliographic data
• bib.schema.org
• Schema Architypes (Chair)
• Financial Industry Business Ontology – fibo.schema.org
• Tourism Structured Web Data (Co-Chair)
• Schema Course Extension
richard.wallis@dataliberate.com — @rjw
25+ Years – Library systems technology
10+ Years – Semantic Web & Linked Data
7. Independent Consultant, Evangelist & Founder
Working With:
• Google – Schema.org vocabulary, site, extensions
documentation and community
• OCLC - Global library cooperative
W3C Community Groups:
• Schema Bib Extend (Chair)
• Schema.org for bibliographic data
• bib.schema.org
• Schema Architypes (Chair)
• Financial Industry Business Ontology – fibo.schema.org
• Tourism Structured Web Data (Co-Chair)
• Schema Course Extension
richard.wallis@dataliberate.com — @rjw
25+ Years – Library systems technology
10+ Years – Semantic Web & Linked Data
8. Independent Consultant, Evangelist & Founder
Working With:
• Google – Schema.org vocabulary, site, extensions
documentation and community
• OCLC - Global library cooperative
• FIBO – Financial Industry Business Ontology
W3C Community Groups:
• Schema Bib Extend (Chair)
• Schema.org for bibliographic data
• bib.schema.org
• Schema Architypes (Chair)
• Financial Industry Business Ontology – fibo.schema.org
• Tourism Structured Web Data (Co-Chair)
• Schema Course Extension
richard.wallis@dataliberate.com — @rjw
25+ Years – Library systems technology
10+ Years – Semantic Web & Linked Data
9. Independent Consultant, Evangelist & Founder
Working With:
• Google – Schema.org vocabulary, site, extensions
documentation and community
• OCLC - Global library cooperative
• FIBO – Financial Industry Business Ontology
• Various Clients – Implementing/understanding Schema.org
e.g. Singapore National Library Board - Europeana
W3C Community Groups:
• Schema Bib Extend (Chair)
• Schema.org for bibliographic data
• bib.schema.org
• Schema Architypes (Chair)
• Financial Industry Business Ontology – fibo.schema.org
• Tourism Structured Web Data (Co-Chair)
• Schema Course Extension
richard.wallis@dataliberate.com — @rjw
25+ Years – Library systems technology
10+ Years – Semantic Web & Linked Data
42. Knowledge Graph
Bart Simpson
Nancy Cartwright
Dayton Ohio
Dayton Aviation
Heritage National Park
Played By
Born In
Place of Interest
Related Entities in a Graph
59. Using Schema.org
•Data embedded in website html
-Microdata / RDFa / JSON-LD
•Harvested during normal web crawls
•Under control of the [site] publisher
67. Schema.org
A de facto vocabulary for
structured data on the web12+ Million
Web Sites
Found On30% Pages*
* In a 10 billion page sample - 2015
68. Schema.org
A de facto vocabulary for
structured data on the web
So, what does it look like ….
12+ Million
Web Sites
Found On30% Pages*
* In a 10 billion page sample - 2015
75. • [Linked Data] Vocabulary
• RDF (triples)
• URIs / string values
• Types / Properties / Enumerations
• “Not strongly typed”
• RangeIncludes / DomainIncludes
• Three serializations
• Microdata, RDFa, JSON-LD
• A web vocabulary to describe stuff!
Reintroducing Schema.org
What is Schema.org
76. • Librarians …
– Really understand their domain and their data
– Good at cooperation
• Early Schema.org – Useful but not good enough
• Form a W3C Community Group – Schema Bib Extend
• Identified public data sharing use cases – Knowledge Graph
• What can't you do with Schema as it is?
• Fill in the gaps
• Create real examples
• Proposed enhancements to core vocabulary
• Proposed more focused extension – bib.schema.org
• A few iterations – 2 years
A story from the library
How to make Schema.org work for you
The story behind Schema Bib Extend
90. Schema.org
Where did that come from!
Richard Wallis
Evangelist and Founder
Data Liberate
richard.wallis@dataliberate.com
@rjw
DC- 2016
Copenhagen
October 13, 2016