10. #pubcon
@bill_slawski
On the DC Metroline, you connect to:
• 91 Stations in Md, Va, & DC
• National Zoo
• 19 Smithsonian Museums
• National Gallery of Art
• Capital One Arena
• Fedex Field
• Pentagon City Shopping Mall
12. #pubcon
@bill_slawski
Knowing how Google uses context and
semantically related phrases can improve the
content you create and how well you optimize
pages for particular queries.
13. #pubcon
@bill_slawski
Keywords & Context Vectors
“For example, a horse to a rancher is an animal. A horse
to a carpenter is an implement of work. A horse to a
gymnast is an implement on which to perform certain
exercises.
User-context-based search engine
18. #pubcon
@bill_slawski
Map Keywords to Pages, then…
• Make sure you add words that indicate context
• Look up the top pages that rank for those keywords
• Find phrases that co-occur for that meaning
• See: Improving semantic topic clustering for search
Queries with word co-occurrence and biograph co-
clustering
19. #pubcon
@bill_slawski
Phrase-Based Indexing
• Look for co-occurring phrases on pages that rank highly
for a query.
• Using these related phrases on a page can boost how it
ranks for that query (body hits)
• Using those related phrases as anchors can boost how
the page targeted ranks for that query (anchor hits)
23. #pubcon
@bill_slawski
Predictive Aspects of Phrases
• Semantically, related phrases will be those that are
commonly used to discuss or describe a given topic or
concept, such as "President of the United States" and
"White House." For a given phrase, the related
phrases can be ordered according to their relevance
or significance based on their respective prediction
measures.
• Integrated external related phrase information into a
phrase-based indexing information retrieval system
25. #pubcon
@bill_slawski
Clustered Meanings
• Jaguars- Cats, Cars, NFL Football Team
• Java – Programming Language, Island in Indonesia,
Drink
• Bank – A place to store money, a river’s side, to lean
to a side
26. #pubcon
@bill_slawski
Ranking Documents Based on Contained Phrases (Body Hits)
“…a ranking stage in which the documents in the search
results are ranked, using the phrase information in each
document's related phrase bit vector, and the cluster
bit vector for the query phrases. This approach ranks
documents according to the phrases that are contained
in the document, or informally ‘body hits.’”
Integrated external related phrase information into a
phrase-based indexing information retrieval system
27. #pubcon
@bill_slawski
Anchor Hits
”Sorting the documents on the outbound score
component makes documents that have many related
phrases to the query as ‘anchor hits,’ rank most highly,
thus representing these documents as ‘expert’
documents”
•Integrated external related phrase information into a
phrase-based indexing information retrieval system
28. #pubcon
@bill_slawski
Personalization & Query Classifications
• Depending upon results selected by a searcher, the
results they see may fall into a specific category from
a biased document set
Personalizing Search Results at Google
33. #pubcon
@bill_slawski
Query Classifications
Search for “Lincoln” and click on the Person (Abe), the
Place (Nebraska), or the thing (towncar). What you
click on may determine what you see in the future on
searches for “Lincoln.”
…determining whether to assign the classification to
the first query based upon classifications for the
identified search entities.
•Propagating query classifications
36. #pubcon
@bill_slawski
Further Reading
• Knowledge-Based Trust: Estimating the
Trustworthiness of Web Sources
• A Review of Relational Machine Learning for
Knowledge Graphs
• Knowledge Curation and Knowledge Fusion:
Challenges, Models, and Applications
• Improving semantic topic clustering for search queries
with word co-occurrence and bigraph co-clustering
37. #pubcon
@bill_slawski
Questions? Ask Me At:
• Twitter: https://twitter.com/bill_slawski
• LinkedIn: https://www.linkedin.com/in/slawski/
• Facebook: https://www.facebook.com/bill.slawski
• Google+: https://plus.google.com/+BillSlawski
• SEO by the Sea: http://www.seobythesea.com/
• Go Fish Digital Blog: https://gofishdigital.com/blog/