SlideShare ist ein Scribd-Unternehmen logo
1 von 62
Downloaden Sie, um offline zu lesen
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Watching
Googlebot
Watching You
–
Optimizing with Server Logs
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
All of us share a common goal
To be crawled, indexed, and ranked.
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
We spend a lot of time and energy
figuring out how to do it better.
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Hey! It’s
me talking
about
how to
pwn JS
SEO.
(That was
rad.)
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Aaaaaand
here’s my
twitter feed
two weeks
later…
#whompwhomp
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Happy ending,
but more fable
than fairy tale.
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Actions > Words.
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
What exactly is Googlebot
crawling?
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Things this report is not
● solely traditional web pages
● details about which
Googlebot is crawling
● just pages 200 response
codes
● reflective of how many
unique pages are crawled
● bigger ≠ better
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Does reality match our
expectations?
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
https://twitter.com/JohnMu/status/856449976351825921
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Was the crawl healthy?
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Googlebot is designed to be a
good citizen of the web...
For Googlebot a speedy site is
a sign of healthy servers...
If the site slows down or
responds with server errors,
the [crawl rate] limit goes down
and Googlebot crawls less.
Politeness is
job 0
https://webmasters.googleblog.com/2017/01/what-crawl-budget-means-for-
googlebot.html
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Is it already too late?
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
https://twitter.com/JohnMu/status/1032553570468552704
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Server logs
are a record
of every
request a
server
receives.
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
How do I get logs?
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
credit: https://flic.kr/p/cnorAf
Make new allies.
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Ask: Is there already a log
management platform in place?
Be Clear: We do not want Personal
Identification Information (PII) and
request it be removed
BE SPECIFIC.
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Web
Server 1
Web
Server 2
Web
Server 3
CDN
DDOS Mitigation/Bot Manager
Logs can
come from
multiple
places in
your stack.
Load Balancer
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Accessing Log Files
Apache (Linux Server)
NGINX (Linux Server)
IIS log files (Windows Server)
AWS Load Balancer (Load Balancer)
Google Cloud Load Balancer (Load Balancer)
AWS Cloudfront (CDN)
Accessing CloudFare log files (CDN)
Incapsula (CDN/DDoS Mitigation)
Akamai logs (CDN/DDoS Mitigation)
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Aggregate Log
Data
Validate
Googlebot
Read Log Data
Parse logs for
meaningful
search and
analysis
Log
Source
1
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Some
assembly
required.
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Ways to read logs
Paid: Botify, Logz.io, Sumo Logic, Splunk
Free(mium): Screaming Frog Log Analyzer,
Big Query
Masochistic: Excel, Command Line
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
216.150.168.131 emeasrvr003
[07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET
/twiki/bin/view/TWiki/WikiSyntax?q=ntoon
HTTP/1.1 www.arrow.com 200 7352 616 -
Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu
ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge
cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+
(compatible;+Googlebot/2.1;++http://www.google
.com/bot.html) https://www.arrow.com/en/
indiegogo
Server IP
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
216.150.168.131 emeasrvr003
[07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET
/twiki/bin/view/TWiki/WikiSyntax?q=ntoon
HTTP/1.1 www.arrow.com 200 7352 616 -
Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu
ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge
cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+
(compatible;+Googlebot/2.1;++http://www.google
.com/bot.html) https://www.arrow.com/en/
indiegogo
Server Name*
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
216.150.168.131 emeasrvr003
[07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET
/twiki/bin/view/TWiki/WikiSyntax?q=ntoon
HTTP/1.1 www.arrow.com 200 7352 616"-"
Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu
ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge
cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+
(compatible;+Googlebot/2.1;++http://www.google
.com/bot.html) https://www.arrow.com/en/
indiegogo
Date & Time
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
216.150.168.131 emeasrvr003
[07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET
/twiki/bin/view/TWiki/WikiSyntax?q=ntoon
HTTP/1.1 www.arrow.com 200 7352 616"-"
Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu
ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge
cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+
(compatible;+Googlebot/2.1;++http://www.google
.com/bot.html) https://www.arrow.com/en/
indiegogo
Requester’s IP
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
216.150.168.131 emeasrvr003
[07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET
/twiki/bin/view/TWiki/WikiSyntax?q=ntoon
HTTP/1.1 www.arrow.com 200 7352 616 -
Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu
ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge
cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+
(compatible;+Googlebot/2.1;++http://www.google
.com/bot.html) https://www.arrow.com/en/
indiegogo
Request Method
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
216.150.168.131 emeasrvr003
[07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET
/twiki/bin/view/TWiki/WikiSyntax?q=ntoon
HTTP/1.1 www.arrow.com 200 7352 616 -
Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu
ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge
cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+
(compatible;+Googlebot/2.1;++http://www.google
.com/bot.html) https://www.arrow.com/en/
indiegogo
URI
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
216.150.168.131 emeasrvr003
[07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET
/twiki/bin/view/TWiki/WikiSyntax?q=ntoon
HTTP/1.1 www.arrow.com 200 7352 616 -
Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu
ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge
cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+
(compatible;+Googlebot/2.1;++http://www.google
.com/bot.html) https://www.arrow.com/en/
indiegogo
Hostname*
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
216.150.168.131 emeasrvr003
[07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET
/twiki/bin/view/TWiki/WikiSyntax?q=ntoon
HTTP/1.1 www.arrow.com 200 7352 616 -
Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu
ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge
cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+
(compatible;+Googlebot/2.1;++http://www.google
.com/bot.html) https://www.arrow.com/en/
indiegogo
Response Code
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
216.150.168.131 emeasrvr003
[07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET
/twiki/bin/view/TWiki/WikiSyntax?q=ntoon
HTTP/1.1 www.arrow.com 200 7352 616 -
Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu
ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge
cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+
(compatible;+Googlebot/2.1;++http://www.google
.com/bot.html) https://www.arrow.com/en/
indiegogo
Response Size*
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
216.150.168.131 emeasrvr003
[07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET
/twiki/bin/view/TWiki/WikiSyntax?q=ntoon
HTTP/1.1 www.arrow.com 200 7352 616 -
Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu
ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge
cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+
(compatible;+Googlebot/2.1;++http://www.google
.com/bot.html) https://www.arrow.com/en/
indiegogo
Response Time
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
216.150.168.131 emeasrvr003
[07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET
/twiki/bin/view/TWiki/WikiSyntax?q=ntoon
HTTP/1.1 www.arrow.com 200 7352 616 -
Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu
ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge
cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+
(compatible;+Googlebot/2.1;++http://www.google
.com/bot.html) https://www.arrow.com/en/
indiegogo
Requester’s User Agent
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
216.150.168.131 emeasrvr003
[07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET
/twiki/bin/view/TWiki/WikiSyntax?q=ntoon
HTTP/1.1 www.arrow.com 200 7352 616 -
Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu
ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge
cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+
(compatible;+Googlebot/2.1;++http://www.google
.com/bot.html) https://www.arrow.com/en/
indiegogo
Referring URL*
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Aggregate Log
Data
Validate
Googlebot
Read Log Data
Parse logs for
meaningful
search and
analysis
Log
Source
1
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
https://support.google.com/webmasters/answer/80553?hl=en
Validate Googlebot IPs: Manual
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Validate Googlebot IPs: Bulk with Script
https://dzone.com/articles/shell-script-to-detect-if-the-ip-address-is-google-1
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Validate Googlebot IPs: Log
Analyser Functionality
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Aggregate Log
Data
Validate
Googlebot
Read Log Data
Parse logs for
meaningful
search and
analysis
Log
Source
1
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Unlock logs ≤ 6 lines
• Data Source
• Condition
• Parse
• Aggregate
• Sort
• Limit
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
!!Every site will be different.
Make a new engineering ally.
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Use Case
Site section with low
index coverage
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Parsing URL Structure
/en/products/blam-o/log-12345
}
}
Language
App
}
Manufacturer
}
SKU
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
LPT: Limit is how you keep
your access to server logs.
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
If 40% of my site is articles,
should those URLS
represent 80% of crawl?
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Use Case
Google chose a
different canonical than
user.
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Query
Duplicate domains by
looking for ‘hostname’
values
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Use Case
Sudden crawl flux
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Query
Count by response code
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
No clear answers?
Dig deeper.
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Query
Broken JS, CSS, or
AJAX Endpoints
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Use Case
Intermittent crawl
errors
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Query
Server parity
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Analysing Log Files
Screaming Frog Log Analysis
BigQuery + Google Cloud Storage Services
Excel + .csv
Big Query + .csv
Command Line
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Making the most of logs means adapting to
your environment and making new friends.
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Iterate.
Test.
Share what you learn.
Jamie Alberico | @Jammer_Volts | #TechSEOBoost
Thank you for your time, energy, and
being part of this wonderful
community
–
@Jammer_Volts
totally@not-a-robot.com

Weitere ähnliche Inhalte

Was ist angesagt?

SearchLove San Diego 2018 | Mat Clayton | Site Speed for Digital Marketers
SearchLove San Diego 2018 | Mat Clayton | Site Speed for Digital MarketersSearchLove San Diego 2018 | Mat Clayton | Site Speed for Digital Marketers
SearchLove San Diego 2018 | Mat Clayton | Site Speed for Digital MarketersDistilled
 
The New Renaissance of JavaScript
The New Renaissance of JavaScriptThe New Renaissance of JavaScript
The New Renaissance of JavaScriptHamlet Batista
 
The State of the Web: Pagination and Infinite Scroll
The State of the Web: Pagination and Infinite ScrollThe State of the Web: Pagination and Infinite Scroll
The State of the Web: Pagination and Infinite ScrollAdam Gent
 
DeepCrawl Webinar: Performing SEO on the Edge
DeepCrawl Webinar: Performing SEO on the EdgeDeepCrawl Webinar: Performing SEO on the Edge
DeepCrawl Webinar: Performing SEO on the EdgeDan Taylor
 
Crafting Expertise, Authority and Trust with Entity-Based Content Strategy - ...
Crafting Expertise, Authority and Trust with Entity-Based Content Strategy - ...Crafting Expertise, Authority and Trust with Entity-Based Content Strategy - ...
Crafting Expertise, Authority and Trust with Entity-Based Content Strategy - ...Jamie Indigo
 
Automating Google Lighthouse
Automating Google LighthouseAutomating Google Lighthouse
Automating Google LighthouseHamlet Batista
 
SearchLove Boston 2018 - Bartosz Goralewicz - JavaScript: Looking Past the ...
SearchLove Boston 2018 -  Bartosz Goralewicz -  JavaScript: Looking Past the ...SearchLove Boston 2018 -  Bartosz Goralewicz -  JavaScript: Looking Past the ...
SearchLove Boston 2018 - Bartosz Goralewicz - JavaScript: Looking Past the ...Distilled
 
Browser Changes That Will Impact SEO From 2019-2020
Browser Changes That Will Impact SEO From 2019-2020Browser Changes That Will Impact SEO From 2019-2020
Browser Changes That Will Impact SEO From 2019-2020Tom Anthony
 
Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?Petra Kis-Herczegh
 
Web Performance & Search Engines - A look beyond rankings
Web Performance & Search Engines - A look beyond rankingsWeb Performance & Search Engines - A look beyond rankings
Web Performance & Search Engines - A look beyond rankingsGiacomo Zecchini
 
Technical SEO "Overoptimization"
Technical SEO "Overoptimization"Technical SEO "Overoptimization"
Technical SEO "Overoptimization"Hamlet Batista
 
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your LogsSearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your LogsDistilled
 
Debugging rendering problems at scale
Debugging rendering problems at scaleDebugging rendering problems at scale
Debugging rendering problems at scaleGiacomo Zecchini
 
5 Time-Saving SEO Alerts to Use Right Now - brightonSEO 2019
5 Time-Saving SEO Alerts to Use Right Now - brightonSEO 20195 Time-Saving SEO Alerts to Use Right Now - brightonSEO 2019
5 Time-Saving SEO Alerts to Use Right Now - brightonSEO 2019Marco Bonomo
 
SEO for Angular - BrightonSEO 2018
SEO for Angular - BrightonSEO 2018SEO for Angular - BrightonSEO 2018
SEO for Angular - BrightonSEO 2018Jamie Indigo
 
Technical Foundations of Successful Internationalization - SMX Munich
Technical Foundations of Successful Internationalization - SMX MunichTechnical Foundations of Successful Internationalization - SMX Munich
Technical Foundations of Successful Internationalization - SMX MunichJamie Indigo
 
Accelerated Mobile - Beyond AMP
Accelerated Mobile - Beyond AMPAccelerated Mobile - Beyond AMP
Accelerated Mobile - Beyond AMPJono Alderson
 
Mauro Cattaneo - Why hreflang is crucial to international SEO success - Brigh...
Mauro Cattaneo - Why hreflang is crucial to international SEO success - Brigh...Mauro Cattaneo - Why hreflang is crucial to international SEO success - Brigh...
Mauro Cattaneo - Why hreflang is crucial to international SEO success - Brigh...Mauro Cattaneo
 
GTM Clowns, fun and hacks - Search Elite - May 2017 Gerry White
GTM Clowns, fun and hacks - Search Elite - May 2017 Gerry WhiteGTM Clowns, fun and hacks - Search Elite - May 2017 Gerry White
GTM Clowns, fun and hacks - Search Elite - May 2017 Gerry WhiteGerry White
 
SearchLove San Diego 2018 | Tom Anthony | An Introduction to HTTP/2 & Service...
SearchLove San Diego 2018 | Tom Anthony | An Introduction to HTTP/2 & Service...SearchLove San Diego 2018 | Tom Anthony | An Introduction to HTTP/2 & Service...
SearchLove San Diego 2018 | Tom Anthony | An Introduction to HTTP/2 & Service...Distilled
 

Was ist angesagt? (20)

SearchLove San Diego 2018 | Mat Clayton | Site Speed for Digital Marketers
SearchLove San Diego 2018 | Mat Clayton | Site Speed for Digital MarketersSearchLove San Diego 2018 | Mat Clayton | Site Speed for Digital Marketers
SearchLove San Diego 2018 | Mat Clayton | Site Speed for Digital Marketers
 
The New Renaissance of JavaScript
The New Renaissance of JavaScriptThe New Renaissance of JavaScript
The New Renaissance of JavaScript
 
The State of the Web: Pagination and Infinite Scroll
The State of the Web: Pagination and Infinite ScrollThe State of the Web: Pagination and Infinite Scroll
The State of the Web: Pagination and Infinite Scroll
 
DeepCrawl Webinar: Performing SEO on the Edge
DeepCrawl Webinar: Performing SEO on the EdgeDeepCrawl Webinar: Performing SEO on the Edge
DeepCrawl Webinar: Performing SEO on the Edge
 
Crafting Expertise, Authority and Trust with Entity-Based Content Strategy - ...
Crafting Expertise, Authority and Trust with Entity-Based Content Strategy - ...Crafting Expertise, Authority and Trust with Entity-Based Content Strategy - ...
Crafting Expertise, Authority and Trust with Entity-Based Content Strategy - ...
 
Automating Google Lighthouse
Automating Google LighthouseAutomating Google Lighthouse
Automating Google Lighthouse
 
SearchLove Boston 2018 - Bartosz Goralewicz - JavaScript: Looking Past the ...
SearchLove Boston 2018 -  Bartosz Goralewicz -  JavaScript: Looking Past the ...SearchLove Boston 2018 -  Bartosz Goralewicz -  JavaScript: Looking Past the ...
SearchLove Boston 2018 - Bartosz Goralewicz - JavaScript: Looking Past the ...
 
Browser Changes That Will Impact SEO From 2019-2020
Browser Changes That Will Impact SEO From 2019-2020Browser Changes That Will Impact SEO From 2019-2020
Browser Changes That Will Impact SEO From 2019-2020
 
Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?
 
Web Performance & Search Engines - A look beyond rankings
Web Performance & Search Engines - A look beyond rankingsWeb Performance & Search Engines - A look beyond rankings
Web Performance & Search Engines - A look beyond rankings
 
Technical SEO "Overoptimization"
Technical SEO "Overoptimization"Technical SEO "Overoptimization"
Technical SEO "Overoptimization"
 
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your LogsSearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
 
Debugging rendering problems at scale
Debugging rendering problems at scaleDebugging rendering problems at scale
Debugging rendering problems at scale
 
5 Time-Saving SEO Alerts to Use Right Now - brightonSEO 2019
5 Time-Saving SEO Alerts to Use Right Now - brightonSEO 20195 Time-Saving SEO Alerts to Use Right Now - brightonSEO 2019
5 Time-Saving SEO Alerts to Use Right Now - brightonSEO 2019
 
SEO for Angular - BrightonSEO 2018
SEO for Angular - BrightonSEO 2018SEO for Angular - BrightonSEO 2018
SEO for Angular - BrightonSEO 2018
 
Technical Foundations of Successful Internationalization - SMX Munich
Technical Foundations of Successful Internationalization - SMX MunichTechnical Foundations of Successful Internationalization - SMX Munich
Technical Foundations of Successful Internationalization - SMX Munich
 
Accelerated Mobile - Beyond AMP
Accelerated Mobile - Beyond AMPAccelerated Mobile - Beyond AMP
Accelerated Mobile - Beyond AMP
 
Mauro Cattaneo - Why hreflang is crucial to international SEO success - Brigh...
Mauro Cattaneo - Why hreflang is crucial to international SEO success - Brigh...Mauro Cattaneo - Why hreflang is crucial to international SEO success - Brigh...
Mauro Cattaneo - Why hreflang is crucial to international SEO success - Brigh...
 
GTM Clowns, fun and hacks - Search Elite - May 2017 Gerry White
GTM Clowns, fun and hacks - Search Elite - May 2017 Gerry WhiteGTM Clowns, fun and hacks - Search Elite - May 2017 Gerry White
GTM Clowns, fun and hacks - Search Elite - May 2017 Gerry White
 
SearchLove San Diego 2018 | Tom Anthony | An Introduction to HTTP/2 & Service...
SearchLove San Diego 2018 | Tom Anthony | An Introduction to HTTP/2 & Service...SearchLove San Diego 2018 | Tom Anthony | An Introduction to HTTP/2 & Service...
SearchLove San Diego 2018 | Tom Anthony | An Introduction to HTTP/2 & Service...
 

Ähnlich wie Optimizing with Server Logs | Jamie Alberico @ #TechSEO Boost 2018

TechSEO Boost 2018: Watching Googlebot Watching You: Optimizing with Server Logs
TechSEO Boost 2018: Watching Googlebot Watching You: Optimizing with Server LogsTechSEO Boost 2018: Watching Googlebot Watching You: Optimizing with Server Logs
TechSEO Boost 2018: Watching Googlebot Watching You: Optimizing with Server LogsCatalyst
 
Developing web applications in 2010
Developing web applications in 2010Developing web applications in 2010
Developing web applications in 2010Ignacio Coloma
 
Browser Automation - NTC 2010
Browser Automation - NTC 2010Browser Automation - NTC 2010
Browser Automation - NTC 2010Ehren Foss
 
Imagesandvideo voxxeddays
Imagesandvideo voxxeddaysImagesandvideo voxxeddays
Imagesandvideo voxxeddaysDoug Sillars
 
Stefan Judis "Did we(b development) lose the right direction?"
Stefan Judis "Did we(b development) lose the right direction?"Stefan Judis "Did we(b development) lose the right direction?"
Stefan Judis "Did we(b development) lose the right direction?"Fwdays
 
Browser Automation
Browser AutomationBrowser Automation
Browser AutomationEhren Foss
 
Using firefox internet browser
Using firefox internet browserUsing firefox internet browser
Using firefox internet browsertaylorr2
 
Deep crawl the chaotic landscape of JavaScript
Deep crawl the chaotic landscape of JavaScript Deep crawl the chaotic landscape of JavaScript
Deep crawl the chaotic landscape of JavaScript Onely
 
Csdn Drdobbs Tenni Theurer Yahoo
Csdn Drdobbs Tenni Theurer YahooCsdn Drdobbs Tenni Theurer Yahoo
Csdn Drdobbs Tenni Theurer Yahooguestb1b95b
 
Real-time data analysis using ELK
Real-time data analysis using ELKReal-time data analysis using ELK
Real-time data analysis using ELKJettro Coenradie
 
Edi react fastandbeautiful
Edi react fastandbeautifulEdi react fastandbeautiful
Edi react fastandbeautifulDoug Sillars
 
Fastandbeautiful porto
Fastandbeautiful portoFastandbeautiful porto
Fastandbeautiful portoDoug Sillars
 
Fastandbeautiful novi sad
Fastandbeautiful novi sadFastandbeautiful novi sad
Fastandbeautiful novi sadDoug Sillars
 
Fastandbeautiful gdgtartu
Fastandbeautiful gdgtartuFastandbeautiful gdgtartu
Fastandbeautiful gdgtartuDoug Sillars
 
Fastandbeautiful full stacklondon
Fastandbeautiful full stacklondonFastandbeautiful full stacklondon
Fastandbeautiful full stacklondonDoug Sillars
 
Technical SEO: Crawl Space Management - SEOZone Istanbul 2014
Technical SEO: Crawl Space Management - SEOZone Istanbul 2014Technical SEO: Crawl Space Management - SEOZone Istanbul 2014
Technical SEO: Crawl Space Management - SEOZone Istanbul 2014Bastian Grimm
 
Imagesandvideo tallinn
Imagesandvideo tallinnImagesandvideo tallinn
Imagesandvideo tallinnDoug Sillars
 
Fastandbeautiful krakow
Fastandbeautiful krakowFastandbeautiful krakow
Fastandbeautiful krakowDoug Sillars
 

Ähnlich wie Optimizing with Server Logs | Jamie Alberico @ #TechSEO Boost 2018 (20)

TechSEO Boost 2018: Watching Googlebot Watching You: Optimizing with Server Logs
TechSEO Boost 2018: Watching Googlebot Watching You: Optimizing with Server LogsTechSEO Boost 2018: Watching Googlebot Watching You: Optimizing with Server Logs
TechSEO Boost 2018: Watching Googlebot Watching You: Optimizing with Server Logs
 
Developing web applications in 2010
Developing web applications in 2010Developing web applications in 2010
Developing web applications in 2010
 
Browser Automation - NTC 2010
Browser Automation - NTC 2010Browser Automation - NTC 2010
Browser Automation - NTC 2010
 
Imagesandvideo voxxeddays
Imagesandvideo voxxeddaysImagesandvideo voxxeddays
Imagesandvideo voxxeddays
 
Stefan Judis "Did we(b development) lose the right direction?"
Stefan Judis "Did we(b development) lose the right direction?"Stefan Judis "Did we(b development) lose the right direction?"
Stefan Judis "Did we(b development) lose the right direction?"
 
Browser Automation
Browser AutomationBrowser Automation
Browser Automation
 
Using firefox internet browser
Using firefox internet browserUsing firefox internet browser
Using firefox internet browser
 
Deep crawl the chaotic landscape of JavaScript
Deep crawl the chaotic landscape of JavaScript Deep crawl the chaotic landscape of JavaScript
Deep crawl the chaotic landscape of JavaScript
 
Csdn Drdobbs Tenni Theurer Yahoo
Csdn Drdobbs Tenni Theurer YahooCsdn Drdobbs Tenni Theurer Yahoo
Csdn Drdobbs Tenni Theurer Yahoo
 
Real-time data analysis using ELK
Real-time data analysis using ELKReal-time data analysis using ELK
Real-time data analysis using ELK
 
White Hat Cloaking
White Hat CloakingWhite Hat Cloaking
White Hat Cloaking
 
Edi react fastandbeautiful
Edi react fastandbeautifulEdi react fastandbeautiful
Edi react fastandbeautiful
 
Fastandbeautiful porto
Fastandbeautiful portoFastandbeautiful porto
Fastandbeautiful porto
 
Fastandbeautiful novi sad
Fastandbeautiful novi sadFastandbeautiful novi sad
Fastandbeautiful novi sad
 
Fastandbeautiful gdgtartu
Fastandbeautiful gdgtartuFastandbeautiful gdgtartu
Fastandbeautiful gdgtartu
 
Fastandbeautiful full stacklondon
Fastandbeautiful full stacklondonFastandbeautiful full stacklondon
Fastandbeautiful full stacklondon
 
Technical SEO: Crawl Space Management - SEOZone Istanbul 2014
Technical SEO: Crawl Space Management - SEOZone Istanbul 2014Technical SEO: Crawl Space Management - SEOZone Istanbul 2014
Technical SEO: Crawl Space Management - SEOZone Istanbul 2014
 
Imagesandvideo tallinn
Imagesandvideo tallinnImagesandvideo tallinn
Imagesandvideo tallinn
 
Fastandbeautiful krakow
Fastandbeautiful krakowFastandbeautiful krakow
Fastandbeautiful krakow
 
HTML 5
HTML 5HTML 5
HTML 5
 

Kürzlich hochgeladen

Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxDyna Gilbert
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书rnrncn29
 
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书zdzoqco
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predieusebiomeyer
 
TRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxTRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxAndrieCagasanAkio
 
Company Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxCompany Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxMario
 
IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119APNIC
 
Unidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxUnidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxmibuzondetrabajo
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书rnrncn29
 
ETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptxETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptxNIMMANAGANTI RAMAKRISHNA
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa494f574xmv
 

Kürzlich hochgeladen (11)

Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptx
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
 
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predi
 
TRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxTRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptx
 
Company Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxCompany Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptx
 
IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119
 
Unidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxUnidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptx
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
 
ETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptxETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptx
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa
 

Optimizing with Server Logs | Jamie Alberico @ #TechSEO Boost 2018

  • 1. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Watching Googlebot Watching You – Optimizing with Server Logs
  • 2. Jamie Alberico | @Jammer_Volts | #TechSEOBoost All of us share a common goal To be crawled, indexed, and ranked.
  • 3. Jamie Alberico | @Jammer_Volts | #TechSEOBoost We spend a lot of time and energy figuring out how to do it better.
  • 4. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Hey! It’s me talking about how to pwn JS SEO. (That was rad.)
  • 5. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Aaaaaand here’s my twitter feed two weeks later… #whompwhomp
  • 6. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Happy ending, but more fable than fairy tale.
  • 7. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Actions > Words.
  • 8. Jamie Alberico | @Jammer_Volts | #TechSEOBoost What exactly is Googlebot crawling?
  • 9. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Things this report is not ● solely traditional web pages ● details about which Googlebot is crawling ● just pages 200 response codes ● reflective of how many unique pages are crawled ● bigger ≠ better
  • 10. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Does reality match our expectations?
  • 11. Jamie Alberico | @Jammer_Volts | #TechSEOBoost https://twitter.com/JohnMu/status/856449976351825921
  • 12. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Was the crawl healthy?
  • 13. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Googlebot is designed to be a good citizen of the web... For Googlebot a speedy site is a sign of healthy servers... If the site slows down or responds with server errors, the [crawl rate] limit goes down and Googlebot crawls less. Politeness is job 0 https://webmasters.googleblog.com/2017/01/what-crawl-budget-means-for- googlebot.html
  • 14. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Is it already too late?
  • 15. Jamie Alberico | @Jammer_Volts | #TechSEOBoost https://twitter.com/JohnMu/status/1032553570468552704
  • 16. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Server logs are a record of every request a server receives.
  • 17. Jamie Alberico | @Jammer_Volts | #TechSEOBoost
  • 18. Jamie Alberico | @Jammer_Volts | #TechSEOBoost How do I get logs?
  • 19. Jamie Alberico | @Jammer_Volts | #TechSEOBoost credit: https://flic.kr/p/cnorAf Make new allies.
  • 20. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Ask: Is there already a log management platform in place? Be Clear: We do not want Personal Identification Information (PII) and request it be removed BE SPECIFIC.
  • 21. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Web Server 1 Web Server 2 Web Server 3 CDN DDOS Mitigation/Bot Manager Logs can come from multiple places in your stack. Load Balancer
  • 22. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Accessing Log Files Apache (Linux Server) NGINX (Linux Server) IIS log files (Windows Server) AWS Load Balancer (Load Balancer) Google Cloud Load Balancer (Load Balancer) AWS Cloudfront (CDN) Accessing CloudFare log files (CDN) Incapsula (CDN/DDoS Mitigation) Akamai logs (CDN/DDoS Mitigation)
  • 23. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Aggregate Log Data Validate Googlebot Read Log Data Parse logs for meaningful search and analysis Log Source 1
  • 24. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Some assembly required.
  • 25. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Ways to read logs Paid: Botify, Logz.io, Sumo Logic, Splunk Free(mium): Screaming Frog Log Analyzer, Big Query Masochistic: Excel, Command Line
  • 26. Jamie Alberico | @Jammer_Volts | #TechSEOBoost
  • 27. Jamie Alberico | @Jammer_Volts | #TechSEOBoost 216.150.168.131 emeasrvr003 [07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET /twiki/bin/view/TWiki/WikiSyntax?q=ntoon HTTP/1.1 www.arrow.com 200 7352 616 - Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+ (compatible;+Googlebot/2.1;++http://www.google .com/bot.html) https://www.arrow.com/en/ indiegogo Server IP
  • 28. Jamie Alberico | @Jammer_Volts | #TechSEOBoost 216.150.168.131 emeasrvr003 [07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET /twiki/bin/view/TWiki/WikiSyntax?q=ntoon HTTP/1.1 www.arrow.com 200 7352 616 - Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+ (compatible;+Googlebot/2.1;++http://www.google .com/bot.html) https://www.arrow.com/en/ indiegogo Server Name*
  • 29. Jamie Alberico | @Jammer_Volts | #TechSEOBoost 216.150.168.131 emeasrvr003 [07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET /twiki/bin/view/TWiki/WikiSyntax?q=ntoon HTTP/1.1 www.arrow.com 200 7352 616"-" Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+ (compatible;+Googlebot/2.1;++http://www.google .com/bot.html) https://www.arrow.com/en/ indiegogo Date & Time
  • 30. Jamie Alberico | @Jammer_Volts | #TechSEOBoost 216.150.168.131 emeasrvr003 [07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET /twiki/bin/view/TWiki/WikiSyntax?q=ntoon HTTP/1.1 www.arrow.com 200 7352 616"-" Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+ (compatible;+Googlebot/2.1;++http://www.google .com/bot.html) https://www.arrow.com/en/ indiegogo Requester’s IP
  • 31. Jamie Alberico | @Jammer_Volts | #TechSEOBoost 216.150.168.131 emeasrvr003 [07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET /twiki/bin/view/TWiki/WikiSyntax?q=ntoon HTTP/1.1 www.arrow.com 200 7352 616 - Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+ (compatible;+Googlebot/2.1;++http://www.google .com/bot.html) https://www.arrow.com/en/ indiegogo Request Method
  • 32. Jamie Alberico | @Jammer_Volts | #TechSEOBoost 216.150.168.131 emeasrvr003 [07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET /twiki/bin/view/TWiki/WikiSyntax?q=ntoon HTTP/1.1 www.arrow.com 200 7352 616 - Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+ (compatible;+Googlebot/2.1;++http://www.google .com/bot.html) https://www.arrow.com/en/ indiegogo URI
  • 33. Jamie Alberico | @Jammer_Volts | #TechSEOBoost 216.150.168.131 emeasrvr003 [07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET /twiki/bin/view/TWiki/WikiSyntax?q=ntoon HTTP/1.1 www.arrow.com 200 7352 616 - Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+ (compatible;+Googlebot/2.1;++http://www.google .com/bot.html) https://www.arrow.com/en/ indiegogo Hostname*
  • 34. Jamie Alberico | @Jammer_Volts | #TechSEOBoost 216.150.168.131 emeasrvr003 [07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET /twiki/bin/view/TWiki/WikiSyntax?q=ntoon HTTP/1.1 www.arrow.com 200 7352 616 - Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+ (compatible;+Googlebot/2.1;++http://www.google .com/bot.html) https://www.arrow.com/en/ indiegogo Response Code
  • 35. Jamie Alberico | @Jammer_Volts | #TechSEOBoost 216.150.168.131 emeasrvr003 [07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET /twiki/bin/view/TWiki/WikiSyntax?q=ntoon HTTP/1.1 www.arrow.com 200 7352 616 - Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+ (compatible;+Googlebot/2.1;++http://www.google .com/bot.html) https://www.arrow.com/en/ indiegogo Response Size*
  • 36. Jamie Alberico | @Jammer_Volts | #TechSEOBoost 216.150.168.131 emeasrvr003 [07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET /twiki/bin/view/TWiki/WikiSyntax?q=ntoon HTTP/1.1 www.arrow.com 200 7352 616 - Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+ (compatible;+Googlebot/2.1;++http://www.google .com/bot.html) https://www.arrow.com/en/ indiegogo Response Time
  • 37. Jamie Alberico | @Jammer_Volts | #TechSEOBoost 216.150.168.131 emeasrvr003 [07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET /twiki/bin/view/TWiki/WikiSyntax?q=ntoon HTTP/1.1 www.arrow.com 200 7352 616 - Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+ (compatible;+Googlebot/2.1;++http://www.google .com/bot.html) https://www.arrow.com/en/ indiegogo Requester’s User Agent
  • 38. Jamie Alberico | @Jammer_Volts | #TechSEOBoost 216.150.168.131 emeasrvr003 [07/Mar/2018:16:11:58 -0800] 66.249.66.1 GET /twiki/bin/view/TWiki/WikiSyntax?q=ntoon HTTP/1.1 www.arrow.com 200 7352 616 - Mozilla/5.0+(Linux;+Android+6.0.1;+Nexus+5X+Bu ild/MMB29P)+AppleWebKit/537.36+(KHTML,+like+Ge cko)+Chrome/41.0.2272.96+Mobile+Safari/537.36+ (compatible;+Googlebot/2.1;++http://www.google .com/bot.html) https://www.arrow.com/en/ indiegogo Referring URL*
  • 39. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Aggregate Log Data Validate Googlebot Read Log Data Parse logs for meaningful search and analysis Log Source 1
  • 40. Jamie Alberico | @Jammer_Volts | #TechSEOBoost https://support.google.com/webmasters/answer/80553?hl=en Validate Googlebot IPs: Manual
  • 41. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Validate Googlebot IPs: Bulk with Script https://dzone.com/articles/shell-script-to-detect-if-the-ip-address-is-google-1
  • 42. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Validate Googlebot IPs: Log Analyser Functionality
  • 43. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Aggregate Log Data Validate Googlebot Read Log Data Parse logs for meaningful search and analysis Log Source 1
  • 44. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Unlock logs ≤ 6 lines • Data Source • Condition • Parse • Aggregate • Sort • Limit
  • 45. Jamie Alberico | @Jammer_Volts | #TechSEOBoost !!Every site will be different. Make a new engineering ally.
  • 46. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Use Case Site section with low index coverage
  • 47. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Parsing URL Structure /en/products/blam-o/log-12345 } } Language App } Manufacturer } SKU
  • 48. Jamie Alberico | @Jammer_Volts | #TechSEOBoost LPT: Limit is how you keep your access to server logs.
  • 49. Jamie Alberico | @Jammer_Volts | #TechSEOBoost
  • 50. Jamie Alberico | @Jammer_Volts | #TechSEOBoost If 40% of my site is articles, should those URLS represent 80% of crawl?
  • 51. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Use Case Google chose a different canonical than user.
  • 52. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Query Duplicate domains by looking for ‘hostname’ values
  • 53. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Use Case Sudden crawl flux
  • 54. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Query Count by response code
  • 55. Jamie Alberico | @Jammer_Volts | #TechSEOBoost No clear answers? Dig deeper.
  • 56. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Query Broken JS, CSS, or AJAX Endpoints
  • 57. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Use Case Intermittent crawl errors
  • 58. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Query Server parity
  • 59. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Analysing Log Files Screaming Frog Log Analysis BigQuery + Google Cloud Storage Services Excel + .csv Big Query + .csv Command Line
  • 60. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Making the most of logs means adapting to your environment and making new friends.
  • 61. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Iterate. Test. Share what you learn.
  • 62. Jamie Alberico | @Jammer_Volts | #TechSEOBoost Thank you for your time, energy, and being part of this wonderful community – @Jammer_Volts totally@not-a-robot.com