SlideShare a Scribd company logo
1 of 23
How Google search engine algorithm works 
Prepared by:- Viral Shah (120570107014) 
Guided by :- Prof. Sahista Machhar, MEFGI
It is a program that 
searches for and 
identifies items in a 
database that 
correspond to 
keywords or 
characters specified 
by the user, used 
especially for finding 
particular sites on the 
World Wide Web.
 There are 759 Million websites on the Web & 
60 Trillion webpages of this websites. 
 AND IT’S CONSTANTLY GROWING !!!!!
 GOOGLE navigates WEB by 
crawling. 
 To find information on the 
hundreds of millions of Web 
pages that exist, a search 
engine employs special 
software robots, called 
SPIDERS, to build lists of the 
words found on Web sites. 
When a spider is building its 
lists, the process is called 
Web crawling.
 The usual starting points are lists of heavily 
used servers and very popular pages. The 
spider will begin with a popular site, indexing 
the words on its pages and following every 
link found within the site. In this way, the 
spidering system quickly begins to travel, 
spreading out across the most widely used 
portions of the Web.
 When the Google spider looked at an HTML page, it took note of 
following things:- 
Words occurring in the title, subtitles, meta tags and other 
positions of relative importance were noted for special consideration 
during a subsequent user search. The Google spider was built to index 
every significant word on a page, leaving out the articles “a”, “an” and 
"the”. Other spiders take different approaches. 
 For example, some spiders will keep track of the words in the title, 
sub-headings and links, along with the 100 most frequently used 
words on the page and each word in the first 20 lines of text. Lycos is 
said to use this approach to spidering the Web. 
 GOOGLE built their initial system to use multiple spiders, usually three 
at one time. Each spider could keep about 300 connections to Web 
pages open at a time.
 Google’s spider name is Googlebot. 
 Googlebot is the search bot software used 
by Google, which collects documents from 
the web to build a searchable index for 
the Google Search engine.
 By following the web-pages, INDEX is 
prepared. The index includes text from 
millions of books from several libraries and 
other partners. 
 That means GOOGLE follow links from page 
to page. Also they sort pages by their content 
and other factors.
 These all activities Google carry out is tracked 
in the INDEX. Google continuously updates 
index and it is stored over large servers. 
 Currently, Google’s Index size is over 100 
million Gigabyte.
 Site owners choose whether their sites are 
crawled. 
 To prevent most search engine web 
crawlers from indexing a page on your site, place 
the following meta tag into the<head> section of 
your page: 
<meta name="robots" content="noindex"> 
 To prevent only Google web crawlers from 
indexing a page: 
<meta name="googlebot" content="noindex">
1) AUTOCOMPLETE 
Predicts what you might be searching for. 
This includes understanding terms with more 
than one meaning. 
2) SYNONYMS 
Recognizes words with similar meanings.
3) QUERY UNDERSTANDING 
Gets to the deeper meaning of the words 
you type. 
4) GOOGLE INSTANT 
Displays immediate results as you type. 
5) SPELLING 
Identifies and corrects possible spelling 
errors and provides alternatives.
 Based on all the above factors, Google picks 
some web-pages from the index. 
 Then, Google ranks the result on various 
factors. 
 1) Site & Page Quality:- 
It is checked by how you are writing 
key-words.
2) Freshness:- 
How much fresh the content is & at how 
much regular interval it is updated !! 
3) Safe-Search:- 
Google tries to find out how much it is safe 
and doesn’t contains spams. 
Along with these, there are 200+ factors used 
by Google to rank any particular webs-page.
 After all these operations, you will get the 
desired result and these all happens in one 
nano-second !!!
 Google fights with spam every second to give 
true & relevant result. 
 The majority of spam removal is 
automatic. Google examine other 
questionable documents by hand. If Google 
find spam, they take manual action.
1) PURE SPAM 
Site appears to use aggressive spam 
techniques such as automatically generated 
gibberish, cloaking, scraping content from 
other websites, and/or repeated or egregious 
violations of Google's Webmaster Guidelines. 
2) HIDDEN TEXT AND/OR KEYWORD STUFFING 
Some of the pages may contain hidden 
text and/or keyword stuffing.
3) USER-GENERATED SPAM 
Site appears to contain spammy user-generated 
content. The problematic content 
may appear on forum pages, guestbook pages, 
or user profiles. 
4) PARKED DOMAINS 
Parked domains are placeholder sites with little 
unique content, so Google doesn't typically 
include them in search results.
5) THIN CONTENT WITH LITTLE OR 
NO ADDED VALUE 
Site appears to consist of low-quality or shallow pages 
which do not provide users with much added value 
(such as thin affiliate pages, doorway pages, cookie-cutter 
sites, automatically generated content, or copied 
content). 
6) UNNATURAL LINKS TO A SITE 
Google has detected a pattern of unnatural artificial, 
deceptive or manipulative links pointing to the site. 
These may be the result of buying links that pass 
PageRank or participating in link schemes.
 Besides these all there are thousands other 
factors Google uses to detect Spam and 
decides the page-rank of web-page 
accordingly which is constantly updated and 
finally Google only keeps trusted documents 
in index.
 And the point of Interest is that to make 
presentation on google, I used
 Behind your simple page of results is a 
complex system, carefully crafted and 
tested, to support more than one-hundred 
billion searches each month !!!! 
How Google Search Engine Algorithm Works ??

More Related Content

What's hot

How a search engine works report
How a search engine works reportHow a search engine works report
How a search engine works reportSovan Misra
 
Google ppt by amit
Google ppt by amitGoogle ppt by amit
Google ppt by amitDAVV
 
Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...joelmaster
 
Comparing Search Engines
Comparing Search EnginesComparing Search Engines
Comparing Search EnginesMelissa Brisbin
 
Search engine optimization (seo)
Search engine optimization (seo)Search engine optimization (seo)
Search engine optimization (seo)jhon smith
 
Brighton SEO - Site Speed for Content Marketers
Brighton SEO - Site Speed for Content MarketersBrighton SEO - Site Speed for Content Marketers
Brighton SEO - Site Speed for Content MarketersTom Bennet
 
Training Project Report on Search Engines
Training Project Report on Search EnginesTraining Project Report on Search Engines
Training Project Report on Search EnginesShivam Saxena
 
Google search architecture services in Hyderabad
Google search architecture services in HyderabadGoogle search architecture services in Hyderabad
Google search architecture services in HyderabadMartin James
 
Google search techniques
Google search techniquesGoogle search techniques
Google search techniquesNirav Ranpara
 
SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012451 Marketing
 
Week 9 10 ppt-how_searchworks
Week 9 10 ppt-how_searchworksWeek 9 10 ppt-how_searchworks
Week 9 10 ppt-how_searchworkscarolyn oldham
 
What is a canonical tag?
What is a canonical tag?What is a canonical tag?
What is a canonical tag?Abhishek Mitra
 

What's hot (20)

About search engines
About search enginesAbout search engines
About search engines
 
Search engine
Search engineSearch engine
Search engine
 
Search Engines
Search EnginesSearch Engines
Search Engines
 
How a search engine works report
How a search engine works reportHow a search engine works report
How a search engine works report
 
Google ppt by amit
Google ppt by amitGoogle ppt by amit
Google ppt by amit
 
Search engine
Search engineSearch engine
Search engine
 
Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...
 
Comparing Search Engines
Comparing Search EnginesComparing Search Engines
Comparing Search Engines
 
Search engine optimization (seo)
Search engine optimization (seo)Search engine optimization (seo)
Search engine optimization (seo)
 
Brighton SEO - Site Speed for Content Marketers
Brighton SEO - Site Speed for Content MarketersBrighton SEO - Site Speed for Content Marketers
Brighton SEO - Site Speed for Content Marketers
 
Training Project Report on Search Engines
Training Project Report on Search EnginesTraining Project Report on Search Engines
Training Project Report on Search Engines
 
Google search architecture services in Hyderabad
Google search architecture services in HyderabadGoogle search architecture services in Hyderabad
Google search architecture services in Hyderabad
 
Google search techniques
Google search techniquesGoogle search techniques
Google search techniques
 
Search Engine Google
Search Engine GoogleSearch Engine Google
Search Engine Google
 
Lvr ppt
Lvr pptLvr ppt
Lvr ppt
 
SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012
 
SEO Animals
SEO AnimalsSEO Animals
SEO Animals
 
Week 9 10 ppt-how_searchworks
Week 9 10 ppt-how_searchworksWeek 9 10 ppt-how_searchworks
Week 9 10 ppt-how_searchworks
 
What is a canonical tag?
What is a canonical tag?What is a canonical tag?
What is a canonical tag?
 
Search Engine
Search EngineSearch Engine
Search Engine
 

Viewers also liked

Clinical Cases from Resource Limited Settings: David Roesel
Clinical Cases from Resource Limited Settings: David RoeselClinical Cases from Resource Limited Settings: David Roesel
Clinical Cases from Resource Limited Settings: David RoeselUWGlobalHealth
 
Understanding search engine algorithms
Understanding search engine algorithmsUnderstanding search engine algorithms
Understanding search engine algorithmsVijay Sankar
 
The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?Kundan Bhaduri
 
Google Search Engine
Google Search EngineGoogle Search Engine
Google Search Engineguestf460ed0
 
Page rank algorithm
Page rank algorithmPage rank algorithm
Page rank algorithmJunghoon Kim
 
Google Page Rank Algorithm
Google Page Rank AlgorithmGoogle Page Rank Algorithm
Google Page Rank AlgorithmOmkar Dash
 
Google Penguin, Google Panda, and Google Algorithms 2013
Google Penguin, Google Panda, and Google Algorithms 2013Google Penguin, Google Panda, and Google Algorithms 2013
Google Penguin, Google Panda, and Google Algorithms 2013Bill Hartzer
 
Google hummingbird algorithm ppt
Google hummingbird algorithm pptGoogle hummingbird algorithm ppt
Google hummingbird algorithm pptPriyodarshini Dhar
 
Pagerank Algorithm Explained
Pagerank Algorithm ExplainedPagerank Algorithm Explained
Pagerank Algorithm Explainedjdhaar
 

Viewers also liked (12)

Clinical Cases from Resource Limited Settings: David Roesel
Clinical Cases from Resource Limited Settings: David RoeselClinical Cases from Resource Limited Settings: David Roesel
Clinical Cases from Resource Limited Settings: David Roesel
 
Google algorithim’s
Google  algorithim’sGoogle  algorithim’s
Google algorithim’s
 
Understanding search engine algorithms
Understanding search engine algorithmsUnderstanding search engine algorithms
Understanding search engine algorithms
 
PageRank
PageRankPageRank
PageRank
 
The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?
 
Google Search Engine
Google Search EngineGoogle Search Engine
Google Search Engine
 
Page rank algorithm
Page rank algorithmPage rank algorithm
Page rank algorithm
 
Google PageRank
Google PageRankGoogle PageRank
Google PageRank
 
Google Page Rank Algorithm
Google Page Rank AlgorithmGoogle Page Rank Algorithm
Google Page Rank Algorithm
 
Google Penguin, Google Panda, and Google Algorithms 2013
Google Penguin, Google Panda, and Google Algorithms 2013Google Penguin, Google Panda, and Google Algorithms 2013
Google Penguin, Google Panda, and Google Algorithms 2013
 
Google hummingbird algorithm ppt
Google hummingbird algorithm pptGoogle hummingbird algorithm ppt
Google hummingbird algorithm ppt
 
Pagerank Algorithm Explained
Pagerank Algorithm ExplainedPagerank Algorithm Explained
Pagerank Algorithm Explained
 

Similar to How Google Search Engine Algorithm Works ??

Search Engine Optimization (Seo)
Search Engine Optimization (Seo)Search Engine Optimization (Seo)
Search Engine Optimization (Seo)ssunnysengar
 
Google Search Engine
Google Search Engine Google Search Engine
Google Search Engine Aniket_1415
 
Search Engine Optimization
Search Engine OptimizationSearch Engine Optimization
Search Engine Optimizationshrishail uttagi
 
Search Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEOSearch Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEONeeraj Reddy
 
Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2Nate Plaunt
 
The Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search EngineThe Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search EngineManish Chopra
 
Search engine and web crawler
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawlerishmecse13
 
Effective Searching Policies for Web Crawler
Effective Searching Policies for Web CrawlerEffective Searching Policies for Web Crawler
Effective Searching Policies for Web CrawlerIJMER
 
Lost in the Net: Navigating Search Engines
Lost in the Net:  Navigating Search EnginesLost in the Net:  Navigating Search Engines
Lost in the Net: Navigating Search EnginesJohan Koren
 
How google works and functions: A complete Approach
How google works and functions: A complete ApproachHow google works and functions: A complete Approach
How google works and functions: A complete ApproachPrakhar Gethe
 
Demand Quest SEO training session 2
Demand Quest SEO training session 2Demand Quest SEO training session 2
Demand Quest SEO training session 2Nate Plaunt
 
Latest Updates on SEO
Latest Updates on SEOLatest Updates on SEO
Latest Updates on SEOshailaja100
 
Search Engine Optimisation - MA Journalism - Week Three
Search Engine Optimisation - MA Journalism - Week ThreeSearch Engine Optimisation - MA Journalism - Week Three
Search Engine Optimisation - MA Journalism - Week Threepaulwould
 
Basic SEO Techniques All Webmasters Must Know
Basic SEO Techniques All Webmasters Must KnowBasic SEO Techniques All Webmasters Must Know
Basic SEO Techniques All Webmasters Must Knowwaqas ahmad
 
Crawling, Indicizzazione e SEO - Paolo Ramazzotti
Crawling, Indicizzazione e SEO - Paolo RamazzottiCrawling, Indicizzazione e SEO - Paolo Ramazzotti
Crawling, Indicizzazione e SEO - Paolo RamazzottiGimasi Sa
 
Il processo di Crawilng e Indexing di Google - Paolo Ramazzotti
Il processo di Crawilng e Indexing di Google - Paolo RamazzottiIl processo di Crawilng e Indexing di Google - Paolo Ramazzotti
Il processo di Crawilng e Indexing di Google - Paolo RamazzottiPaolo Ramazzotti
 
The ultimate guide to the invisible web
The ultimate guide to the invisible webThe ultimate guide to the invisible web
The ultimate guide to the invisible webYKNIB O
 

Similar to How Google Search Engine Algorithm Works ?? (20)

Search Engine Optimization (Seo)
Search Engine Optimization (Seo)Search Engine Optimization (Seo)
Search Engine Optimization (Seo)
 
Google Search Engine
Google Search Engine Google Search Engine
Google Search Engine
 
Search Engine Optimization
Search Engine OptimizationSearch Engine Optimization
Search Engine Optimization
 
Seo Manual
Seo ManualSeo Manual
Seo Manual
 
Search Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEOSearch Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEO
 
Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2
 
The Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search EngineThe Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search Engine
 
Search engine and web crawler
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawler
 
Effective Searching Policies for Web Crawler
Effective Searching Policies for Web CrawlerEffective Searching Policies for Web Crawler
Effective Searching Policies for Web Crawler
 
Lost in the Net: Navigating Search Engines
Lost in the Net:  Navigating Search EnginesLost in the Net:  Navigating Search Engines
Lost in the Net: Navigating Search Engines
 
Search engine
Search engineSearch engine
Search engine
 
How google works and functions: A complete Approach
How google works and functions: A complete ApproachHow google works and functions: A complete Approach
How google works and functions: A complete Approach
 
Demand Quest SEO training session 2
Demand Quest SEO training session 2Demand Quest SEO training session 2
Demand Quest SEO training session 2
 
Latest Updates on SEO
Latest Updates on SEOLatest Updates on SEO
Latest Updates on SEO
 
Search Engine Optimisation - MA Journalism - Week Three
Search Engine Optimisation - MA Journalism - Week ThreeSearch Engine Optimisation - MA Journalism - Week Three
Search Engine Optimisation - MA Journalism - Week Three
 
Basic SEO Techniques All Webmasters Must Know
Basic SEO Techniques All Webmasters Must KnowBasic SEO Techniques All Webmasters Must Know
Basic SEO Techniques All Webmasters Must Know
 
Search engine
Search engineSearch engine
Search engine
 
Crawling, Indicizzazione e SEO - Paolo Ramazzotti
Crawling, Indicizzazione e SEO - Paolo RamazzottiCrawling, Indicizzazione e SEO - Paolo Ramazzotti
Crawling, Indicizzazione e SEO - Paolo Ramazzotti
 
Il processo di Crawilng e Indexing di Google - Paolo Ramazzotti
Il processo di Crawilng e Indexing di Google - Paolo RamazzottiIl processo di Crawilng e Indexing di Google - Paolo Ramazzotti
Il processo di Crawilng e Indexing di Google - Paolo Ramazzotti
 
The ultimate guide to the invisible web
The ultimate guide to the invisible webThe ultimate guide to the invisible web
The ultimate guide to the invisible web
 

Recently uploaded

Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...RKavithamani
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 

Recently uploaded (20)

Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 

How Google Search Engine Algorithm Works ??

  • 1. How Google search engine algorithm works Prepared by:- Viral Shah (120570107014) Guided by :- Prof. Sahista Machhar, MEFGI
  • 2. It is a program that searches for and identifies items in a database that correspond to keywords or characters specified by the user, used especially for finding particular sites on the World Wide Web.
  • 3.  There are 759 Million websites on the Web & 60 Trillion webpages of this websites.  AND IT’S CONSTANTLY GROWING !!!!!
  • 4.  GOOGLE navigates WEB by crawling.  To find information on the hundreds of millions of Web pages that exist, a search engine employs special software robots, called SPIDERS, to build lists of the words found on Web sites. When a spider is building its lists, the process is called Web crawling.
  • 5.  The usual starting points are lists of heavily used servers and very popular pages. The spider will begin with a popular site, indexing the words on its pages and following every link found within the site. In this way, the spidering system quickly begins to travel, spreading out across the most widely used portions of the Web.
  • 6.  When the Google spider looked at an HTML page, it took note of following things:- Words occurring in the title, subtitles, meta tags and other positions of relative importance were noted for special consideration during a subsequent user search. The Google spider was built to index every significant word on a page, leaving out the articles “a”, “an” and "the”. Other spiders take different approaches.  For example, some spiders will keep track of the words in the title, sub-headings and links, along with the 100 most frequently used words on the page and each word in the first 20 lines of text. Lycos is said to use this approach to spidering the Web.  GOOGLE built their initial system to use multiple spiders, usually three at one time. Each spider could keep about 300 connections to Web pages open at a time.
  • 7.  Google’s spider name is Googlebot.  Googlebot is the search bot software used by Google, which collects documents from the web to build a searchable index for the Google Search engine.
  • 8.  By following the web-pages, INDEX is prepared. The index includes text from millions of books from several libraries and other partners.  That means GOOGLE follow links from page to page. Also they sort pages by their content and other factors.
  • 9.  These all activities Google carry out is tracked in the INDEX. Google continuously updates index and it is stored over large servers.  Currently, Google’s Index size is over 100 million Gigabyte.
  • 10.  Site owners choose whether their sites are crawled.  To prevent most search engine web crawlers from indexing a page on your site, place the following meta tag into the<head> section of your page: <meta name="robots" content="noindex">  To prevent only Google web crawlers from indexing a page: <meta name="googlebot" content="noindex">
  • 11. 1) AUTOCOMPLETE Predicts what you might be searching for. This includes understanding terms with more than one meaning. 2) SYNONYMS Recognizes words with similar meanings.
  • 12. 3) QUERY UNDERSTANDING Gets to the deeper meaning of the words you type. 4) GOOGLE INSTANT Displays immediate results as you type. 5) SPELLING Identifies and corrects possible spelling errors and provides alternatives.
  • 13.  Based on all the above factors, Google picks some web-pages from the index.  Then, Google ranks the result on various factors.  1) Site & Page Quality:- It is checked by how you are writing key-words.
  • 14. 2) Freshness:- How much fresh the content is & at how much regular interval it is updated !! 3) Safe-Search:- Google tries to find out how much it is safe and doesn’t contains spams. Along with these, there are 200+ factors used by Google to rank any particular webs-page.
  • 15.  After all these operations, you will get the desired result and these all happens in one nano-second !!!
  • 16.  Google fights with spam every second to give true & relevant result.  The majority of spam removal is automatic. Google examine other questionable documents by hand. If Google find spam, they take manual action.
  • 17. 1) PURE SPAM Site appears to use aggressive spam techniques such as automatically generated gibberish, cloaking, scraping content from other websites, and/or repeated or egregious violations of Google's Webmaster Guidelines. 2) HIDDEN TEXT AND/OR KEYWORD STUFFING Some of the pages may contain hidden text and/or keyword stuffing.
  • 18. 3) USER-GENERATED SPAM Site appears to contain spammy user-generated content. The problematic content may appear on forum pages, guestbook pages, or user profiles. 4) PARKED DOMAINS Parked domains are placeholder sites with little unique content, so Google doesn't typically include them in search results.
  • 19. 5) THIN CONTENT WITH LITTLE OR NO ADDED VALUE Site appears to consist of low-quality or shallow pages which do not provide users with much added value (such as thin affiliate pages, doorway pages, cookie-cutter sites, automatically generated content, or copied content). 6) UNNATURAL LINKS TO A SITE Google has detected a pattern of unnatural artificial, deceptive or manipulative links pointing to the site. These may be the result of buying links that pass PageRank or participating in link schemes.
  • 20.  Besides these all there are thousands other factors Google uses to detect Spam and decides the page-rank of web-page accordingly which is constantly updated and finally Google only keeps trusted documents in index.
  • 21.  And the point of Interest is that to make presentation on google, I used
  • 22.  Behind your simple page of results is a complex system, carefully crafted and tested, to support more than one-hundred billion searches each month !!!! 