SlideShare a Scribd company logo
1 of 26
Download to read offline
Data Centers Architecture
Presented By Ali Al Ogaili
 About Google
 Google Products
 Background
 Distributed Computing
 Layered Architecture & Abstraction
 Google Architecture
 Computing Infrastructure
 Software Infrastructure
 App Engine: Google platform for your
Enterprise
genda
 Google Mission: to organize the world’s information
and make it universally accessible and useful
 Established in (September 4, 1998)
 Today Google runs over one million servers in centers
around the world
 Processes over one billion search requests[
and twenty petabyte (1015 B) of user-generated data
every day
oogle at glance
 Some of the products Google provide
 Google Search
 Gmail
 Maps
 YouTube
 GoogleDoc
 Google Calendar
 App Engine
 And many more
 Most of their products are web based
 They serve millions of people and they store user’s data in the “cloud”
 How do they do that? What is under the hood?
oogle products
 About Google
 Google Products
 Background
 Distributed Parallel Computing
 Layered Architecture & Abstraction
 Google Architecture
 Computing Infrastructure
 Software Infrastructure
 App Engine: Google platform for your
Enterprise
genda
 One “smart” computer doing the task of summing up the cells of the arrays
sequentially.
ackground: (Distributed Parallel Computing)
1 2 0 3
3 1 2 2
5 1 3 3
4 5 3 6
6
8
12
18
44
Compute
 Five “dummy” distributed computers doing the same task in parallel.
ackground: (Distributed Parallel Computing)
1 2 0 3
3 1 2 2
5 1 3 3
4 5 3 6
6
8
12
18
44
Compute
WorkerServers
Master
Distribute
computation power
and memory
 Division of concern
Structure the system in layers, such as that each layer has a set of
problems, tasks and processes decoupled from the other layers.
 Abstraction
Each layer abstract a set of functions
and concerns to the layer above it
 Flexibility
Replace an implementation while maintaining
the interface
ackground: (Layers & Abstraction)
The trouble with layers of computer
software is that sooner or later you loose
touch with reality.
 About Google
 Google Products
 Background
 Distributed Parallel Computing
 Layered Architecture & Abstraction
 Google Architecture
 Computing Infrastructure
 Software Infrastructure
 App Engine: Google platform for your
Enterprise
genda
rchitecture (General Overview)
Computing Platform:
- Cost Efficiency
- Server Design
- Networking
- Datacenters Technologies
System Infrastructure:
-Google File System (GFS)
-MapReduce
-BigTable
Google Services
Computing
Platform
Clusters of thousands of
commodity-class PC
-Reliable (fault tolerance)
-Scalable
-Cost Efficient (Low end
servers)
System Infrastructure:
A layer of software that abstracts the
hardware complexity from the
developers, it provides features
such as:
-Scheduling
-File access
-Fault management
-And many more
Google Services:
The set of services provided for
the users:
-Usability/User friendliness
-Simplicity
-Performance
-Innovation & solving people’s
problems
 Google Datacenters evolved over time…
 Google.standford.edu (circa 1997)
 Eric & Sergey (google founders)
volunteered to receive a shipments of
machines other research groups order,
and hold on them for sometime.
ompuing Platform
ompuing Platform
 Google Datacenters evolved over time…
ompuing Platform
 Google’s software architecture arises from two
basic insights *:
o Reliability in software rather than in server-class hardware (thus
they can commodity PC)
o Tailor the design for best aggregate request throughput, not
peak server response time (manage request time by
parallelizing individual request)
* WEB SEARCH FOR A PLANET:THE GOOGLE CLUSTER
ARCHITECTURE by Luiz André Barroso , Jeffrey Dean & Urs Hölzle
ompuing Platform
Dual SATA
Disks
RAM
12VDC Sealed
Lead-Acid Battery
Dual CPUs
Power Supply
Google custom made
servers uses consumers
products to get the best
economical value per
performance..
ompuing Platform
 The servers are placed in racks in
a shipment container (Modular
design)
 Plug & play (or serve)
 The servers interconnect via a 100-
Mbps Ethernet switch that has one or
two gigabit uplinks to a core gigabit
switch that connects all racks
together.
 Each shipping container can hold up
to 1,160 servers
 “power above, water below,”
 Modular design
 The Google facility features
a “container hanger” filled
with 45 containers,
ompuing Platform
 Some key challenges with Datacenter
design:
 Powering:
(Google has a backup battery for each
server as a oppose to a centralized UPS)
 Cooling
(Low tech PC generates more heat, thus
the datacenter requires more aggressive
cooling)
 Cabling and modularity
(Low tech pc are more prone to failure and
their life span is shorter; thus, those
machines need to be replaced easily)
 And much more..
ompuing Platform
 What could go wrong? Many things*..
 Overheating (power down most machines)
 PDU failure (machines suddenly
disappear)
 Rack-move (plenty of warnings)
 Rack-failures (40-80 machines instantly
disappear)
 Racks go wonky (40-80 machines see
50% pack loss)
 Network maintenance ( ~ 30 min random
connectivity loss)
 Individual machine failures
 Thousands of hard drive failures
 And much more (slow disk, bad memory,
miss configured machine, etc..)
Thousands of low end
machines clustered
together is
maintenance nightmare
!
*Google Seattle Conference on Scalability
ompuing Platform
 Google datacenters are more a single upgradable machine
Warehouse Scale Machines– (WSM).
ompuing Platform
 “Cloud” computing or back to mainframe computing?
1960s mainframe machines
serving thin clients
2005 Google datacenters hosting
web applications and serving thin
clients
oftware Platform
A software layer on top of computing platform
If one thinks of Google Datacenter as one single machine
(WSM) composted of thousands of individual machines, then
the software platforms managing those machines could be
thought of as an operating system for this machine
 Some of the main custom tools created by Google
 Google File Systems (GFS)
 MapReduce
 BigTable
oftware Platform (GFS)
 Google File System (GFS)
It is designed to provide efficient, reliable access to data
using large clusters of commodity hardware. (from Wikipedia)
 Abstract the storage on distributed unreliable hardware
 Master machines that deals with Metadata(Filename, mapping from
filename to chuck locations)
 64MB chunks (on the disk 8K file system block on the Operating System)
 Every chunk is replicated 3 times on different racks
 Responsible for managing failures (if machine dies,
then replicate the data in another machine)
oftware Platform (MapRecude)
MapReduce
Introduced by Google to support distributed computing on
large data sets on clusters of computers. (from Wikipedia)
 Abstract the computation on distributed unreliable hardware
 User has to write to functions (Map & Redeuce) and the library will take
care of all the hardware related issues (Assigning tasks to machines,
managing machines failures etc)
 The library will try to make the computation faster by pushing the logic
closer to where the chunk data is located
 Deals with scalability
oftware Platform (MapRecude)
 Split the data set into N (mapping)
where N is equal to the number of
available workers
 Wait until all the workers finish their
tasks (some processing is done on
intermediate results)
 Computer the final result (reduce)
functions
oftware Platform (BigTable)
BigTable
A compressed, high performance,
and proprietary database system built on Google File
System (GFS), Chubby Lock Service, and a few
other Google programs (from Wikipedia)
 Non-relational distributed database created by Google
 Built on top of GFS and provides a higher level of abstraction
 Implements a sub-set of typical DBMS (Database management system)
 Google Analytics, Google Earth, Personalized Search, App Engine and many more..
 About Google
 Google Products
 Background
 Distributed Parallel Computing
 Layered Architecture & Abstraction
 Google Architecture
 Computing Infrastructure
 Software Infrastructure
 App Engine: Google platform for your
Enterprise
genda
pp Engine
* From http://code.google.com/appengine

More Related Content

What's hot

From Embedded to IoT and From Cloud to Edge & AIoT -- A computer technology t...
From Embedded to IoT and From Cloud to Edge & AIoT -- A computer technology t...From Embedded to IoT and From Cloud to Edge & AIoT -- A computer technology t...
From Embedded to IoT and From Cloud to Edge & AIoT -- A computer technology t...William Liang
 
Understanding and Testing DDR4 R-DIMM and LR-DIMM Technology
Understanding and Testing DDR4 R-DIMM and LR-DIMM TechnologyUnderstanding and Testing DDR4 R-DIMM and LR-DIMM Technology
Understanding and Testing DDR4 R-DIMM and LR-DIMM TechnologyIntegrated Device Technology
 
IoT Fundamentals.pdf
IoT Fundamentals.pdfIoT Fundamentals.pdf
IoT Fundamentals.pdfVishnumayaN1
 
Internet of Things: state of the art
Internet of Things: state of the artInternet of Things: state of the art
Internet of Things: state of the artMario Kušek
 
Case study mobile computing
Case study mobile computingCase study mobile computing
Case study mobile computingkaleeswaran S
 
Cloud Computing and Vertualization
Cloud Computing and VertualizationCloud Computing and Vertualization
Cloud Computing and VertualizationReach Chirag
 
5G Edge Computing IoT Presentation
5G Edge Computing IoT Presentation 5G Edge Computing IoT Presentation
5G Edge Computing IoT Presentation Rick Stomphorst
 
Iot presentation
Iot presentationIot presentation
Iot presentationhuma742446
 
Data Center Tiers : Tier 1, Tier 2, Tier 3 and Tier 4 data center tiers expla...
Data Center Tiers : Tier 1, Tier 2, Tier 3 and Tier 4 data center tiers expla...Data Center Tiers : Tier 1, Tier 2, Tier 3 and Tier 4 data center tiers expla...
Data Center Tiers : Tier 1, Tier 2, Tier 3 and Tier 4 data center tiers expla...Cloud Computing Wire
 
Protocols for IoT
Protocols for IoTProtocols for IoT
Protocols for IoTAmit Dev
 
Elements of IoT connectivity technologies
Elements of IoT connectivity technologiesElements of IoT connectivity technologies
Elements of IoT connectivity technologiesusman sarwar
 

What's hot (20)

From Embedded to IoT and From Cloud to Edge & AIoT -- A computer technology t...
From Embedded to IoT and From Cloud to Edge & AIoT -- A computer technology t...From Embedded to IoT and From Cloud to Edge & AIoT -- A computer technology t...
From Embedded to IoT and From Cloud to Edge & AIoT -- A computer technology t...
 
Understanding and Testing DDR4 R-DIMM and LR-DIMM Technology
Understanding and Testing DDR4 R-DIMM and LR-DIMM TechnologyUnderstanding and Testing DDR4 R-DIMM and LR-DIMM Technology
Understanding and Testing DDR4 R-DIMM and LR-DIMM Technology
 
IoT Fundamentals.pdf
IoT Fundamentals.pdfIoT Fundamentals.pdf
IoT Fundamentals.pdf
 
Parallel Computing
Parallel ComputingParallel Computing
Parallel Computing
 
FreeRTOS
FreeRTOSFreeRTOS
FreeRTOS
 
Iot - Internet of Things
Iot - Internet of ThingsIot - Internet of Things
Iot - Internet of Things
 
Internet of Things: state of the art
Internet of Things: state of the artInternet of Things: state of the art
Internet of Things: state of the art
 
Case study mobile computing
Case study mobile computingCase study mobile computing
Case study mobile computing
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Distributed Computing
Distributed ComputingDistributed Computing
Distributed Computing
 
Cloud Computing and Vertualization
Cloud Computing and VertualizationCloud Computing and Vertualization
Cloud Computing and Vertualization
 
5G Edge Computing IoT Presentation
5G Edge Computing IoT Presentation 5G Edge Computing IoT Presentation
5G Edge Computing IoT Presentation
 
DATA CENTER
DATA CENTER DATA CENTER
DATA CENTER
 
Iot presentation
Iot presentationIot presentation
Iot presentation
 
Data Center
Data CenterData Center
Data Center
 
Energy Aware Clouds
Energy Aware CloudsEnergy Aware Clouds
Energy Aware Clouds
 
Data Center Tiers : Tier 1, Tier 2, Tier 3 and Tier 4 data center tiers expla...
Data Center Tiers : Tier 1, Tier 2, Tier 3 and Tier 4 data center tiers expla...Data Center Tiers : Tier 1, Tier 2, Tier 3 and Tier 4 data center tiers expla...
Data Center Tiers : Tier 1, Tier 2, Tier 3 and Tier 4 data center tiers expla...
 
Protocols for IoT
Protocols for IoTProtocols for IoT
Protocols for IoT
 
Edge computing
Edge computingEdge computing
Edge computing
 
Elements of IoT connectivity technologies
Elements of IoT connectivity technologiesElements of IoT connectivity technologies
Elements of IoT connectivity technologies
 

Similar to Google data centers

The Anatomy Of The Google Architecture Fina Lv1.1
The Anatomy Of The Google Architecture Fina Lv1.1The Anatomy Of The Google Architecture Fina Lv1.1
The Anatomy Of The Google Architecture Fina Lv1.1Hassy Veldstra
 
Google Cloud Computing on Google Developer 2008 Day
Google Cloud Computing on Google Developer 2008 DayGoogle Cloud Computing on Google Developer 2008 Day
Google Cloud Computing on Google Developer 2008 Dayprogrammermag
 
Google cluster architecture
Google cluster architecture Google cluster architecture
Google cluster architecture Abhijeet Desai
 
Infrastructure student
Infrastructure studentInfrastructure student
Infrastructure studentJohn Scrugham
 
An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)
An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)
An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)Robert Grossman
 
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...confluent
 
Google Cluster Innards
Google Cluster InnardsGoogle Cluster Innards
Google Cluster InnardsMartin Dvorak
 
Hadoop online-training
Hadoop online-trainingHadoop online-training
Hadoop online-trainingGeohedrick
 
Journey to Containerized Application / Google Container Engine
Journey to Containerized Application / Google Container EngineJourney to Containerized Application / Google Container Engine
Journey to Containerized Application / Google Container EngineGoogle Cloud Platform - Japan
 
Cloud computing overview
Cloud computing overviewCloud computing overview
Cloud computing overviewKHANSAFEE
 
Hadoop bigdata overview
Hadoop bigdata overviewHadoop bigdata overview
Hadoop bigdata overviewharithakannan
 
Introduction to Cloud computing
Introduction to Cloud computingIntroduction to Cloud computing
Introduction to Cloud computingMathews Job
 
Accelerating open science and AI with automated, portable, customizable and r...
Accelerating open science and AI with automated, portable, customizable and r...Accelerating open science and AI with automated, portable, customizable and r...
Accelerating open science and AI with automated, portable, customizable and r...Grigori Fursin
 
Cluster Computers
Cluster ComputersCluster Computers
Cluster Computersshopnil786
 
Nt1310 Unit 3 Computer Components
Nt1310 Unit 3 Computer ComponentsNt1310 Unit 3 Computer Components
Nt1310 Unit 3 Computer ComponentsKristi Anderson
 

Similar to Google data centers (20)

The Anatomy Of The Google Architecture Fina Lv1.1
The Anatomy Of The Google Architecture Fina Lv1.1The Anatomy Of The Google Architecture Fina Lv1.1
The Anatomy Of The Google Architecture Fina Lv1.1
 
Google Cloud Computing on Google Developer 2008 Day
Google Cloud Computing on Google Developer 2008 DayGoogle Cloud Computing on Google Developer 2008 Day
Google Cloud Computing on Google Developer 2008 Day
 
Google cluster architecture
Google cluster architecture Google cluster architecture
Google cluster architecture
 
Infrastructure student
Infrastructure studentInfrastructure student
Infrastructure student
 
An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)
An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)
An Introduction to Cloud Computing by Robert Grossman 08-06-09 (v19)
 
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
 
Google Cluster Innards
Google Cluster InnardsGoogle Cluster Innards
Google Cluster Innards
 
Handout3o
Handout3oHandout3o
Handout3o
 
Hadoop online-training
Hadoop online-trainingHadoop online-training
Hadoop online-training
 
Cluster computing
Cluster computingCluster computing
Cluster computing
 
Journey to Containerized Application / Google Container Engine
Journey to Containerized Application / Google Container EngineJourney to Containerized Application / Google Container Engine
Journey to Containerized Application / Google Container Engine
 
Google warehouse scale computer
Google warehouse scale computerGoogle warehouse scale computer
Google warehouse scale computer
 
Cloud computing overview
Cloud computing overviewCloud computing overview
Cloud computing overview
 
GIST AI-X Computing Cluster
GIST AI-X Computing ClusterGIST AI-X Computing Cluster
GIST AI-X Computing Cluster
 
Hadoop bigdata overview
Hadoop bigdata overviewHadoop bigdata overview
Hadoop bigdata overview
 
Introduction to Cloud computing
Introduction to Cloud computingIntroduction to Cloud computing
Introduction to Cloud computing
 
Accelerating open science and AI with automated, portable, customizable and r...
Accelerating open science and AI with automated, portable, customizable and r...Accelerating open science and AI with automated, portable, customizable and r...
Accelerating open science and AI with automated, portable, customizable and r...
 
Cluster Computers
Cluster ComputersCluster Computers
Cluster Computers
 
Nt1310 Unit 3 Computer Components
Nt1310 Unit 3 Computer ComponentsNt1310 Unit 3 Computer Components
Nt1310 Unit 3 Computer Components
 
Mod 2 hardware_graphics.pdf
Mod 2 hardware_graphics.pdfMod 2 hardware_graphics.pdf
Mod 2 hardware_graphics.pdf
 

Recently uploaded

Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 

Recently uploaded (20)

Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 

Google data centers

  • 2.  About Google  Google Products  Background  Distributed Computing  Layered Architecture & Abstraction  Google Architecture  Computing Infrastructure  Software Infrastructure  App Engine: Google platform for your Enterprise genda
  • 3.  Google Mission: to organize the world’s information and make it universally accessible and useful  Established in (September 4, 1998)  Today Google runs over one million servers in centers around the world  Processes over one billion search requests[ and twenty petabyte (1015 B) of user-generated data every day oogle at glance
  • 4.  Some of the products Google provide  Google Search  Gmail  Maps  YouTube  GoogleDoc  Google Calendar  App Engine  And many more  Most of their products are web based  They serve millions of people and they store user’s data in the “cloud”  How do they do that? What is under the hood? oogle products
  • 5.  About Google  Google Products  Background  Distributed Parallel Computing  Layered Architecture & Abstraction  Google Architecture  Computing Infrastructure  Software Infrastructure  App Engine: Google platform for your Enterprise genda
  • 6.  One “smart” computer doing the task of summing up the cells of the arrays sequentially. ackground: (Distributed Parallel Computing) 1 2 0 3 3 1 2 2 5 1 3 3 4 5 3 6 6 8 12 18 44 Compute
  • 7.  Five “dummy” distributed computers doing the same task in parallel. ackground: (Distributed Parallel Computing) 1 2 0 3 3 1 2 2 5 1 3 3 4 5 3 6 6 8 12 18 44 Compute WorkerServers Master Distribute computation power and memory
  • 8.  Division of concern Structure the system in layers, such as that each layer has a set of problems, tasks and processes decoupled from the other layers.  Abstraction Each layer abstract a set of functions and concerns to the layer above it  Flexibility Replace an implementation while maintaining the interface ackground: (Layers & Abstraction) The trouble with layers of computer software is that sooner or later you loose touch with reality.
  • 9.  About Google  Google Products  Background  Distributed Parallel Computing  Layered Architecture & Abstraction  Google Architecture  Computing Infrastructure  Software Infrastructure  App Engine: Google platform for your Enterprise genda
  • 10. rchitecture (General Overview) Computing Platform: - Cost Efficiency - Server Design - Networking - Datacenters Technologies System Infrastructure: -Google File System (GFS) -MapReduce -BigTable Google Services Computing Platform Clusters of thousands of commodity-class PC -Reliable (fault tolerance) -Scalable -Cost Efficient (Low end servers) System Infrastructure: A layer of software that abstracts the hardware complexity from the developers, it provides features such as: -Scheduling -File access -Fault management -And many more Google Services: The set of services provided for the users: -Usability/User friendliness -Simplicity -Performance -Innovation & solving people’s problems
  • 11.  Google Datacenters evolved over time…  Google.standford.edu (circa 1997)  Eric & Sergey (google founders) volunteered to receive a shipments of machines other research groups order, and hold on them for sometime. ompuing Platform
  • 12. ompuing Platform  Google Datacenters evolved over time…
  • 13. ompuing Platform  Google’s software architecture arises from two basic insights *: o Reliability in software rather than in server-class hardware (thus they can commodity PC) o Tailor the design for best aggregate request throughput, not peak server response time (manage request time by parallelizing individual request) * WEB SEARCH FOR A PLANET:THE GOOGLE CLUSTER ARCHITECTURE by Luiz André Barroso , Jeffrey Dean & Urs Hölzle
  • 14. ompuing Platform Dual SATA Disks RAM 12VDC Sealed Lead-Acid Battery Dual CPUs Power Supply Google custom made servers uses consumers products to get the best economical value per performance..
  • 15. ompuing Platform  The servers are placed in racks in a shipment container (Modular design)  Plug & play (or serve)  The servers interconnect via a 100- Mbps Ethernet switch that has one or two gigabit uplinks to a core gigabit switch that connects all racks together.  Each shipping container can hold up to 1,160 servers  “power above, water below,”  Modular design  The Google facility features a “container hanger” filled with 45 containers,
  • 16. ompuing Platform  Some key challenges with Datacenter design:  Powering: (Google has a backup battery for each server as a oppose to a centralized UPS)  Cooling (Low tech PC generates more heat, thus the datacenter requires more aggressive cooling)  Cabling and modularity (Low tech pc are more prone to failure and their life span is shorter; thus, those machines need to be replaced easily)  And much more..
  • 17. ompuing Platform  What could go wrong? Many things*..  Overheating (power down most machines)  PDU failure (machines suddenly disappear)  Rack-move (plenty of warnings)  Rack-failures (40-80 machines instantly disappear)  Racks go wonky (40-80 machines see 50% pack loss)  Network maintenance ( ~ 30 min random connectivity loss)  Individual machine failures  Thousands of hard drive failures  And much more (slow disk, bad memory, miss configured machine, etc..) Thousands of low end machines clustered together is maintenance nightmare ! *Google Seattle Conference on Scalability
  • 18. ompuing Platform  Google datacenters are more a single upgradable machine Warehouse Scale Machines– (WSM).
  • 19. ompuing Platform  “Cloud” computing or back to mainframe computing? 1960s mainframe machines serving thin clients 2005 Google datacenters hosting web applications and serving thin clients
  • 20. oftware Platform A software layer on top of computing platform If one thinks of Google Datacenter as one single machine (WSM) composted of thousands of individual machines, then the software platforms managing those machines could be thought of as an operating system for this machine  Some of the main custom tools created by Google  Google File Systems (GFS)  MapReduce  BigTable
  • 21. oftware Platform (GFS)  Google File System (GFS) It is designed to provide efficient, reliable access to data using large clusters of commodity hardware. (from Wikipedia)  Abstract the storage on distributed unreliable hardware  Master machines that deals with Metadata(Filename, mapping from filename to chuck locations)  64MB chunks (on the disk 8K file system block on the Operating System)  Every chunk is replicated 3 times on different racks  Responsible for managing failures (if machine dies, then replicate the data in another machine)
  • 22. oftware Platform (MapRecude) MapReduce Introduced by Google to support distributed computing on large data sets on clusters of computers. (from Wikipedia)  Abstract the computation on distributed unreliable hardware  User has to write to functions (Map & Redeuce) and the library will take care of all the hardware related issues (Assigning tasks to machines, managing machines failures etc)  The library will try to make the computation faster by pushing the logic closer to where the chunk data is located  Deals with scalability
  • 23. oftware Platform (MapRecude)  Split the data set into N (mapping) where N is equal to the number of available workers  Wait until all the workers finish their tasks (some processing is done on intermediate results)  Computer the final result (reduce) functions
  • 24. oftware Platform (BigTable) BigTable A compressed, high performance, and proprietary database system built on Google File System (GFS), Chubby Lock Service, and a few other Google programs (from Wikipedia)  Non-relational distributed database created by Google  Built on top of GFS and provides a higher level of abstraction  Implements a sub-set of typical DBMS (Database management system)  Google Analytics, Google Earth, Personalized Search, App Engine and many more..
  • 25.  About Google  Google Products  Background  Distributed Parallel Computing  Layered Architecture & Abstraction  Google Architecture  Computing Infrastructure  Software Infrastructure  App Engine: Google platform for your Enterprise genda
  • 26. pp Engine * From http://code.google.com/appengine