SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 1
Aerospike aer . o . spike [air-oh- spahyk]
noun, 1. tip of a rocket that enhances speed and stability
YOU SNOOZE YOU LOSE
OR
HOW TO WIN IN AD TECH?
THE ONLY FLASH-OPTIMIZED DATABASE
BRIAN BULKOWSKI
FOUNDER, CTO, PRODUCT
STACK EXCHANGE MEETUP
APRIL 10, 2014
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 2
Aerospike: the gold standard for high throughput,
low latency, high reliability transactions
Performance
• Over ten trillion transactions per
month
• 99% of transactions faster than 2
ms
• 150K TPS per server
Scalability
• Billions of Internet users
• Clustered Software
• Automatic Data Rebalancing
Reliability
• 50 customers; zero service down-
time
• Immediate Consistency
• Rapid Failover; Data Center
Replication
Price/Performance
• Makes impossible projects
affordable
• Flash-optimized
• 1/10 the servers required
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 3
Aerospike Proven in Production
■  AppNexus - #2 RTB after Google
■  45 Billion auctions per day
■  2M QPS
■  3 12 server clusters
■  4.8T Flash per server
■  120K read TPS, 60K write TPS
■  Chango – #2 Search after Google
■  Sees more Searches than
Yahoo! + bing
■  Data on 300 Million users
■  TradeDesk – first Ad Exchange
■  Facebook Exchange partner
■  FBX serves 25% of Ads on the Internet
■  Snapdeal
■  2 servers replace 10 mongo servers
■  10GB data
■  “changed our company”
“Aerospike has operated
without interruptions
and easily scaled to meet
our performance demands.”
– Mike Nolet, CTO, AppNexus
© 2013 Aerospike. All rights reserved. Pg. 3
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 4
MILLIONS OF CONSUMERS
BILLIONS OF DEVICES
AEROSPIKE CLUSTER
APP SERVERS RDBMS
DATA
WAREHOUSE
SEGMENTS
WRITE REAL-TIME CONTEXT
READ RECENT CONTENT
PROFILE STORE
Cookies, email, deviceID, IP address,
location, segments, clicks, likes,
tweets, search terms...
REAL-TIME ANALYTICS
Best sellers, top scores, trending
tweets
BATCH
ANALYTICS
Discover
patterns,
segment data:
location patterns,
audience affinity
TYPICAL REAL-TIME DATABASE
DEPLOYMENT
TRANSACTIONS
WRITE CONTEXT
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 5
KEY CHALLENGES
1.  Handle extremely high rates of read/write transactions
over persistent data
2.  Avoid hot spots to maintain tight latency SLAs
3.  Provide immediate consistency with replication
4.  Ensure long running tasks do not slow down
transactions
5.  Scale linearly as data sizes and workloads increase
6.  Add capacity with no service interruption
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 6
SYSTEM ARCHITECTURE FOR 100%
UPTIME
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 7
SHARED-NOTHING SYSTEM:100% DATA
AVAILABILITY
■  Every node in a cluster is identical,
handles both transactions and long
running tasks
■  Data is replicated synchronously with
immediate consistency within the cluster
■  Data is replicated asynchronously
across data centers
OHIO Data Center
© 2013 Aerospike. All rights reserved Pg. 7
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 8
ROBUST DHT TO ELIMINATE HOT SPOTS
How Data Is Distributed (Replication Factor 2)
■  Every key is hashed into a
20 byte (fixed length) string
using the RIPEMD160 hash function
■  This hash + additional data
(fixed 64 bytes)
are stored in RAM in the index
■  Some bits from this hash value are
used to compute the partition id
■  There are 4096 partitions
■  Partition id maps to node id
based on cluster membership
cookie-abcdefg-12345678
182023kh15hh3kahdjsh
Partition
ID
Master
node
Replica
node
… 1 4
1820 2 3
1821 3 2
4096 4 1
© 2013 Aerospike. All rights reserved Pg. 8
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 9
REAL-TIME PRIORITIZATION TO MEET SLA
1.  Write sent to row master
2.  Latch against simultaneous writes
3.  Apply write to master memory and replica
memory synchronously
4.  Queue operations to disk
5.  Signal completed transaction (optional
storage commit wait)
6.  Master applies conflict resolution policy
(rollback/ rollforward)
master replica
1.  Cluster discovers new node via gossip
protocol
2.  Paxos vote determines new data
organization
3.  Partition migrations scheduled
4.  When a partition migration starts, write
journal starts on destination
5.  Partition moves atomically
6.  Journal is applied and source data deleted
transactions
continue
Writing with Immediate Consistency Adding a Node
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 10
INTELLIGENT CLIENT TO MAKE APPS
SIMPLER
■  Implements Aerospike API
■  Optimistic row locking
■  Optimized binary protocol
■  Cluster tracking
■  Learns about cluster changes,
partition map
■  Gossip protocol
■  Transaction semantics
■  Global transaction ID
■  Retransmit and timeout
■  Linear scale
■  No extra hop
■  No load balancers
© 2013 Aerospike. All rights reserved Pg. 10
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 11
OTHER DATABASE
OS FILE SYSTEM
PAGE CACHE
BLOCK INTERFACE
SSD HDD
BLOCK INTERFACE
SSD SSD
OPEN NVM
SSD
OTHER
DATABASE
AEROSPIKE FLASH OPTIMIZED
IN-MEMORY DATABASE
Ask me and I’ll tell you the answer.Ask me. I’ll look up the answer and then tell it to
you.
AEROSPIKE
HYBRID MEMORY SYSTEM™
•  Direct device access
•  Large Block Writes
•  Indexes in DRAM
•  Highly Parallelized
•  Log-structured FS “copy-on-write”
•  Fast restart with shared memory
FLASH OPTIMIZED HIGH
PERFORMANCE
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 12
Storage type DRAM & NoSQL SSD & DRAM
Storage per server 180 GB (196 GB Server) 2.4 GB (4 x 700 GB)
TPS per server 500,000 500,000
Cost per server $8,000 $11,000
Server costs $1,488,000 $154,000
Power/server 0.9 kW 1.1 kW
Power (2 years) $0.12 per kWh ave.
US
$352,000 $32,400
Maintenance (2 years) $3,600 per
server
$670,000 $50,400
Total $2,510,000 $236,800
THE BOTTOM LINE
Actual customer analysis.
Customer requires 500K TPS,
10 TB of storage, with
2x replication factor.
186 SERVERS REQUIRED 14 SERVERS REQUIRED
OTHER DATABASES
ONLY
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 13
Only up in 2013, 2014
Everyone wants that “Facebook architecture”
Facebook and Apple bought at least
$200+M in FusionIO cards in 2012
+ =
$200+M
© 2013 Aerospike. All rights reserved Pg. 13
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 14
© 2012 Aerospike. All rights reserved. Pg. 14
Measure your drives!
Aerospike Certification Tool (ACT)
http://github.com/aerospike/act
Transactional database workload
Reads: 1.5KB
(can’t batch / cache reads, random)
Writes: 128K blocks
(log based layout)
(plus defragmentation)
Turn up the load until
latency is over required SLA
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 15
➤  Super Storm Sandy 2012
§  NYC down for 17 hours
§  Back up and synched in 1 hour via
Aerospike Cross-Data Center Replication (XDR)
Replication that Works
“Aerospike allows us to
handle business continuity
and reliability across 4 data
centers seamlessly. And we
can now expand our
deployment to new data
centers in less than a week.”
- Elad Efraim, CTO
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 16
HOT ANALYTICS BY ROW
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 17
➤  Namespaces (policy containers)
§  Determine storage - DRAM or Flash
§  Determine replication factor
§  Contain records and sets
➤  Sets (tables) of records
§  Arbitrary grouping
➤  Records (rows)
§  Max 128k, contain key and bins
§  Bin with same name can contain
values of different types
u  String, integer, bytes (raw, blob, etc)
u  list ( an ordered collection of
values )
u  map ( a collection of keys and
values )
§  Bins can be added anytime
NOSQL EXTENSIBILITY
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 18
DISTRIBUTED QUERIES
1.  “Scatter” requests to all nodes
2.  Indexes in DRAM for fast map of secondary à primary keys
3.  Indexes co-located with data to guarantee ACID,
manage migrations
4.  Records read in parallel from all SSDs
using lock free concurrency control
5.  Aggregate results on each node
6.  “Gather” results from all nodes on client
STREAM AGGREGATIONS
1.  Push Code/ Security Policies/ Rules to Data with UDFs
2.  Pipe Query results through UDFs to
Filter, Transform, Aggregate.. Map, Reduce
REAL-TIME ANALYTICS on OPERATIONAL DATA (No ETL)
➤  In Database, within the same Cluster
➤  On the same Data, on XDR Replicated Clusters
Real-time Analytics on Operational Data
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 19
LESSONS LEARNED
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 20
NATIVE FLASH à PERFORMANCE
■  Low Latency at High Throughput
0
2.5
5
7.5
10
0 50,000 100,000 150,000 200,000
AverageLatency,ms
Throughput, ops/sec
Balanced Workload Read Latency (Full view)
Aerospike
Cassandra
MongoDB
0
4
8
12
16
0 50,000 100,000 150,000 200,000
AverageLatency,ms
Throughput, ops/sec
Balanced Workload Update Latency (Full view)
Aerospike
Cassandra
MongoDB
0
1
2
3
4
0 75,000 150,000 225,000 300,000
AverageLatency,ms
Throughput, ops/sec
Read-Heavy Workload Read Latency (Full view)
Aerospike
Cassandra
MongoDB
0
6
12
18
24
0 75,000 150,000 225,000 300,000
AverageLatency,ms
Throughput, ops/sec
Read-Heavy Workload Update Latency (Full view)
Aerospike
Cassandra
MongoDB
© 2013 Aerospike. All rights reserved Pg. 20
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 21
LESSONS
1.  Keep architecture simple
■  No hot spots (e.g., robust DHT)
■  Scales up easily (e.g., easy to size)
■  Avoids points of failure (e.g., single node type)
2.  Avoid manual operation – automate, automate!
■  Self-managed cluster responds to node failures
■  Data rebalancing requires no intervention
■  Real-time prioritization allows unattended system operation
3.  Keep system asynchronous
■  Shared nothing – nodes are autonomous
■  Async writes across data centers
■  Independent tuning parameters for different classes of tasks
© 2013 Aerospike. All rights reserved Pg. 21
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 22
LESSONS (cont’d)
4.  Monitor the Health of the System Extensively
■  Growth in load sneaks up on you over weeks
■  Early detection means better service
■  Most failures can be predicted (e.g., capacity, load, …)
5.  Size clusters properly
■  Have enough capacity ALWAYS!
■  Upgrade SSDs every couple years
■  Reduce cluster sizes to make operations simple
6.  Have geographically distributed data centers
■  Size the distributed data centers properly
■  Use active-active configurations if possible
■  Size bandwidth requirements accurately
© 2013 Aerospike. All rights reserved Pg. 22
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 23
LESSONS (CONT’D)
7.  Have plan for unforeseen situations
■  Devise scenarios and practice during normal work time
■  Ensure you can do rolling upgrades during high load time
■  Make sure that your nodes can restart fast (< 1 minute)
8.  Constantly test and monitor app end-to-end
■  Application level metrics are more important than DB metrics
■  Most issues in a service are due to a combination of application, network,
database, storage, etc.
9.  Separate online and offline workloads
■  Reserve real-time edge database for transactions and hot analytics queries
(where newest data is important)
■  Avoid ad-hoc queries on on-line system
■  Perform deep analysis in offline system (Hadoop)
10.  Use the Right Data Management System for the job
■  Fast NoSQL DB for real-time transactions and hot analytics on rapidly
changing data
■  Hadoop or other comparable systems for exhaustive analytics on mostly
read-only data
© 2013 Aerospike. All rights reserved Pg. 23
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 24
AEROSPIKE REAL-TIME BIG DATA
PLATFORM
Rapid Development Complete Customizability
➤  Support for popular languages and
tools
§  ASQL and Aerospike Client in
Java, C#, Ruby, Python..
➤  Complex data types
§  Nested documents
(map, list, string, integer)
§  Large (Stack, Set, List) Objects
➤  Queries
§  Single record
§  Batch multi-record lookups
§  Equality and range
§  Aggregations and MapReduce
➤  User Defined Functions
(UDFs)
§  In-DB processing
➤  Aggregation Framework
§  UDF Pipeline
§  MapReduce ++
➤  Time Series Queries
§  Just 2 IOPs for most r/w
independent of object size
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 25
HOW TO GET AEROSPIKE?
Free Community
Edition Enterprise Edition
➤  For developers looking
for speed and stability
and transparently scale
as they grow
➤  No transaction limits
➤  No time limit
➤  No production limit
➤  Data per cluster limit
➤  Community support
➤  For mission critical apps
needing to scale right from
the start
§  Unlimited number of
nodes, clusters, data
centers
§  Cross data center
replication
§  Premium 24x7 support
§  Priced by TBs of unique
data (not replicas)
© 2013 Aerospike. All rights reserved Pg. 25
© 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 26
QUESTIONS?
brian@aerospike.com
www.aerospike.com
© 2013 Aerospike. All rights reserved Pg. 26

Weitere ähnliche Inhalte

Was ist angesagt?

A Gentle Introduction to GPU Computing by Armen Donigian
A Gentle Introduction to GPU Computing by Armen DonigianA Gentle Introduction to GPU Computing by Armen Donigian
A Gentle Introduction to GPU Computing by Armen DonigianData Con LA
 
Webinar | Introducing DataStax Enterprise 4.6
Webinar | Introducing DataStax Enterprise 4.6Webinar | Introducing DataStax Enterprise 4.6
Webinar | Introducing DataStax Enterprise 4.6DataStax
 
Cost of Ownership for Hadoop Implementation
Cost of Ownership for Hadoop ImplementationCost of Ownership for Hadoop Implementation
Cost of Ownership for Hadoop ImplementationDataWorks Summit
 
Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...
Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...
Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...DataStax
 
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ... Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...Cloudera, Inc.
 
Transform Your DBMS to Drive Application Innovation
Transform Your DBMS to Drive Application InnovationTransform Your DBMS to Drive Application Innovation
Transform Your DBMS to Drive Application InnovationEDB
 
Getting Big Value from Big Data
Getting Big Value from Big DataGetting Big Value from Big Data
Getting Big Value from Big DataDataStax
 
NoSQL on MySQL - MySQL Document Store by Vadim Tkachenko
NoSQL on MySQL - MySQL Document Store by Vadim TkachenkoNoSQL on MySQL - MySQL Document Store by Vadim Tkachenko
NoSQL on MySQL - MySQL Document Store by Vadim TkachenkoData Con LA
 
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...DataStax
 
Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Cloudera, Inc.
 
Don’t Get Caught in a PCI Pickle: Meet Compliance and Protect Payment Card Da...
Don’t Get Caught in a PCI Pickle: Meet Compliance and Protect Payment Card Da...Don’t Get Caught in a PCI Pickle: Meet Compliance and Protect Payment Card Da...
Don’t Get Caught in a PCI Pickle: Meet Compliance and Protect Payment Card Da...DataStax
 
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...Cloudera, Inc.
 
Welcome: MariaDB today and our vision for the future
Welcome: MariaDB today and our vision for the futureWelcome: MariaDB today and our vision for the future
Welcome: MariaDB today and our vision for the futureMariaDB plc
 
Transforms Document Management at Scale with Distributed Database Solution wi...
Transforms Document Management at Scale with Distributed Database Solution wi...Transforms Document Management at Scale with Distributed Database Solution wi...
Transforms Document Management at Scale with Distributed Database Solution wi...DataStax Academy
 
Next Generation Enterprise Architecture
Next Generation Enterprise ArchitectureNext Generation Enterprise Architecture
Next Generation Enterprise ArchitectureMapR Technologies
 
Delivering fast, powerful and scalable analytics
Delivering fast, powerful and scalable analyticsDelivering fast, powerful and scalable analytics
Delivering fast, powerful and scalable analyticsMariaDB plc
 
Breakout: Operational Analytics with Hadoop
Breakout: Operational Analytics with HadoopBreakout: Operational Analytics with Hadoop
Breakout: Operational Analytics with HadoopCloudera, Inc.
 
Attunity Efficient ODR For Sql Server Using Attunity CDC Suite For SSIS Slide...
Attunity Efficient ODR For Sql Server Using Attunity CDC Suite For SSIS Slide...Attunity Efficient ODR For Sql Server Using Attunity CDC Suite For SSIS Slide...
Attunity Efficient ODR For Sql Server Using Attunity CDC Suite For SSIS Slide...Melissa Kolodziej
 
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...DataStax
 
Building a Digital Bank
Building a Digital BankBuilding a Digital Bank
Building a Digital BankDataStax
 

Was ist angesagt? (20)

A Gentle Introduction to GPU Computing by Armen Donigian
A Gentle Introduction to GPU Computing by Armen DonigianA Gentle Introduction to GPU Computing by Armen Donigian
A Gentle Introduction to GPU Computing by Armen Donigian
 
Webinar | Introducing DataStax Enterprise 4.6
Webinar | Introducing DataStax Enterprise 4.6Webinar | Introducing DataStax Enterprise 4.6
Webinar | Introducing DataStax Enterprise 4.6
 
Cost of Ownership for Hadoop Implementation
Cost of Ownership for Hadoop ImplementationCost of Ownership for Hadoop Implementation
Cost of Ownership for Hadoop Implementation
 
Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...
Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...
Webinar - Macy’s: Why Your Database Decision Directly Impacts Customer Experi...
 
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ... Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 
Transform Your DBMS to Drive Application Innovation
Transform Your DBMS to Drive Application InnovationTransform Your DBMS to Drive Application Innovation
Transform Your DBMS to Drive Application Innovation
 
Getting Big Value from Big Data
Getting Big Value from Big DataGetting Big Value from Big Data
Getting Big Value from Big Data
 
NoSQL on MySQL - MySQL Document Store by Vadim Tkachenko
NoSQL on MySQL - MySQL Document Store by Vadim TkachenkoNoSQL on MySQL - MySQL Document Store by Vadim Tkachenko
NoSQL on MySQL - MySQL Document Store by Vadim Tkachenko
 
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
 
Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive


 
Don’t Get Caught in a PCI Pickle: Meet Compliance and Protect Payment Card Da...
Don’t Get Caught in a PCI Pickle: Meet Compliance and Protect Payment Card Da...Don’t Get Caught in a PCI Pickle: Meet Compliance and Protect Payment Card Da...
Don’t Get Caught in a PCI Pickle: Meet Compliance and Protect Payment Card Da...
 
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
 
Welcome: MariaDB today and our vision for the future
Welcome: MariaDB today and our vision for the futureWelcome: MariaDB today and our vision for the future
Welcome: MariaDB today and our vision for the future
 
Transforms Document Management at Scale with Distributed Database Solution wi...
Transforms Document Management at Scale with Distributed Database Solution wi...Transforms Document Management at Scale with Distributed Database Solution wi...
Transforms Document Management at Scale with Distributed Database Solution wi...
 
Next Generation Enterprise Architecture
Next Generation Enterprise ArchitectureNext Generation Enterprise Architecture
Next Generation Enterprise Architecture
 
Delivering fast, powerful and scalable analytics
Delivering fast, powerful and scalable analyticsDelivering fast, powerful and scalable analytics
Delivering fast, powerful and scalable analytics
 
Breakout: Operational Analytics with Hadoop
Breakout: Operational Analytics with HadoopBreakout: Operational Analytics with Hadoop
Breakout: Operational Analytics with Hadoop
 
Attunity Efficient ODR For Sql Server Using Attunity CDC Suite For SSIS Slide...
Attunity Efficient ODR For Sql Server Using Attunity CDC Suite For SSIS Slide...Attunity Efficient ODR For Sql Server Using Attunity CDC Suite For SSIS Slide...
Attunity Efficient ODR For Sql Server Using Attunity CDC Suite For SSIS Slide...
 
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
 
Building a Digital Bank
Building a Digital BankBuilding a Digital Bank
Building a Digital Bank
 

Ähnlich wie Aerospike AdTech Gets Hacked in Lower Manhattan

What enterprises can learn from Real Time Bidding
What enterprises can learn from Real Time BiddingWhat enterprises can learn from Real Time Bidding
What enterprises can learn from Real Time BiddingAerospike
 
What enterprises can learn from Real Time Bidding (RTB)
What enterprises can learn from Real Time Bidding (RTB)What enterprises can learn from Real Time Bidding (RTB)
What enterprises can learn from Real Time Bidding (RTB)bigdatagurus_meetup
 
Predictable Big Data Performance in Real-time
Predictable Big Data Performance in Real-timePredictable Big Data Performance in Real-time
Predictable Big Data Performance in Real-timeAerospike, Inc.
 
Big Data Learnings from a Vendor's Perspective
Big Data Learnings from a Vendor's PerspectiveBig Data Learnings from a Vendor's Perspective
Big Data Learnings from a Vendor's PerspectiveAerospike, Inc.
 
Configuring Aerospike - Part 2
Configuring Aerospike - Part 2 Configuring Aerospike - Part 2
Configuring Aerospike - Part 2 Aerospike, Inc.
 
Brian Bulkowski. Aerospike
Brian Bulkowski. AerospikeBrian Bulkowski. Aerospike
Brian Bulkowski. AerospikeVolha Banadyseva
 
Streaming solutions for real time problems
Streaming solutions for real time problems Streaming solutions for real time problems
Streaming solutions for real time problems Aparna Gaonkar
 
Aerospike meetup july 2019 | Big Data Demystified
Aerospike meetup july 2019 | Big Data DemystifiedAerospike meetup july 2019 | Big Data Demystified
Aerospike meetup july 2019 | Big Data DemystifiedOmid Vahdaty
 
Handling Increasing Load and Reducing Costs Using Aerospike NoSQL Database - ...
Handling Increasing Load and Reducing Costs Using Aerospike NoSQL Database - ...Handling Increasing Load and Reducing Costs Using Aerospike NoSQL Database - ...
Handling Increasing Load and Reducing Costs Using Aerospike NoSQL Database - ...Aerospike
 
Building Scalable Applications using Pivotal Gemfire/Apache Geode
Building Scalable Applications using Pivotal Gemfire/Apache GeodeBuilding Scalable Applications using Pivotal Gemfire/Apache Geode
Building Scalable Applications using Pivotal Gemfire/Apache Geodeimcpune
 
Apache Kudu Fast Analytics on Fast Data (Hadoop / Spark Conference Japan 2016...
Apache Kudu Fast Analytics on Fast Data (Hadoop / Spark Conference Japan 2016...Apache Kudu Fast Analytics on Fast Data (Hadoop / Spark Conference Japan 2016...
Apache Kudu Fast Analytics on Fast Data (Hadoop / Spark Conference Japan 2016...Hadoop / Spark Conference Japan
 
What's Next for Google's BigTable
What's Next for Google's BigTableWhat's Next for Google's BigTable
What's Next for Google's BigTableSqrrl
 
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...In-Memory Computing Summit
 
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast DataDatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast DataHakka Labs
 
Best Practices for Using Alluxio with Spark
Best Practices for Using Alluxio with SparkBest Practices for Using Alluxio with Spark
Best Practices for Using Alluxio with SparkAlluxio, Inc.
 
ACID & CAP: Clearing CAP Confusion and Why C In CAP ≠ C in ACID
ACID & CAP:  Clearing CAP Confusion and Why C In CAP ≠ C in ACIDACID & CAP:  Clearing CAP Confusion and Why C In CAP ≠ C in ACID
ACID & CAP: Clearing CAP Confusion and Why C In CAP ≠ C in ACIDAerospike, Inc.
 
times ten in-memory database for extreme performance
times ten in-memory database for extreme performancetimes ten in-memory database for extreme performance
times ten in-memory database for extreme performanceOracle Korea
 

Ähnlich wie Aerospike AdTech Gets Hacked in Lower Manhattan (20)

What enterprises can learn from Real Time Bidding
What enterprises can learn from Real Time BiddingWhat enterprises can learn from Real Time Bidding
What enterprises can learn from Real Time Bidding
 
What enterprises can learn from Real Time Bidding (RTB)
What enterprises can learn from Real Time Bidding (RTB)What enterprises can learn from Real Time Bidding (RTB)
What enterprises can learn from Real Time Bidding (RTB)
 
Predictable Big Data Performance in Real-time
Predictable Big Data Performance in Real-timePredictable Big Data Performance in Real-time
Predictable Big Data Performance in Real-time
 
Big Data Learnings from a Vendor's Perspective
Big Data Learnings from a Vendor's PerspectiveBig Data Learnings from a Vendor's Perspective
Big Data Learnings from a Vendor's Perspective
 
Configuring Aerospike - Part 2
Configuring Aerospike - Part 2 Configuring Aerospike - Part 2
Configuring Aerospike - Part 2
 
Brian Bulkowski. Aerospike
Brian Bulkowski. AerospikeBrian Bulkowski. Aerospike
Brian Bulkowski. Aerospike
 
Streaming solutions for real time problems
Streaming solutions for real time problems Streaming solutions for real time problems
Streaming solutions for real time problems
 
Aerospike meetup july 2019 | Big Data Demystified
Aerospike meetup july 2019 | Big Data DemystifiedAerospike meetup july 2019 | Big Data Demystified
Aerospike meetup july 2019 | Big Data Demystified
 
Handling Increasing Load and Reducing Costs Using Aerospike NoSQL Database - ...
Handling Increasing Load and Reducing Costs Using Aerospike NoSQL Database - ...Handling Increasing Load and Reducing Costs Using Aerospike NoSQL Database - ...
Handling Increasing Load and Reducing Costs Using Aerospike NoSQL Database - ...
 
Building Scalable Applications using Pivotal Gemfire/Apache Geode
Building Scalable Applications using Pivotal Gemfire/Apache GeodeBuilding Scalable Applications using Pivotal Gemfire/Apache Geode
Building Scalable Applications using Pivotal Gemfire/Apache Geode
 
Apache Kudu Fast Analytics on Fast Data (Hadoop / Spark Conference Japan 2016...
Apache Kudu Fast Analytics on Fast Data (Hadoop / Spark Conference Japan 2016...Apache Kudu Fast Analytics on Fast Data (Hadoop / Spark Conference Japan 2016...
Apache Kudu Fast Analytics on Fast Data (Hadoop / Spark Conference Japan 2016...
 
What's Next for Google's BigTable
What's Next for Google's BigTableWhat's Next for Google's BigTable
What's Next for Google's BigTable
 
What's New in Apache Hive
What's New in Apache HiveWhat's New in Apache Hive
What's New in Apache Hive
 
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
 
Oracle Storage a ochrana dat
Oracle Storage a ochrana datOracle Storage a ochrana dat
Oracle Storage a ochrana dat
 
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast DataDatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
 
Best Practices for Using Alluxio with Spark
Best Practices for Using Alluxio with SparkBest Practices for Using Alluxio with Spark
Best Practices for Using Alluxio with Spark
 
Greenplum Architecture
Greenplum ArchitectureGreenplum Architecture
Greenplum Architecture
 
ACID & CAP: Clearing CAP Confusion and Why C In CAP ≠ C in ACID
ACID & CAP:  Clearing CAP Confusion and Why C In CAP ≠ C in ACIDACID & CAP:  Clearing CAP Confusion and Why C In CAP ≠ C in ACID
ACID & CAP: Clearing CAP Confusion and Why C In CAP ≠ C in ACID
 
times ten in-memory database for extreme performance
times ten in-memory database for extreme performancetimes ten in-memory database for extreme performance
times ten in-memory database for extreme performance
 

Kürzlich hochgeladen

Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfAarwolf Industries LLC
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...itnewsafrica
 
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sectoritnewsafrica
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Mark Simos
 
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Jeffrey Haguewood
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...BookNet Canada
 
Accelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessAccelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessWSO2
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsYoss Cohen
 
Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentMahmoud Rabie
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 

Kürzlich hochgeladen (20)

Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdf
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
 
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
 
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
 
Accelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessAccelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with Platformless
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platforms
 
Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career Development
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 

Aerospike AdTech Gets Hacked in Lower Manhattan

  • 1. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 1 Aerospike aer . o . spike [air-oh- spahyk] noun, 1. tip of a rocket that enhances speed and stability YOU SNOOZE YOU LOSE OR HOW TO WIN IN AD TECH? THE ONLY FLASH-OPTIMIZED DATABASE BRIAN BULKOWSKI FOUNDER, CTO, PRODUCT STACK EXCHANGE MEETUP APRIL 10, 2014
  • 2. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 2 Aerospike: the gold standard for high throughput, low latency, high reliability transactions Performance • Over ten trillion transactions per month • 99% of transactions faster than 2 ms • 150K TPS per server Scalability • Billions of Internet users • Clustered Software • Automatic Data Rebalancing Reliability • 50 customers; zero service down- time • Immediate Consistency • Rapid Failover; Data Center Replication Price/Performance • Makes impossible projects affordable • Flash-optimized • 1/10 the servers required
  • 3. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 3 Aerospike Proven in Production ■  AppNexus - #2 RTB after Google ■  45 Billion auctions per day ■  2M QPS ■  3 12 server clusters ■  4.8T Flash per server ■  120K read TPS, 60K write TPS ■  Chango – #2 Search after Google ■  Sees more Searches than Yahoo! + bing ■  Data on 300 Million users ■  TradeDesk – first Ad Exchange ■  Facebook Exchange partner ■  FBX serves 25% of Ads on the Internet ■  Snapdeal ■  2 servers replace 10 mongo servers ■  10GB data ■  “changed our company” “Aerospike has operated without interruptions and easily scaled to meet our performance demands.” – Mike Nolet, CTO, AppNexus © 2013 Aerospike. All rights reserved. Pg. 3
  • 4. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 4 MILLIONS OF CONSUMERS BILLIONS OF DEVICES AEROSPIKE CLUSTER APP SERVERS RDBMS DATA WAREHOUSE SEGMENTS WRITE REAL-TIME CONTEXT READ RECENT CONTENT PROFILE STORE Cookies, email, deviceID, IP address, location, segments, clicks, likes, tweets, search terms... REAL-TIME ANALYTICS Best sellers, top scores, trending tweets BATCH ANALYTICS Discover patterns, segment data: location patterns, audience affinity TYPICAL REAL-TIME DATABASE DEPLOYMENT TRANSACTIONS WRITE CONTEXT
  • 5. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 5 KEY CHALLENGES 1.  Handle extremely high rates of read/write transactions over persistent data 2.  Avoid hot spots to maintain tight latency SLAs 3.  Provide immediate consistency with replication 4.  Ensure long running tasks do not slow down transactions 5.  Scale linearly as data sizes and workloads increase 6.  Add capacity with no service interruption
  • 6. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 6 SYSTEM ARCHITECTURE FOR 100% UPTIME
  • 7. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 7 SHARED-NOTHING SYSTEM:100% DATA AVAILABILITY ■  Every node in a cluster is identical, handles both transactions and long running tasks ■  Data is replicated synchronously with immediate consistency within the cluster ■  Data is replicated asynchronously across data centers OHIO Data Center © 2013 Aerospike. All rights reserved Pg. 7
  • 8. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 8 ROBUST DHT TO ELIMINATE HOT SPOTS How Data Is Distributed (Replication Factor 2) ■  Every key is hashed into a 20 byte (fixed length) string using the RIPEMD160 hash function ■  This hash + additional data (fixed 64 bytes) are stored in RAM in the index ■  Some bits from this hash value are used to compute the partition id ■  There are 4096 partitions ■  Partition id maps to node id based on cluster membership cookie-abcdefg-12345678 182023kh15hh3kahdjsh Partition ID Master node Replica node … 1 4 1820 2 3 1821 3 2 4096 4 1 © 2013 Aerospike. All rights reserved Pg. 8
  • 9. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 9 REAL-TIME PRIORITIZATION TO MEET SLA 1.  Write sent to row master 2.  Latch against simultaneous writes 3.  Apply write to master memory and replica memory synchronously 4.  Queue operations to disk 5.  Signal completed transaction (optional storage commit wait) 6.  Master applies conflict resolution policy (rollback/ rollforward) master replica 1.  Cluster discovers new node via gossip protocol 2.  Paxos vote determines new data organization 3.  Partition migrations scheduled 4.  When a partition migration starts, write journal starts on destination 5.  Partition moves atomically 6.  Journal is applied and source data deleted transactions continue Writing with Immediate Consistency Adding a Node
  • 10. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 10 INTELLIGENT CLIENT TO MAKE APPS SIMPLER ■  Implements Aerospike API ■  Optimistic row locking ■  Optimized binary protocol ■  Cluster tracking ■  Learns about cluster changes, partition map ■  Gossip protocol ■  Transaction semantics ■  Global transaction ID ■  Retransmit and timeout ■  Linear scale ■  No extra hop ■  No load balancers © 2013 Aerospike. All rights reserved Pg. 10
  • 11. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 11 OTHER DATABASE OS FILE SYSTEM PAGE CACHE BLOCK INTERFACE SSD HDD BLOCK INTERFACE SSD SSD OPEN NVM SSD OTHER DATABASE AEROSPIKE FLASH OPTIMIZED IN-MEMORY DATABASE Ask me and I’ll tell you the answer.Ask me. I’ll look up the answer and then tell it to you. AEROSPIKE HYBRID MEMORY SYSTEM™ •  Direct device access •  Large Block Writes •  Indexes in DRAM •  Highly Parallelized •  Log-structured FS “copy-on-write” •  Fast restart with shared memory FLASH OPTIMIZED HIGH PERFORMANCE
  • 12. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 12 Storage type DRAM & NoSQL SSD & DRAM Storage per server 180 GB (196 GB Server) 2.4 GB (4 x 700 GB) TPS per server 500,000 500,000 Cost per server $8,000 $11,000 Server costs $1,488,000 $154,000 Power/server 0.9 kW 1.1 kW Power (2 years) $0.12 per kWh ave. US $352,000 $32,400 Maintenance (2 years) $3,600 per server $670,000 $50,400 Total $2,510,000 $236,800 THE BOTTOM LINE Actual customer analysis. Customer requires 500K TPS, 10 TB of storage, with 2x replication factor. 186 SERVERS REQUIRED 14 SERVERS REQUIRED OTHER DATABASES ONLY
  • 13. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 13 Only up in 2013, 2014 Everyone wants that “Facebook architecture” Facebook and Apple bought at least $200+M in FusionIO cards in 2012 + = $200+M © 2013 Aerospike. All rights reserved Pg. 13
  • 14. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 14 © 2012 Aerospike. All rights reserved. Pg. 14 Measure your drives! Aerospike Certification Tool (ACT) http://github.com/aerospike/act Transactional database workload Reads: 1.5KB (can’t batch / cache reads, random) Writes: 128K blocks (log based layout) (plus defragmentation) Turn up the load until latency is over required SLA
  • 15. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 15 ➤  Super Storm Sandy 2012 §  NYC down for 17 hours §  Back up and synched in 1 hour via Aerospike Cross-Data Center Replication (XDR) Replication that Works “Aerospike allows us to handle business continuity and reliability across 4 data centers seamlessly. And we can now expand our deployment to new data centers in less than a week.” - Elad Efraim, CTO
  • 16. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 16 HOT ANALYTICS BY ROW
  • 17. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 17 ➤  Namespaces (policy containers) §  Determine storage - DRAM or Flash §  Determine replication factor §  Contain records and sets ➤  Sets (tables) of records §  Arbitrary grouping ➤  Records (rows) §  Max 128k, contain key and bins §  Bin with same name can contain values of different types u  String, integer, bytes (raw, blob, etc) u  list ( an ordered collection of values ) u  map ( a collection of keys and values ) §  Bins can be added anytime NOSQL EXTENSIBILITY
  • 18. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 18 DISTRIBUTED QUERIES 1.  “Scatter” requests to all nodes 2.  Indexes in DRAM for fast map of secondary à primary keys 3.  Indexes co-located with data to guarantee ACID, manage migrations 4.  Records read in parallel from all SSDs using lock free concurrency control 5.  Aggregate results on each node 6.  “Gather” results from all nodes on client STREAM AGGREGATIONS 1.  Push Code/ Security Policies/ Rules to Data with UDFs 2.  Pipe Query results through UDFs to Filter, Transform, Aggregate.. Map, Reduce REAL-TIME ANALYTICS on OPERATIONAL DATA (No ETL) ➤  In Database, within the same Cluster ➤  On the same Data, on XDR Replicated Clusters Real-time Analytics on Operational Data
  • 19. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 19 LESSONS LEARNED
  • 20. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 20 NATIVE FLASH à PERFORMANCE ■  Low Latency at High Throughput 0 2.5 5 7.5 10 0 50,000 100,000 150,000 200,000 AverageLatency,ms Throughput, ops/sec Balanced Workload Read Latency (Full view) Aerospike Cassandra MongoDB 0 4 8 12 16 0 50,000 100,000 150,000 200,000 AverageLatency,ms Throughput, ops/sec Balanced Workload Update Latency (Full view) Aerospike Cassandra MongoDB 0 1 2 3 4 0 75,000 150,000 225,000 300,000 AverageLatency,ms Throughput, ops/sec Read-Heavy Workload Read Latency (Full view) Aerospike Cassandra MongoDB 0 6 12 18 24 0 75,000 150,000 225,000 300,000 AverageLatency,ms Throughput, ops/sec Read-Heavy Workload Update Latency (Full view) Aerospike Cassandra MongoDB © 2013 Aerospike. All rights reserved Pg. 20
  • 21. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 21 LESSONS 1.  Keep architecture simple ■  No hot spots (e.g., robust DHT) ■  Scales up easily (e.g., easy to size) ■  Avoids points of failure (e.g., single node type) 2.  Avoid manual operation – automate, automate! ■  Self-managed cluster responds to node failures ■  Data rebalancing requires no intervention ■  Real-time prioritization allows unattended system operation 3.  Keep system asynchronous ■  Shared nothing – nodes are autonomous ■  Async writes across data centers ■  Independent tuning parameters for different classes of tasks © 2013 Aerospike. All rights reserved Pg. 21
  • 22. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 22 LESSONS (cont’d) 4.  Monitor the Health of the System Extensively ■  Growth in load sneaks up on you over weeks ■  Early detection means better service ■  Most failures can be predicted (e.g., capacity, load, …) 5.  Size clusters properly ■  Have enough capacity ALWAYS! ■  Upgrade SSDs every couple years ■  Reduce cluster sizes to make operations simple 6.  Have geographically distributed data centers ■  Size the distributed data centers properly ■  Use active-active configurations if possible ■  Size bandwidth requirements accurately © 2013 Aerospike. All rights reserved Pg. 22
  • 23. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 23 LESSONS (CONT’D) 7.  Have plan for unforeseen situations ■  Devise scenarios and practice during normal work time ■  Ensure you can do rolling upgrades during high load time ■  Make sure that your nodes can restart fast (< 1 minute) 8.  Constantly test and monitor app end-to-end ■  Application level metrics are more important than DB metrics ■  Most issues in a service are due to a combination of application, network, database, storage, etc. 9.  Separate online and offline workloads ■  Reserve real-time edge database for transactions and hot analytics queries (where newest data is important) ■  Avoid ad-hoc queries on on-line system ■  Perform deep analysis in offline system (Hadoop) 10.  Use the Right Data Management System for the job ■  Fast NoSQL DB for real-time transactions and hot analytics on rapidly changing data ■  Hadoop or other comparable systems for exhaustive analytics on mostly read-only data © 2013 Aerospike. All rights reserved Pg. 23
  • 24. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 24 AEROSPIKE REAL-TIME BIG DATA PLATFORM Rapid Development Complete Customizability ➤  Support for popular languages and tools §  ASQL and Aerospike Client in Java, C#, Ruby, Python.. ➤  Complex data types §  Nested documents (map, list, string, integer) §  Large (Stack, Set, List) Objects ➤  Queries §  Single record §  Batch multi-record lookups §  Equality and range §  Aggregations and MapReduce ➤  User Defined Functions (UDFs) §  In-DB processing ➤  Aggregation Framework §  UDF Pipeline §  MapReduce ++ ➤  Time Series Queries §  Just 2 IOPs for most r/w independent of object size
  • 25. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 25 HOW TO GET AEROSPIKE? Free Community Edition Enterprise Edition ➤  For developers looking for speed and stability and transparently scale as they grow ➤  No transaction limits ➤  No time limit ➤  No production limit ➤  Data per cluster limit ➤  Community support ➤  For mission critical apps needing to scale right from the start §  Unlimited number of nodes, clusters, data centers §  Cross data center replication §  Premium 24x7 support §  Priced by TBs of unique data (not replicas) © 2013 Aerospike. All rights reserved Pg. 25
  • 26. © 2013 Aerospike, Inc. All rights reserved. Confidential. | <Title of Presentation> | 26 QUESTIONS? brian@aerospike.com www.aerospike.com © 2013 Aerospike. All rights reserved Pg. 26