SlideShare ist ein Scribd-Unternehmen logo
1 von 72
1© Copyright 2015 EMC Corporation. All rights reserved.
OBJECT STORAGE
TECHNICAL DISCUSSION
2© Copyright 2015 EMC Corporation. All rights reserved.
DATA GROWTH IS BREAKING TRADITIONAL STORAGE
SCALE-OUTFILE &OBJECT STORAGE CONTINUES TO GROW
• Overly complex
– Multiple data protection schemes,
protocols, management tools
• Can’t economically scale
– Inefficient, high overhead,
especially at geo-scale
• Not cloud-ready
– Not architecturally suited and no
self-service
Source: IDC EMC Digital Universe Study 2014
3© Copyright 2015 EMC Corporation. All rights reserved.
BLOCK FILE FILE BLOCK FILE
Today’s Storage Infrastructure
BLOCK
4© Copyright 2015 EMC Corporation. All rights reserved.
WHY OBJECT?
5© Copyright 2015 EMC Corporation. All rights reserved.
Object Storage Characteristics
Linear Scalability
Scales to billions of
objects
No Locking
No lock on write or
create operations
Geo-scale
Geo-replicated and
distributed
Support for large files
Object sizes are in TBs
Web friendly
Firewall friendly, http,
REST accessibility
Metadata and
extensibility
Objects can be extended
to multiple policies
(Immutability, retention,
etc…)
6© Copyright 2015 EMC Corporation. All rights reserved.
An Object platform offers …
 A flat namespace of millions of buckets
 Buckets that scale to billions of objects
 Geo distribution, protection, access
 User meta data as a first-class entity
 Snapshot consistency semantics
 Multi-tenant access & metering
 Multiple data access methods including
via REST/HTTP (S3, Hadoop, CAS,
Atmos & Swift)
OBJECT
OBJECT
OBJECT
OBJECT
OBJECT
OBJECT
7© Copyright 2015 EMC Corporation. All rights reserved.
File and Object Storage Comparison
File Object
Writing to a file requires exclusive lock Object supports multiple writes, no locking
Limit on number of files in a directory. Objects are limitless in size, 1 MB to TBs, Objects scale
across multiple files
File meta-data is fixed by file system, no user
meta-data
Objects support extensible meta-data
Large files hard to seek Objects can be viewed with no limitation
File create operations require directory to be in
exclusivity lock
No locking required to create files
CIFS/NFS access not Web or firewall-friendly –
relies on file/folder access control and session-
based authentication
Easy, fine-grained authentication and access control
(per object),
HTTP, REST-based access
8© Copyright 2015 EMC Corporation. All rights reserved.
Object Use Cases are Expanding
• Existing Use Cases:
– Scalable content store for cloud-based
applications/services
– Scalable content storage for vertical
applications
– Tape rationalization/elimination
• Emerging Use Cases:
– Storage for Big Data/Hadoop
– NAS replacement/augmentation
– Public IaaS alternative
• Migrate to alternative providers or “in-sourcing”
Applications, analytics and data growth drive Object
Source: 451 Research
9© Copyright 2015 EMC Corporation. All rights reserved.
MODERN APPS ARE BREAKING TRADITIONAL STORAGE
NOT DESIGNED FOR CLOUDAND BIG DATA APPLICATIONS
• Architecture is too complex
– Locking, replication, High Availability, geo-
distribution is complex
• Not Web or firewall friendly
– Distributed (WAN) access is complex
• Storage silos impede development
– Different hardware for every data type and
access protocol
10© Copyright 2015 EMC Corporation. All rights reserved.
USE CASES
11© Copyright 2015 EMC Corporation. All rights reserved.
GLOBAL CONTENT REPOSITORY
ON-PREMISE UNSTRUCTURED STORAGE PLATFORM
PROBLEM
• Can’t cost-effectively manage or scale storage to
support explosive growth in unstructured content.
• Traditional storage not suited for new Web, mobile
and cloud applications.
• Difficult and costly to manage data lifecycle and
retention policies across archive silos and sites
VALUE
• Reduce complexity and cost–one globally
accessible, geo-efficient archive that serves
multiple applications and content types at lower
cost than public cloud.
• Anywhere data access – All data globally
accessible by Web, mobile and cloud apps.
• Enterprise-grade data protection – Efficient
geo-protection and policy-based retention for
basic compliance and governance.
https://accesspoint.yourcompany.com
U.K.L.A.
Memphis
Applications Tiering, Archiving,
Backup
12© Copyright 2015 EMC Corporation. All rights reserved.
MODERN APPLICATION PLATFORM
EFFICIENT GEO-CAPABLE STORAGE & ANYWHERE ACCESS
https://accesspoint.yourcompany.com
U.K.L.A.
Memphis
PROBLEM
• Traditional storage architecture not optimized for
multi-site, mobile access to content
• Writing to multiple file systems and proprietary
APIs complicates development
• Can’t access or process large data sets
VALUE
• Anywhere access - Provides anywhere access to
geo-replicated content
• Simpler, faster development - Supports
multiple industry standard APIs/protocols and
anywhere access with strong consistency
• Unmatched access and efficiency - Geo-
protection, active-active architecture optimizes
both access and storage efficiency for Big Data –
large and small files
13© Copyright 2015 EMC Corporation. All rights reserved.
GEO-SCALE BIG DATA ANALYTICS
EFFICIENT GEO-SCALE STORAGE & GLOBAL BIG DATA ANALYTICS
https://accesspoint.yourcompany.com
U.K.L.A.
New York
ANALYTICS
PROBLEM
• Large (and growing) data volumes lead to
exponential storage costs
• Traditional Hadoop replication leads to
unmanageable DC footprint with data growth
• Always have to move data to the analytics cluster
VALUE
• Cost Efficient Storage
• HDFS Archive – Bring state of the art patented
technology to provide highly dense storage for
Hadoop
• Global Analytics –Bring analytics to geo-
distributed data and archives
14© Copyright 2015 EMC Corporation. All rights reserved.
PROBLEM
• Unstructured data growth - Reclaim costly Tier 1 storage
• Current solutions aren’t scalable or cost efficient
• Instant access to cold-stored data is required
• “No Public Cloud” policy - Data needs to be on-premises
VALUE
Costs less than public cloud - Provides on-premises security
U.K.L.A.
Memphis
LAN/
WAN
Video
Unstructured
Data
Sensory Data
Images
COLD ARCHIVE
COSTEFFECTIVE LONG TERMRETENTION
15© Copyright 2015 EMC Corporation. All rights reserved.
PROBLEM
• Need cost effective solution to store hue amounts of
unstructured data generated by IOT and sensors
• “No Public Cloud” policy - Data needs to be on-premises
• Data collection via modern cloud applications requires
compatibility with APIS’s like S3 and OpenStack
• Analytics workflow is slow, expensive and complicated using
Hadoop direct attach or public cloud storage
VALUE
• Cost per GB is less than public clouds
• Provide high availability with on-premises security
• Compatible with S3, OpenStack, and other popular API’s
• HDFS compatible and enables a streamlined Hadoop workflow
for “data in place” analytics
‘IOT’ CLOUD STORAGE PLATFORM
‘INTERNET OF THINGS’ –SENSORY & TELEMETRY DATACOLLECTION
16© Copyright 2015 EMC Corporation. All rights reserved.
EMC & OBJECT STORAGE
17© Copyright 2015 EMC Corporation. All rights reserved.
 Hyper Scale - Sales Out to Billions of Objects
 “Public Cloud-Like” – Secure Access, Anytime, Anywhere
 Comprehensive Multi-Tenant Management
 Active/Active Geo-Distributed Architecture
 Multiple Protocol Support – REST and HDFS Ready
 Compelling Economics –Appliance or SW Only (DIY)
ECS HYPERSCALE CLOUD STORAGE
18© Copyright 2015 EMC Corporation. All rights reserved.
CUSTOMERS CAN LEVERAGE COMMODITY PLATFORMS
SOFTWARE-DEFINED STORAGE
Software-Defined
Storage
Commodity
Platforms
19© Copyright 2015 EMC Corporation. All rights reserved.
COMMODITY HARDWARE VALUE PROPOSITION
• Utilize standardized, open
technologies and mass market
components
• Individual components provide
lower performance, reliability, etc.
• At sufficient scale, with the right
software, the component pool
provides superior characteristics
20© Copyright 2015 EMC Corporation. All rights reserved.
ECS
SOFTWARE
Enterprise & SPs
Object & HDFS
DIY Commodity
ECS
APPLIANCE
Enterprise & SPs
Object & HDFS
integrated appliance
ViPR
DATASERVICES
Enterprise & SPs
Object & HDFS
File-based Arrays
CHOICE AND FLEXIBILITY
21© Copyright 2015 EMC Corporation. All rights reserved.
EMC OBJECT ECOSYSTEM
Enterprise Information
Archiving
Enterprise Content
Management
Analytics
Cloud Gateways
Migration
Sync & Share
21© Copyright 2015 EMC Corporation. All rights reserved.
Analytics
Protocols CAS
22© Copyright 2015 EMC Corporation. All rights reserved.
Learn | Try | Develop | Collaborate
Explore how-to
videos, helpful
guides, and
training
Download ViPR
FREE with no
time limit – for
non-production
use
Access SDKs,
FAQs, forums,
technical
documentation,
sample apps,
and more
Ask the
experts, talk to
peers, share
ideas and
experiences
www.emc.com/viprcommunity
JOIN THE ViPR COMMUNITY…
24© Copyright 2015 EMC Corporation. All rights reserved.
ECS TECHNICAL DETAIL
• ECSSTORAGE ENGINE
• WRITE PATH, READ PATH, BOX CARTING
• ECSGEO-CAPABILITIES
25© Copyright 2015 EMC Corporation. All rights reserved.
ECS ARCHITECTURE OVERVIEW
Object
Storage Engine
HDFS NFS*
• Multi-head access - ability to access same data concurrently
through multiple access protocols.
• Provides High Availability and Scalability.
• Manages transactions and persistent data.
• Protects data against failures, corruption and disasters
ECS
Appliance
Commodity
EMC and 3rd party
file arrays
26© Copyright 2015 EMC Corporation. All rights reserved.
COMPREHENSIVE DATA ACCESS
COMPATIBILITY WITH COMMONINDUSTRY API’S
• Simultaneous access to underlying data
through multiple interfaces
– Object, HDFS, File (future)
• HDFS compatible with Cloudera,
Hortonworks, Pivotal etc.
• Support for S3, Swift, Atmos and Centera
CAS APIs object
• Extensions to APIs
– Byte-Range updates, Atomic appends, Rich
ACLs etc.
ATMOS
27© Copyright 2015 EMC Corporation. All rights reserved.
DESIGN PRINCIPLE: LAYERED ARCHITECTURE
 Limitless Scale: Each layer is
independently scalable, highly
available, and has no single
point of failure.
 Scale-Out Architecture: Scale
by adding more nodes, no
special nodes or roles
 Global Namespace: Any node
has full system view off data and
meta-data
Persistence Layer
Storage Engine
JBODs
OBJECT
HDFS
OBJECT
HDFS
OBJECT
HDFS
28© Copyright 2015 EMC Corporation. All rights reserved.
TRANSACTION FLOW (WRITE)
Node 1
1. Create Object request
(Name, data, metadata)
Node 1 Node 4 Node 5
2. Write of data and
metadata in chunk
All three copies written in parallel. Write successful only if all copies ack
Node 2
5. Back to client
3. Index update (name,
location) to the owner
partition
4. Journal write
29© Copyright 2015 EMC Corporation. All rights reserved.
ERASURE CODING
 Data is written into chunks - 3 copies
 Erasure coding begins as the chunks are shipped
 Once EC completes, the data becomes fully
protected and the 3 copies are deleted
A
A
AA
30© Copyright 2015 EMC Corporation. All rights reserved.
GARBAGE COLLECTION
 In Append-only systems updates/deletes cause files
to have blocks of data that are unused
 This is done at the level of chunk
 Unused chunks reclaimed by a background task
31© Copyright 2015 EMC Corporation. All rights reserved.
Node 1
1. Read Object request
3. Read data
Node 2
4. Send data back
2. Get Location
TRANSACTION FLOW (READ)
32© Copyright 2015 EMC Corporation. All rights reserved.
BOX CARTING: CHUNK WRITE
Node
Buffered Writer
Acks
(PARALELL SYNC WRITE)
33© Copyright 2015 EMC Corporation. All rights reserved.
DATA WRITTEN IN APPEND ONLY CHUNKS
• Data is written in an append-
only pattern.
• No data is overwritten or
modified.
• No locking required for I/O.
• No cache invalidation
required.
• Journaling, snapshot and
versioning natively built-in
• ECS stores all types of data
and index in “chunks”
• Chunks are
– Logical containers of
contiguous space
(128MB)
– Written in an append-
only pattern
• All data protection operations
are done on chunks
34© Copyright 2015 EMC Corporation. All rights reserved.
ECS STORAGE ENGINE: KEY BENEFITS
• All nodes can process write requests for the same
object simultaneously, and write to different sets of
disks.
• Throughput takes advantage of all spindles and NICs
in cluster.
• Payload from multiple small objects are aggregated
in memory and written in a single disk write
• Efficient storage for both small and large data
35© Copyright 2015 EMC Corporation. All rights reserved.
 Unstructured configurations
– Object & HDFS
 Available in multiple capacities within
a rack
 Clustering across racks scales to 100s
of PBs
HARDWARE CONFIGURATIONS
36© Copyright 2015 EMC Corporation. All rights reserved.
ANALYTICS
37© Copyright 2015 EMC Corporation. All rights reserved.
SAMPLE HADOOP WORKFLOW
HDFS
Analytical Models
(Hive, HAWQ)
Data Visualizations
(Tableau)
Variety
of Data
Sources
Data
Cleansing
Ingest
Data Scientists
Store
Analyze
Surface
38© Copyright 2015 EMC Corporation. All rights reserved.
HADOOP PROCESSING MODEL
SHARED STORAGE MODEL
VNXVMAX Isilon CommodityVNX Isilon 3rd Party Commodity
• Enables common Data
Lake for LOB application
storage and analytics
• Scale compute & storage
independently
• Multiple distributions,
clusters connect to the
same data
ECS
39© Copyright 2015 EMC Corporation. All rights reserved.
CHALLENGES WITH HDFS
• HDFS not Enterprise-Grade
– Requires three full copies of data, no erasure
coding
– No Geo-distribution, Limited DR, Multi-tenancy
– Inefficient for handling small files
• High Availability Still In Progress
– No Active-Active Failover even with the
secondary NN
• DAS architecture not suitable for some
customers
– Lack of Enterprise Data Governance Features
40© Copyright 2015 EMC Corporation. All rights reserved.
HDFS DATA SERVICE OVERVIEW
• Addresses limitations of off-the-
shelf HDFS
• Brings HDFS to existing storage
hardware
• Enables HDFS/Object/File scenarios
• Flexible software model allows for
future co-location of compute and
storage
41© Copyright 2015 EMC Corporation. All rights reserved.
HDFS DATA SERVICE OVERVIEW
• API head
– Custom client/server protocol optimized for
high scale
– Uses the same unstructured storage engine as
ECS/ViPR Object data service
• Client library over the HDFS API
– Provides a viprfs:// drop-in replacement for
HDFS 2.0
– Can be seamlessly added to existing Hadoop
distributions
• Implemented as a Hadoop Compatible
Filesystem (HCFS)
– Supports HDFS 2.0 and 2.2
42© Copyright 2015 EMC Corporation. All rights reserved.
HDFS ARCHITECTURE
RM / AsM
Commodity Compute & Storage
Node Manager
Data Store
MapReduce Task
Client
Node Manager
Data Store
MapReduce Task
Node Manager
Data Store
MapReduce Task
NAME
NODE
SEC
NAME
NODE
43© Copyright 2015 EMC Corporation. All rights reserved.
ViPR/ECS HDFS ARCHITECTURE Client
Node Manager
MapReduce Task
ViPR/ECS
Client
Node Manager
MapReduce Task
ViPR/ECS
Client
Node Manager
MapReduce Task
ViPR/ECS
Client
NAME
NODE
RM/AsM
SEC
NAME
NODE
44© Copyright 2015 EMC Corporation. All rights reserved.
Customer’s Hadoop
Compute Cluster
HDFS – ECS APPLIANCE DEPLOYMENT
ViPR Controller
VMViPR Controller
VMViPR Controller
VM
Data Read/
Write
Object/HDFS
…
Object/HDFS
45© Copyright 2015 EMC Corporation. All rights reserved.
HDFS DATA SERVICE/ECS ARCHITECTURE
ECS Storage Engine
HDFS
API
Head
S3
API
Head
Customer’s Hadoop
Compute Cluster
ViPR Data Service Node
Data Read/Write
via ViPR HDFS
46© Copyright 2015 EMC Corporation. All rights reserved.
HDFS DATA SERVICE VALUE PROPOSTION
• High Availability Built-In, No SPOF
• Avoids multiple copies of data
• Erasure Coding Support
• Geo-Distributed Across Sites
• Multi-tenancy, Metering, Chargeback
• Allows Byte-Range Updates Through
S3 Interface
• ViPR Controller aids in Management &
Monitoring
47© Copyright 2015 EMC Corporation. All rights reserved.
UPCOMING ECS INTEGRATIONS
48© Copyright 2015 EMC Corporation. All rights reserved.
ISILON CLOUDPOOLS
SMART TIERING TO OBJECT STORES
Key Features
Benefits
 Stub to Cloud of choice
 Extending SmartPools workflow to
CloudPools
 Ability to send encrypted data to the
cloud
 Compression for efficient transport
 Simple policy based management
 Combine file & object store benefits
 Use stubs to optimize local storage
space, with offsite archive protection
 Seamless placement and availability
of data per policy
 One Accessible namespace
SmartPools ->
CloudPools
Clients
SMB | NFS | REST| HDFS | SWIFT
OneFS
Service
Provider
Public
Cloud
ECS
49© Copyright 2015 EMC Corporation. All rights reserved.
EMC CLOUDBOOST
LONG TERM RETENTION TO THE CLOUD
LAN LAN/WAN
CloudBoost
appliance
Desktops
Laptops
Files NAS/NDMP
VMware &
Hyper-V
Databases
Email Applications
DB
ROBO
Protected by NetWorker,
Avamar, NetBackup
Key Features
Benefits
 Long-term retention to ECS for
NetWorker, Avamar, NetBackup
 Inline variable de-duplication and
compression
 Data encrypted in-flight and at rest
 Cloud choice: private/public clouds
 Appliance cache for ROBO
 Capacity of up to 6PB logical per
appliance
 Central management via a cloud
portal
 Lower storage cost per TB
 Efficiency: Reduced network and
storage consumption
 Lower risk, operational overhead
than tape.
 Airtight security
50© Copyright 2015 EMC Corporation. All rights reserved.
WRITE PATH, READ PATH,BOX CARTING
51© Copyright 2015 EMC Corporation. All rights reserved.
TRANSACTION FLOW (WRITE)
Node 1
1. Create Object request
(Name, data, metadata)
Node 1 Node 4 Node 5
2. Write of data and
metadata in chunk.
All three copies written in parallel. Write successful only if all copies ack.
Node 2
5. Back to
client
3. Index update
(name, location) to
the owner partition.
4. Journal write
52© Copyright 2015 EMC Corporation. All rights reserved.
ERASURE CODING
 Data is written into chunks 3 copies
 Once a chunk fills to 128 MB, erasure coding starts
 Once it is completed and data is protected the 3
copies are deleted.
A
A
AA
53© Copyright 2015 EMC Corporation. All rights reserved.
GARBAGE COLLECTION
 In Append-only systems updates/deletes cause files
to have blocks of data that are unused.
 This is done at the level of chunk
 Unused chunks reclaimed by a background task
54© Copyright 2015 EMC Corporation. All rights reserved.
Node 1
1. Read Object request
3. Read data
Node 2
4. Send data
back
2. Get Location
TRANSACTION FLOW (READ)
55© Copyright 2015 EMC Corporation. All rights reserved.
BOX CARTING: CHUNK WRITE
Node
Buffered Writer
Acks
(PARALELL SYNC WRITE)
56© Copyright 2015 EMC Corporation. All rights reserved.
• ECSGEO-STORAGE OVERVIEW
• DATAPROTECTION
• GLOBAL DATAACCESS
ECS GEO-CAPABILITES
57© Copyright 2015 EMC Corporation. All rights reserved.
ECS GEO-STORAGE OVERVIEW
• Data Protection
– Protection against data center failure
– Seamless failover and recovery
• Global Data Access
– Global namespace
– Ability to read/write data from any site
• Optimized Storage
– Low storage overhead
– WAN Optimization
• Applicable to all unstructured Storage Engine based
heads
– Object, HDFS
– File when available
58© Copyright 2015 EMC Corporation. All rights reserved.
DATA PROTECTION
59© Copyright 2015 EMC Corporation. All rights reserved.
INDUSTRY SOLUTION: MIRROR COPY
• Mirrored copy in a backup site
• Benefit: Achieves Local
reconstruction on hardware failure
• Shortcoming: Storage overhead
-> 2.66xPrimary Secondary
60© Copyright 2015 EMC Corporation. All rights reserved.
INDUSTRY SOLUTION: DISTRIBUTED ERASURE
CODING
• Distributing fragments across
sites
• Benefit: Achieves low Storage
Overhead ~ 1.6x
• Shortcoming: Disk/Node failure
requires fragments to be fetched
over the WAN.
Site 1 Site 2
Site 3 Site 4
61© Copyright 2015 EMC Corporation. All rights reserved.
ECS MODEL: BEST OF BOTH WORLDS
• Achieves low Storage Overhead
~ 1.8x
• Local hardware failure recovery
requires no WAN traffic.
• Handles local hardware and full
data center failures
– Disk, Node, Rack, Data Center are
failure domains
Site 1 Site 2
Site 3 Site 4
62© Copyright 2015 EMC Corporation. All rights reserved.
GLOBAL DATA ACCESS
63© Copyright 2015 EMC Corporation. All rights reserved.
INDUSTRY SOLUTION: SEGREGATED NAMESPACE
• Customers are asked to pick a
location for each bucket.
• Shortcoming: sites are vertical
silos, unaware of each other’s
namespaces.
Site 1 Site 2
app app
Bucket BBucket A
64© Copyright 2015 EMC Corporation. All rights reserved.
INDUSTRY SOLUTION: MULTI-ACCESS WITH
EVENTUAL CONSISTENCY
• Global Namespace with read only
replicas.
• Replicas have eventual consistency
• Shortcomings: Difficult to write
applications against eventual
consistency models
Site 1 Site 2
app
Bucket A
app
read only
65© Copyright 2015 EMC Corporation. All rights reserved.
ECS: MULTI-ACCESS WITH STRONG
CONSISTENCY
• Global Namespace: buckets
stretches across sites
• Global Access: Any data can be
read and written to any site
• Strongly consistent: Always
returning latest version without
requiring synchronous write.
Site 1 Site 2
Bucket A
app app app
66© Copyright 2015 EMC Corporation. All rights reserved.
OPTIMIZED STORAGE
• Low Storage Overhead: ~1.8x
replication over head across 4 sites
• WAN Optimization: All node and
disk failures are repaired within the
site, without any WAN traffic.
67© Copyright 2015 EMC Corporation. All rights reserved.
STORAGE OVERHEAD
# of Data Centers Overhead
1 1.33 x
2 2.67 x
3 2.00 x
4 1.77 x
5 1.67 x
6 1.60 x
7 1.55 x
8 1.52 x
68© Copyright 2015 EMC Corporation. All rights reserved.
ECS GEO KEY DIFFERENTIATORS
• Tolerates one site disaster along
with up to 2 node failures in all the
rest of the sites.
• Component failures are recovered
using fragments from local site
without WAN traffic
• Geo-efficient (~1.8 copies across 4
sites) without WAN read/write
penalties
69© Copyright 2015 EMC Corporation. All rights reserved.
ECS APPLIANCE
70© Copyright 2015 EMC Corporation. All rights reserved.
 Not Contradictory!
 Components are Commodity
 x86 Servers
 Ethernet Networking
 SATA Disk Drives
 Innovation in how they’re put
together to enable reliability,
availability, and serviceability!
COMMODITY INNOVATION
71© Copyright 2015 EMC Corporation. All rights reserved.
ECS APPLIANCE CHARACTERISTICS
• Use COTS Components
– Economies of scale
• Density Optimized
– Up to 72TB Raw / Rack Unit
– Saves Power/GB, Real Estate costs, etc.
• Labor Optimized
– Manage the cluster, not the devices
– Maximize Serviceability
• Protection Efficiency
– Geo-efficient storage
ECS/Cloud Object Storage - DevOps Day

Weitere ähnliche Inhalte

Was ist angesagt?

Enterprise Backup & Recovery to the Cloud by CommVault
Enterprise Backup & Recovery to the Cloud by CommVaultEnterprise Backup & Recovery to the Cloud by CommVault
Enterprise Backup & Recovery to the Cloud by CommVaultAmazon Web Services
 
Ceph Object Storage Reference Architecture Performance and Sizing Guide
Ceph Object Storage Reference Architecture Performance and Sizing GuideCeph Object Storage Reference Architecture Performance and Sizing Guide
Ceph Object Storage Reference Architecture Performance and Sizing GuideKaran Singh
 
High Availability & Disaster Recovery with SQL Server 2012 AlwaysOn Availabil...
High Availability & Disaster Recovery with SQL Server 2012 AlwaysOn Availabil...High Availability & Disaster Recovery with SQL Server 2012 AlwaysOn Availabil...
High Availability & Disaster Recovery with SQL Server 2012 AlwaysOn Availabil...turgaysahtiyan
 
Introduction to Google Cloud Platform
Introduction to Google Cloud PlatformIntroduction to Google Cloud Platform
Introduction to Google Cloud PlatformOpsta
 
Azure Virtual Desktop Overview.pptx
Azure Virtual Desktop Overview.pptxAzure Virtual Desktop Overview.pptx
Azure Virtual Desktop Overview.pptxceyhan1
 
Ibm pure data system for analytics n200x
Ibm pure data system for analytics n200xIbm pure data system for analytics n200x
Ibm pure data system for analytics n200xIBM Sverige
 
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...Odinot Stanislas
 
AWS Storage Services - AWS Presentation - AWS Cloud Storage for the Enterpris...
AWS Storage Services - AWS Presentation - AWS Cloud Storage for the Enterpris...AWS Storage Services - AWS Presentation - AWS Cloud Storage for the Enterpris...
AWS Storage Services - AWS Presentation - AWS Cloud Storage for the Enterpris...Amazon Web Services
 
IBM Spectrum Scale for File and Object Storage
IBM Spectrum Scale for File and Object StorageIBM Spectrum Scale for File and Object Storage
IBM Spectrum Scale for File and Object StorageTony Pearson
 
ElastiCache Deep Dive: Design Patterns for In-Memory Data Stores (DAT302-R1) ...
ElastiCache Deep Dive: Design Patterns for In-Memory Data Stores (DAT302-R1) ...ElastiCache Deep Dive: Design Patterns for In-Memory Data Stores (DAT302-R1) ...
ElastiCache Deep Dive: Design Patterns for In-Memory Data Stores (DAT302-R1) ...Amazon Web Services
 
Modern Device Management Intune Policies vs Group Policies
Modern Device Management Intune Policies vs Group PoliciesModern Device Management Intune Policies vs Group Policies
Modern Device Management Intune Policies vs Group PoliciesAnoop Nair
 
Deep dive session - sap and aws - extend and innovate
Deep dive session - sap and aws - extend and innovateDeep dive session - sap and aws - extend and innovate
Deep dive session - sap and aws - extend and innovateRitesh Toshniwal
 
Ceph Performance and Sizing Guide
Ceph Performance and Sizing GuideCeph Performance and Sizing Guide
Ceph Performance and Sizing GuideJose De La Rosa
 
Kata Container - The Security of VM and The Speed of Container | Yuntong Jin
Kata Container - The Security of VM and The Speed of Container | Yuntong Jin	Kata Container - The Security of VM and The Speed of Container | Yuntong Jin
Kata Container - The Security of VM and The Speed of Container | Yuntong Jin Vietnam Open Infrastructure User Group
 
Data Warehouse approaches with Dynamics AX
Data Warehouse  approaches with Dynamics AXData Warehouse  approaches with Dynamics AX
Data Warehouse approaches with Dynamics AXAlvin You
 
Best practices with Microsoft Graph: Making your applications more performant...
Best practices with Microsoft Graph: Making your applications more performant...Best practices with Microsoft Graph: Making your applications more performant...
Best practices with Microsoft Graph: Making your applications more performant...Microsoft Tech Community
 
Demystifying Data Warehouse as a Service
Demystifying Data Warehouse as a ServiceDemystifying Data Warehouse as a Service
Demystifying Data Warehouse as a ServiceSnowflake Computing
 

Was ist angesagt? (20)

Enterprise Backup & Recovery to the Cloud by CommVault
Enterprise Backup & Recovery to the Cloud by CommVaultEnterprise Backup & Recovery to the Cloud by CommVault
Enterprise Backup & Recovery to the Cloud by CommVault
 
Ceph Object Storage Reference Architecture Performance and Sizing Guide
Ceph Object Storage Reference Architecture Performance and Sizing GuideCeph Object Storage Reference Architecture Performance and Sizing Guide
Ceph Object Storage Reference Architecture Performance and Sizing Guide
 
High Availability & Disaster Recovery with SQL Server 2012 AlwaysOn Availabil...
High Availability & Disaster Recovery with SQL Server 2012 AlwaysOn Availabil...High Availability & Disaster Recovery with SQL Server 2012 AlwaysOn Availabil...
High Availability & Disaster Recovery with SQL Server 2012 AlwaysOn Availabil...
 
Introduction to Google Cloud Platform
Introduction to Google Cloud PlatformIntroduction to Google Cloud Platform
Introduction to Google Cloud Platform
 
Azure Virtual Desktop Overview.pptx
Azure Virtual Desktop Overview.pptxAzure Virtual Desktop Overview.pptx
Azure Virtual Desktop Overview.pptx
 
Ibm pure data system for analytics n200x
Ibm pure data system for analytics n200xIbm pure data system for analytics n200x
Ibm pure data system for analytics n200x
 
SCCM 2012 Presentation
SCCM 2012 PresentationSCCM 2012 Presentation
SCCM 2012 Presentation
 
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...
Ceph: Open Source Storage Software Optimizations on Intel® Architecture for C...
 
AWS Storage Services - AWS Presentation - AWS Cloud Storage for the Enterpris...
AWS Storage Services - AWS Presentation - AWS Cloud Storage for the Enterpris...AWS Storage Services - AWS Presentation - AWS Cloud Storage for the Enterpris...
AWS Storage Services - AWS Presentation - AWS Cloud Storage for the Enterpris...
 
IBM Spectrum Scale for File and Object Storage
IBM Spectrum Scale for File and Object StorageIBM Spectrum Scale for File and Object Storage
IBM Spectrum Scale for File and Object Storage
 
ElastiCache Deep Dive: Design Patterns for In-Memory Data Stores (DAT302-R1) ...
ElastiCache Deep Dive: Design Patterns for In-Memory Data Stores (DAT302-R1) ...ElastiCache Deep Dive: Design Patterns for In-Memory Data Stores (DAT302-R1) ...
ElastiCache Deep Dive: Design Patterns for In-Memory Data Stores (DAT302-R1) ...
 
Modern Device Management Intune Policies vs Group Policies
Modern Device Management Intune Policies vs Group PoliciesModern Device Management Intune Policies vs Group Policies
Modern Device Management Intune Policies vs Group Policies
 
Deep dive session - sap and aws - extend and innovate
Deep dive session - sap and aws - extend and innovateDeep dive session - sap and aws - extend and innovate
Deep dive session - sap and aws - extend and innovate
 
Ceph Performance and Sizing Guide
Ceph Performance and Sizing GuideCeph Performance and Sizing Guide
Ceph Performance and Sizing Guide
 
SQL Server 2012 Best Practices
SQL Server 2012 Best PracticesSQL Server 2012 Best Practices
SQL Server 2012 Best Practices
 
Kata Container - The Security of VM and The Speed of Container | Yuntong Jin
Kata Container - The Security of VM and The Speed of Container | Yuntong Jin	Kata Container - The Security of VM and The Speed of Container | Yuntong Jin
Kata Container - The Security of VM and The Speed of Container | Yuntong Jin
 
Data Warehouse approaches with Dynamics AX
Data Warehouse  approaches with Dynamics AXData Warehouse  approaches with Dynamics AX
Data Warehouse approaches with Dynamics AX
 
Deep Dive on Amazon RDS
Deep Dive on Amazon RDSDeep Dive on Amazon RDS
Deep Dive on Amazon RDS
 
Best practices with Microsoft Graph: Making your applications more performant...
Best practices with Microsoft Graph: Making your applications more performant...Best practices with Microsoft Graph: Making your applications more performant...
Best practices with Microsoft Graph: Making your applications more performant...
 
Demystifying Data Warehouse as a Service
Demystifying Data Warehouse as a ServiceDemystifying Data Warehouse as a Service
Demystifying Data Warehouse as a Service
 

Andere mochten auch

What is Object storage ?
What is Object storage ?What is Object storage ?
What is Object storage ?Nabil Kassi
 
IBM Object Storage and Software Defined Solutions - Cleversafe
IBM Object Storage and Software Defined Solutions - CleversafeIBM Object Storage and Software Defined Solutions - Cleversafe
IBM Object Storage and Software Defined Solutions - CleversafeDiego Alberto Tamayo
 
IBM Cloud Storage - Cleversafe
IBM Cloud Storage - CleversafeIBM Cloud Storage - Cleversafe
IBM Cloud Storage - CleversafeMichael Beatty
 
cleversafe_definitive_guide_white_paper
cleversafe_definitive_guide_white_papercleversafe_definitive_guide_white_paper
cleversafe_definitive_guide_white_paperChris Woeppel
 
MT126 Virtustream Storage Cloud: Hyperscale Cloud Object Storage Built for th...
MT126 Virtustream Storage Cloud: Hyperscale Cloud Object Storage Built for th...MT126 Virtustream Storage Cloud: Hyperscale Cloud Object Storage Built for th...
MT126 Virtustream Storage Cloud: Hyperscale Cloud Object Storage Built for th...Dell EMC World
 
IBM Spectrum Scale for File and Object Storage
IBM Spectrum Scale for File and Object StorageIBM Spectrum Scale for File and Object Storage
IBM Spectrum Scale for File and Object StorageTony Pearson
 
Emc ecs 2 technical deep dive workshop
Emc ecs 2 technical deep dive workshopEmc ecs 2 technical deep dive workshop
Emc ecs 2 technical deep dive workshopsolarisyougood
 
Introducing Lattus Object Storage
Introducing Lattus Object StorageIntroducing Lattus Object Storage
Introducing Lattus Object StorageQuantum
 
EMC Vnx master-presentation
EMC Vnx master-presentationEMC Vnx master-presentation
EMC Vnx master-presentationsolarisyougood
 
Cleversafe single page
Cleversafe single pageCleversafe single page
Cleversafe single pageJoe Krotz
 
최음제 『 W3.ow.to 』 톡 w2015 ♡ 최음제판매, 최음제 효과,최음제 정품구입,최음제부작용,최음제지속시간
최음제 『 W3.ow.to  』 톡 w2015 ♡ 최음제판매, 최음제 효과,최음제 정품구입,최음제부작용,최음제지속시간최음제 『 W3.ow.to  』 톡 w2015 ♡ 최음제판매, 최음제 효과,최음제 정품구입,최음제부작용,최음제지속시간
최음제 『 W3.ow.to 』 톡 w2015 ♡ 최음제판매, 최음제 효과,최음제 정품구입,최음제부작용,최음제지속시간전 윤희
 
Storage as a service OpenStack
Storage as a service OpenStackStorage as a service OpenStack
Storage as a service OpenStackopenstackindia
 
SNIA : Swift Object Storage adding EC (Erasure Code)
SNIA : Swift Object Storage adding EC (Erasure Code)SNIA : Swift Object Storage adding EC (Erasure Code)
SNIA : Swift Object Storage adding EC (Erasure Code)Odinot Stanislas
 
SoftLayer Storage Services Overview
SoftLayer Storage Services OverviewSoftLayer Storage Services Overview
SoftLayer Storage Services OverviewMichael Fork
 
Openstack Swift Introduction
Openstack Swift IntroductionOpenstack Swift Introduction
Openstack Swift IntroductionPark YounSung
 
Deploying OpenStack Object Storage (Swift)
Deploying OpenStack Object Storage (Swift)Deploying OpenStack Object Storage (Swift)
Deploying OpenStack Object Storage (Swift)Juan José Martínez
 
OpenStack Swift In the Enterprise
OpenStack Swift In the EnterpriseOpenStack Swift In the Enterprise
OpenStack Swift In the EnterpriseHostway|HOSTING
 
Turning OpenStack Swift into a VM storage platform
Turning OpenStack Swift into a VM storage platformTurning OpenStack Swift into a VM storage platform
Turning OpenStack Swift into a VM storage platformOpenStack_Online
 

Andere mochten auch (20)

EMC EC Overview
EMC EC OverviewEMC EC Overview
EMC EC Overview
 
What is Object storage ?
What is Object storage ?What is Object storage ?
What is Object storage ?
 
IBM Object Storage and Software Defined Solutions - Cleversafe
IBM Object Storage and Software Defined Solutions - CleversafeIBM Object Storage and Software Defined Solutions - Cleversafe
IBM Object Storage and Software Defined Solutions - Cleversafe
 
IBM Cloud Storage - Cleversafe
IBM Cloud Storage - CleversafeIBM Cloud Storage - Cleversafe
IBM Cloud Storage - Cleversafe
 
cleversafe_definitive_guide_white_paper
cleversafe_definitive_guide_white_papercleversafe_definitive_guide_white_paper
cleversafe_definitive_guide_white_paper
 
MT126 Virtustream Storage Cloud: Hyperscale Cloud Object Storage Built for th...
MT126 Virtustream Storage Cloud: Hyperscale Cloud Object Storage Built for th...MT126 Virtustream Storage Cloud: Hyperscale Cloud Object Storage Built for th...
MT126 Virtustream Storage Cloud: Hyperscale Cloud Object Storage Built for th...
 
IBM Spectrum Scale for File and Object Storage
IBM Spectrum Scale for File and Object StorageIBM Spectrum Scale for File and Object Storage
IBM Spectrum Scale for File and Object Storage
 
Emc ecs 2 technical deep dive workshop
Emc ecs 2 technical deep dive workshopEmc ecs 2 technical deep dive workshop
Emc ecs 2 technical deep dive workshop
 
Introducing Lattus Object Storage
Introducing Lattus Object StorageIntroducing Lattus Object Storage
Introducing Lattus Object Storage
 
EMC Vnx master-presentation
EMC Vnx master-presentationEMC Vnx master-presentation
EMC Vnx master-presentation
 
EMC VNX
EMC VNXEMC VNX
EMC VNX
 
Cleversafe single page
Cleversafe single pageCleversafe single page
Cleversafe single page
 
최음제 『 W3.ow.to 』 톡 w2015 ♡ 최음제판매, 최음제 효과,최음제 정품구입,최음제부작용,최음제지속시간
최음제 『 W3.ow.to  』 톡 w2015 ♡ 최음제판매, 최음제 효과,최음제 정품구입,최음제부작용,최음제지속시간최음제 『 W3.ow.to  』 톡 w2015 ♡ 최음제판매, 최음제 효과,최음제 정품구입,최음제부작용,최음제지속시간
최음제 『 W3.ow.to 』 톡 w2015 ♡ 최음제판매, 최음제 효과,최음제 정품구입,최음제부작용,최음제지속시간
 
Storage as a service OpenStack
Storage as a service OpenStackStorage as a service OpenStack
Storage as a service OpenStack
 
SNIA : Swift Object Storage adding EC (Erasure Code)
SNIA : Swift Object Storage adding EC (Erasure Code)SNIA : Swift Object Storage adding EC (Erasure Code)
SNIA : Swift Object Storage adding EC (Erasure Code)
 
SoftLayer Storage Services Overview
SoftLayer Storage Services OverviewSoftLayer Storage Services Overview
SoftLayer Storage Services Overview
 
Openstack Swift Introduction
Openstack Swift IntroductionOpenstack Swift Introduction
Openstack Swift Introduction
 
Deploying OpenStack Object Storage (Swift)
Deploying OpenStack Object Storage (Swift)Deploying OpenStack Object Storage (Swift)
Deploying OpenStack Object Storage (Swift)
 
OpenStack Swift In the Enterprise
OpenStack Swift In the EnterpriseOpenStack Swift In the Enterprise
OpenStack Swift In the Enterprise
 
Turning OpenStack Swift into a VM storage platform
Turning OpenStack Swift into a VM storage platformTurning OpenStack Swift into a VM storage platform
Turning OpenStack Swift into a VM storage platform
 

Ähnlich wie ECS/Cloud Object Storage - DevOps Day

Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lakeEMC
 
NetApp Se training storage grid webscale technical overview
NetApp Se training   storage grid webscale technical overviewNetApp Se training   storage grid webscale technical overview
NetApp Se training storage grid webscale technical overviewsolarisyougood
 
Se training storage grid webscale technical overview
Se training   storage grid webscale technical overviewSe training   storage grid webscale technical overview
Se training storage grid webscale technical overviewsolarisyougood
 
Emc data domain technical deep dive workshop
Emc data domain  technical deep dive workshopEmc data domain  technical deep dive workshop
Emc data domain technical deep dive workshopsolarisyougood
 
S100293 hybrid-cloud-orlando-v1804a
S100293 hybrid-cloud-orlando-v1804aS100293 hybrid-cloud-orlando-v1804a
S100293 hybrid-cloud-orlando-v1804aTony Pearson
 
Emc vi pr data services
Emc vi pr data servicesEmc vi pr data services
Emc vi pr data servicessolarisyougood
 
Webinar: Cloud Storage: The 5 Reasons IT Can Do it Better
Webinar: Cloud Storage: The 5 Reasons IT Can Do it BetterWebinar: Cloud Storage: The 5 Reasons IT Can Do it Better
Webinar: Cloud Storage: The 5 Reasons IT Can Do it BetterStorage Switzerland
 
Webinar: End NAS Sprawl - Gain Control Over Unstructured Data
Webinar: End NAS Sprawl - Gain Control Over Unstructured DataWebinar: End NAS Sprawl - Gain Control Over Unstructured Data
Webinar: End NAS Sprawl - Gain Control Over Unstructured DataStorage Switzerland
 
BIOIT14: Deploying very low cost cloud storage technology in a traditional re...
BIOIT14: Deploying very low cost cloud storage technology in a traditional re...BIOIT14: Deploying very low cost cloud storage technology in a traditional re...
BIOIT14: Deploying very low cost cloud storage technology in a traditional re...Dirk Petersen
 
Achieving Separation of Compute and Storage in a Cloud World
Achieving Separation of Compute and Storage in a Cloud WorldAchieving Separation of Compute and Storage in a Cloud World
Achieving Separation of Compute and Storage in a Cloud WorldAlluxio, Inc.
 
Emc vi pr global data services
Emc vi pr global data servicesEmc vi pr global data services
Emc vi pr global data servicessolarisyougood
 
Unify Data at Memory Speed
Unify Data at Memory SpeedUnify Data at Memory Speed
Unify Data at Memory SpeedAlluxio, Inc.
 
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5Doug O'Flaherty
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAlluxio, Inc.
 
Le Software Defined Solutions, ou comment automatiser les ressources IT ?
Le Software Defined Solutions, ou comment automatiser les ressources IT ?Le Software Defined Solutions, ou comment automatiser les ressources IT ?
Le Software Defined Solutions, ou comment automatiser les ressources IT ?RSD
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAlluxio, Inc.
 

Ähnlich wie ECS/Cloud Object Storage - DevOps Day (20)

Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lake
 
NetApp Se training storage grid webscale technical overview
NetApp Se training   storage grid webscale technical overviewNetApp Se training   storage grid webscale technical overview
NetApp Se training storage grid webscale technical overview
 
Se training storage grid webscale technical overview
Se training   storage grid webscale technical overviewSe training   storage grid webscale technical overview
Se training storage grid webscale technical overview
 
Emc data domain technical deep dive workshop
Emc data domain  technical deep dive workshopEmc data domain  technical deep dive workshop
Emc data domain technical deep dive workshop
 
S100293 hybrid-cloud-orlando-v1804a
S100293 hybrid-cloud-orlando-v1804aS100293 hybrid-cloud-orlando-v1804a
S100293 hybrid-cloud-orlando-v1804a
 
EMC Unified Analytics Platform. Gintaras Pelenis
EMC Unified Analytics Platform. Gintaras PelenisEMC Unified Analytics Platform. Gintaras Pelenis
EMC Unified Analytics Platform. Gintaras Pelenis
 
Emc vi pr data services
Emc vi pr data servicesEmc vi pr data services
Emc vi pr data services
 
Webinar: Cloud Storage: The 5 Reasons IT Can Do it Better
Webinar: Cloud Storage: The 5 Reasons IT Can Do it BetterWebinar: Cloud Storage: The 5 Reasons IT Can Do it Better
Webinar: Cloud Storage: The 5 Reasons IT Can Do it Better
 
Cleversafe.PPTX
Cleversafe.PPTXCleversafe.PPTX
Cleversafe.PPTX
 
Webinar: End NAS Sprawl - Gain Control Over Unstructured Data
Webinar: End NAS Sprawl - Gain Control Over Unstructured DataWebinar: End NAS Sprawl - Gain Control Over Unstructured Data
Webinar: End NAS Sprawl - Gain Control Over Unstructured Data
 
BIOIT14: Deploying very low cost cloud storage technology in a traditional re...
BIOIT14: Deploying very low cost cloud storage technology in a traditional re...BIOIT14: Deploying very low cost cloud storage technology in a traditional re...
BIOIT14: Deploying very low cost cloud storage technology in a traditional re...
 
Achieving Separation of Compute and Storage in a Cloud World
Achieving Separation of Compute and Storage in a Cloud WorldAchieving Separation of Compute and Storage in a Cloud World
Achieving Separation of Compute and Storage in a Cloud World
 
Emc data domain
Emc data domainEmc data domain
Emc data domain
 
Emc vi pr global data services
Emc vi pr global data servicesEmc vi pr global data services
Emc vi pr global data services
 
Unify Data at Memory Speed
Unify Data at Memory SpeedUnify Data at Memory Speed
Unify Data at Memory Speed
 
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
Introducing IBM Spectrum Scale 4.2 and Elastic Storage Server 3.5
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud Era
 
Le Software Defined Solutions, ou comment automatiser les ressources IT ?
Le Software Defined Solutions, ou comment automatiser les ressources IT ?Le Software Defined Solutions, ou comment automatiser les ressources IT ?
Le Software Defined Solutions, ou comment automatiser les ressources IT ?
 
Data EcoSystem 2.0
Data EcoSystem 2.0Data EcoSystem 2.0
Data EcoSystem 2.0
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud Era
 

Mehr von Bob Sokol

AppOrbit DevOps NYC
AppOrbit DevOps NYCAppOrbit DevOps NYC
AppOrbit DevOps NYCBob Sokol
 
RackN DevOps meetup NYC
RackN DevOps meetup NYCRackN DevOps meetup NYC
RackN DevOps meetup NYCBob Sokol
 
How (and why!) we built Packet
How (and why!) we built Packet  How (and why!) we built Packet
How (and why!) we built Packet Bob Sokol
 
Accelerating the Software Delivery Pipelinewith Mirantis OpenStack
Accelerating the Software Delivery Pipelinewith Mirantis OpenStackAccelerating the Software Delivery Pipelinewith Mirantis OpenStack
Accelerating the Software Delivery Pipelinewith Mirantis OpenStackBob Sokol
 
More than Technology - The Culture of DevOps
More than Technology - The Culture of DevOpsMore than Technology - The Culture of DevOps
More than Technology - The Culture of DevOpsBob Sokol
 
Cloud Native Applications - DevOps, EMC and Cloud Foundry
Cloud Native Applications - DevOps, EMC and Cloud FoundryCloud Native Applications - DevOps, EMC and Cloud Foundry
Cloud Native Applications - DevOps, EMC and Cloud FoundryBob Sokol
 
Enabling Enterprise DevOps at Scale
Enabling Enterprise DevOps at ScaleEnabling Enterprise DevOps at Scale
Enabling Enterprise DevOps at ScaleBob Sokol
 
IPVS for Docker Containers
IPVS for Docker ContainersIPVS for Docker Containers
IPVS for Docker ContainersBob Sokol
 
XebiaLabs Enterprise DevOps
XebiaLabs Enterprise DevOpsXebiaLabs Enterprise DevOps
XebiaLabs Enterprise DevOpsBob Sokol
 
EMC {code} Open Source
EMC {code} Open SourceEMC {code} Open Source
EMC {code} Open SourceBob Sokol
 
DevOps Toolkit
DevOps ToolkitDevOps Toolkit
DevOps ToolkitBob Sokol
 
Puppet Labs EMC DevOps Day NYC Aug-2015
Puppet Labs  EMC DevOps Day NYC Aug-2015Puppet Labs  EMC DevOps Day NYC Aug-2015
Puppet Labs EMC DevOps Day NYC Aug-2015Bob Sokol
 
EMC DevOps Day Aug-2015 - Stormy Peters, Cloud Foundry Foundation
EMC DevOps Day Aug-2015 - Stormy Peters, Cloud Foundry FoundationEMC DevOps Day Aug-2015 - Stormy Peters, Cloud Foundry Foundation
EMC DevOps Day Aug-2015 - Stormy Peters, Cloud Foundry FoundationBob Sokol
 

Mehr von Bob Sokol (13)

AppOrbit DevOps NYC
AppOrbit DevOps NYCAppOrbit DevOps NYC
AppOrbit DevOps NYC
 
RackN DevOps meetup NYC
RackN DevOps meetup NYCRackN DevOps meetup NYC
RackN DevOps meetup NYC
 
How (and why!) we built Packet
How (and why!) we built Packet  How (and why!) we built Packet
How (and why!) we built Packet
 
Accelerating the Software Delivery Pipelinewith Mirantis OpenStack
Accelerating the Software Delivery Pipelinewith Mirantis OpenStackAccelerating the Software Delivery Pipelinewith Mirantis OpenStack
Accelerating the Software Delivery Pipelinewith Mirantis OpenStack
 
More than Technology - The Culture of DevOps
More than Technology - The Culture of DevOpsMore than Technology - The Culture of DevOps
More than Technology - The Culture of DevOps
 
Cloud Native Applications - DevOps, EMC and Cloud Foundry
Cloud Native Applications - DevOps, EMC and Cloud FoundryCloud Native Applications - DevOps, EMC and Cloud Foundry
Cloud Native Applications - DevOps, EMC and Cloud Foundry
 
Enabling Enterprise DevOps at Scale
Enabling Enterprise DevOps at ScaleEnabling Enterprise DevOps at Scale
Enabling Enterprise DevOps at Scale
 
IPVS for Docker Containers
IPVS for Docker ContainersIPVS for Docker Containers
IPVS for Docker Containers
 
XebiaLabs Enterprise DevOps
XebiaLabs Enterprise DevOpsXebiaLabs Enterprise DevOps
XebiaLabs Enterprise DevOps
 
EMC {code} Open Source
EMC {code} Open SourceEMC {code} Open Source
EMC {code} Open Source
 
DevOps Toolkit
DevOps ToolkitDevOps Toolkit
DevOps Toolkit
 
Puppet Labs EMC DevOps Day NYC Aug-2015
Puppet Labs  EMC DevOps Day NYC Aug-2015Puppet Labs  EMC DevOps Day NYC Aug-2015
Puppet Labs EMC DevOps Day NYC Aug-2015
 
EMC DevOps Day Aug-2015 - Stormy Peters, Cloud Foundry Foundation
EMC DevOps Day Aug-2015 - Stormy Peters, Cloud Foundry FoundationEMC DevOps Day Aug-2015 - Stormy Peters, Cloud Foundry Foundation
EMC DevOps Day Aug-2015 - Stormy Peters, Cloud Foundry Foundation
 

Kürzlich hochgeladen

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 

Kürzlich hochgeladen (20)

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 

ECS/Cloud Object Storage - DevOps Day

  • 1. 1© Copyright 2015 EMC Corporation. All rights reserved. OBJECT STORAGE TECHNICAL DISCUSSION
  • 2. 2© Copyright 2015 EMC Corporation. All rights reserved. DATA GROWTH IS BREAKING TRADITIONAL STORAGE SCALE-OUTFILE &OBJECT STORAGE CONTINUES TO GROW • Overly complex – Multiple data protection schemes, protocols, management tools • Can’t economically scale – Inefficient, high overhead, especially at geo-scale • Not cloud-ready – Not architecturally suited and no self-service Source: IDC EMC Digital Universe Study 2014
  • 3. 3© Copyright 2015 EMC Corporation. All rights reserved. BLOCK FILE FILE BLOCK FILE Today’s Storage Infrastructure BLOCK
  • 4. 4© Copyright 2015 EMC Corporation. All rights reserved. WHY OBJECT?
  • 5. 5© Copyright 2015 EMC Corporation. All rights reserved. Object Storage Characteristics Linear Scalability Scales to billions of objects No Locking No lock on write or create operations Geo-scale Geo-replicated and distributed Support for large files Object sizes are in TBs Web friendly Firewall friendly, http, REST accessibility Metadata and extensibility Objects can be extended to multiple policies (Immutability, retention, etc…)
  • 6. 6© Copyright 2015 EMC Corporation. All rights reserved. An Object platform offers …  A flat namespace of millions of buckets  Buckets that scale to billions of objects  Geo distribution, protection, access  User meta data as a first-class entity  Snapshot consistency semantics  Multi-tenant access & metering  Multiple data access methods including via REST/HTTP (S3, Hadoop, CAS, Atmos & Swift) OBJECT OBJECT OBJECT OBJECT OBJECT OBJECT
  • 7. 7© Copyright 2015 EMC Corporation. All rights reserved. File and Object Storage Comparison File Object Writing to a file requires exclusive lock Object supports multiple writes, no locking Limit on number of files in a directory. Objects are limitless in size, 1 MB to TBs, Objects scale across multiple files File meta-data is fixed by file system, no user meta-data Objects support extensible meta-data Large files hard to seek Objects can be viewed with no limitation File create operations require directory to be in exclusivity lock No locking required to create files CIFS/NFS access not Web or firewall-friendly – relies on file/folder access control and session- based authentication Easy, fine-grained authentication and access control (per object), HTTP, REST-based access
  • 8. 8© Copyright 2015 EMC Corporation. All rights reserved. Object Use Cases are Expanding • Existing Use Cases: – Scalable content store for cloud-based applications/services – Scalable content storage for vertical applications – Tape rationalization/elimination • Emerging Use Cases: – Storage for Big Data/Hadoop – NAS replacement/augmentation – Public IaaS alternative • Migrate to alternative providers or “in-sourcing” Applications, analytics and data growth drive Object Source: 451 Research
  • 9. 9© Copyright 2015 EMC Corporation. All rights reserved. MODERN APPS ARE BREAKING TRADITIONAL STORAGE NOT DESIGNED FOR CLOUDAND BIG DATA APPLICATIONS • Architecture is too complex – Locking, replication, High Availability, geo- distribution is complex • Not Web or firewall friendly – Distributed (WAN) access is complex • Storage silos impede development – Different hardware for every data type and access protocol
  • 10. 10© Copyright 2015 EMC Corporation. All rights reserved. USE CASES
  • 11. 11© Copyright 2015 EMC Corporation. All rights reserved. GLOBAL CONTENT REPOSITORY ON-PREMISE UNSTRUCTURED STORAGE PLATFORM PROBLEM • Can’t cost-effectively manage or scale storage to support explosive growth in unstructured content. • Traditional storage not suited for new Web, mobile and cloud applications. • Difficult and costly to manage data lifecycle and retention policies across archive silos and sites VALUE • Reduce complexity and cost–one globally accessible, geo-efficient archive that serves multiple applications and content types at lower cost than public cloud. • Anywhere data access – All data globally accessible by Web, mobile and cloud apps. • Enterprise-grade data protection – Efficient geo-protection and policy-based retention for basic compliance and governance. https://accesspoint.yourcompany.com U.K.L.A. Memphis Applications Tiering, Archiving, Backup
  • 12. 12© Copyright 2015 EMC Corporation. All rights reserved. MODERN APPLICATION PLATFORM EFFICIENT GEO-CAPABLE STORAGE & ANYWHERE ACCESS https://accesspoint.yourcompany.com U.K.L.A. Memphis PROBLEM • Traditional storage architecture not optimized for multi-site, mobile access to content • Writing to multiple file systems and proprietary APIs complicates development • Can’t access or process large data sets VALUE • Anywhere access - Provides anywhere access to geo-replicated content • Simpler, faster development - Supports multiple industry standard APIs/protocols and anywhere access with strong consistency • Unmatched access and efficiency - Geo- protection, active-active architecture optimizes both access and storage efficiency for Big Data – large and small files
  • 13. 13© Copyright 2015 EMC Corporation. All rights reserved. GEO-SCALE BIG DATA ANALYTICS EFFICIENT GEO-SCALE STORAGE & GLOBAL BIG DATA ANALYTICS https://accesspoint.yourcompany.com U.K.L.A. New York ANALYTICS PROBLEM • Large (and growing) data volumes lead to exponential storage costs • Traditional Hadoop replication leads to unmanageable DC footprint with data growth • Always have to move data to the analytics cluster VALUE • Cost Efficient Storage • HDFS Archive – Bring state of the art patented technology to provide highly dense storage for Hadoop • Global Analytics –Bring analytics to geo- distributed data and archives
  • 14. 14© Copyright 2015 EMC Corporation. All rights reserved. PROBLEM • Unstructured data growth - Reclaim costly Tier 1 storage • Current solutions aren’t scalable or cost efficient • Instant access to cold-stored data is required • “No Public Cloud” policy - Data needs to be on-premises VALUE Costs less than public cloud - Provides on-premises security U.K.L.A. Memphis LAN/ WAN Video Unstructured Data Sensory Data Images COLD ARCHIVE COSTEFFECTIVE LONG TERMRETENTION
  • 15. 15© Copyright 2015 EMC Corporation. All rights reserved. PROBLEM • Need cost effective solution to store hue amounts of unstructured data generated by IOT and sensors • “No Public Cloud” policy - Data needs to be on-premises • Data collection via modern cloud applications requires compatibility with APIS’s like S3 and OpenStack • Analytics workflow is slow, expensive and complicated using Hadoop direct attach or public cloud storage VALUE • Cost per GB is less than public clouds • Provide high availability with on-premises security • Compatible with S3, OpenStack, and other popular API’s • HDFS compatible and enables a streamlined Hadoop workflow for “data in place” analytics ‘IOT’ CLOUD STORAGE PLATFORM ‘INTERNET OF THINGS’ –SENSORY & TELEMETRY DATACOLLECTION
  • 16. 16© Copyright 2015 EMC Corporation. All rights reserved. EMC & OBJECT STORAGE
  • 17. 17© Copyright 2015 EMC Corporation. All rights reserved.  Hyper Scale - Sales Out to Billions of Objects  “Public Cloud-Like” – Secure Access, Anytime, Anywhere  Comprehensive Multi-Tenant Management  Active/Active Geo-Distributed Architecture  Multiple Protocol Support – REST and HDFS Ready  Compelling Economics –Appliance or SW Only (DIY) ECS HYPERSCALE CLOUD STORAGE
  • 18. 18© Copyright 2015 EMC Corporation. All rights reserved. CUSTOMERS CAN LEVERAGE COMMODITY PLATFORMS SOFTWARE-DEFINED STORAGE Software-Defined Storage Commodity Platforms
  • 19. 19© Copyright 2015 EMC Corporation. All rights reserved. COMMODITY HARDWARE VALUE PROPOSITION • Utilize standardized, open technologies and mass market components • Individual components provide lower performance, reliability, etc. • At sufficient scale, with the right software, the component pool provides superior characteristics
  • 20. 20© Copyright 2015 EMC Corporation. All rights reserved. ECS SOFTWARE Enterprise & SPs Object & HDFS DIY Commodity ECS APPLIANCE Enterprise & SPs Object & HDFS integrated appliance ViPR DATASERVICES Enterprise & SPs Object & HDFS File-based Arrays CHOICE AND FLEXIBILITY
  • 21. 21© Copyright 2015 EMC Corporation. All rights reserved. EMC OBJECT ECOSYSTEM Enterprise Information Archiving Enterprise Content Management Analytics Cloud Gateways Migration Sync & Share 21© Copyright 2015 EMC Corporation. All rights reserved. Analytics Protocols CAS
  • 22. 22© Copyright 2015 EMC Corporation. All rights reserved. Learn | Try | Develop | Collaborate Explore how-to videos, helpful guides, and training Download ViPR FREE with no time limit – for non-production use Access SDKs, FAQs, forums, technical documentation, sample apps, and more Ask the experts, talk to peers, share ideas and experiences www.emc.com/viprcommunity JOIN THE ViPR COMMUNITY…
  • 23.
  • 24. 24© Copyright 2015 EMC Corporation. All rights reserved. ECS TECHNICAL DETAIL • ECSSTORAGE ENGINE • WRITE PATH, READ PATH, BOX CARTING • ECSGEO-CAPABILITIES
  • 25. 25© Copyright 2015 EMC Corporation. All rights reserved. ECS ARCHITECTURE OVERVIEW Object Storage Engine HDFS NFS* • Multi-head access - ability to access same data concurrently through multiple access protocols. • Provides High Availability and Scalability. • Manages transactions and persistent data. • Protects data against failures, corruption and disasters ECS Appliance Commodity EMC and 3rd party file arrays
  • 26. 26© Copyright 2015 EMC Corporation. All rights reserved. COMPREHENSIVE DATA ACCESS COMPATIBILITY WITH COMMONINDUSTRY API’S • Simultaneous access to underlying data through multiple interfaces – Object, HDFS, File (future) • HDFS compatible with Cloudera, Hortonworks, Pivotal etc. • Support for S3, Swift, Atmos and Centera CAS APIs object • Extensions to APIs – Byte-Range updates, Atomic appends, Rich ACLs etc. ATMOS
  • 27. 27© Copyright 2015 EMC Corporation. All rights reserved. DESIGN PRINCIPLE: LAYERED ARCHITECTURE  Limitless Scale: Each layer is independently scalable, highly available, and has no single point of failure.  Scale-Out Architecture: Scale by adding more nodes, no special nodes or roles  Global Namespace: Any node has full system view off data and meta-data Persistence Layer Storage Engine JBODs OBJECT HDFS OBJECT HDFS OBJECT HDFS
  • 28. 28© Copyright 2015 EMC Corporation. All rights reserved. TRANSACTION FLOW (WRITE) Node 1 1. Create Object request (Name, data, metadata) Node 1 Node 4 Node 5 2. Write of data and metadata in chunk All three copies written in parallel. Write successful only if all copies ack Node 2 5. Back to client 3. Index update (name, location) to the owner partition 4. Journal write
  • 29. 29© Copyright 2015 EMC Corporation. All rights reserved. ERASURE CODING  Data is written into chunks - 3 copies  Erasure coding begins as the chunks are shipped  Once EC completes, the data becomes fully protected and the 3 copies are deleted A A AA
  • 30. 30© Copyright 2015 EMC Corporation. All rights reserved. GARBAGE COLLECTION  In Append-only systems updates/deletes cause files to have blocks of data that are unused  This is done at the level of chunk  Unused chunks reclaimed by a background task
  • 31. 31© Copyright 2015 EMC Corporation. All rights reserved. Node 1 1. Read Object request 3. Read data Node 2 4. Send data back 2. Get Location TRANSACTION FLOW (READ)
  • 32. 32© Copyright 2015 EMC Corporation. All rights reserved. BOX CARTING: CHUNK WRITE Node Buffered Writer Acks (PARALELL SYNC WRITE)
  • 33. 33© Copyright 2015 EMC Corporation. All rights reserved. DATA WRITTEN IN APPEND ONLY CHUNKS • Data is written in an append- only pattern. • No data is overwritten or modified. • No locking required for I/O. • No cache invalidation required. • Journaling, snapshot and versioning natively built-in • ECS stores all types of data and index in “chunks” • Chunks are – Logical containers of contiguous space (128MB) – Written in an append- only pattern • All data protection operations are done on chunks
  • 34. 34© Copyright 2015 EMC Corporation. All rights reserved. ECS STORAGE ENGINE: KEY BENEFITS • All nodes can process write requests for the same object simultaneously, and write to different sets of disks. • Throughput takes advantage of all spindles and NICs in cluster. • Payload from multiple small objects are aggregated in memory and written in a single disk write • Efficient storage for both small and large data
  • 35. 35© Copyright 2015 EMC Corporation. All rights reserved.  Unstructured configurations – Object & HDFS  Available in multiple capacities within a rack  Clustering across racks scales to 100s of PBs HARDWARE CONFIGURATIONS
  • 36. 36© Copyright 2015 EMC Corporation. All rights reserved. ANALYTICS
  • 37. 37© Copyright 2015 EMC Corporation. All rights reserved. SAMPLE HADOOP WORKFLOW HDFS Analytical Models (Hive, HAWQ) Data Visualizations (Tableau) Variety of Data Sources Data Cleansing Ingest Data Scientists Store Analyze Surface
  • 38. 38© Copyright 2015 EMC Corporation. All rights reserved. HADOOP PROCESSING MODEL SHARED STORAGE MODEL VNXVMAX Isilon CommodityVNX Isilon 3rd Party Commodity • Enables common Data Lake for LOB application storage and analytics • Scale compute & storage independently • Multiple distributions, clusters connect to the same data ECS
  • 39. 39© Copyright 2015 EMC Corporation. All rights reserved. CHALLENGES WITH HDFS • HDFS not Enterprise-Grade – Requires three full copies of data, no erasure coding – No Geo-distribution, Limited DR, Multi-tenancy – Inefficient for handling small files • High Availability Still In Progress – No Active-Active Failover even with the secondary NN • DAS architecture not suitable for some customers – Lack of Enterprise Data Governance Features
  • 40. 40© Copyright 2015 EMC Corporation. All rights reserved. HDFS DATA SERVICE OVERVIEW • Addresses limitations of off-the- shelf HDFS • Brings HDFS to existing storage hardware • Enables HDFS/Object/File scenarios • Flexible software model allows for future co-location of compute and storage
  • 41. 41© Copyright 2015 EMC Corporation. All rights reserved. HDFS DATA SERVICE OVERVIEW • API head – Custom client/server protocol optimized for high scale – Uses the same unstructured storage engine as ECS/ViPR Object data service • Client library over the HDFS API – Provides a viprfs:// drop-in replacement for HDFS 2.0 – Can be seamlessly added to existing Hadoop distributions • Implemented as a Hadoop Compatible Filesystem (HCFS) – Supports HDFS 2.0 and 2.2
  • 42. 42© Copyright 2015 EMC Corporation. All rights reserved. HDFS ARCHITECTURE RM / AsM Commodity Compute & Storage Node Manager Data Store MapReduce Task Client Node Manager Data Store MapReduce Task Node Manager Data Store MapReduce Task NAME NODE SEC NAME NODE
  • 43. 43© Copyright 2015 EMC Corporation. All rights reserved. ViPR/ECS HDFS ARCHITECTURE Client Node Manager MapReduce Task ViPR/ECS Client Node Manager MapReduce Task ViPR/ECS Client Node Manager MapReduce Task ViPR/ECS Client NAME NODE RM/AsM SEC NAME NODE
  • 44. 44© Copyright 2015 EMC Corporation. All rights reserved. Customer’s Hadoop Compute Cluster HDFS – ECS APPLIANCE DEPLOYMENT ViPR Controller VMViPR Controller VMViPR Controller VM Data Read/ Write Object/HDFS … Object/HDFS
  • 45. 45© Copyright 2015 EMC Corporation. All rights reserved. HDFS DATA SERVICE/ECS ARCHITECTURE ECS Storage Engine HDFS API Head S3 API Head Customer’s Hadoop Compute Cluster ViPR Data Service Node Data Read/Write via ViPR HDFS
  • 46. 46© Copyright 2015 EMC Corporation. All rights reserved. HDFS DATA SERVICE VALUE PROPOSTION • High Availability Built-In, No SPOF • Avoids multiple copies of data • Erasure Coding Support • Geo-Distributed Across Sites • Multi-tenancy, Metering, Chargeback • Allows Byte-Range Updates Through S3 Interface • ViPR Controller aids in Management & Monitoring
  • 47. 47© Copyright 2015 EMC Corporation. All rights reserved. UPCOMING ECS INTEGRATIONS
  • 48. 48© Copyright 2015 EMC Corporation. All rights reserved. ISILON CLOUDPOOLS SMART TIERING TO OBJECT STORES Key Features Benefits  Stub to Cloud of choice  Extending SmartPools workflow to CloudPools  Ability to send encrypted data to the cloud  Compression for efficient transport  Simple policy based management  Combine file & object store benefits  Use stubs to optimize local storage space, with offsite archive protection  Seamless placement and availability of data per policy  One Accessible namespace SmartPools -> CloudPools Clients SMB | NFS | REST| HDFS | SWIFT OneFS Service Provider Public Cloud ECS
  • 49. 49© Copyright 2015 EMC Corporation. All rights reserved. EMC CLOUDBOOST LONG TERM RETENTION TO THE CLOUD LAN LAN/WAN CloudBoost appliance Desktops Laptops Files NAS/NDMP VMware & Hyper-V Databases Email Applications DB ROBO Protected by NetWorker, Avamar, NetBackup Key Features Benefits  Long-term retention to ECS for NetWorker, Avamar, NetBackup  Inline variable de-duplication and compression  Data encrypted in-flight and at rest  Cloud choice: private/public clouds  Appliance cache for ROBO  Capacity of up to 6PB logical per appliance  Central management via a cloud portal  Lower storage cost per TB  Efficiency: Reduced network and storage consumption  Lower risk, operational overhead than tape.  Airtight security
  • 50. 50© Copyright 2015 EMC Corporation. All rights reserved. WRITE PATH, READ PATH,BOX CARTING
  • 51. 51© Copyright 2015 EMC Corporation. All rights reserved. TRANSACTION FLOW (WRITE) Node 1 1. Create Object request (Name, data, metadata) Node 1 Node 4 Node 5 2. Write of data and metadata in chunk. All three copies written in parallel. Write successful only if all copies ack. Node 2 5. Back to client 3. Index update (name, location) to the owner partition. 4. Journal write
  • 52. 52© Copyright 2015 EMC Corporation. All rights reserved. ERASURE CODING  Data is written into chunks 3 copies  Once a chunk fills to 128 MB, erasure coding starts  Once it is completed and data is protected the 3 copies are deleted. A A AA
  • 53. 53© Copyright 2015 EMC Corporation. All rights reserved. GARBAGE COLLECTION  In Append-only systems updates/deletes cause files to have blocks of data that are unused.  This is done at the level of chunk  Unused chunks reclaimed by a background task
  • 54. 54© Copyright 2015 EMC Corporation. All rights reserved. Node 1 1. Read Object request 3. Read data Node 2 4. Send data back 2. Get Location TRANSACTION FLOW (READ)
  • 55. 55© Copyright 2015 EMC Corporation. All rights reserved. BOX CARTING: CHUNK WRITE Node Buffered Writer Acks (PARALELL SYNC WRITE)
  • 56. 56© Copyright 2015 EMC Corporation. All rights reserved. • ECSGEO-STORAGE OVERVIEW • DATAPROTECTION • GLOBAL DATAACCESS ECS GEO-CAPABILITES
  • 57. 57© Copyright 2015 EMC Corporation. All rights reserved. ECS GEO-STORAGE OVERVIEW • Data Protection – Protection against data center failure – Seamless failover and recovery • Global Data Access – Global namespace – Ability to read/write data from any site • Optimized Storage – Low storage overhead – WAN Optimization • Applicable to all unstructured Storage Engine based heads – Object, HDFS – File when available
  • 58. 58© Copyright 2015 EMC Corporation. All rights reserved. DATA PROTECTION
  • 59. 59© Copyright 2015 EMC Corporation. All rights reserved. INDUSTRY SOLUTION: MIRROR COPY • Mirrored copy in a backup site • Benefit: Achieves Local reconstruction on hardware failure • Shortcoming: Storage overhead -> 2.66xPrimary Secondary
  • 60. 60© Copyright 2015 EMC Corporation. All rights reserved. INDUSTRY SOLUTION: DISTRIBUTED ERASURE CODING • Distributing fragments across sites • Benefit: Achieves low Storage Overhead ~ 1.6x • Shortcoming: Disk/Node failure requires fragments to be fetched over the WAN. Site 1 Site 2 Site 3 Site 4
  • 61. 61© Copyright 2015 EMC Corporation. All rights reserved. ECS MODEL: BEST OF BOTH WORLDS • Achieves low Storage Overhead ~ 1.8x • Local hardware failure recovery requires no WAN traffic. • Handles local hardware and full data center failures – Disk, Node, Rack, Data Center are failure domains Site 1 Site 2 Site 3 Site 4
  • 62. 62© Copyright 2015 EMC Corporation. All rights reserved. GLOBAL DATA ACCESS
  • 63. 63© Copyright 2015 EMC Corporation. All rights reserved. INDUSTRY SOLUTION: SEGREGATED NAMESPACE • Customers are asked to pick a location for each bucket. • Shortcoming: sites are vertical silos, unaware of each other’s namespaces. Site 1 Site 2 app app Bucket BBucket A
  • 64. 64© Copyright 2015 EMC Corporation. All rights reserved. INDUSTRY SOLUTION: MULTI-ACCESS WITH EVENTUAL CONSISTENCY • Global Namespace with read only replicas. • Replicas have eventual consistency • Shortcomings: Difficult to write applications against eventual consistency models Site 1 Site 2 app Bucket A app read only
  • 65. 65© Copyright 2015 EMC Corporation. All rights reserved. ECS: MULTI-ACCESS WITH STRONG CONSISTENCY • Global Namespace: buckets stretches across sites • Global Access: Any data can be read and written to any site • Strongly consistent: Always returning latest version without requiring synchronous write. Site 1 Site 2 Bucket A app app app
  • 66. 66© Copyright 2015 EMC Corporation. All rights reserved. OPTIMIZED STORAGE • Low Storage Overhead: ~1.8x replication over head across 4 sites • WAN Optimization: All node and disk failures are repaired within the site, without any WAN traffic.
  • 67. 67© Copyright 2015 EMC Corporation. All rights reserved. STORAGE OVERHEAD # of Data Centers Overhead 1 1.33 x 2 2.67 x 3 2.00 x 4 1.77 x 5 1.67 x 6 1.60 x 7 1.55 x 8 1.52 x
  • 68. 68© Copyright 2015 EMC Corporation. All rights reserved. ECS GEO KEY DIFFERENTIATORS • Tolerates one site disaster along with up to 2 node failures in all the rest of the sites. • Component failures are recovered using fragments from local site without WAN traffic • Geo-efficient (~1.8 copies across 4 sites) without WAN read/write penalties
  • 69. 69© Copyright 2015 EMC Corporation. All rights reserved. ECS APPLIANCE
  • 70. 70© Copyright 2015 EMC Corporation. All rights reserved.  Not Contradictory!  Components are Commodity  x86 Servers  Ethernet Networking  SATA Disk Drives  Innovation in how they’re put together to enable reliability, availability, and serviceability! COMMODITY INNOVATION
  • 71. 71© Copyright 2015 EMC Corporation. All rights reserved. ECS APPLIANCE CHARACTERISTICS • Use COTS Components – Economies of scale • Density Optimized – Up to 72TB Raw / Rack Unit – Saves Power/GB, Real Estate costs, etc. • Labor Optimized – Manage the cluster, not the devices – Maximize Serviceability • Protection Efficiency – Geo-efficient storage

Hinweis der Redaktion

  1. As a result of the way we’ve been buying and manage storage, data growth is simply breaking it. Organizations are creating more unstructured data like videos, audio files, images, etc. and file system data each year than any other category of data. Traditional file-based storage have been built for traditional applications and they’re critical for those applications, but they’re more complex than necessary for new Web and mobile apps. They feature multiple data protection schemes depending on the arrays features and each arrays has its own APIs and management tools. The complexity also makes it very difficult to scale. One of the biggest challenges of a growing storage environment is that primary storage reaches capacity, and organizations are continually purchasing more storage. This growth causes other problems as well, such as unplanned server outages due to running out of space, excessive administrative costs, backup failures, and more. But the bottom line is that they can’t economically scale – especially when storage spans multiple locations. Data protection, admin costs, etc add storage overhead. Which is important because it’s not just about total capacity – it’s about usable capacity – and how efficient you are drives down your $/GB. And fairly or not, enterprises are being compared to public cloud economics which feature very low $/GB. And speaking of cloud, traditional storage was never architected for new Web, mobile and cloud applications. They were built for access over a LAN for specific applications. Provisioning and access is driven by IT, it’s difficult if not impossible to provide self-service access to traditional storage in an IT-as-a-Service model.
  2. Note to Presenter: There are multiple clicks in this animation. View in Slide Show mode for animation. <<CLICK>> If you were to walk into any data center in the world today you would most likely find a storage infrastructure that consists of multiple platforms from many different vendors set up as individual storage silos based on application, workload and/or data type. With most companies experiencing massive data growth having these storage silos lead to underutilized storage and escalating costs to support new data needs. <<CLICK>>
  3. Writing to a file requires exclusive lock. An object store supports multiple writes with no locking. Object storage is built to allow for eventual consistency. Users can simultaneously write to an object. The object store will eventually reflect all the writes (eventual consistency). A file system, has to lock to prevent data corruptions. Limit on number of files in a directory. - A traditional file system supports about 16TB before a new file system must be added to support more data. Objects are limitless in size and have no file system limitations. This is the value of object – linear scale. Plus, adding capacity does not require an app to be re-coded. In the traditional NAS, adding an additional file system will require re-coding the application to understand multiple file-systems. This also adds to the complexity of scaling the environment. File meta-data is fixed by file system, no user meta-data – Objects simply store an object and its metadata in BLOB storage. To attach metadata to files, you’d need to build a separate metadata database and maintain it separately. Object can manage data according to meta-data driven policy, use meta-data search. In a file system, large files are hard to find. Imagine reading the whole news paper to reach a specific article. Object can read a specific byte range much faster for large files, since it seeks a much smaller file. This makes retrieving information from a large content store – like an archive – faster. File create operations require directory to be exclusivity lock. Object does not require any locking when creating files. There is no directory structure that needs to be locked. File systems must do this to prevent data or file system corruptions. This File systems are not Web or firewall friendly. Security is complex due to… Folder/file access control, inheritance Reliance on complex, session-based authentication Objects interfaces are web friendly and easy to navigate firewalls. Each operation is individually authenticated (each request carries its own credentials)
  4. Hyperscale and Big Data applications are the new normal. For years we’ve discussed the rise of “consumerization” – the trend of mobility, and BYOD and the corresponding rise in unstructured content. Now, we have millions, if not billions of connected devices that are creating and consuming data– everything from POS handhelds, industrial equipment, gaming systems to thermostats. And all these devices and applications function 24x7x365 and know no geographic boundaries. This has implications for how organizations ingest, store and provide access to all this data. Cloud and Big Data applications have a fundamentally different architecture. These modern applications… Are massively multi-tenant Are assembled from well-defined components or “black boxes” Use standard communication protocols to facilitate universal access and interoperability Are built using frameworks like JavaEE, Spring, Struts, .NET, WebObjects, Zend, Ruby on Rails, Grails, Django, Catalyst … Make use of client-side languages like JavaScript/DHTML, Flex, Silverlight … Mix both structured and unstructured content Must store both object- and DB-type content Are extremely scalable in their design and implementation Process massive data sets – often, the information is the crucial asset, not the application functionality Bottom line, we need a data management architecture suited for these new applications. We need simpler geo capabilities, we need Web-friendly access and a storage platform that can support many different data types and access protocols independent of the hardware.
  5. Situation Unstructured content repositories containing images, videos etc. are currently stored in high cost storage systems making it impossible for businesses to cost-effectively manage massive data growth. Desire for on-premise clouds to manage and store cold/archive data with ease. Newer applications e.g. Uber, Instagram are being written to take advantage of massive data availability, anytime, anywhere through open APIs. Enterprise developers are creating shadow IT by deploying applications in public clouds. Other 3rd party solution not enterprise production ready. Solution ECS Object Appliance
  6. Our next use case is really just a sub-category of the complete cloud storage platform. In addition to analytics, enterprises and service providers can use their complete cloud storage platform to support Web, mobile and Cloud applications. Problem: Traditional storage was never architected for new Web, mobile and cloud applications. They were built for access over a LAN for specific applications. Provisioning and access is driven by IT, it’s difficult if not impossible to provide self-service access to traditional storage in an IT-as-a-Service model. Writing to multiple file systems and proprietary APIs increases development time and cost Data locked into on-prem file systems is not accessible by Web-based and mobile applications Developers find ti easier to go to public cloud alternatives Solution: ECS HDFS Appliance, with support for industry standard APIs Value: ECS supports multiple access methods and a very simple geo-capability. Developers only have to worry about the apps, not the ops. ECS is made to support next-gen Web, mobile and Cloud applications. Multi-site read/writes with strong consistency make a developers job much easier. As the ECS capacity changes and grows, developers never need to recode their apps. Again, the target audience are C-level and IT leadership that are looking to deploy new Web, mobile and cloud applications, They may have some apps already deployed in a public cloud. ECS Software and/or ECS appliance lets them deploy on their own infrastructure. The VP of Apps/App architecture will also be interested – especially if they are not able to use public cloud – they can be influencers in an account since ECS appliance will make their development efforts less risky and speed time to production.
  7. Our next use case performance trending and host reporting is specifically focused on host performance troubleshooting. Problem: Inability to unlock business insights from complex datasets. Large (and growing) data volumes prevent timely analysis and insights. Struggle with storing and accessing PBs of data, billions of small files and/or large media files being generated. Data volume & velocity make it costly to store persistently on traditional storage platforms. Need for on-premise, enterprise ready data analytics to meet business requirements. Unmanageable Data center footprint increase due to 3X replication of standard HDFS Solution: ECS HDFS Appliance, ECS Software HDFS Service Value: Time to Market - Improve time to market for new products & applications leveraging Objects and HDFS delivered as a service. “In-place” analytics capabilities reduce risk, resources and time-to-results. Storage Efficiencies - Efficiently store PBs of data, billions of small files and/or large media files in a low cost, state-of-art, commodity-based storage system Future proof Architecture - Addresses challenges with traditional HDFS enabling enterprise features like erasure coding and geo replication with reduced storage overhead. Industry accepted standard API support for all interfaces. Reduce Risk/Deliver Value on existing Infrastructure – Enables analytics on your existing storage infrastructure without moving your data. For analytics, again, this can be a c-level discussion or even in the business units that are trying to better understand their data and extract business insight. Do they have projects for information-based applications? Data scientists are also targets - they are responsible for business intelligence and analytics and are trying to tap new data sources.
  8. Situation Organizations are seeing a massive growth specifically in unstructured data and need to move inactive content off of Tier 1 storage to drive down cost and fully optimize existing storage resources Public cloud archive storage services have unpredictable cost structures and often take an extremely long time to retrieve data. SMB’s and Enterprises cannot afford to wait days/weeks to gain access to archived data. Tape solutions have a low hardware cost but the dollar spend on servicing and storage tape and managing the library can become more expensive. Cloud solutions provide a much easier experience with a much lower price point at scale. Solution EMC Elastic Cloud Storage
  9. The Internet of Things offers a new revenue opportunity for businesses who can extract value from customer data. ECS offers an efficient platform for data collection at massive scale. ECS also streamlines analytics because the data can be analyzed directly on the ECS platform without requiring time consuming ETL (Extract, transform, load) processes. Just point the Hadoop cluster at ECS to start running queries. No expensive DAS is required on your Hadoop cluster.
  10. EMC ECS (Elastic Cloud Storage) is a hyper scale, object-based, cloud storage infrastructure which leverages commodity components and software-defined intelligence to deliver a turnkey solution. ECS delivers the benefits of the “public cloud” in a solution that can be purchased as a turnkey system, or as software which can be deployed over an EMC approved 3rd party storage.
  11. The answer is software-defined storage. It’s not a marketing term as some have dismissed it as. Software defined means storage and data services are delivered as software that can run on any storage device and support many different data types and access protocols. Most importantly, it lets organizations leverage commodity platforms which means they can finally bring hyperscale capabilities and economics to their data center.
  12. The idea of hyperscale is to use standardized, off-the-shelf components that, individually, don’t provide performance and reliability. But at scale, the pool of components, together with intelligent software, provide superior performance and reliability characteristics.
  13. ViPR Data Services:  Used to describe HDFS and Object Services running on traditional (EMC/non-EMC) File arrays ECS Software:  Used to describe HDFS and Object Services running on commodity and ECS Appliance
  14. EMC Elastic Cloud has a fast growing ecosystem of partners who provide various solutions to augment its uses.
  15. The EMC ViPR community provides a single place to learn more about ViPR, ECS Software and ECS Appliance, download a trial version of ViPR at no charge – for non-production use, access developer resources, and get support from EMC Experts, other community members, and more.
  16. As mentioned previously, ECS is a storage engine, architected much like any array. An array exposes an I/O protocol on top and writes to disk underneath. ECS is different in that it is software-defined. Because it’s software-defined it supports multiple access methods – Object, HDFS and in the future, File. The software separates the software layer of storage from the hardware. The storage engine provides HA and scalability. It has the unique ability to write data to both protected and unprotected disks.
  17. ECS Software and ECS Appliance enables simultaneous access to data with multiple interfaces. ECS supports Object and HDFS and will add a file interface in the future. For Object, ECS supports industry-standard Object APIs Amazon S3, OpenStack Swift, and EMC Atmos. Customers can also interact with the same data via HDFS, compatible with Hadoop 2.x distributions such Cloudera, Hortonworks, Pivotal, etc. ECS Software’s Object service is very similar to the Amazon S3 model but ECS adds extensions such as byte-range updates, atomic append, rich ACLs, etc.
  18. ECS is a distributed scale-out software platform, delivered as a virtual appliance that runs on VMware ESX servers or bare metal. ECS leverages the cloud technologies such as the Apache Cassandra database, an open source distributed database management system, to handle large amounts of data with no single point of failure. With this foundation, ECS is capable of executing thousands of provisioning operations per hour. Some of the services inclusive to this high-performance architecture include: The load balancer service: assures any requests to ECS are distributed evenly across virtual appliances; workloads are distributed in a round robin fashion. The workflow automator and controller services: centralize the provisioning, metering, monitoring, reporting, cataloging, and provide a self-service portal. And finally, the coordinator service: coordinates tasks between distributed processes. These features allow ViPR Data Services/ECS users to manage workflows, workloads, and tasks across a variety of arrays, and from one single control point.
  19. ECS efficiently stores objects in append-only containers. This also ensures efficient utilization on commodity. As stated previously, this can be overkill for an existing object or NAS filer that has its own persistent file system, but is important for efficient storage on JBOD. This transaction flow illustrates a protected write. The storage engine receives a Create Object request ECS writes the data and metadata in chunk. It writes 3 copies in parallel Once the 3 copies acknowledge, ECS writes to the index with name and location Journal write Acknowledgement to the client The write is successful only after the 3 copies have acknowledged and the index is successfully updated.
  20. In a protected write, data is written into chunks with 3 copies. Erasure coding starts as the data is written and shipped instantaneously. The container becomes immutable and the 3 copies are then deleted. All data protection operations are now done on the chunk.
  21. ECS is an append-only system which always results in unused blocks of data. When a container is empty, the unused chunk is reclaimed by a background task, sometimes referred to as garbage collection process.
  22. A Read request is simple. The system received a Read object request ECS gets the location from the index Reads the data and send to the client
  23. ECS can also execute a large # of user transactions concurrently with very little latency. ECS supports box-carting to handle workloads with high transaction rates. When an application is writing a lot of small files with high I/O, ECS can take multiple requests together in memory and write them as one. This improves performance by reducing the round trips to the underlying storage
  24. Some of the key benefits include All nodes can process write requests for the same object simultaneously, and write to different sets of disks. Throughput takes advantage of all spindles and NICs in cluster. ECS supports box carting – the payload from multiple small objects are aggregated in memory and written in a single disk write – this improves performance by reducing roundtrips to/from storage ECS can very efficiently ingest and store both small and large data
  25. The unstructured engine deployments are designed to scale across racks as part of a single logical server cluster. The system aggregates all the unstructured nodes from all of the available ECS Appliance resources and groups them into one single, logical VARRAY, and aggregates all the structured nodes from the available resources and aggregates them into a second, logical VARRAY. The unstructured logical VARRAYs are managed by a single ECS instance within a zone, regardless of the number of ECS Appliance units and associated resources provided by those units within a zone.
  26. Before discussing the HDFS capabilities, it’s important to understand a typical Hadoop workflow. There are a variety of data sources that create data – sensor data, industrial automation, etc. This data has to be cleansed and ingested. The storage engine for Hadoop is HDFS (Hadoop Distributed File System). An ETL (extract, transform, load) process via Talend or Sqoop gets the data into HDFS from varied data sources. Data scientists can then run Hive and R queries now on top of this data – results viewed via tableau.
  27. Hadoop was architected for co-location of compute and storage for scenarios where data locality is paramount. The advantage of the shared storage model is the compute and data are independent. This enables a common data lake for all LOB applications. Analytics and compute and storage can be scaled independently. Multiple analytics distributions can connect to and use the same data.
  28. NameNode is a single point of failure Only one NameNode in the cluster (secondary NameNode is not active-active) Keeps metadata in RAM – hence data size bound by RAM
  29. ViPR HDFS gives organizations the ability to run analytics using well known industry Hadoop distributions on existing data stored across heterogeneous systems such as VNX, Isilon and Netapp arrays. Hadoop has become standard for companies that are investigating novel strategies for addressing their Big Data challenges. HDFS is the core distributed file system used by Hadoop. Many organizations have an HDFS project in their labs. However, many of these companies have found Hadoop to be difficult to deploy and manage at scale. The ViPR approach to HDFS takes advantage of proven storage hardware to overcome this challenge. Instead of building a discrete analytics silo with dedicated infrastructure, the ViPR HDFS data service leverages the existing ViPR virtualized storage environment and the backend storage platforms it utilizes. Or it can simply make tall the object data within an ECS environment available to analytics tools.
  30. ViPR HDFS uses the same ECS unstructured storage engine. The API head is a custom client/server protocol optimized for scale. A customer deploys a client on the data nodes of their existing cluster that provides a viprfs:// drop-in replacement for HDFS 2.0. it’s a small jar file. This enables the data node to direct queries to ViPR HDFS which is implemented as a Hadoop Compatible File system (HCFS)
  31. The same process holds true for running using ECS Appliance as the shared storage. The entire ECS Appliance implementation can now be made available as a Big Data repository.
  32. ViPR Data Services/ECS supports access to data via HDFS as well as supporting an S3 API head. The S3 API head allows Byte-Range updates. Data can replicated across nodes using the ECS geo-capabilities. Organizations can interact with the data as an Object as well as HDFS without having to move or copy the data. ECS will add File access in the near future.
  33. The value for customers is that they can more quickly move to a production Hadoop environment at a lower cost and with lower risk. ViPR HDFS makes Hadoop enterprise grade by making their existing storage (mixed, heterogeneous environment or ECS Appliance or commodity) available as a Big Data repository or “data lake” In addition, ViPR HDFS/ECS Appliance mitigates the limitations of off-the-shelf HDFS: No single point of failure namenode. Doesn’t require multiple copies of the data – HDFS requires 3 copies for HA ECS can also distribute the data very efficiently across geos and use erasure coding for efficiency and protection It features multi-tenancy, metering and chargeback for a Hadoop-as-a-service capability The S3 API support enable byte range updates ViPR Controller automates storage and aids in management and monitoring All these things taken together make HDFS enterprise-grade.
  34. ECS 2.0 features a host of improvements and new features. To start ECS 2.0 is now controller-less and does not require the installation of a ViPR Controller instance. In addition ECS 2.0 can now be installed via bare metal installation or with VMware virtual machine. New packaging scripts have been included for a simpler deployment experience.
  35. CloudPools is targeted for GA in the Riptide release. The CloudPools beta is available today for selected customers. Contact product management if a customer is interested. The current CloudPools beta is a special build based on the JAWS code but newest targets such as AWS S3 and ECS will be available for testing as part of Riptide beta. So what the next six year boom if the last one was enterprise scale out NAS? Focusing on cloud as one of those areas where we want to lead and differentiate, let’s walk through the impact of ‘cloud’ on our platform… 1st – new applications require new semantics, first RAN then something else – rest transport, platform value prop, looking at swift and more S3 like semantics AND SOON OpenStack Swift access for all apps developed to this api/protocol 2nd – think of a smart pool, but out of the four walls of the box, also different in that it caches not just moves…. (weave in ever increasing performance here and tier 1.5 ambitions…) First, tier to an on prem isilon cluster – why? Little fast cluster, tiering to cold giant core. Then – what about delivering as a service – think rack space – X on prem, Y off prem Then what about deploying this same thing to CSPs – and leveraging CSPs – cooperating and competing; but offering services on top of Amazon (hat tip to Panzura here) Think more into the future, this isn’t just tiering…. The cloud version of the assets isn’t just data objects – when viewed through the Isilon software we are creating it’s a distributed filesystem.
  36. ECS efficiently stores objects in append-only containers. This also ensures efficient utilization on commodity. As stated previously, this can be overkill for an existing object or NAS filer that has its own persistent file system, but is important for efficient storage on JBOD. This transaction flow illustrates a protected write. The storage engine receives a Create Object request ECS writes the data and metadata in chunk. It writes 3 copies in parallel Once the 3 copies acknowledge, ViPR writes to the index with name and location Journal write Acknowledgement to the client The write is successful only after the 3 copies have acknowledged and the index is successfully updated.
  37. In a protected write, data is written into chunks with 3 copies. Once a container fills to 128MB, erasure coding starts and the container becomes immutable and the 3 copies are deleted. All data protection operations are now done on the chunk.
  38. ECS is an append-only system which always results in unused blocks of data. When a container is empty, the unused chunk is reclaimed by a background task, sometimes referred to as garbage collection process.
  39. A Read request is simple. The system received a Read object request ECS gets the location from the index Reads the data and send to the client
  40. ECS can also execute a large # of user transactions concurrently with very little latency. ECS supports box-carting to handle workloads with high transaction rates. When an application is writing a lot of small files with high I/O, ECS can take multiple requests together in memory and write them as one. This improves performance by reducing the round trips to the underlying storage
  41. There are 3 main things I will discuss today in presenting how ECS enables a true geo-distributed architecture First Data Protection You need to ensure data is protected from site failure. We all need to plan for the unexpected, total site failure or data loss in a site due to a natural disaster How can you make sure the failover is seamless and ensure it meets your business continuity needs. No body expects the business to stop cause you had failure in a data center. Everything needs to be up and running regardless of site disaster. Second Global Data Access Everyone wants data to be accessible across the globe. We don’t have the model you are sitting on a desk and doing everything from your PC or laptop. You are constantly traveling or at least expect your customers to be across the globe and so is your data. You want to access data globally, with consistent views Third Optimized Storage Solution Third as you plan for all that you want to optimize cost. Cost of your storage as well as cost of your network traffic Both are important to you as you scale cause you don’t want to triple or quadruple your costs So lets look at each topic how the industry tried to solve it in the past few years and how ECS solves them using its unique architecture.
  42. The mirrored approach is typically designed to deal with both local and remote failures. Data in this model typically uses a local protection scheme (such as RAID or Erasure Coding) in conjunction with a remote protection scheme which also incurs local protection overhead. Typical protection ratios for both replicas can be 1.33x or more, so typical total protection overhead for both local and remote protection is at least 2.66x overhead.
  43. Customers can also replicate via Distributed erasure coding where they distribute fragments and parity across multiple sites. The benefit is that it is very efficient. The drawback is that a disk or node failure requires reconstruction over the WAN. This increases WAN traffic and affects access and performance.
  44. ECS features a hybrid encoding approach. It is higher performance than geo erasure-code with much lower replication overhead than mirroring All node and disk failures are repaired within the zone, without any WAN traffic.
  45. It’s common in the industry for customers to segregate data into sites. Each site has its own namespace and the system replicates sites asynchronously. Failover handled by adjusting DNS etc. The drawback is that sites are vertical silos, completely unaware of each others namespaces. You need to re-direct traffic in the case of a failure.
  46. To address the issues of segregated namespaces, many object platform s feature a global namespace with read only replicas. The replicas are eventually consistent. The issue is that one site is primary and if that site fails, you only have read access - writes will queue up. You can read from any site but only one site can be written to. The responsibility of avoiding the “stale read” problem associated with eventual consistency is left to the application developer Apps find it difficult to deal with eventual consistency models.
  47. ECS offers the best of both worlds with a global namespace that spans across sites. It’s an active-active architecture that supports writing to and reading from any location PLUS it’s strongly consistent - always returning the most recent version of a file. This makes the developer’s job a lot easier and supports anywhere access to data.
  48. Taken together, ECS presents a optimized storage – it combines lo storage overhead - < 1.8x in a four site implementation, with fast access to content with minimal WAN traffic. Customers no longer have to choose between performance and efficiency.
  49. Storage overhead calculation: N sites Original Chunk 1.33x (Three code fragments for nine data fragments). Chunk Backup 1.33x / (N -1) Total Overhead Original Chunk + Chunk Backup 1.33x * N / (N – 1)
  50. The system will function normally with a 4 or more nodes. With 3 nodes, the system runs in a degraded state where data will be available for read, but data will not be adequately protected. The system will not function with less than 3 nodes. The system tolerates 3 or more component failures so long as the number of available nodes is above these constraints.
  51. So, if you can get superior characteristics at scale on commodity, is there any need for hardware innovation? The idea of “Commodity Innovation” sounds like something of an oxymoron. However, innovation is in how the components are put together to enable RAS (Reliability, Availability, Serviceability).
  52. The ECS Appliance embodies these characteristics of hyperscale based on commodity components.