SlideShare ist ein Scribd-Unternehmen logo
1 von 42
How Can Startups Leverage Big Data? 
Trudging Through Myth To Discover Real Value
• Mostly Unstructured Data 
• Client Data 
• Customer Data 
• Social Data 
• Driving towards insight 
2 
What is Big Data? 
www.rackspace.com
RACKSPACE® HOSTING | WWW.RACKSPACE.COM 
“Big Data is any 
dataset not suited to 
be processed by 
traditional legacy 
technology.”
The Three V’s 
4 
V3C 
Mining social data for sentiment 
Analyzing web clickstreams 
Analyzing log data for security breaches 
Telemetry from sensors and machines 
eCommerce predictive analytics 
VOLUME VELOCITY 
VARIETY COMPLEXITY
The Three V’s 
5 
V3C 
Mining social data for sentiment 
Analyzing web clickstreams 
Analyzing log data for security breaches 
Telemetry from sensors and machines 
eCommerce predictive analytics 
VOLUME VELOCITY 
VARIETY COMPLEXITY
Evolution of Data 
Time
• Big Data is now much more than hype – real 
customers with real use cases are adopting daily 
•Recent survey found that business leaders expected 
the deployment of Hadoop to result in a 3-year benefit 
ranging from $5M to $50M+ 
• Close to 100% of business leaders have already 
deployed or plan to deploy ApacheTM Hadoop® 
"Enterprises are showing increasing interest in the value provided by the large-scale data processing that Hadoop and Spark 
can provide, but can be wary of the upfront cost and complexity of setting up a cluster to prove that value. Managed services 
such as [OnMetalTM Cloud Big Data Platform] enable enterprises to focus their energies on generating business insights rather 
than configuring and managing infrastructure.” 
Matt Aslett 
451 Research Director, Data Platforms and Analytics 
7 
Big Data is Here to Stay 
www.rackspace.com
• To learn more about your customers 
• To optimize your business processes 
• To become a more targeted marketer 
• Interact with users and customers in real time 
• Add additional revenue and services 
8 
Why leverage Big Data? 
www.rackspace.com
www.rackspace.com 9 
What Is the Cost of Lacking a Big Data Strategy? 
• Today every company can be a data company 
• Successful companies will be data companies 
• Under Armour isn’t just a fitness company – they’re a data company
• Open Source 
• Able to process petabytes of data quickly 
• Developed at Google, implemented at scale at Yahoo 
• Handles unstructured data very well 
• One of the fastest growing eco-systems 
10 
Hadoop Has Emerged As A Leader In Distributed Data Sets
Fundamentals of Hadoop v1 
Zookeeper 
Configuration, sync 
and naming registry 
Oozie 
Workflow and job 
scheduling 
Knox 
Auth and access 
Falcon 
Data pipeline 
framework 
Installation, monitoring, administration 
11 
Data 
Services 
Pig 
Data flow 
scripting 
language 
HBase 
Distributed, 
scalable, non 
relational 
database HCatalog 
Metadata and table management system 
Core 
Services HDFS 
Distributed File System 
Hive 
DW analysis layer 
through HiveQL 
(SQL-like) queries 
MapReduce 
Data processing framework 
Ambari 
Operational 
Services 
Flume 
Log data 
aggregation and 
movement 
Sqoop 
Bulk data transfer 
from and to 
relational DB
• Biggest impediments include: 
– Insufficient skills in-house to design and deploy 
– Designing and deploying takes too long 
– High cost of physical infrastructure 
12 
Hadoop is Hard 
www.rackspace.com 
3 10 
only in 
businesses that plan 
to implement Hadoop 
have done so
Hadoop is Changing 
• Original focus on batch processing 
• Streaming and interactive use cases emerging 
• Shift from jobs that take hours to seconds 
• Impala, Spark, and Presto are emerging tools
14 
But what are these companies 
doing with Big Data? 
www.rackspace.com 
Gaining Insights!!!
What are Companies Doing with Hadoop? 
www.rackspace.com 15 
Vertical Use Case Data Type 
Financial Services 
New Account Risk Screens Text, Server Logs 
Fraud Prevention Server Logs 
Trading Risk Server Logs 
Maximize Deposit Spread Text, Server Logs 
Insurance Underwriting Geographic, Sensor, Text 
Accelerate Loan Processing Text 
Telecom 
Call Detail Records (CDRs) Machine, Geographic 
Infrastructure Investment Machine, Server logs 
Next Product to Buy (NPTB) Clickstream 
Real-time Bandwidth Allocation Server Logs, Text, 
Sentiment 
New Product Development Machine, Geographic 
Retail 
360 View of the Customer Clickstream, Text 
Analyze Brand Sentiment Sentiment 
Localized, Personalized Promotions Geographic 
Website Optimization Clickstream 
Optimal Store Layout Sensor 
Manufacturing 
Supply Chain and Logistics Sensor 
Assembly Line Quality Assurance Sensor 
Proactive Maintenance Machine 
Crowdsourced Quality Assurance Sentiment
Application Underpinning 
People are building net-new applications with Hadoop as their database 
• Mobile 
– Enterprises consider support for mobility and productivity enhancement to mobile workers as their top-priority new application 
category, according to a recent survey by CIMI Corp. That means most companies that have adopted, or are adopting, 
Hadoop will likely have to integrate the framework with mobile applications. 
• Data Aggregation 
– The two big use cases we're seeing for Impala are aggregating data in Hadoop to present analytic dashboards and improving 
data-discovery applications by providing faster performance than Hive," Alex Gutow, Cloudera's product marketing 
manager. 
• Dashboarding 
– Users are increasingly choosing Hadoop as the underlying technology to power interactive dashboarding capability. 
• Internet of Things 
– As tech wearables and generated devices start to become common-day solutions the backend of your application needs to 
be built to address these concerns and can handle the velocity and volume of data being produced by the appliance. 
www.rackspace.com 16
Clickstream Analysis 
Understand how your users are behaving on your website and optimize your experience 
Your home page looks great. But how do you move customers on to bigger things—like submitting a form 
or completing a purchase? Get more granular with customer segmentation. Hadoop makes it easier to 
analyze, visualize and ultimately change how visitors behave on your website. 
A clickstream is a series of page requests. Every page requested generates a signal. These signals can be 
graphically represented for clickstream reporting. The main point of clickstream tracking is to give 
webmasters insight into what visitors on their site are doing. 
• Clickpath 
– The study of human clicks on a website 
• Tracking Cookies 
– Tool used to understand and track online activity 
• Data Mining 
– Collecting data from websites and online properties 
www.rackspace.com 17
Sentiment Analysis 
Find out what your users are saying about you. Are they happy? Does your product make them a promoter? 
Your customers are talking. With Hadoop, you can mine Twitter, Facebook and other social media 
conversations for sentiment data about you and your competition, and use it to make targeted, real-time 
decisions that increase market share. 
Sentiment analysis aims to determine the attitude of a speaker or a writer with respect to some topic or the 
overall contextual polarity of a document. 
• Social Media Feeds 
– Many companies are now capturing entire Twitter and Facebook feeds to analyze. 
• Data Mining 
– Users are searching the web for comments, blogs, and whitepapers that can point to overall sentiment 
• E-Communities 
– Forums, user groups, Heroku 
www.rackspace.com 18
Machine Learning 
Interactive devices are now streamlining things like maintenance and troubleshooting 
Your machines know things. From out in the field to the assembly line floor—machines stream low-cost, 
always-on data. Hadoop makes it easier for you to store and refine that data and identify meaningful 
patterns, providing you with the insight to make proactive business decisions. 
Machine Learning is a scientific discipline that deals with the construction and study of algorithms that can 
learn from data. Such algorithms operate by building a model based on inputs and using that to make 
predictions or decisions, rather than following only explicitly programmed instructions. 
• Pattern Recognition 
– Users are building clusters to detect patterns and identify anomalies in data that these devices are generating 
• Decision Tree 
– Allows the system to take action and make choices based on the data 
• Predictive Modeling 
– Aims to automate the most common mistakes and errors as part of a preventative model 
www.rackspace.com 19
Fraud Detection 
Users are detecting fraudulent online behavior and rejecting those users before they commit an offense 
Fraud is a billion-dollar business and it is increasing every year. The PwC global economic crime survey of 
2009 suggests that close to 30% of companies worldwide have reported being victims of fraud in the past 
year. 
Fraud involves one or more persons who intentionally act secretly to deprive another of something of value, 
for their own benefit. Fraud is as old as humanity itself and can take an unlimited variety of different forms. 
However, in recent years, the development of new technologies has also provided further ways in which 
criminals may commit fraud. 
• Rules-Based Detection 
– Even though internet hackers have become better at tricking online systems, they still exhibit very calculated behavior. 
• Machine Learning 
– The aggregation of data points can help you collect more info about the potential sale and detect if it might be fraud. 
• Users Tagging and Tracing 
– Once users are flagged as fraudulent, their repeated attempts can be prevented. 
www.rackspace.com 20
Server Log Data 
Aggregate server logs to find trends and anomalies in your security records 
Security breaches happen. And when they do, your server logs may be your best line of defense. Hadoop 
takes server-log analysis to the next level by speeding and improving security forensics and providing a low 
cost platform to show compliance. 
Generally small files that track user information inside a confined environment; often used to meet 
compliance or troubleshoot an incident. 
• Scrub Data for Forensics 
– If a security incident occurs, it is important to remediate fast 
• Identify Anomalies 
– Anti-patterns are often the first sign 
• Discover Trends 
– Some types of errors might become common; learn to identify them 
• Actively Automate to Solve Issues with Log Files 
– Many of these errors can be proactively eliminated through the use of automation. 
www.rackspace.com 21
360 View of Customer – Dashboards and Analytics 
Create in-depth personas for your customers based on how they are actually behaving. 
Whenever a customer interacts with an organization, it is vital that the richness of information available on 
that customer informs and guides the processes that will help to maximize their experience, while 
simultaneously making the interaction as effective and efficient as possible. This includes everything from 
avoiding repetition or rekeying of information, to viewing customer history, establishing context and initiating 
desired actions. 
A total 360 view often contains 3 views: 
• The Past 
– Understanding how your users act in the past lets you understand who they are and serve them relevant content and 
products 
• The Present 
– Where are users coming from? What is their experience on your site right now? Do they need help? 
• The Future 
– Did they buy? Can we serve them more information to help their choice? Can we market to them better? 
www.rackspace.com 22
What’s Next? Interactive Processing! 
Interact with customers in real-time offering suggestions and inhibiting behavior 
What if instead of reacting to behavior we can engage virtually with the user to inhibit behavior? 
This is called interactive processing and it takes input from humans and reacts based on patterns and 
algorithms. 
The quicker we can server up this interaction, to the user the better equipped we are to inhibit their behavior! 
www.rackspace.com 23 
Input 
data 
Proces 
s 
Output 
data 
source: Teach-ICT.com
• Introducing support of Apache SparkTM 
• Apache Spark enables enterprises to combine the breadth of structured and unstructured data with the 
speed of in-memory processing to build streaming, machine learning, and graph-optimized applications 
that allow businesses to take action at the speed of insight. 
24 
Apache Spark 
www.rackspace.com
• Deeper Integration with SQL Workloads 
• Streaming Applications 
• Machine Learning 
• Iterative Processing 
• Real-time Graphical Dashboards 
25 
New Use Cases 
www.rackspace.com
YES 
26 
Does the delivery method matter? 
www.rackspace.com
Choose The Best Deployment Model 
27 
Public Cloud Managed Cloud 
Your Private Cloud 
(on Premise) 
Private Cloud
28
Advantages of storing data in the cloud: 
29 
Portability between 
providers 
Utility Pricing Minimal 
planning needed 
Scale to meet the exact 
demands 
Integration with data 
platforms
• Dedicated Hosting 
– No Capex Investment 
– Choose new hardware and software versioning easily 
– Rely on extended support personnel 
– Increased security options 
– Concurrent and predictable performance 
• On-Premise 
– Control Data Access 
– Integrate with core mainframe and systems 
– Build your own IP 
– Control every aspect of design and operation 
www.rackspace.com 30 
Advantages of Dedicated Hosting/On-Premise
www.rackspace.com 31 
The Trade Off... 
Custom Built 
Consistent 
Available 
Performant 
Purpose Built 
Elastic 
Flexible 
On-Demand
www.rackspace.com 32 
OnMetal Lets You Scale Like the Internet Giants 
BARE METAL 
SERVERS 
Instantly Available API-driven Highly Specialized No Hypervisor 
“Rackspace Cloud, because of its single-tenant OnMetal line, is the only place on Earth where you can enjoy 
Facebook/Google-style infrastructure rented by the hour.” 
-Ev Kontsevoy 
Director, Product 
Rackspace
Benefits of Outsourced Hosting 
Deliver resources fast 
Offload management responsibilities 
Scale as you grow 
Optimize around specified hardware
www.rackspace.com 34 
The Level of Management You Need 
Only you can decide what model is best for you! 
• DIY 
• Platform 
• Managed Service 
• Turnkey Service
Data as a Service: 
more time building, 
less time managing databases 
• For some businesses, database or 
infrastructure management IS core to the 
business 
• For most software-based businesses, database 
or infrastructure management represents time 
and resources not spent building the 
application 
• You must answer for yourself: are you in the 
business of managing infrastructure, or in the 
business of [your market here]? 
More time 
spent building 
the app 
More tasks performed FOR the 
developer (means that more time can be 
spent building the application) 
Sharding 
Scaling 
Performance 
Availability 
Analytics 
Optimization 
Proactive tasks 
Complex admin 
Patch 
Upgrade 
Backup/Restore 
Monitoring 
Replication 
HW selection 
Installation 
Patch 
Upgrade 
Backup/Restore 
Monitoring 
Replication 
HW selection 
Installation 
Patch 
Upgrade 
Backup/Restore 
Monitoring 
Replication 
HW selection 
Installation 
1 
Do-it-yourself 
database 
2 
Provisioned 
database 
3 
Automated 
database 
4 
Data as a 
Service 
HW selection 
Installation 
Patch 
Upgrade 
Backup/Restore 
Sharding 
Scaling 
Performance 
Availability 
Analytics 
Optimization 
Proactive tasks 
Complex admin 
App-specific 
data mgmt 
Patch 
Upgrade 
Backup/Restore 
Monitoring 
Replication 
Sharding 
Scaling 
Performance 
Availability 
Analytics 
Optimization 
Proactive tasks 
Complex admin 
App-specific 
data mgmt 
Sharding 
Scaling 
Performance 
Availability 
Analytics 
Optimization 
Proactive tasks 
Complex admin 
More tasks performed BY the developer 
(means that more time can be spent 
building the application) 
App-specific 
data mgmt 
App-specific 
data mgmt
www.rackspace.com 36
www.rackspace.com 37
www.rackspace.com 38
39 
Rackspace Offerings for the Data Tier 
www.rackspace.com 
Managed 
Database 
Services for 
Production Apps 
Managed 
Offerings of Most 
Popular 
Big Data, SQL, & 
NoSQL Databases 
Infrastructure 
for Data 
•Automatic DBA: Sharding, 
Backup, & HA 
•Entire Stack Optimized on Bare 
Metal 
•Supported 24x7x365 by experts 
• More than MongoDB… 
Cloud IaaS 
Get started fast 
DBA Services 
Dedicated 
Hosting 
Predictable costs & 
performance 
OnMetal 
Cloud Elasticity & 
Dedicated 
Performance 
•Architecture & Design 
•Tuning & Monitoring 
•24 x 7 x 365 Support 
•Cost Effective
1. Sign up for a free trial 
2. Want to know more? 
– Read my blog and check out the articles 
www.baremetalbigdata.com 
40 
What’s Next? 
www.rackspace.com
41 
Questions? 
www.rackspace.com
THANK YOU 
RACKSPACE® | 1 FANATICAL PLACE, CITY OF WINDCREST | SAN ANTONIO, TX 78218 
US SALES: 1-800-961-2888 | US SUPPORT: 1-800-961-4454 | WWW.RACKSPACE.COM 
© RACKSPACE LTD. | RACKSPACE® AND FANATICAL SUPPORT® ARE SERVICE MARKS OF RACKSPACE US, INC. REGISTERED IN THE UNITED S TATES AND OTHER COUNTRIES. | WWW.RACKSPACE.COM

Weitere ähnliche Inhalte

Was ist angesagt?

Customer Experience: A Catalyst for Digital Transformation
Customer Experience: A Catalyst for Digital TransformationCustomer Experience: A Catalyst for Digital Transformation
Customer Experience: A Catalyst for Digital Transformation
Cloudera, Inc.
 
Srini Data Monetization
Srini Data MonetizationSrini Data Monetization
Srini Data Monetization
Srini Alavala
 
Security and governance
Security and governanceSecurity and governance
Security and governance
DataWorks Summit
 

Was ist angesagt? (20)

Rethinking People Costs in Enterprise IT
Rethinking People Costs in Enterprise ITRethinking People Costs in Enterprise IT
Rethinking People Costs in Enterprise IT
 
Customer Experience: A Catalyst for Digital Transformation
Customer Experience: A Catalyst for Digital TransformationCustomer Experience: A Catalyst for Digital Transformation
Customer Experience: A Catalyst for Digital Transformation
 
Embedded Analytics Expert Session Webinar
Embedded Analytics Expert Session Webinar Embedded Analytics Expert Session Webinar
Embedded Analytics Expert Session Webinar
 
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
 
Srini Data Monetization
Srini Data MonetizationSrini Data Monetization
Srini Data Monetization
 
Extending BI with Big Data Analytics
Extending BI with Big Data AnalyticsExtending BI with Big Data Analytics
Extending BI with Big Data Analytics
 
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
 
Security and governance
Security and governanceSecurity and governance
Security and governance
 
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
 
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
 
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
The State of Big Data Adoption: A Glance at Top Industries Adopting Big Data ...
 
Modern Data Integration Expert Session Webinar
Modern Data Integration Expert Session Webinar Modern Data Integration Expert Session Webinar
Modern Data Integration Expert Session Webinar
 
CWIN17 san francisco-blockchain three ways to prevent it from failing in the ...
CWIN17 san francisco-blockchain three ways to prevent it from failing in the ...CWIN17 san francisco-blockchain three ways to prevent it from failing in the ...
CWIN17 san francisco-blockchain three ways to prevent it from failing in the ...
 
Competitive edgewithmongod bandpentaho_2014sep_v3[1]
Competitive edgewithmongod bandpentaho_2014sep_v3[1]Competitive edgewithmongod bandpentaho_2014sep_v3[1]
Competitive edgewithmongod bandpentaho_2014sep_v3[1]
 
Advanced Analytics and New Big Data
Advanced Analytics and New Big DataAdvanced Analytics and New Big Data
Advanced Analytics and New Big Data
 
Data Driven Decisions - Big Data Warehousing Meetup, FICO
Data Driven Decisions - Big Data Warehousing Meetup, FICOData Driven Decisions - Big Data Warehousing Meetup, FICO
Data Driven Decisions - Big Data Warehousing Meetup, FICO
 
How to optimize the supply chain with ai
How to optimize the supply chain with ai How to optimize the supply chain with ai
How to optimize the supply chain with ai
 
Cloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learningCloudera Fast Forward Labs: Accelerate machine learning
Cloudera Fast Forward Labs: Accelerate machine learning
 
Starting the Journey to Managed Infrastructure Services
Starting the Journey to Managed Infrastructure ServicesStarting the Journey to Managed Infrastructure Services
Starting the Journey to Managed Infrastructure Services
 
Webinar - Big Data: Power to the User
Webinar - Big Data: Power to the User Webinar - Big Data: Power to the User
Webinar - Big Data: Power to the User
 

Andere mochten auch

1 encu-escritra-clau-correcion
1 encu-escritra-clau-correcion1 encu-escritra-clau-correcion
1 encu-escritra-clau-correcion
Diego Solano
 
Polyglot Persistence
Polyglot PersistencePolyglot Persistence
Polyglot Persistence
Wayne Walls
 
Making Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High ExpectationsMaking Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High Expectations
Rackspace
 

Andere mochten auch (15)

The Evolution of OpenStack – From Infancy to Enterprise
The Evolution of OpenStack – From Infancy to EnterpriseThe Evolution of OpenStack – From Infancy to Enterprise
The Evolution of OpenStack – From Infancy to Enterprise
 
Rackspace Hosting Presentation
Rackspace Hosting  PresentationRackspace Hosting  Presentation
Rackspace Hosting Presentation
 
What Would You Do With More Time?
What Would You Do With More Time?What Would You Do With More Time?
What Would You Do With More Time?
 
Agile-Techture: Nimble Cloud Engineering at Rackspace
Agile-Techture:  Nimble Cloud Engineering at RackspaceAgile-Techture:  Nimble Cloud Engineering at Rackspace
Agile-Techture: Nimble Cloud Engineering at Rackspace
 
Rackspace::Solve NYC - Welcome Keynote featuring Rackspace CTO John Engates
Rackspace::Solve NYC - Welcome Keynote featuring Rackspace CTO John EngatesRackspace::Solve NYC - Welcome Keynote featuring Rackspace CTO John Engates
Rackspace::Solve NYC - Welcome Keynote featuring Rackspace CTO John Engates
 
Deploy Apache Spark™ on Rackspace OnMetal™ for Cloud Big Data Platform
Deploy Apache Spark™ on Rackspace OnMetal™ for Cloud Big Data PlatformDeploy Apache Spark™ on Rackspace OnMetal™ for Cloud Big Data Platform
Deploy Apache Spark™ on Rackspace OnMetal™ for Cloud Big Data Platform
 
Integration testing for salt states using aws ec2 container service
Integration testing for salt states using aws ec2 container serviceIntegration testing for salt states using aws ec2 container service
Integration testing for salt states using aws ec2 container service
 
Personal Branding 2017
Personal Branding 2017Personal Branding 2017
Personal Branding 2017
 
1 encu-escritra-clau-correcion
1 encu-escritra-clau-correcion1 encu-escritra-clau-correcion
1 encu-escritra-clau-correcion
 
Why VM Replication Is Your Lifeline when Disaster Strikes
Why VM Replication Is Your Lifeline when Disaster StrikesWhy VM Replication Is Your Lifeline when Disaster Strikes
Why VM Replication Is Your Lifeline when Disaster Strikes
 
Tearing Down Silos and Building Your Enterprise Dev/Ops Engine
Tearing Down Silos and Building Your Enterprise Dev/Ops EngineTearing Down Silos and Building Your Enterprise Dev/Ops Engine
Tearing Down Silos and Building Your Enterprise Dev/Ops Engine
 
Ruby + Josy
Ruby + JosyRuby + Josy
Ruby + Josy
 
RMS Security Breakfast
RMS Security BreakfastRMS Security Breakfast
RMS Security Breakfast
 
Polyglot Persistence
Polyglot PersistencePolyglot Persistence
Polyglot Persistence
 
Making Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High ExpectationsMaking Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High Expectations
 

Ähnlich wie How Startups can leverage big data?

02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
Raul Chong
 
Latest corp big data and acme
Latest corp   big data and acmeLatest corp   big data and acme
Latest corp big data and acme
hooduku
 

Ähnlich wie How Startups can leverage big data? (20)

Big data Introduction by Mohan
Big data Introduction by MohanBig data Introduction by Mohan
Big data Introduction by Mohan
 
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
 
Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Big Data Use Cases
Big Data Use CasesBig Data Use Cases
Big Data Use Cases
 
Aziksa hadoop for buisness users2 santosh jha
Aziksa hadoop for buisness users2 santosh jhaAziksa hadoop for buisness users2 santosh jha
Aziksa hadoop for buisness users2 santosh jha
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017
 
Liberating data power of APIs
Liberating data power of APIsLiberating data power of APIs
Liberating data power of APIs
 
Hp big data_casestudy
Hp big data_casestudyHp big data_casestudy
Hp big data_casestudy
 
bigdataintro.pptx
bigdataintro.pptxbigdataintro.pptx
bigdataintro.pptx
 
Riding and Capitalizing the Next Wave of Information Technology
Riding and Capitalizing the Next Wave of Information TechnologyRiding and Capitalizing the Next Wave of Information Technology
Riding and Capitalizing the Next Wave of Information Technology
 
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTBig Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
 
Big data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardBig data solutions on cloud – the way forward
Big data solutions on cloud – the way forward
 
Latest corp big data and acme
Latest corp   big data and acmeLatest corp   big data and acme
Latest corp big data and acme
 
Hooduku - Big data analytics - case study
Hooduku - Big data analytics - case studyHooduku - Big data analytics - case study
Hooduku - Big data analytics - case study
 
Big data
Big dataBig data
Big data
 
Customer 360
Customer 360Customer 360
Customer 360
 
Big data
Big dataBig data
Big data
 
Big data and data mining
Big data and data miningBig data and data mining
Big data and data mining
 
IARE_BDBA_ PPT_0.pptx
IARE_BDBA_ PPT_0.pptxIARE_BDBA_ PPT_0.pptx
IARE_BDBA_ PPT_0.pptx
 

Mehr von Rackspace

Ignite Innovation: Turn Developers Loose on the Hybrid Cloud”
Ignite Innovation: Turn Developers Loose on the Hybrid Cloud”Ignite Innovation: Turn Developers Loose on the Hybrid Cloud”
Ignite Innovation: Turn Developers Loose on the Hybrid Cloud”
Rackspace
 
The Next Generation IT Department MUST HAVE CLOUD
The Next Generation IT Department MUST HAVE CLOUDThe Next Generation IT Department MUST HAVE CLOUD
The Next Generation IT Department MUST HAVE CLOUD
Rackspace
 
Calculating Downtime Costs: How Much Should You Spend on DR?
Calculating Downtime Costs: How Much Should You Spend on DR?Calculating Downtime Costs: How Much Should You Spend on DR?
Calculating Downtime Costs: How Much Should You Spend on DR?
Rackspace
 

Mehr von Rackspace (20)

Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...
Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...
Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...
 
Rackspace::Solve NYC - Second Stage Cloud
Rackspace::Solve NYC - Second Stage CloudRackspace::Solve NYC - Second Stage Cloud
Rackspace::Solve NYC - Second Stage Cloud
 
Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...
Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...
Rackspace::Solve NYC - Solving for Rapid Customer Growth and Scale Through De...
 
Rackspace::Solve NYC - The Future of Applications with Ken Cochrane, Engineer...
Rackspace::Solve NYC - The Future of Applications with Ken Cochrane, Engineer...Rackspace::Solve NYC - The Future of Applications with Ken Cochrane, Engineer...
Rackspace::Solve NYC - The Future of Applications with Ken Cochrane, Engineer...
 
vCenter Site Recovery Manager: Architecting a DR Solution
vCenter Site Recovery Manager: Architecting a DR SolutionvCenter Site Recovery Manager: Architecting a DR Solution
vCenter Site Recovery Manager: Architecting a DR Solution
 
Outsourcing IT Projects to Managed Hosting of the Cloud
Outsourcing IT Projects to Managed Hosting of the CloudOutsourcing IT Projects to Managed Hosting of the Cloud
Outsourcing IT Projects to Managed Hosting of the Cloud
 
How to Bring Shadow IT to the Light
How to Bring Shadow IT to the LightHow to Bring Shadow IT to the Light
How to Bring Shadow IT to the Light
 
DR-to-the-Cloud Best Practices
DR-to-the-Cloud Best PracticesDR-to-the-Cloud Best Practices
DR-to-the-Cloud Best Practices
 
Migrating Traditional Apps from On-Premises to the Hybrid Cloud
Migrating Traditional Apps from On-Premises to the Hybrid CloudMigrating Traditional Apps from On-Premises to the Hybrid Cloud
Migrating Traditional Apps from On-Premises to the Hybrid Cloud
 
Rackspace::Solve SFO - CoreOS CEO Alex Polvi on Solving for What's Next
Rackspace::Solve SFO - CoreOS CEO Alex Polvi on Solving for What's NextRackspace::Solve SFO - CoreOS CEO Alex Polvi on Solving for What's Next
Rackspace::Solve SFO - CoreOS CEO Alex Polvi on Solving for What's Next
 
Rackspace::Solve SFO - Rackspace CEO Taylor Rhodes on the Power of Solving Pr...
Rackspace::Solve SFO - Rackspace CEO Taylor Rhodes on the Power of Solving Pr...Rackspace::Solve SFO - Rackspace CEO Taylor Rhodes on the Power of Solving Pr...
Rackspace::Solve SFO - Rackspace CEO Taylor Rhodes on the Power of Solving Pr...
 
Rackspace::Solve SFO - Solving for the Coming Tidal Wave of Choices with Avai...
Rackspace::Solve SFO - Solving for the Coming Tidal Wave of Choices with Avai...Rackspace::Solve SFO - Solving for the Coming Tidal Wave of Choices with Avai...
Rackspace::Solve SFO - Solving for the Coming Tidal Wave of Choices with Avai...
 
vSphere with Openstack
vSphere with OpenstackvSphere with Openstack
vSphere with Openstack
 
Rackspace::Solve SFO - Solve(Scale) Featuring Docker CEO Ben Golub
Rackspace::Solve SFO - Solve(Scale) Featuring Docker CEO Ben GolubRackspace::Solve SFO - Solve(Scale) Featuring Docker CEO Ben Golub
Rackspace::Solve SFO - Solve(Scale) Featuring Docker CEO Ben Golub
 
Rackspace::Solve SFO - Welcome Keynote featuring Rackspace CTO John Engates
Rackspace::Solve SFO - Welcome Keynote featuring Rackspace CTO John EngatesRackspace::Solve SFO - Welcome Keynote featuring Rackspace CTO John Engates
Rackspace::Solve SFO - Welcome Keynote featuring Rackspace CTO John Engates
 
vSphere with OpenStack
vSphere with OpenStackvSphere with OpenStack
vSphere with OpenStack
 
Pre-Aggregated Analytics And Social Feeds Using MongoDB
Pre-Aggregated Analytics And Social Feeds Using MongoDBPre-Aggregated Analytics And Social Feeds Using MongoDB
Pre-Aggregated Analytics And Social Feeds Using MongoDB
 
Ignite Innovation: Turn Developers Loose on the Hybrid Cloud”
Ignite Innovation: Turn Developers Loose on the Hybrid Cloud”Ignite Innovation: Turn Developers Loose on the Hybrid Cloud”
Ignite Innovation: Turn Developers Loose on the Hybrid Cloud”
 
The Next Generation IT Department MUST HAVE CLOUD
The Next Generation IT Department MUST HAVE CLOUDThe Next Generation IT Department MUST HAVE CLOUD
The Next Generation IT Department MUST HAVE CLOUD
 
Calculating Downtime Costs: How Much Should You Spend on DR?
Calculating Downtime Costs: How Much Should You Spend on DR?Calculating Downtime Costs: How Much Should You Spend on DR?
Calculating Downtime Costs: How Much Should You Spend on DR?
 

Kürzlich hochgeladen

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 

Kürzlich hochgeladen (20)

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
hybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptxhybrid Seed Production In Chilli & Capsicum.pptx
hybrid Seed Production In Chilli & Capsicum.pptx
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 

How Startups can leverage big data?

  • 1. How Can Startups Leverage Big Data? Trudging Through Myth To Discover Real Value
  • 2. • Mostly Unstructured Data • Client Data • Customer Data • Social Data • Driving towards insight 2 What is Big Data? www.rackspace.com
  • 3. RACKSPACE® HOSTING | WWW.RACKSPACE.COM “Big Data is any dataset not suited to be processed by traditional legacy technology.”
  • 4. The Three V’s 4 V3C Mining social data for sentiment Analyzing web clickstreams Analyzing log data for security breaches Telemetry from sensors and machines eCommerce predictive analytics VOLUME VELOCITY VARIETY COMPLEXITY
  • 5. The Three V’s 5 V3C Mining social data for sentiment Analyzing web clickstreams Analyzing log data for security breaches Telemetry from sensors and machines eCommerce predictive analytics VOLUME VELOCITY VARIETY COMPLEXITY
  • 7. • Big Data is now much more than hype – real customers with real use cases are adopting daily •Recent survey found that business leaders expected the deployment of Hadoop to result in a 3-year benefit ranging from $5M to $50M+ • Close to 100% of business leaders have already deployed or plan to deploy ApacheTM Hadoop® "Enterprises are showing increasing interest in the value provided by the large-scale data processing that Hadoop and Spark can provide, but can be wary of the upfront cost and complexity of setting up a cluster to prove that value. Managed services such as [OnMetalTM Cloud Big Data Platform] enable enterprises to focus their energies on generating business insights rather than configuring and managing infrastructure.” Matt Aslett 451 Research Director, Data Platforms and Analytics 7 Big Data is Here to Stay www.rackspace.com
  • 8. • To learn more about your customers • To optimize your business processes • To become a more targeted marketer • Interact with users and customers in real time • Add additional revenue and services 8 Why leverage Big Data? www.rackspace.com
  • 9. www.rackspace.com 9 What Is the Cost of Lacking a Big Data Strategy? • Today every company can be a data company • Successful companies will be data companies • Under Armour isn’t just a fitness company – they’re a data company
  • 10. • Open Source • Able to process petabytes of data quickly • Developed at Google, implemented at scale at Yahoo • Handles unstructured data very well • One of the fastest growing eco-systems 10 Hadoop Has Emerged As A Leader In Distributed Data Sets
  • 11. Fundamentals of Hadoop v1 Zookeeper Configuration, sync and naming registry Oozie Workflow and job scheduling Knox Auth and access Falcon Data pipeline framework Installation, monitoring, administration 11 Data Services Pig Data flow scripting language HBase Distributed, scalable, non relational database HCatalog Metadata and table management system Core Services HDFS Distributed File System Hive DW analysis layer through HiveQL (SQL-like) queries MapReduce Data processing framework Ambari Operational Services Flume Log data aggregation and movement Sqoop Bulk data transfer from and to relational DB
  • 12. • Biggest impediments include: – Insufficient skills in-house to design and deploy – Designing and deploying takes too long – High cost of physical infrastructure 12 Hadoop is Hard www.rackspace.com 3 10 only in businesses that plan to implement Hadoop have done so
  • 13. Hadoop is Changing • Original focus on batch processing • Streaming and interactive use cases emerging • Shift from jobs that take hours to seconds • Impala, Spark, and Presto are emerging tools
  • 14. 14 But what are these companies doing with Big Data? www.rackspace.com Gaining Insights!!!
  • 15. What are Companies Doing with Hadoop? www.rackspace.com 15 Vertical Use Case Data Type Financial Services New Account Risk Screens Text, Server Logs Fraud Prevention Server Logs Trading Risk Server Logs Maximize Deposit Spread Text, Server Logs Insurance Underwriting Geographic, Sensor, Text Accelerate Loan Processing Text Telecom Call Detail Records (CDRs) Machine, Geographic Infrastructure Investment Machine, Server logs Next Product to Buy (NPTB) Clickstream Real-time Bandwidth Allocation Server Logs, Text, Sentiment New Product Development Machine, Geographic Retail 360 View of the Customer Clickstream, Text Analyze Brand Sentiment Sentiment Localized, Personalized Promotions Geographic Website Optimization Clickstream Optimal Store Layout Sensor Manufacturing Supply Chain and Logistics Sensor Assembly Line Quality Assurance Sensor Proactive Maintenance Machine Crowdsourced Quality Assurance Sentiment
  • 16. Application Underpinning People are building net-new applications with Hadoop as their database • Mobile – Enterprises consider support for mobility and productivity enhancement to mobile workers as their top-priority new application category, according to a recent survey by CIMI Corp. That means most companies that have adopted, or are adopting, Hadoop will likely have to integrate the framework with mobile applications. • Data Aggregation – The two big use cases we're seeing for Impala are aggregating data in Hadoop to present analytic dashboards and improving data-discovery applications by providing faster performance than Hive," Alex Gutow, Cloudera's product marketing manager. • Dashboarding – Users are increasingly choosing Hadoop as the underlying technology to power interactive dashboarding capability. • Internet of Things – As tech wearables and generated devices start to become common-day solutions the backend of your application needs to be built to address these concerns and can handle the velocity and volume of data being produced by the appliance. www.rackspace.com 16
  • 17. Clickstream Analysis Understand how your users are behaving on your website and optimize your experience Your home page looks great. But how do you move customers on to bigger things—like submitting a form or completing a purchase? Get more granular with customer segmentation. Hadoop makes it easier to analyze, visualize and ultimately change how visitors behave on your website. A clickstream is a series of page requests. Every page requested generates a signal. These signals can be graphically represented for clickstream reporting. The main point of clickstream tracking is to give webmasters insight into what visitors on their site are doing. • Clickpath – The study of human clicks on a website • Tracking Cookies – Tool used to understand and track online activity • Data Mining – Collecting data from websites and online properties www.rackspace.com 17
  • 18. Sentiment Analysis Find out what your users are saying about you. Are they happy? Does your product make them a promoter? Your customers are talking. With Hadoop, you can mine Twitter, Facebook and other social media conversations for sentiment data about you and your competition, and use it to make targeted, real-time decisions that increase market share. Sentiment analysis aims to determine the attitude of a speaker or a writer with respect to some topic or the overall contextual polarity of a document. • Social Media Feeds – Many companies are now capturing entire Twitter and Facebook feeds to analyze. • Data Mining – Users are searching the web for comments, blogs, and whitepapers that can point to overall sentiment • E-Communities – Forums, user groups, Heroku www.rackspace.com 18
  • 19. Machine Learning Interactive devices are now streamlining things like maintenance and troubleshooting Your machines know things. From out in the field to the assembly line floor—machines stream low-cost, always-on data. Hadoop makes it easier for you to store and refine that data and identify meaningful patterns, providing you with the insight to make proactive business decisions. Machine Learning is a scientific discipline that deals with the construction and study of algorithms that can learn from data. Such algorithms operate by building a model based on inputs and using that to make predictions or decisions, rather than following only explicitly programmed instructions. • Pattern Recognition – Users are building clusters to detect patterns and identify anomalies in data that these devices are generating • Decision Tree – Allows the system to take action and make choices based on the data • Predictive Modeling – Aims to automate the most common mistakes and errors as part of a preventative model www.rackspace.com 19
  • 20. Fraud Detection Users are detecting fraudulent online behavior and rejecting those users before they commit an offense Fraud is a billion-dollar business and it is increasing every year. The PwC global economic crime survey of 2009 suggests that close to 30% of companies worldwide have reported being victims of fraud in the past year. Fraud involves one or more persons who intentionally act secretly to deprive another of something of value, for their own benefit. Fraud is as old as humanity itself and can take an unlimited variety of different forms. However, in recent years, the development of new technologies has also provided further ways in which criminals may commit fraud. • Rules-Based Detection – Even though internet hackers have become better at tricking online systems, they still exhibit very calculated behavior. • Machine Learning – The aggregation of data points can help you collect more info about the potential sale and detect if it might be fraud. • Users Tagging and Tracing – Once users are flagged as fraudulent, their repeated attempts can be prevented. www.rackspace.com 20
  • 21. Server Log Data Aggregate server logs to find trends and anomalies in your security records Security breaches happen. And when they do, your server logs may be your best line of defense. Hadoop takes server-log analysis to the next level by speeding and improving security forensics and providing a low cost platform to show compliance. Generally small files that track user information inside a confined environment; often used to meet compliance or troubleshoot an incident. • Scrub Data for Forensics – If a security incident occurs, it is important to remediate fast • Identify Anomalies – Anti-patterns are often the first sign • Discover Trends – Some types of errors might become common; learn to identify them • Actively Automate to Solve Issues with Log Files – Many of these errors can be proactively eliminated through the use of automation. www.rackspace.com 21
  • 22. 360 View of Customer – Dashboards and Analytics Create in-depth personas for your customers based on how they are actually behaving. Whenever a customer interacts with an organization, it is vital that the richness of information available on that customer informs and guides the processes that will help to maximize their experience, while simultaneously making the interaction as effective and efficient as possible. This includes everything from avoiding repetition or rekeying of information, to viewing customer history, establishing context and initiating desired actions. A total 360 view often contains 3 views: • The Past – Understanding how your users act in the past lets you understand who they are and serve them relevant content and products • The Present – Where are users coming from? What is their experience on your site right now? Do they need help? • The Future – Did they buy? Can we serve them more information to help their choice? Can we market to them better? www.rackspace.com 22
  • 23. What’s Next? Interactive Processing! Interact with customers in real-time offering suggestions and inhibiting behavior What if instead of reacting to behavior we can engage virtually with the user to inhibit behavior? This is called interactive processing and it takes input from humans and reacts based on patterns and algorithms. The quicker we can server up this interaction, to the user the better equipped we are to inhibit their behavior! www.rackspace.com 23 Input data Proces s Output data source: Teach-ICT.com
  • 24. • Introducing support of Apache SparkTM • Apache Spark enables enterprises to combine the breadth of structured and unstructured data with the speed of in-memory processing to build streaming, machine learning, and graph-optimized applications that allow businesses to take action at the speed of insight. 24 Apache Spark www.rackspace.com
  • 25. • Deeper Integration with SQL Workloads • Streaming Applications • Machine Learning • Iterative Processing • Real-time Graphical Dashboards 25 New Use Cases www.rackspace.com
  • 26. YES 26 Does the delivery method matter? www.rackspace.com
  • 27. Choose The Best Deployment Model 27 Public Cloud Managed Cloud Your Private Cloud (on Premise) Private Cloud
  • 28. 28
  • 29. Advantages of storing data in the cloud: 29 Portability between providers Utility Pricing Minimal planning needed Scale to meet the exact demands Integration with data platforms
  • 30. • Dedicated Hosting – No Capex Investment – Choose new hardware and software versioning easily – Rely on extended support personnel – Increased security options – Concurrent and predictable performance • On-Premise – Control Data Access – Integrate with core mainframe and systems – Build your own IP – Control every aspect of design and operation www.rackspace.com 30 Advantages of Dedicated Hosting/On-Premise
  • 31. www.rackspace.com 31 The Trade Off... Custom Built Consistent Available Performant Purpose Built Elastic Flexible On-Demand
  • 32. www.rackspace.com 32 OnMetal Lets You Scale Like the Internet Giants BARE METAL SERVERS Instantly Available API-driven Highly Specialized No Hypervisor “Rackspace Cloud, because of its single-tenant OnMetal line, is the only place on Earth where you can enjoy Facebook/Google-style infrastructure rented by the hour.” -Ev Kontsevoy Director, Product Rackspace
  • 33. Benefits of Outsourced Hosting Deliver resources fast Offload management responsibilities Scale as you grow Optimize around specified hardware
  • 34. www.rackspace.com 34 The Level of Management You Need Only you can decide what model is best for you! • DIY • Platform • Managed Service • Turnkey Service
  • 35. Data as a Service: more time building, less time managing databases • For some businesses, database or infrastructure management IS core to the business • For most software-based businesses, database or infrastructure management represents time and resources not spent building the application • You must answer for yourself: are you in the business of managing infrastructure, or in the business of [your market here]? More time spent building the app More tasks performed FOR the developer (means that more time can be spent building the application) Sharding Scaling Performance Availability Analytics Optimization Proactive tasks Complex admin Patch Upgrade Backup/Restore Monitoring Replication HW selection Installation Patch Upgrade Backup/Restore Monitoring Replication HW selection Installation Patch Upgrade Backup/Restore Monitoring Replication HW selection Installation 1 Do-it-yourself database 2 Provisioned database 3 Automated database 4 Data as a Service HW selection Installation Patch Upgrade Backup/Restore Sharding Scaling Performance Availability Analytics Optimization Proactive tasks Complex admin App-specific data mgmt Patch Upgrade Backup/Restore Monitoring Replication Sharding Scaling Performance Availability Analytics Optimization Proactive tasks Complex admin App-specific data mgmt Sharding Scaling Performance Availability Analytics Optimization Proactive tasks Complex admin More tasks performed BY the developer (means that more time can be spent building the application) App-specific data mgmt App-specific data mgmt
  • 39. 39 Rackspace Offerings for the Data Tier www.rackspace.com Managed Database Services for Production Apps Managed Offerings of Most Popular Big Data, SQL, & NoSQL Databases Infrastructure for Data •Automatic DBA: Sharding, Backup, & HA •Entire Stack Optimized on Bare Metal •Supported 24x7x365 by experts • More than MongoDB… Cloud IaaS Get started fast DBA Services Dedicated Hosting Predictable costs & performance OnMetal Cloud Elasticity & Dedicated Performance •Architecture & Design •Tuning & Monitoring •24 x 7 x 365 Support •Cost Effective
  • 40. 1. Sign up for a free trial 2. Want to know more? – Read my blog and check out the articles www.baremetalbigdata.com 40 What’s Next? www.rackspace.com
  • 42. THANK YOU RACKSPACE® | 1 FANATICAL PLACE, CITY OF WINDCREST | SAN ANTONIO, TX 78218 US SALES: 1-800-961-2888 | US SUPPORT: 1-800-961-4454 | WWW.RACKSPACE.COM © RACKSPACE LTD. | RACKSPACE® AND FANATICAL SUPPORT® ARE SERVICE MARKS OF RACKSPACE US, INC. REGISTERED IN THE UNITED S TATES AND OTHER COUNTRIES. | WWW.RACKSPACE.COM

Hinweis der Redaktion

  1. If your company doesn’t have a robust big data strategy it’s a real concern. As you can see from the last slide, it’s likely that regardless of industry your competitors are building their own big data initiatives. Examples: Nike Nest MapMyFitness Today we are all data companies. Examples include Nike, Nest, even old guard companies like John Deere. Share the Under Armour story. The data they harvest has the potential to impact every part of their business---from how they manage their supply chain to how the interact with their customers. With consumer movements like “the instrumented self”, it is the differentiator.