SlideShare ist ein Scribd-Unternehmen logo
1 von 32
 Business Value of Hadoop
 Adoption Success Criteria
 “Jumpstarting” Your Hadoop
Big Data on Hadoop
> OMG! – The Opportunity - What Do We Do With All This Data?
Volume, Variety and Velocity all increasing rapidly
Source: IBM, Oct 2012
Big Data on Hadoop
A Needed Technology Disruption
Told Us What Happened & Made Us More Efficient
BI Using
Data Cube
Analysis
Structured, Sampled
Transitional, Closed,
Expensive
RDBS & EDW
SQL
Driven by Vendors
Infrastructure for Sales,
Finance & Operations
CFO
COO
CRO
Supply Chain
ERP
Sales Ops
Big Data on Hadoop
Technology Disruption
What Can We Make Happen?
“Database of Intent”
Analytics Using
Total Fidelity
Analysis
Unstructured
Behavioral,
Open, Affordable
HADOOP
HQL
BI Using
Data Cube
Analysis
Structured, Sampled
Transitional,
Closed, Expensive
RDBS & EDW
SQL
Driven by Open SourceDriven by Vendors
Big Data on Hadoop…What is it Good For?
Customer Innovation – Optimize the Customer Engagement Cycle
BUY USE
SUPPORT
Customer
Innovation
Personalized
Cross&Upsell
Channel
Analytics
Superior
Products
Telemetry
Analytics
Proactive
Support
Response
Analytics
Monitoring
Usage
Quality
ChurnTicket
Resolution SLA
Micro
Segmentation
Product
Categorization
Recommendations
BUY USE
SUPPORT
Customer
Innovation
Personalized
Cross&Upsell
Channel
Analytics
Superior
Products
Telemetry
Analytics
Proactive
Support
Response
Analytics
Big Data on Hadoop
Jumpstart Use Cases For Hadoop
Big Data on Hadoop
Not Really About More BI
MORE BI?
Big Data on Hadoop
So, We’re On a Journey to Big Data Analytics
You Are Here….Let’s get going!
Early Adopter Success Drivers
But First!
 Education
 Training
 Certifications
#1. IT Partnership With LOB (Marketing)
 Find Use Case
 Identify Budget
 Form project teams
 Partner on a small POC
 Educate
Big Data on Hadoop
Success Driver #1
“By 2016 the CMO will have
more budget than the CIO”
- Gartner Group
Big Data on Hadoop
#2. Use Second-Generation Big Data Analytics
 Support both power analysts & business users
 Work collaboratively on projects
 Empower Interactive Visualizations
 Facilitate the new “Big Data” workflow
 Gamify
Big Data on Hadoop
Success Driver #2
#3. Embrace Analytics Hubs…They’re Coming!
 Sharing
 Re-Use
 Enforce Standards
 Analytics “Assembly”
Big Data on Hadoop
Adoption Driver #3
Go Native & Get Total Fidelity for
Analytics on Hadoop
 Very rich
 Not sampled
 No data replication or movement
 Low complexity and TCO
Big Data on Hadoop
Jumpstarting Your Hadoop
A Use Case Demo
Data
Warehouse
OLTP to OLAP
Mapping
Analyst
BI Using Data Cube Analysis
Analysts Worked with Transformed, Aggregated, Sampled Data
Ordering App
Financial App
Master Data
Staging
OLAP
Reports
BI Using
Data Cube
Analysis
Structured, Sampled
Transitional,
Closed, Expensive
RDBS & EDW
SQL
Driven by Vendors
Analysts Access All the Data With a New Data WorkFlow
Iterative & Adaptive
Big Data Analytics on Hadoop Using Total Fidelity Analytics
Application
AnalyticsData
Unstructured
Behavioral,
Open, Affordable
HADOOP
HQL
Driven by Open Source
Analyst
Analytics Using
Total Fidelity
Analysis
1. How do we increase traffic to our site?
2. How do we make customers stay on the site, come
into the store and engage with our offerings?
3. How do we convert browsers to buyers?
4. What else can promote knowing their profile?
5. How can make them come back and shop for more?
What kind of customer innovation can we drive?
1. How often do they visit, what did they buy, how
much did they spend?
2. What did they view, how long did they stay on the
site? What did they click on? What did they rate?
What did their friends buy?
3. How do we offer the most relevant product for
service and invite them for the right campaign.
What insights do we need?
To better understand my customer we need a more
granular segmentation (micro-segmentation)
The Signals of Big Data on Hadoop
Logs - Search terms, page views,
useragent , Geo, IP, duration, size...
Campaigns, keywords,
channels, SEO, Display, Affiliates
Reviews - SKU, date, who,
comment, rating, location
Products-
SKU, categories, bundles, descripti
on, prize
Orders – SKU, prize, purchased
with, Shipping date, status
Profiles – Names, location, gender,
demographics, reach, interests, influence
1. Describe and prepare the data
2. Perform initial analysis on raw data
3. A number of insights can be derived without further
analysis (RFM)
4. Identify feature set and extract training data for
modeling
5. Create the model in any model authoring tool
6. Score the model in Hadoop
7. Use the insight to improve business value
Steps in a typical analysis
Complexity of data compared to transactional world
{"frequentlyPurchasedWith": [], "color": "Black", "skutype": "parent", "productTemplate":
"Computer_Accessory", "salesRankMediumTerm": "3039", "shortDescription": "Compatible with Windows 8 and
RT and Android 3.0 tablets; Bluetooth technology; convertible stand/carrying case", "includedItemList":
[{"includedItem": "Logitech Tablet Keyboard for Windows 8 and RT and Android 3.0+ Tablets"}, {"includedItem":
"4 AAA batteries"}, {"includedItem": "Owner's manual"}], "subclassId": 2409, "sku": 6541967, "width": "12.3"",
"subclass": "BLUETOOTH KEYBOARDS", "source": "BoxStore", "modelNumber": "920-004569", "digital": false,
"department": "COMPUTERS", "type": "HardGood", "productId": 1218752781558, "description": "None",
"technologyCode": "None", "longDescription": "This Logitech 920-004569 keyboard features a low-profile, 65-key
design for easy, comfortable typing on your Windows 8 or RT or Android 3.0 tablet. The convertible stand allows
comfortable viewing and provides on-the-go protection for the keyboard.", "categoryPath": [{"name": "Box
Store", "id": "cat00000"}, {"name": "Computers & Tablets", "id": "abcat0500000"}, {"name": "iPad, Tablets & E-
Readers", "id": "pcmcat209000050006"}, {"name": "Tablet Accessories", "id": "pcmcat231800050009"}, {"name":
"Tablet Docks, Keyboards & Stands", "id": "pcmcat242000050003"}], "manufacturer": "Logitech", "classId": 492,
"upc": "097855090973", "regularPrice": 69.99, "class_1": "TABLET ACCESSORIES", "relatedProducts": [{"sku":
4974041}, {"sku": 1306578835}, {"sku": 4640745}, {"sku": 9610542}, {"sku": 8785729}, {"sku": 6640676}]}
{"frequentlyPurchasedWith": [], "color": "Gray", "skutype": "parent", "productTemplate": "Computer_Accessory",
"salesRankMediumTerm": "None", "shortDescription": "Compatible with BlackBerry Playbook tablets; wool
construction; TPU plastic cradle; elastic band; metallic clip; functions as a stand; play-through design",
"includedItemList": [{"includedItem": "DICOTA TabBook Case for BlackBerry Playbook Tablets"}], "subclassId":
2404, "sku": 6738835, "width": "5.5"", "subclass": "SO TABLET ACCY", "source": "BoxStore", "modelNumber":
"D30203", "digital": false, "department": "COMPUTERS", "type": "HardGood", "productId": 1218789793935,
"description": "None", "technologyCode": "None", "longDescription": "This DICOTA TabBook D30203 case helps
keep your BlackBerry Playbook tablet safe from hazards, with wool construction and a TPU plastic cradle for
durability and an elastic band to keep your tablet snug and secure in the case.", "categoryPath": [{"name": "Box
Store", "id": "cat00000"}, {"name": "Computers & Tablets", "id": "abcat0500000"}, {"name": "iPad, Tablets & E-
Readers", "id": "pcmcat209000050006"}, {"name": "Tablet Accessories", "id": "pcmcat231800050009"}, {"name":
"Tablet Cases, Covers & Sleeves", "id": "pcmcat242000050002"}], "manufacturer": "DICOTA", "classId": 492,
"upc": "7332752000964", "regularPrice": 49.99, "class_1": "TABLET ACCESSORIES", "relatedProducts": [{"sku":
How big your data can grow? Number of unique visitors per day
Why does it matter to track user activity
Significant amount of time is spent on understanding the data and preparing it
for further analysis.
Describe, transform and prepare the data
Derive insights using Hadoop and Hive User Defined Functions
KNIME - Open-source modeling tool that can build and export
model as PMML
Bring Them Into Karmasphere for Scoring on Hadoop
Thank You
Questions?

Weitere ähnliche Inhalte

Was ist angesagt?

Neo4j GraphTour New York_Realogy Presentation
Neo4j GraphTour New York_Realogy Presentation Neo4j GraphTour New York_Realogy Presentation
Neo4j GraphTour New York_Realogy Presentation Neo4j
 
Data Leaders Summit Barcelona 2018
Data Leaders Summit Barcelona 2018Data Leaders Summit Barcelona 2018
Data Leaders Summit Barcelona 2018Harvinder Atwal
 
Big Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesBig Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesJames Serra
 
Ecr presentation ss chain - jeffrey - final
Ecr presentation   ss chain - jeffrey - finalEcr presentation   ss chain - jeffrey - final
Ecr presentation ss chain - jeffrey - finalECR Community
 
The Business Value of Big Data
The Business Value of Big DataThe Business Value of Big Data
The Business Value of Big DataClark Boyd
 
Commercial Analytics at Scale in Pharma: From Hackathon to MVP with Azure Dat...
Commercial Analytics at Scale in Pharma: From Hackathon to MVP with Azure Dat...Commercial Analytics at Scale in Pharma: From Hackathon to MVP with Azure Dat...
Commercial Analytics at Scale in Pharma: From Hackathon to MVP with Azure Dat...Databricks
 
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...Impetus Technologies
 
Scaling AI At H&M
Scaling AI At H&MScaling AI At H&M
Scaling AI At H&MDatabricks
 
How Starbucks Forecasts Demand at Scale with Facebook Prophet and Databricks
How Starbucks Forecasts Demand at Scale with Facebook Prophet and DatabricksHow Starbucks Forecasts Demand at Scale with Facebook Prophet and Databricks
How Starbucks Forecasts Demand at Scale with Facebook Prophet and DatabricksNavin Albert
 
Analyzing Unstructured Data in Hadoop Webinar
Analyzing Unstructured Data in Hadoop WebinarAnalyzing Unstructured Data in Hadoop Webinar
Analyzing Unstructured Data in Hadoop WebinarDatameer
 
Using a Graph Database for Next-Gen MDM
Using a Graph Database for Next-Gen MDMUsing a Graph Database for Next-Gen MDM
Using a Graph Database for Next-Gen MDMNeo4j
 

Was ist angesagt? (12)

Neo4j GraphTour New York_Realogy Presentation
Neo4j GraphTour New York_Realogy Presentation Neo4j GraphTour New York_Realogy Presentation
Neo4j GraphTour New York_Realogy Presentation
 
Data Leaders Summit Barcelona 2018
Data Leaders Summit Barcelona 2018Data Leaders Summit Barcelona 2018
Data Leaders Summit Barcelona 2018
 
Big Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesBig Data: It’s all about the Use Cases
Big Data: It’s all about the Use Cases
 
Ecr presentation ss chain - jeffrey - final
Ecr presentation   ss chain - jeffrey - finalEcr presentation   ss chain - jeffrey - final
Ecr presentation ss chain - jeffrey - final
 
The Business Value of Big Data
The Business Value of Big DataThe Business Value of Big Data
The Business Value of Big Data
 
Commercial Analytics at Scale in Pharma: From Hackathon to MVP with Azure Dat...
Commercial Analytics at Scale in Pharma: From Hackathon to MVP with Azure Dat...Commercial Analytics at Scale in Pharma: From Hackathon to MVP with Azure Dat...
Commercial Analytics at Scale in Pharma: From Hackathon to MVP with Azure Dat...
 
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
 
Scaling AI At H&M
Scaling AI At H&MScaling AI At H&M
Scaling AI At H&M
 
How Starbucks Forecasts Demand at Scale with Facebook Prophet and Databricks
How Starbucks Forecasts Demand at Scale with Facebook Prophet and DatabricksHow Starbucks Forecasts Demand at Scale with Facebook Prophet and Databricks
How Starbucks Forecasts Demand at Scale with Facebook Prophet and Databricks
 
Analyzing Unstructured Data in Hadoop Webinar
Analyzing Unstructured Data in Hadoop WebinarAnalyzing Unstructured Data in Hadoop Webinar
Analyzing Unstructured Data in Hadoop Webinar
 
Business Visualization: Dashboard & Storyboarding
Business Visualization: Dashboard & StoryboardingBusiness Visualization: Dashboard & Storyboarding
Business Visualization: Dashboard & Storyboarding
 
Using a Graph Database for Next-Gen MDM
Using a Graph Database for Next-Gen MDMUsing a Graph Database for Next-Gen MDM
Using a Graph Database for Next-Gen MDM
 

Andere mochten auch

Unlocking Big Data through Analytics and Search - Big Data Cloud - June 3 Meetup
Unlocking Big Data through Analytics and Search - Big Data Cloud - June 3 MeetupUnlocking Big Data through Analytics and Search - Big Data Cloud - June 3 Meetup
Unlocking Big Data through Analytics and Search - Big Data Cloud - June 3 MeetupBigDataCloud
 
Engagement slideshow final 6 4-2011
Engagement slideshow final 6 4-2011Engagement slideshow final 6 4-2011
Engagement slideshow final 6 4-2011bryanbigos
 
Big Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of SybaseBig Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of SybaseBigDataCloud
 
BigDataCloud Sept 8 2011 meetup - Big Data Analytics for Health by Charles Ka...
BigDataCloud Sept 8 2011 meetup - Big Data Analytics for Health by Charles Ka...BigDataCloud Sept 8 2011 meetup - Big Data Analytics for Health by Charles Ka...
BigDataCloud Sept 8 2011 meetup - Big Data Analytics for Health by Charles Ka...BigDataCloud
 
Big Data Cloud Meetup - Jan 24 2013 - Zettaset
Big Data Cloud Meetup - Jan 24 2013 - ZettasetBig Data Cloud Meetup - Jan 24 2013 - Zettaset
Big Data Cloud Meetup - Jan 24 2013 - ZettasetBigDataCloud
 
Big Data Analytics in Motorola on the Google Cloud Platform
Big Data Analytics in Motorola on the Google Cloud PlatformBig Data Analytics in Motorola on the Google Cloud Platform
Big Data Analytics in Motorola on the Google Cloud PlatformBigDataCloud
 
Streak + Google Cloud Platform
Streak + Google Cloud PlatformStreak + Google Cloud Platform
Streak + Google Cloud PlatformBigDataCloud
 
Creating Business Value from Big Data, Analytics & Technology.
Creating Business Value from Big Data, Analytics & Technology.Creating Business Value from Big Data, Analytics & Technology.
Creating Business Value from Big Data, Analytics & Technology.BigDataCloud
 

Andere mochten auch (8)

Unlocking Big Data through Analytics and Search - Big Data Cloud - June 3 Meetup
Unlocking Big Data through Analytics and Search - Big Data Cloud - June 3 MeetupUnlocking Big Data through Analytics and Search - Big Data Cloud - June 3 Meetup
Unlocking Big Data through Analytics and Search - Big Data Cloud - June 3 Meetup
 
Engagement slideshow final 6 4-2011
Engagement slideshow final 6 4-2011Engagement slideshow final 6 4-2011
Engagement slideshow final 6 4-2011
 
Big Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of SybaseBig Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
 
BigDataCloud Sept 8 2011 meetup - Big Data Analytics for Health by Charles Ka...
BigDataCloud Sept 8 2011 meetup - Big Data Analytics for Health by Charles Ka...BigDataCloud Sept 8 2011 meetup - Big Data Analytics for Health by Charles Ka...
BigDataCloud Sept 8 2011 meetup - Big Data Analytics for Health by Charles Ka...
 
Big Data Cloud Meetup - Jan 24 2013 - Zettaset
Big Data Cloud Meetup - Jan 24 2013 - ZettasetBig Data Cloud Meetup - Jan 24 2013 - Zettaset
Big Data Cloud Meetup - Jan 24 2013 - Zettaset
 
Big Data Analytics in Motorola on the Google Cloud Platform
Big Data Analytics in Motorola on the Google Cloud PlatformBig Data Analytics in Motorola on the Google Cloud Platform
Big Data Analytics in Motorola on the Google Cloud Platform
 
Streak + Google Cloud Platform
Streak + Google Cloud PlatformStreak + Google Cloud Platform
Streak + Google Cloud Platform
 
Creating Business Value from Big Data, Analytics & Technology.
Creating Business Value from Big Data, Analytics & Technology.Creating Business Value from Big Data, Analytics & Technology.
Creating Business Value from Big Data, Analytics & Technology.
 

Ähnlich wie Why Hadoop is the New Infrastructure for the CMO?

Driving Customer Loyalty with Azure Machine Learning
Driving Customer Loyalty with Azure Machine LearningDriving Customer Loyalty with Azure Machine Learning
Driving Customer Loyalty with Azure Machine LearningCCG
 
Webinar: Realizing Omni-Channel Retailing with MongoDB - One Step at a Time
Webinar: Realizing Omni-Channel Retailing with MongoDB - One Step at a TimeWebinar: Realizing Omni-Channel Retailing with MongoDB - One Step at a Time
Webinar: Realizing Omni-Channel Retailing with MongoDB - One Step at a TimeMongoDB
 
Finding business value in Big Data
Finding business value in Big DataFinding business value in Big Data
Finding business value in Big DataJames Serra
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data SolutionJames Serra
 
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...Dataconomy Media
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneySai Paravastu
 
Hooduku - Big data analytics - case study
Hooduku - Big data analytics - case studyHooduku - Big data analytics - case study
Hooduku - Big data analytics - case studySudhi Seshachala
 
Latest corp big data and acme
Latest corp   big data and acmeLatest corp   big data and acme
Latest corp big data and acmehooduku
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalHarvinder Atwal
 
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data EstateEnable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data EstateCCG
 
Knowage 8 presentation
Knowage 8   presentationKnowage 8   presentation
Knowage 8 presentationKNOWAGE
 
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient..."Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...Dataconomy Media
 
Your Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph StrategyYour Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph StrategyNeo4j
 
A developer's introduction to big data processing with Azure Databricks
A developer's introduction to big data processing with Azure DatabricksA developer's introduction to big data processing with Azure Databricks
A developer's introduction to big data processing with Azure DatabricksMicrosoft Tech Community
 
Hadoop Demo eConvergence
Hadoop Demo eConvergenceHadoop Demo eConvergence
Hadoop Demo eConvergencekvnnrao
 
Your Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph Strategy Your Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph Strategy Neo4j
 
Your Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph StrategyYour Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph StrategyNeo4j
 
Bigdata and Analytics Services - Clover Infotech
Bigdata and Analytics Services - Clover InfotechBigdata and Analytics Services - Clover Infotech
Bigdata and Analytics Services - Clover InfotechSwetha Elias
 

Ähnlich wie Why Hadoop is the New Infrastructure for the CMO? (20)

Driving Customer Loyalty with Azure Machine Learning
Driving Customer Loyalty with Azure Machine LearningDriving Customer Loyalty with Azure Machine Learning
Driving Customer Loyalty with Azure Machine Learning
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
Webinar: Realizing Omni-Channel Retailing with MongoDB - One Step at a Time
Webinar: Realizing Omni-Channel Retailing with MongoDB - One Step at a TimeWebinar: Realizing Omni-Channel Retailing with MongoDB - One Step at a Time
Webinar: Realizing Omni-Channel Retailing with MongoDB - One Step at a Time
 
Finding business value in Big Data
Finding business value in Big DataFinding business value in Big Data
Finding business value in Big Data
 
A6 big data_in_the_cloud
A6 big data_in_the_cloudA6 big data_in_the_cloud
A6 big data_in_the_cloud
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data Solution
 
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
Hooduku - Big data analytics - case study
Hooduku - Big data analytics - case studyHooduku - Big data analytics - case study
Hooduku - Big data analytics - case study
 
Latest corp big data and acme
Latest corp   big data and acmeLatest corp   big data and acme
Latest corp big data and acme
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
 
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data EstateEnable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
 
Knowage 8 presentation
Knowage 8   presentationKnowage 8   presentation
Knowage 8 presentation
 
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient..."Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
 
Your Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph StrategyYour Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph Strategy
 
A developer's introduction to big data processing with Azure Databricks
A developer's introduction to big data processing with Azure DatabricksA developer's introduction to big data processing with Azure Databricks
A developer's introduction to big data processing with Azure Databricks
 
Hadoop Demo eConvergence
Hadoop Demo eConvergenceHadoop Demo eConvergence
Hadoop Demo eConvergence
 
Your Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph Strategy Your Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph Strategy
 
Your Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph StrategyYour Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph Strategy
 
Bigdata and Analytics Services - Clover Infotech
Bigdata and Analytics Services - Clover InfotechBigdata and Analytics Services - Clover Infotech
Bigdata and Analytics Services - Clover Infotech
 

Mehr von BigDataCloud

Webinar - Comparative Analysis of Cloud based Machine Learning Platforms
Webinar - Comparative Analysis of Cloud based Machine Learning PlatformsWebinar - Comparative Analysis of Cloud based Machine Learning Platforms
Webinar - Comparative Analysis of Cloud based Machine Learning PlatformsBigDataCloud
 
Crime Analysis & Prediction System
Crime Analysis & Prediction SystemCrime Analysis & Prediction System
Crime Analysis & Prediction SystemBigDataCloud
 
REAL-TIME RECOMMENDATION SYSTEMS
REAL-TIME RECOMMENDATION SYSTEMS REAL-TIME RECOMMENDATION SYSTEMS
REAL-TIME RECOMMENDATION SYSTEMS BigDataCloud
 
Cloud Computing Services
Cloud Computing ServicesCloud Computing Services
Cloud Computing ServicesBigDataCloud
 
Google Enterprise Cloud Platform - Resources & $2000 credit!
Google Enterprise Cloud Platform - Resources & $2000 credit!Google Enterprise Cloud Platform - Resources & $2000 credit!
Google Enterprise Cloud Platform - Resources & $2000 credit!BigDataCloud
 
Big Data in the Cloud - Solutions & Apps
Big Data in the Cloud - Solutions & AppsBig Data in the Cloud - Solutions & Apps
Big Data in the Cloud - Solutions & AppsBigDataCloud
 
Using Advanced Analyics to bring Business Value
Using Advanced Analyics to bring Business Value Using Advanced Analyics to bring Business Value
Using Advanced Analyics to bring Business Value BigDataCloud
 
Deep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
Deep Learning for NLP (without Magic) - Richard Socher and Christopher ManningDeep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
Deep Learning for NLP (without Magic) - Richard Socher and Christopher ManningBigDataCloud
 
Recommendation Engines - An Architectural Guide
Recommendation Engines - An Architectural GuideRecommendation Engines - An Architectural Guide
Recommendation Engines - An Architectural GuideBigDataCloud
 
Hadoop : A Foundation for Change - Milind Bhandarkar Chief Scientist, Pivotal
Hadoop : A Foundation for Change - Milind Bhandarkar Chief Scientist, PivotalHadoop : A Foundation for Change - Milind Bhandarkar Chief Scientist, Pivotal
Hadoop : A Foundation for Change - Milind Bhandarkar Chief Scientist, PivotalBigDataCloud
 
Big Data Cloud Meetup - Jan 29 2013 - Mike Stonebraker & Scott Jarr of VoltDB
Big Data Cloud Meetup - Jan 29 2013 - Mike Stonebraker & Scott Jarr of VoltDBBig Data Cloud Meetup - Jan 29 2013 - Mike Stonebraker & Scott Jarr of VoltDB
Big Data Cloud Meetup - Jan 29 2013 - Mike Stonebraker & Scott Jarr of VoltDBBigDataCloud
 
A Survey of Petabyte Scale Databases and Storage Systems Deployed at Facebook
A Survey of Petabyte Scale Databases and Storage Systems Deployed at FacebookA Survey of Petabyte Scale Databases and Storage Systems Deployed at Facebook
A Survey of Petabyte Scale Databases and Storage Systems Deployed at FacebookBigDataCloud
 
What Does Big Data Mean and Who Will Win
What Does Big Data Mean and Who Will WinWhat Does Big Data Mean and Who Will Win
What Does Big Data Mean and Who Will WinBigDataCloud
 
BigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentation
BigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentationBigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentation
BigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentationBigDataCloud
 
BigDataCloud Sept 8 2011 Meetup - Fail-Proofing Hadoop Clusters with Automati...
BigDataCloud Sept 8 2011 Meetup - Fail-Proofing Hadoop Clusters with Automati...BigDataCloud Sept 8 2011 Meetup - Fail-Proofing Hadoop Clusters with Automati...
BigDataCloud Sept 8 2011 Meetup - Fail-Proofing Hadoop Clusters with Automati...BigDataCloud
 
BigDataCloud Sept 8 2011 Meetup - Big Data Analytics for DoddFrank Regulation...
BigDataCloud Sept 8 2011 Meetup - Big Data Analytics for DoddFrank Regulation...BigDataCloud Sept 8 2011 Meetup - Big Data Analytics for DoddFrank Regulation...
BigDataCloud Sept 8 2011 Meetup - Big Data Analytics for DoddFrank Regulation...BigDataCloud
 
Recommendation Engine Powered by Hadoop - Pranab Ghosh
Recommendation Engine Powered by Hadoop - Pranab GhoshRecommendation Engine Powered by Hadoop - Pranab Ghosh
Recommendation Engine Powered by Hadoop - Pranab GhoshBigDataCloud
 
BigDataCloud meetup - July 8th - Cost effective big-data processing using Ama...
BigDataCloud meetup - July 8th - Cost effective big-data processing using Ama...BigDataCloud meetup - July 8th - Cost effective big-data processing using Ama...
BigDataCloud meetup - July 8th - Cost effective big-data processing using Ama...BigDataCloud
 
Optimizing Bursty Hadoop on AWS - Big Data Cloud - June 3rd Meetup
Optimizing Bursty Hadoop on AWS - Big Data Cloud - June 3rd MeetupOptimizing Bursty Hadoop on AWS - Big Data Cloud - June 3rd Meetup
Optimizing Bursty Hadoop on AWS - Big Data Cloud - June 3rd MeetupBigDataCloud
 

Mehr von BigDataCloud (19)

Webinar - Comparative Analysis of Cloud based Machine Learning Platforms
Webinar - Comparative Analysis of Cloud based Machine Learning PlatformsWebinar - Comparative Analysis of Cloud based Machine Learning Platforms
Webinar - Comparative Analysis of Cloud based Machine Learning Platforms
 
Crime Analysis & Prediction System
Crime Analysis & Prediction SystemCrime Analysis & Prediction System
Crime Analysis & Prediction System
 
REAL-TIME RECOMMENDATION SYSTEMS
REAL-TIME RECOMMENDATION SYSTEMS REAL-TIME RECOMMENDATION SYSTEMS
REAL-TIME RECOMMENDATION SYSTEMS
 
Cloud Computing Services
Cloud Computing ServicesCloud Computing Services
Cloud Computing Services
 
Google Enterprise Cloud Platform - Resources & $2000 credit!
Google Enterprise Cloud Platform - Resources & $2000 credit!Google Enterprise Cloud Platform - Resources & $2000 credit!
Google Enterprise Cloud Platform - Resources & $2000 credit!
 
Big Data in the Cloud - Solutions & Apps
Big Data in the Cloud - Solutions & AppsBig Data in the Cloud - Solutions & Apps
Big Data in the Cloud - Solutions & Apps
 
Using Advanced Analyics to bring Business Value
Using Advanced Analyics to bring Business Value Using Advanced Analyics to bring Business Value
Using Advanced Analyics to bring Business Value
 
Deep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
Deep Learning for NLP (without Magic) - Richard Socher and Christopher ManningDeep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
Deep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
 
Recommendation Engines - An Architectural Guide
Recommendation Engines - An Architectural GuideRecommendation Engines - An Architectural Guide
Recommendation Engines - An Architectural Guide
 
Hadoop : A Foundation for Change - Milind Bhandarkar Chief Scientist, Pivotal
Hadoop : A Foundation for Change - Milind Bhandarkar Chief Scientist, PivotalHadoop : A Foundation for Change - Milind Bhandarkar Chief Scientist, Pivotal
Hadoop : A Foundation for Change - Milind Bhandarkar Chief Scientist, Pivotal
 
Big Data Cloud Meetup - Jan 29 2013 - Mike Stonebraker & Scott Jarr of VoltDB
Big Data Cloud Meetup - Jan 29 2013 - Mike Stonebraker & Scott Jarr of VoltDBBig Data Cloud Meetup - Jan 29 2013 - Mike Stonebraker & Scott Jarr of VoltDB
Big Data Cloud Meetup - Jan 29 2013 - Mike Stonebraker & Scott Jarr of VoltDB
 
A Survey of Petabyte Scale Databases and Storage Systems Deployed at Facebook
A Survey of Petabyte Scale Databases and Storage Systems Deployed at FacebookA Survey of Petabyte Scale Databases and Storage Systems Deployed at Facebook
A Survey of Petabyte Scale Databases and Storage Systems Deployed at Facebook
 
What Does Big Data Mean and Who Will Win
What Does Big Data Mean and Who Will WinWhat Does Big Data Mean and Who Will Win
What Does Big Data Mean and Who Will Win
 
BigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentation
BigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentationBigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentation
BigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentation
 
BigDataCloud Sept 8 2011 Meetup - Fail-Proofing Hadoop Clusters with Automati...
BigDataCloud Sept 8 2011 Meetup - Fail-Proofing Hadoop Clusters with Automati...BigDataCloud Sept 8 2011 Meetup - Fail-Proofing Hadoop Clusters with Automati...
BigDataCloud Sept 8 2011 Meetup - Fail-Proofing Hadoop Clusters with Automati...
 
BigDataCloud Sept 8 2011 Meetup - Big Data Analytics for DoddFrank Regulation...
BigDataCloud Sept 8 2011 Meetup - Big Data Analytics for DoddFrank Regulation...BigDataCloud Sept 8 2011 Meetup - Big Data Analytics for DoddFrank Regulation...
BigDataCloud Sept 8 2011 Meetup - Big Data Analytics for DoddFrank Regulation...
 
Recommendation Engine Powered by Hadoop - Pranab Ghosh
Recommendation Engine Powered by Hadoop - Pranab GhoshRecommendation Engine Powered by Hadoop - Pranab Ghosh
Recommendation Engine Powered by Hadoop - Pranab Ghosh
 
BigDataCloud meetup - July 8th - Cost effective big-data processing using Ama...
BigDataCloud meetup - July 8th - Cost effective big-data processing using Ama...BigDataCloud meetup - July 8th - Cost effective big-data processing using Ama...
BigDataCloud meetup - July 8th - Cost effective big-data processing using Ama...
 
Optimizing Bursty Hadoop on AWS - Big Data Cloud - June 3rd Meetup
Optimizing Bursty Hadoop on AWS - Big Data Cloud - June 3rd MeetupOptimizing Bursty Hadoop on AWS - Big Data Cloud - June 3rd Meetup
Optimizing Bursty Hadoop on AWS - Big Data Cloud - June 3rd Meetup
 

Kürzlich hochgeladen

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 

Kürzlich hochgeladen (20)

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 

Why Hadoop is the New Infrastructure for the CMO?

  • 1.
  • 2.
  • 3.  Business Value of Hadoop  Adoption Success Criteria  “Jumpstarting” Your Hadoop Big Data on Hadoop
  • 4. > OMG! – The Opportunity - What Do We Do With All This Data? Volume, Variety and Velocity all increasing rapidly Source: IBM, Oct 2012
  • 5. Big Data on Hadoop A Needed Technology Disruption Told Us What Happened & Made Us More Efficient BI Using Data Cube Analysis Structured, Sampled Transitional, Closed, Expensive RDBS & EDW SQL Driven by Vendors Infrastructure for Sales, Finance & Operations CFO COO CRO Supply Chain ERP Sales Ops
  • 6. Big Data on Hadoop Technology Disruption What Can We Make Happen? “Database of Intent” Analytics Using Total Fidelity Analysis Unstructured Behavioral, Open, Affordable HADOOP HQL BI Using Data Cube Analysis Structured, Sampled Transitional, Closed, Expensive RDBS & EDW SQL Driven by Open SourceDriven by Vendors
  • 7. Big Data on Hadoop…What is it Good For? Customer Innovation – Optimize the Customer Engagement Cycle BUY USE SUPPORT Customer Innovation Personalized Cross&Upsell Channel Analytics Superior Products Telemetry Analytics Proactive Support Response Analytics
  • 9. Big Data on Hadoop Not Really About More BI MORE BI?
  • 10. Big Data on Hadoop So, We’re On a Journey to Big Data Analytics You Are Here….Let’s get going!
  • 12. But First!  Education  Training  Certifications
  • 13. #1. IT Partnership With LOB (Marketing)  Find Use Case  Identify Budget  Form project teams  Partner on a small POC  Educate Big Data on Hadoop Success Driver #1
  • 14. “By 2016 the CMO will have more budget than the CIO” - Gartner Group Big Data on Hadoop
  • 15. #2. Use Second-Generation Big Data Analytics  Support both power analysts & business users  Work collaboratively on projects  Empower Interactive Visualizations  Facilitate the new “Big Data” workflow  Gamify Big Data on Hadoop Success Driver #2
  • 16. #3. Embrace Analytics Hubs…They’re Coming!  Sharing  Re-Use  Enforce Standards  Analytics “Assembly” Big Data on Hadoop Adoption Driver #3
  • 17. Go Native & Get Total Fidelity for Analytics on Hadoop  Very rich  Not sampled  No data replication or movement  Low complexity and TCO Big Data on Hadoop
  • 18. Jumpstarting Your Hadoop A Use Case Demo
  • 19. Data Warehouse OLTP to OLAP Mapping Analyst BI Using Data Cube Analysis Analysts Worked with Transformed, Aggregated, Sampled Data Ordering App Financial App Master Data Staging OLAP Reports BI Using Data Cube Analysis Structured, Sampled Transitional, Closed, Expensive RDBS & EDW SQL Driven by Vendors
  • 20. Analysts Access All the Data With a New Data WorkFlow Iterative & Adaptive Big Data Analytics on Hadoop Using Total Fidelity Analytics Application AnalyticsData Unstructured Behavioral, Open, Affordable HADOOP HQL Driven by Open Source Analyst Analytics Using Total Fidelity Analysis
  • 21. 1. How do we increase traffic to our site? 2. How do we make customers stay on the site, come into the store and engage with our offerings? 3. How do we convert browsers to buyers? 4. What else can promote knowing their profile? 5. How can make them come back and shop for more? What kind of customer innovation can we drive?
  • 22. 1. How often do they visit, what did they buy, how much did they spend? 2. What did they view, how long did they stay on the site? What did they click on? What did they rate? What did their friends buy? 3. How do we offer the most relevant product for service and invite them for the right campaign. What insights do we need? To better understand my customer we need a more granular segmentation (micro-segmentation)
  • 23. The Signals of Big Data on Hadoop Logs - Search terms, page views, useragent , Geo, IP, duration, size... Campaigns, keywords, channels, SEO, Display, Affiliates Reviews - SKU, date, who, comment, rating, location Products- SKU, categories, bundles, descripti on, prize Orders – SKU, prize, purchased with, Shipping date, status Profiles – Names, location, gender, demographics, reach, interests, influence
  • 24. 1. Describe and prepare the data 2. Perform initial analysis on raw data 3. A number of insights can be derived without further analysis (RFM) 4. Identify feature set and extract training data for modeling 5. Create the model in any model authoring tool 6. Score the model in Hadoop 7. Use the insight to improve business value Steps in a typical analysis
  • 25. Complexity of data compared to transactional world {"frequentlyPurchasedWith": [], "color": "Black", "skutype": "parent", "productTemplate": "Computer_Accessory", "salesRankMediumTerm": "3039", "shortDescription": "Compatible with Windows 8 and RT and Android 3.0 tablets; Bluetooth technology; convertible stand/carrying case", "includedItemList": [{"includedItem": "Logitech Tablet Keyboard for Windows 8 and RT and Android 3.0+ Tablets"}, {"includedItem": "4 AAA batteries"}, {"includedItem": "Owner's manual"}], "subclassId": 2409, "sku": 6541967, "width": "12.3"", "subclass": "BLUETOOTH KEYBOARDS", "source": "BoxStore", "modelNumber": "920-004569", "digital": false, "department": "COMPUTERS", "type": "HardGood", "productId": 1218752781558, "description": "None", "technologyCode": "None", "longDescription": "This Logitech 920-004569 keyboard features a low-profile, 65-key design for easy, comfortable typing on your Windows 8 or RT or Android 3.0 tablet. The convertible stand allows comfortable viewing and provides on-the-go protection for the keyboard.", "categoryPath": [{"name": "Box Store", "id": "cat00000"}, {"name": "Computers & Tablets", "id": "abcat0500000"}, {"name": "iPad, Tablets & E- Readers", "id": "pcmcat209000050006"}, {"name": "Tablet Accessories", "id": "pcmcat231800050009"}, {"name": "Tablet Docks, Keyboards & Stands", "id": "pcmcat242000050003"}], "manufacturer": "Logitech", "classId": 492, "upc": "097855090973", "regularPrice": 69.99, "class_1": "TABLET ACCESSORIES", "relatedProducts": [{"sku": 4974041}, {"sku": 1306578835}, {"sku": 4640745}, {"sku": 9610542}, {"sku": 8785729}, {"sku": 6640676}]} {"frequentlyPurchasedWith": [], "color": "Gray", "skutype": "parent", "productTemplate": "Computer_Accessory", "salesRankMediumTerm": "None", "shortDescription": "Compatible with BlackBerry Playbook tablets; wool construction; TPU plastic cradle; elastic band; metallic clip; functions as a stand; play-through design", "includedItemList": [{"includedItem": "DICOTA TabBook Case for BlackBerry Playbook Tablets"}], "subclassId": 2404, "sku": 6738835, "width": "5.5"", "subclass": "SO TABLET ACCY", "source": "BoxStore", "modelNumber": "D30203", "digital": false, "department": "COMPUTERS", "type": "HardGood", "productId": 1218789793935, "description": "None", "technologyCode": "None", "longDescription": "This DICOTA TabBook D30203 case helps keep your BlackBerry Playbook tablet safe from hazards, with wool construction and a TPU plastic cradle for durability and an elastic band to keep your tablet snug and secure in the case.", "categoryPath": [{"name": "Box Store", "id": "cat00000"}, {"name": "Computers & Tablets", "id": "abcat0500000"}, {"name": "iPad, Tablets & E- Readers", "id": "pcmcat209000050006"}, {"name": "Tablet Accessories", "id": "pcmcat231800050009"}, {"name": "Tablet Cases, Covers & Sleeves", "id": "pcmcat242000050002"}], "manufacturer": "DICOTA", "classId": 492, "upc": "7332752000964", "regularPrice": 49.99, "class_1": "TABLET ACCESSORIES", "relatedProducts": [{"sku":
  • 26. How big your data can grow? Number of unique visitors per day
  • 27. Why does it matter to track user activity
  • 28. Significant amount of time is spent on understanding the data and preparing it for further analysis. Describe, transform and prepare the data
  • 29. Derive insights using Hadoop and Hive User Defined Functions
  • 30. KNIME - Open-source modeling tool that can build and export model as PMML
  • 31. Bring Them Into Karmasphere for Scoring on Hadoop