SlideShare ist ein Scribd-Unternehmen logo
1 von 25
The Next Generation
of Big Data Analytics

August 22, 2012




                  © Hortonworks Inc. 2012   © 2012 Teradata Corporation   1
Today’s Speakers




  Jim Walker               Cesar Rojas                           Eric Linden
Dir. Product Marketing   Dir. Solutions Marketing                Technical Marketing
  Hortonworks              Teradata Aster                        Teradata Aster




                                       © Hortonworks Inc. 2012    © 2012 Teradata Corporation   2
Big Data Changes the Game

                                                           Transactions + Interactions
Petabytes
               BIG DATA                 Mobile Web                 + Observations
                                        Sentiment

                                         User Click Stream
                                                          SMS/MMS
                                                                         = BIG DATA
                                                                Speech to Text

                                                      Social Interactions & Feeds
  Terabytes    WEB        Web logs
                                                                Spatial & GPS Coordinates
                                 A/B testing
                                                                         Sensors / RFID / Devices
                                         Behavioral Targeting
   Gigabytes   CRM                                                                  Business Data Feeds
                                                    Dynamic Pricing
                          Segmentation                                                  External Demographics
                                                          Search Marketing
                                 Customer Touches                                        User Generated Content
               ERP
   Megabytes                                                 Affiliate Networks
               Purchase detail        Support Contacts                                      HD Video, Audio, Images
                                                                Dynamic Funnels
               Purchase record
                                          Offer details            Offer history              Product/Service Logs
               Payment record



                                         Increasing Data Variety and Complexity


                                                              © Hortonworks Inc. 2012         © 2012 Teradata Corporation   3
Next Generation Data Architecture Drivers


 Business    •    Enable new business models & drive faster growth (20%+)
  Drivers    •    Find insights for competitive advantage & optimal returns



             •    Data continues to grow exponentially
 Technical   •    Data is increasingly everywhere and in many formats
   Drivers   •    Legacy solutions unfit for new requirements growth



 Financial   •    Cost of data systems, as % of IT spend, continues to grow
   Drivers   •    Cost advantages of commodity hardware & open source




                                            © Hortonworks Inc. 2012   © 2012 Teradata Corporation   4
Fueling Adoption of the Next Generation


 Empower the Ecosystem
 •  Apache Hadoop has to just work with what
    you already have
 •  Apache Hadoop must be a seamless part of
    holistic data management strategy
 •  Leverage existing assets and tools…
    Extend them with new and powerful data
                                                       Data Platform Services & Open APIs


                                                            Hortonworks
                                                            Data Platform

                                        © Hortonworks Inc. 2012   © 2012 Teradata Corporation   5
Hortonworks Data Platform (HDP)

                                                                           •  Simplify deployment to get
                                                                              started quickly and easily

                                                                           •  Monitor, manage any size cluster
                                                                              with familiar console and tools

                                                                           •  Only platform to include data
                                                                              integration services to interact
                            1                                                 with any data source

                                                                           •  Metadata services opens the
                                                                              platform for integration with
         Hortonworks Data Platform                                            existing applications
    Delivers enterprise grade functionality on a proven
    Apache Hadoop distribution to ease management,                         •  Dependable high availability
   simplify use and ease integration into the enterprise                      architecture




The only 100% open source data platform for Apache Hadoop

                                                 © Hortonworks Inc. 2012          © 2012 Teradata Corporation    6
Shift in Paradigm

                                        Classic BI
                                Structured & Repeatable Analysis




Business determines what                                                      IT structures the data to
     questions to ask                                                         answer those questions
                               SQL Performance & Structure
                                                                                “Capture only
                                                                                what’s needed”
“Capture in case it’s
     needed”                 MapReduce Processing Flexibility




 IT delivers platform for      Big Data Analytics
   storing, refining, &                                                      Business explores data for
                             Multi-structured & Iterative Analysis           questions worth answering
analyzing all data sources



                                                   © Hortonworks Inc. 2012    © 2012 Teradata Corporation   7
Transactions + Interactions + Observations
 Audio,              Retain runtime models and
 Video,
Images
                      historical data for ongoing   5      Business                Web, Mobile, CRM,
                           refinement & analysis                                   ERP, SCM, …
                                                         Transactions
 Docs,                                                   & Interactions
 Text,
 XML


  Web
 Logs,
 Clicks
                      Big Data                4                           Data
Social,               Refinery                                        Discovery &                                      Classic
Graph,                                                                                                              1     ETL
Feeds                                                                 Investigative                                 processing
                                                                        Analytics
Sensors,     3                          Share refined
Devices,
  RFID
                                        data & runtime                                2
           Store, aggregate, and        models                                         Interactive
           transform multi-structured                                                  data
Spatial,   data to unlock value                                 Business               exploration
 GPS
                                                               Intelligence
                                                               & Analytics
                            Retain historical data to
Events,
 Other
                            unlock additional value      6
                                                                                   Dashboards, Reports,
                                                                                   Visualization, …

                                                         © Hortonworks Inc. 2012      © 2012 Teradata Corporation           8
Unified Big Data Architecture
            •    Engineers
            •    Data Scientists
                                           Java, C/C++, Pig, Python, R, SAS,
            •    Quants                    SQL, Excel, BI, Visualization, etc.
            •    Business Analysts



        Discovery                                                      Integrated
        Platform                                                       Data Warehouse




                                      Capture, Store, Refine



                               Audio/                                    Web &             Machine
  CRM      SCM         ERP                 Images          Text
                               Video                                     Social             Logs
                                     Sources of data

                                             © Hortonworks Inc. 2012       © 2012 Teradata Corporation   9
Next Generation Big Data Analytics
The Data Discovery Cycle


                                  Analytical Idea




Operational DB   Operationalize                                    Zero-ETL Data
   or EDW         or Move On                                      Load/Integration




                      Evaluate
                       Results
                                                        SQL & non-SQL
                                                           Analysis



                              © Hortonworks Inc. 2012     © 2012 Teradata Corporation   10
Key Elements of a Data Discovery Platform


       Highly Efficient & Performant Big Data Platform
   1   That Allows Quick Iterations


       Hybrid Capabilities that Provide both Legacy
   2
       (SQL, BI) and New (MapReduce) Interfaces


       Significant Out-of-the-Box Analytical Apps that
   3
       Minimize Development



  Democratize Big Data & Maximize Enterprise Adoption


                              © Hortonworks Inc. 2012   © 2012 Teradata Corporation   11
Teradata Aster Data Discovery Platform

               Analysts       Customers        Business Users                 Data Scientists

          Your Analytics & Advanced Reporting Applications

           Pattern
          Matching
                      Graph     Statistical   ELT     •  50+ pre-built analytic modules
Develop                                               •  Visual IDE; develop apps in hours
              Java, C, Python, Perl …                 •  Many programming languages


           SQL             SQL-MapReduce              •  SQL-MapReduce framework
                                                      •  Analyze both non-relational +
Process          Platform Services                       relational data
             (e.g. query planning, dynamic
           workload management, security …)           •  Linear, incremental scalability

                                                      •  Commodity-hardware based
 Store        Relational         Relational           •  Software only, cloud, or appliance
                Row               Column              •  Relational-data architecture can
                                                         be extended for non-relational types

                           External HDFS Data (Using SQL-H and HCatalog)

                                                    © Hortonworks Inc. 2012         © 2012 Teradata Corporation   12
Aster MapReduce Portfolio: the App Store of
Big Data
    50+ out-of-the-box SQL-MapReduce analytic applications


    Path Analysis                             Text Analysis
    Discover patterns in rows of              Derive patterns and extract features
    sequential data                           in textual data



    Statistical Analysis                      Segmentation
    High-performance processing of            Discover natural groupings of data
    common statistical calculations           points



    Marketing Analytics                       Data Transformation
    Analyze customer interactions to          Transform data for more advanced
    optimize marketing decisions              analysis



                                       © Hortonworks Inc. 2012   © 2012 Teradata Corporation   13
Aster SQL-H Enables Data Discovery on
 Hadoop Data

                     Aster SQL-H™
         A Business User’s Bridge to Analyze Hadoop Data



Aster SQL-H gives analysts and data scientists a better way
to analyze data stored cheaply in Hadoop
   •  Allow standard ANSI SQL to Hadoop data

   •  Leverage existing BI tool investments

   •  Enable 50+ prebuilt SQL-MapReduce Apps and IDE

   •  Improve self-sufficiency for analysts going against Hadoop

                                   © Hortonworks Inc. 2012   © 2012 Teradata Corporation   14
Analyst Point of View

                                 Gap 1:
                                Analysts

         Engineers            Data Scientists            Quants                 Business Analysts

      Java, C/C++, Pig, Python, R, SAS, SQL, Excel, BI, Visualization, etc.

         MapReduce
        (Processing)
                                                  Discovery                           Active Data
 Gap 2: File system lacks
                                                  Platform                            Warehouse
 optimizers, data locality,
 indexes
                                                Database and Analytic Processing Layer



   Data Storage and
       Refining

     Audio/                            Web &        Machine
                 Images       Text                                        CRM     SCM             ERP
     Video                             Social        Logs




                                                    © Hortonworks Inc. 2012      © 2012 Teradata Corporation   15
Analyst’s Goal: Get Insights from Data in
Hadoop

  Engineers            Data Scientists           Quants                     Business Analysts




                               Aster MapReduce Portfolio                Teradata Analytics Portfolio
    Custom Code and
      Development

                                 SQL & SQL-MapReduce                                 SQL

      MR, Pig, Hive
                                   Teradata Aster                                Teradata
      IT is the optimizer        Discovery Platform                                IDW




                                              © Hortonworks Inc. 2012         © 2012 Teradata Corporation   16
Analytics on Hadoop Data with Aster SQL-H


 Engineers         Data Scientists            Quants                      Business Analysts




              Aster MapReduce Portfolio
                           Aster MapReduce Portfolio                 Teradata Analytics Portfolio




      SQL-H                  SQL & MapReduce
                    SQL & SQL-MapReduce                                            SQL
                                                                                   SQL



                                Teradata Aster                                 Teradata
                              Discovery Platform                                 IDW




                                           © Hortonworks Inc. 2012          © 2012 Teradata Corporation   17
Aster SQL-H™ Integration with HCatalog
                                                                      Aster is the execution layer,
                                                                       all analytical processing is
              Aster Layer: SQL-H                                           done with Aster SQL-
                                                                          MapReduce functions
                                                                        (no Hive or Hadoop-MR)

                        Hadoop
       Data Filtering




                          MR

                                                                       HCatalog is the metadata
Data




                         Hive    HCatalog                                     repository


                         Pig




                          HDFS                                        HDFS is the data repository



                                            © Hortonworks Inc. 2012        © 2012 Teradata Corporation   18
When to Use What?

  •  The best approach by workload and data type
  •  Processing as a Function of Schema Requirements by Data Type

                                    Loading and Refining
                                                                                                          Analytics
            Low Cost Storage      Data Pre-                                    Reporting                (User-driven,
              & Retention        Processing,     Transformations                                         interactive)
                               Prep, Cleansing


Stable         Teradata /                                                                              Teradata
Schema                            Teradata          Teradata                   Teradata
                Hadoop                                                                               (SQL analytics)


                                                       Aster                                            Aster
Evolving                          Aster /
Schema          Hadoop                             (joining with                Aster             (SQL + MapReduce
                                  Hadoop
                                                 structured data)                                     Analytics)

                                                                                                          Aster
Format,
No Schema       Hadoop            Hadoop            Hadoop                                             (MapReduce
                                                                                                        Analytics)




                                                     © Hortonworks Inc. 2012         © 2012 Teradata Corporation        19
Customer Churn Prevention

Challenge                                                                            Cross-Channel
•    Know when churn will occur                                                   Customer Interactions
•    Data Mining tools predict probability but do not
                                                                                  17,000 Customers, 1 Month
     identify cause events

With Hadoop
•    Capture, retention and transformation of
     customer images (e.g. checks) and customer
     voice records

With Aster & Teradata
•    SQL-MapReduce listens and predicts the
     customer churn event
      –  Identifies all interaction patterns prior to
          acquisition or attrition

Business Impact
•    10-300x less effort to pinpoint a customer in
     the middle of a decision




                                                        © Hortonworks Inc. 2012          © 2012 Teradata Corporation   20
More Accurate Customer Churn Prevention

    Hadoop captures,                                                                                              Aster does path
   stores and transform                                                                                            and sentiment
      images and call                                     Social &
                                                          Web data
                                                                                                                 analysis with multi-
          records                                                                                                  structured data


            Multi-Structured Raw
                     Data
                                                   Call Data                                                               Analysis
                                                                         Aster Data
              Call Center Voice                                          Discovery                                            +
                   Records          Hadoop       Image Data
                                                                          Platform                                        Marketing
                                                                                                                          Automation
                 Images &           Capture, Retention




                                                                                              Analytic Results
                                                                           Dimensional Data
                Documents
                                            &                                                                              (Customer
                                     Transformation                                                                         Retention
                                                                                                                           Campaign)
            Traditional Data Flow         Layer
              Data Sources


                                       ETL Tools                           Teradata
                                                                        Integrated DW




                                              © Hortonworks Inc. 2012                              © 2012 Teradata Corporation          21
Aster-Hadoop Integration Demo
Churn Attrition




                  © Hortonworks Inc. 2012   © 2012 Teradata Corporation   22
Use Cases: Optimize Outcomes at Scale

             Media    optimize                         Content
       Intelligence   optimize                         Detection
       Investment     optimize                         Algorithms
       Advertising    optimize                         Performance
             Fraud    optimize                         Prevention
        Regulation    optimize                         Compliance
 Retail / Wholesale   optimize                         Inventory turns
    Manufacturing     optimize                         Supply chains
        Healthcare    optimize                         Patient outcomes
        Education     optimize                         Learning outcomes
      Government      optimize                         Citizen services
                            Source: Geoffrey Moore. Hadoop Summit 2012 keynote presentation.

                                 © Hortonworks Inc. 2012     © 2012 Teradata Corporation       23
Why Hortonworks and Teradata
Familiar business analysis on Apache Hadoop big data
•  50+ advanced SQL-MapReduce functions (Aster MapReduce Portfolio)
•  SQL-MapReduce development environment to build more functions

Straightforward Database to Apache Hadoop Integration
•  ANSI SQL-based interface to standard HCatalog metadata/schema in Hadoop

Interoperability with existing ecosystem & skillsets
•  BI tools (Tableau, MicroStrategy, Cognos), ETL tools, SQL analysts & existing
   applications

Ease of maintenance, skillset and tools compliant
•  Leverage existing DBA skill-sets without additional overhead
•  Apache Ambari provides management and monitoring of the Hadoop cluster and
   integrates with current administration tools


                                         © Hortonworks Inc. 2012   © 2012 Teradata Corporation   24
Learn More


Big Analytics Best Practices                                     Jim Walker
                                                                 Hortonworks
www.asterdata.com/BigAnalyticsSeries                             jim@hortonworks.com

                                                                 Cesar Rojas
Apache Hadoop                                                    Teradata Aster
                                                                 cesar.rojas@teradata.com
& the Big Data Refinery
www.hortonworks.com                                              Eric Linden
                                                                 Teradata Aster
                                                                 eric.linden@teradata.com




Twitter: @hortonworks   @asterdata @jaymce



                                       © Hortonworks Inc. 2012      © 2012 Teradata Corporation   25

Weitere ähnliche Inhalte

Was ist angesagt?

Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache HadoopRescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache HadoopHortonworks
 
Cloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinarCloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinarHortonworks
 
Big Data, Hadoop, Hortonworks and Microsoft HDInsight
Big Data, Hadoop, Hortonworks and Microsoft HDInsightBig Data, Hadoop, Hortonworks and Microsoft HDInsight
Big Data, Hadoop, Hortonworks and Microsoft HDInsightHortonworks
 
Hortonworks and Clarity Solution Group
Hortonworks and Clarity Solution Group Hortonworks and Clarity Solution Group
Hortonworks and Clarity Solution Group Hortonworks
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksHortonworks
 
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - WebinarHortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - WebinarHortonworks
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoptionHortonworks
 
Yahoo! Hack Europe
Yahoo! Hack EuropeYahoo! Hack Europe
Yahoo! Hack EuropeHortonworks
 
IDC Retail Insights - What's Possible with a Modern Data Architecture?
IDC Retail Insights - What's Possible with a Modern Data Architecture?IDC Retail Insights - What's Possible with a Modern Data Architecture?
IDC Retail Insights - What's Possible with a Modern Data Architecture?Hortonworks
 
Hortonworks sqrrl webinar v5.pptx
Hortonworks sqrrl webinar v5.pptxHortonworks sqrrl webinar v5.pptx
Hortonworks sqrrl webinar v5.pptxHortonworks
 
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...Hortonworks
 
Simplify and Secure your Hadoop Environment with Hortonworks and Centrify
Simplify and Secure your Hadoop Environment with Hortonworks and CentrifySimplify and Secure your Hadoop Environment with Hortonworks and Centrify
Simplify and Secure your Hadoop Environment with Hortonworks and CentrifyHortonworks
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata Hortonworks
 
Enterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble StorageEnterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble StorageHortonworks
 
Apache Hadoop on the Open Cloud
Apache Hadoop on the Open CloudApache Hadoop on the Open Cloud
Apache Hadoop on the Open CloudHortonworks
 
Hortonworks and Voltage Security webinar
Hortonworks and Voltage Security webinarHortonworks and Voltage Security webinar
Hortonworks and Voltage Security webinarHortonworks
 
Predicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior GraphsPredicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior GraphsHortonworks
 
Create a Smarter Data Lake with HP Haven and Apache Hadoop
Create a Smarter Data Lake with HP Haven and Apache HadoopCreate a Smarter Data Lake with HP Haven and Apache Hadoop
Create a Smarter Data Lake with HP Haven and Apache HadoopHortonworks
 
Accelerate Big Data Application Development with Cascading and HDP, Hortonwor...
Accelerate Big Data Application Development with Cascading and HDP, Hortonwor...Accelerate Big Data Application Development with Cascading and HDP, Hortonwor...
Accelerate Big Data Application Development with Cascading and HDP, Hortonwor...Hortonworks
 
10 Amazing Things To Do With a Hadoop-Based Data Lake
10 Amazing Things To Do With a Hadoop-Based Data Lake10 Amazing Things To Do With a Hadoop-Based Data Lake
10 Amazing Things To Do With a Hadoop-Based Data LakeVMware Tanzu
 

Was ist angesagt? (20)

Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache HadoopRescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
 
Cloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinarCloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinar
 
Big Data, Hadoop, Hortonworks and Microsoft HDInsight
Big Data, Hadoop, Hortonworks and Microsoft HDInsightBig Data, Hadoop, Hortonworks and Microsoft HDInsight
Big Data, Hadoop, Hortonworks and Microsoft HDInsight
 
Hortonworks and Clarity Solution Group
Hortonworks and Clarity Solution Group Hortonworks and Clarity Solution Group
Hortonworks and Clarity Solution Group
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and Hortonworks
 
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - WebinarHortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - Webinar
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
 
Yahoo! Hack Europe
Yahoo! Hack EuropeYahoo! Hack Europe
Yahoo! Hack Europe
 
IDC Retail Insights - What's Possible with a Modern Data Architecture?
IDC Retail Insights - What's Possible with a Modern Data Architecture?IDC Retail Insights - What's Possible with a Modern Data Architecture?
IDC Retail Insights - What's Possible with a Modern Data Architecture?
 
Hortonworks sqrrl webinar v5.pptx
Hortonworks sqrrl webinar v5.pptxHortonworks sqrrl webinar v5.pptx
Hortonworks sqrrl webinar v5.pptx
 
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
 
Simplify and Secure your Hadoop Environment with Hortonworks and Centrify
Simplify and Secure your Hadoop Environment with Hortonworks and CentrifySimplify and Secure your Hadoop Environment with Hortonworks and Centrify
Simplify and Secure your Hadoop Environment with Hortonworks and Centrify
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
 
Enterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble StorageEnterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble Storage
 
Apache Hadoop on the Open Cloud
Apache Hadoop on the Open CloudApache Hadoop on the Open Cloud
Apache Hadoop on the Open Cloud
 
Hortonworks and Voltage Security webinar
Hortonworks and Voltage Security webinarHortonworks and Voltage Security webinar
Hortonworks and Voltage Security webinar
 
Predicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior GraphsPredicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior Graphs
 
Create a Smarter Data Lake with HP Haven and Apache Hadoop
Create a Smarter Data Lake with HP Haven and Apache HadoopCreate a Smarter Data Lake with HP Haven and Apache Hadoop
Create a Smarter Data Lake with HP Haven and Apache Hadoop
 
Accelerate Big Data Application Development with Cascading and HDP, Hortonwor...
Accelerate Big Data Application Development with Cascading and HDP, Hortonwor...Accelerate Big Data Application Development with Cascading and HDP, Hortonwor...
Accelerate Big Data Application Development with Cascading and HDP, Hortonwor...
 
10 Amazing Things To Do With a Hadoop-Based Data Lake
10 Amazing Things To Do With a Hadoop-Based Data Lake10 Amazing Things To Do With a Hadoop-Based Data Lake
10 Amazing Things To Do With a Hadoop-Based Data Lake
 

Ähnlich wie The Next Generation of Big Data Analytics

Introduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for WindowsIntroduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for WindowsHortonworks
 
Hortonworks roadshow
Hortonworks roadshowHortonworks roadshow
Hortonworks roadshowAccenture
 
Break Through the Traditional Advertisement Services with Big Data and Apache...
Break Through the Traditional Advertisement Services with Big Data and Apache...Break Through the Traditional Advertisement Services with Big Data and Apache...
Break Through the Traditional Advertisement Services with Big Data and Apache...Hortonworks
 
Tackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integrationTackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integrationDataWorks Summit
 
Hadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation ArchitecturesHadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation ArchitecturesDataWorks Summit
 
Powering Next Generation Data Architecture With Apache Hadoop
Powering Next Generation Data Architecture With Apache HadoopPowering Next Generation Data Architecture With Apache Hadoop
Powering Next Generation Data Architecture With Apache HadoopHortonworks
 
Hadoop: What It Is and What It's Not
Hadoop: What It Is and What It's NotHadoop: What It Is and What It's Not
Hadoop: What It Is and What It's NotInside Analysis
 
Unified big data architecture
Unified big data architectureUnified big data architecture
Unified big data architectureDataWorks Summit
 
The Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureThe Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureInside Analysis
 
Hadoop's Role in the Big Data Architecture, OW2con'12, Paris
Hadoop's Role in the Big Data Architecture, OW2con'12, ParisHadoop's Role in the Big Data Architecture, OW2con'12, Paris
Hadoop's Role in the Big Data Architecture, OW2con'12, ParisOW2
 
Hadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondHadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondTeradata Aster
 
Teradata Big Data London Seminar
Teradata Big Data London SeminarTeradata Big Data London Seminar
Teradata Big Data London SeminarHortonworks
 
Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Cana Ko
 
2011 Sharepoint Summit - Microsoft's vision and strategy for the future of bu...
2011 Sharepoint Summit - Microsoft's vision and strategy for the future of bu...2011 Sharepoint Summit - Microsoft's vision and strategy for the future of bu...
2011 Sharepoint Summit - Microsoft's vision and strategy for the future of bu...MSHOWTO Bilisim Toplulugu
 
Informatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityInformatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityDatabase Architechs
 
Scaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write SplittingScaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write SplittingScaleBase
 
Tera stream for datastreams
Tera stream for datastreamsTera stream for datastreams
Tera stream for datastreams치민 최
 

Ähnlich wie The Next Generation of Big Data Analytics (20)

Introduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for WindowsIntroduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for Windows
 
vBACD July 2012 - Apache Hadoop, Now and Beyond
vBACD July 2012 - Apache Hadoop, Now and BeyondvBACD July 2012 - Apache Hadoop, Now and Beyond
vBACD July 2012 - Apache Hadoop, Now and Beyond
 
2012 06 hortonworks paris hug
2012 06 hortonworks paris hug2012 06 hortonworks paris hug
2012 06 hortonworks paris hug
 
Hortonworks roadshow
Hortonworks roadshowHortonworks roadshow
Hortonworks roadshow
 
Break Through the Traditional Advertisement Services with Big Data and Apache...
Break Through the Traditional Advertisement Services with Big Data and Apache...Break Through the Traditional Advertisement Services with Big Data and Apache...
Break Through the Traditional Advertisement Services with Big Data and Apache...
 
Tackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integrationTackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integration
 
Hadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation ArchitecturesHadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation Architectures
 
Powering Next Generation Data Architecture With Apache Hadoop
Powering Next Generation Data Architecture With Apache HadoopPowering Next Generation Data Architecture With Apache Hadoop
Powering Next Generation Data Architecture With Apache Hadoop
 
Hadoop: What It Is and What It's Not
Hadoop: What It Is and What It's NotHadoop: What It Is and What It's Not
Hadoop: What It Is and What It's Not
 
Unified big data architecture
Unified big data architectureUnified big data architecture
Unified big data architecture
 
The Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureThe Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information Architecture
 
Hadoop's Role in the Big Data Architecture, OW2con'12, Paris
Hadoop's Role in the Big Data Architecture, OW2con'12, ParisHadoop's Role in the Big Data Architecture, OW2con'12, Paris
Hadoop's Role in the Big Data Architecture, OW2con'12, Paris
 
Hadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondHadoop - Now, Next and Beyond
Hadoop - Now, Next and Beyond
 
Teradata Big Data London Seminar
Teradata Big Data London SeminarTeradata Big Data London Seminar
Teradata Big Data London Seminar
 
Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831
 
2011 Sharepoint Summit - Microsoft's vision and strategy for the future of bu...
2011 Sharepoint Summit - Microsoft's vision and strategy for the future of bu...2011 Sharepoint Summit - Microsoft's vision and strategy for the future of bu...
2011 Sharepoint Summit - Microsoft's vision and strategy for the future of bu...
 
Informatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityInformatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data Quality
 
Scaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write SplittingScaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write Splitting
 
Tera stream for datastreams
Tera stream for datastreamsTera stream for datastreams
Tera stream for datastreams
 
Enterprise Services Solutions
Enterprise Services SolutionsEnterprise Services Solutions
Enterprise Services Solutions
 

Mehr von Hortonworks

Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks
 
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyHortonworks
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakHortonworks
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsHortonworks
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysHortonworks
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's NewHortonworks
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerHortonworks
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsHortonworks
 
IBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeIBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeHortonworks
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidHortonworks
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleHortonworks
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATAHortonworks
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Hortonworks
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseHortonworks
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseHortonworks
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationHortonworks
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementHortonworks
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHortonworks
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCHortonworks
 

Mehr von Hortonworks (20)

Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
 
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with Cloudbreak
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log Events
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's New
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
 
IBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeIBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data Landscape
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache Druid
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at Scale
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with Ease
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data Management
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDC
 

Kürzlich hochgeladen

Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxkarenfajardo43
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQuiz Club NITW
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Projectjordimapav
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmStan Meyer
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1GloryAnnCastre1
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Developmentchesterberbo7
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxSayali Powar
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...DhatriParmar
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Association for Project Management
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research DiscourseAnita GoswamiGiri
 

Kürzlich hochgeladen (20)

Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Project
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and Film
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1
 
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of EngineeringFaculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Development
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdf
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research Discourse
 

The Next Generation of Big Data Analytics

  • 1. The Next Generation of Big Data Analytics August 22, 2012 © Hortonworks Inc. 2012 © 2012 Teradata Corporation 1
  • 2. Today’s Speakers Jim Walker Cesar Rojas Eric Linden Dir. Product Marketing Dir. Solutions Marketing Technical Marketing Hortonworks Teradata Aster Teradata Aster © Hortonworks Inc. 2012 © 2012 Teradata Corporation 2
  • 3. Big Data Changes the Game Transactions + Interactions Petabytes BIG DATA Mobile Web + Observations Sentiment User Click Stream SMS/MMS = BIG DATA Speech to Text Social Interactions & Feeds Terabytes WEB Web logs Spatial & GPS Coordinates A/B testing Sensors / RFID / Devices Behavioral Targeting Gigabytes CRM Business Data Feeds Dynamic Pricing Segmentation External Demographics Search Marketing Customer Touches User Generated Content ERP Megabytes Affiliate Networks Purchase detail Support Contacts HD Video, Audio, Images Dynamic Funnels Purchase record Offer details Offer history Product/Service Logs Payment record Increasing Data Variety and Complexity © Hortonworks Inc. 2012 © 2012 Teradata Corporation 3
  • 4. Next Generation Data Architecture Drivers Business •  Enable new business models & drive faster growth (20%+) Drivers •  Find insights for competitive advantage & optimal returns •  Data continues to grow exponentially Technical •  Data is increasingly everywhere and in many formats Drivers •  Legacy solutions unfit for new requirements growth Financial •  Cost of data systems, as % of IT spend, continues to grow Drivers •  Cost advantages of commodity hardware & open source © Hortonworks Inc. 2012 © 2012 Teradata Corporation 4
  • 5. Fueling Adoption of the Next Generation Empower the Ecosystem •  Apache Hadoop has to just work with what you already have •  Apache Hadoop must be a seamless part of holistic data management strategy •  Leverage existing assets and tools… Extend them with new and powerful data Data Platform Services & Open APIs Hortonworks Data Platform © Hortonworks Inc. 2012 © 2012 Teradata Corporation 5
  • 6. Hortonworks Data Platform (HDP) •  Simplify deployment to get started quickly and easily •  Monitor, manage any size cluster with familiar console and tools •  Only platform to include data integration services to interact 1 with any data source •  Metadata services opens the platform for integration with Hortonworks Data Platform existing applications Delivers enterprise grade functionality on a proven Apache Hadoop distribution to ease management, •  Dependable high availability simplify use and ease integration into the enterprise architecture The only 100% open source data platform for Apache Hadoop © Hortonworks Inc. 2012 © 2012 Teradata Corporation 6
  • 7. Shift in Paradigm Classic BI Structured & Repeatable Analysis Business determines what IT structures the data to questions to ask answer those questions SQL Performance & Structure “Capture only what’s needed” “Capture in case it’s needed” MapReduce Processing Flexibility IT delivers platform for Big Data Analytics storing, refining, & Business explores data for Multi-structured & Iterative Analysis questions worth answering analyzing all data sources © Hortonworks Inc. 2012 © 2012 Teradata Corporation 7
  • 8. Transactions + Interactions + Observations Audio, Retain runtime models and Video, Images historical data for ongoing 5 Business Web, Mobile, CRM, refinement & analysis ERP, SCM, … Transactions Docs, & Interactions Text, XML Web Logs, Clicks Big Data 4 Data Social, Refinery Discovery & Classic Graph, 1 ETL Feeds Investigative processing Analytics Sensors, 3 Share refined Devices, RFID data & runtime 2 Store, aggregate, and models Interactive transform multi-structured data Spatial, data to unlock value Business exploration GPS Intelligence & Analytics Retain historical data to Events, Other unlock additional value 6 Dashboards, Reports, Visualization, … © Hortonworks Inc. 2012 © 2012 Teradata Corporation 8
  • 9. Unified Big Data Architecture •  Engineers •  Data Scientists Java, C/C++, Pig, Python, R, SAS, •  Quants SQL, Excel, BI, Visualization, etc. •  Business Analysts Discovery Integrated Platform Data Warehouse Capture, Store, Refine Audio/ Web & Machine CRM SCM ERP Images Text Video Social Logs Sources of data © Hortonworks Inc. 2012 © 2012 Teradata Corporation 9
  • 10. Next Generation Big Data Analytics The Data Discovery Cycle Analytical Idea Operational DB Operationalize Zero-ETL Data or EDW or Move On Load/Integration Evaluate Results SQL & non-SQL Analysis © Hortonworks Inc. 2012 © 2012 Teradata Corporation 10
  • 11. Key Elements of a Data Discovery Platform Highly Efficient & Performant Big Data Platform 1 That Allows Quick Iterations Hybrid Capabilities that Provide both Legacy 2 (SQL, BI) and New (MapReduce) Interfaces Significant Out-of-the-Box Analytical Apps that 3 Minimize Development Democratize Big Data & Maximize Enterprise Adoption © Hortonworks Inc. 2012 © 2012 Teradata Corporation 11
  • 12. Teradata Aster Data Discovery Platform Analysts Customers Business Users Data Scientists Your Analytics & Advanced Reporting Applications Pattern Matching Graph Statistical ELT •  50+ pre-built analytic modules Develop •  Visual IDE; develop apps in hours Java, C, Python, Perl … •  Many programming languages SQL SQL-MapReduce •  SQL-MapReduce framework •  Analyze both non-relational + Process Platform Services relational data (e.g. query planning, dynamic workload management, security …) •  Linear, incremental scalability •  Commodity-hardware based Store Relational Relational •  Software only, cloud, or appliance Row Column •  Relational-data architecture can be extended for non-relational types External HDFS Data (Using SQL-H and HCatalog) © Hortonworks Inc. 2012 © 2012 Teradata Corporation 12
  • 13. Aster MapReduce Portfolio: the App Store of Big Data 50+ out-of-the-box SQL-MapReduce analytic applications Path Analysis Text Analysis Discover patterns in rows of Derive patterns and extract features sequential data in textual data Statistical Analysis Segmentation High-performance processing of Discover natural groupings of data common statistical calculations points Marketing Analytics Data Transformation Analyze customer interactions to Transform data for more advanced optimize marketing decisions analysis © Hortonworks Inc. 2012 © 2012 Teradata Corporation 13
  • 14. Aster SQL-H Enables Data Discovery on Hadoop Data Aster SQL-H™ A Business User’s Bridge to Analyze Hadoop Data Aster SQL-H gives analysts and data scientists a better way to analyze data stored cheaply in Hadoop •  Allow standard ANSI SQL to Hadoop data •  Leverage existing BI tool investments •  Enable 50+ prebuilt SQL-MapReduce Apps and IDE •  Improve self-sufficiency for analysts going against Hadoop © Hortonworks Inc. 2012 © 2012 Teradata Corporation 14
  • 15. Analyst Point of View Gap 1: Analysts Engineers Data Scientists Quants Business Analysts Java, C/C++, Pig, Python, R, SAS, SQL, Excel, BI, Visualization, etc. MapReduce (Processing) Discovery Active Data Gap 2: File system lacks Platform Warehouse optimizers, data locality, indexes Database and Analytic Processing Layer Data Storage and Refining Audio/ Web & Machine Images Text CRM SCM ERP Video Social Logs © Hortonworks Inc. 2012 © 2012 Teradata Corporation 15
  • 16. Analyst’s Goal: Get Insights from Data in Hadoop Engineers Data Scientists Quants Business Analysts Aster MapReduce Portfolio Teradata Analytics Portfolio Custom Code and Development SQL & SQL-MapReduce SQL MR, Pig, Hive Teradata Aster Teradata IT is the optimizer Discovery Platform IDW © Hortonworks Inc. 2012 © 2012 Teradata Corporation 16
  • 17. Analytics on Hadoop Data with Aster SQL-H Engineers Data Scientists Quants Business Analysts Aster MapReduce Portfolio Aster MapReduce Portfolio Teradata Analytics Portfolio SQL-H SQL & MapReduce SQL & SQL-MapReduce SQL SQL Teradata Aster Teradata Discovery Platform IDW © Hortonworks Inc. 2012 © 2012 Teradata Corporation 17
  • 18. Aster SQL-H™ Integration with HCatalog Aster is the execution layer, all analytical processing is Aster Layer: SQL-H done with Aster SQL- MapReduce functions (no Hive or Hadoop-MR) Hadoop Data Filtering MR HCatalog is the metadata Data Hive HCatalog repository Pig HDFS HDFS is the data repository © Hortonworks Inc. 2012 © 2012 Teradata Corporation 18
  • 19. When to Use What? •  The best approach by workload and data type •  Processing as a Function of Schema Requirements by Data Type Loading and Refining Analytics Low Cost Storage Data Pre- Reporting (User-driven, & Retention Processing, Transformations interactive) Prep, Cleansing Stable Teradata / Teradata Schema Teradata Teradata Teradata Hadoop (SQL analytics) Aster Aster Evolving Aster / Schema Hadoop (joining with Aster (SQL + MapReduce Hadoop structured data) Analytics) Aster Format, No Schema Hadoop Hadoop Hadoop (MapReduce Analytics) © Hortonworks Inc. 2012 © 2012 Teradata Corporation 19
  • 20. Customer Churn Prevention Challenge Cross-Channel •  Know when churn will occur Customer Interactions •  Data Mining tools predict probability but do not 17,000 Customers, 1 Month identify cause events With Hadoop •  Capture, retention and transformation of customer images (e.g. checks) and customer voice records With Aster & Teradata •  SQL-MapReduce listens and predicts the customer churn event –  Identifies all interaction patterns prior to acquisition or attrition Business Impact •  10-300x less effort to pinpoint a customer in the middle of a decision © Hortonworks Inc. 2012 © 2012 Teradata Corporation 20
  • 21. More Accurate Customer Churn Prevention Hadoop captures, Aster does path stores and transform and sentiment images and call Social & Web data analysis with multi- records structured data Multi-Structured Raw Data Call Data Analysis Aster Data Call Center Voice Discovery + Records Hadoop Image Data Platform Marketing Automation Images & Capture, Retention Analytic Results Dimensional Data Documents & (Customer Transformation Retention Campaign) Traditional Data Flow Layer Data Sources ETL Tools Teradata Integrated DW © Hortonworks Inc. 2012 © 2012 Teradata Corporation 21
  • 22. Aster-Hadoop Integration Demo Churn Attrition © Hortonworks Inc. 2012 © 2012 Teradata Corporation 22
  • 23. Use Cases: Optimize Outcomes at Scale Media optimize Content Intelligence optimize Detection Investment optimize Algorithms Advertising optimize Performance Fraud optimize Prevention Regulation optimize Compliance Retail / Wholesale optimize Inventory turns Manufacturing optimize Supply chains Healthcare optimize Patient outcomes Education optimize Learning outcomes Government optimize Citizen services Source: Geoffrey Moore. Hadoop Summit 2012 keynote presentation. © Hortonworks Inc. 2012 © 2012 Teradata Corporation 23
  • 24. Why Hortonworks and Teradata Familiar business analysis on Apache Hadoop big data •  50+ advanced SQL-MapReduce functions (Aster MapReduce Portfolio) •  SQL-MapReduce development environment to build more functions Straightforward Database to Apache Hadoop Integration •  ANSI SQL-based interface to standard HCatalog metadata/schema in Hadoop Interoperability with existing ecosystem & skillsets •  BI tools (Tableau, MicroStrategy, Cognos), ETL tools, SQL analysts & existing applications Ease of maintenance, skillset and tools compliant •  Leverage existing DBA skill-sets without additional overhead •  Apache Ambari provides management and monitoring of the Hadoop cluster and integrates with current administration tools © Hortonworks Inc. 2012 © 2012 Teradata Corporation 24
  • 25. Learn More Big Analytics Best Practices Jim Walker Hortonworks www.asterdata.com/BigAnalyticsSeries jim@hortonworks.com Cesar Rojas Apache Hadoop Teradata Aster cesar.rojas@teradata.com & the Big Data Refinery www.hortonworks.com Eric Linden Teradata Aster eric.linden@teradata.com Twitter: @hortonworks @asterdata @jaymce © Hortonworks Inc. 2012 © 2012 Teradata Corporation 25