Real-Time Analytics Visualized w/ Kafka + Streamliner + MemSQL + ZoomData, Anton Gorshkov

•Als PPTX, PDF herunterladen•

4 gefällt mir•6,197 views

The document summarizes a presentation about using Kafka, Streamliner, MemSQL and ZoomData for real-time analytics visualization. It shows an initial setup with one producer and queue feeding into Kafka, then adding a sink to an in-memory SQL database and real-time visualization consumer. It asks questions about ensuring the system is resilient, handles bad data and schema evolution, maintains consistency across visualization layers, and ability to scale throughput, concurrency and size.

Ingenieurwesen

Anton Gorshkov
Real-Time Analytics Visualized
Kafka  Streamliner  MemSQL  ZoomData
Please note that during the course of this presentation ZoomData products will be used and shown on the
screen. Goldman Sachs has an ownership interest in ZoomData, Inc. and may have other business relationships
with ZoomData, Inc. Nothing herein shall constitute an offer to sell or a solicitation of an offer to buy an interest in
any entity or product.
Learn more at GS.com/Engineering

>docker run kafka
>docker run memsql
>docker run zoomdata
Initial Set-Up
2 4-CPU / 16GB / 80GB SSD / Intel Xeon E5-2670 @ 2.5GHzX

Start with one producer & a queue
Producer 1 Kafka

Add-a-Sink
Producer 1 Kafka In-Memory
SQL RDB
Consumer

Add a Real-Time Visualization
Producer 1 Kafka In-Memory
SQL RDB
RT VisConsumer

Resilience
- Don’t lose data
- Be Up
- Deliver (at least once and in-order)

How “I” is the BI?
- Is it a “view” or a “do work” layer?
- Data-at-Rest vs Data-at-Motion
- Pull vs Push
- Consistency to other vis layers

Will It Scale?
Throughput, Concurrency, Size
(and at what cost…)

Will it Scale?
Producer 1
Kafka In-Memory
SQL RDB
RT VisProducer 2
Producer n
Consumer

Audience Participation Time…
Kafka In-Memory
SQL RDB
RT Vis
(620) 487-2222
Consumer

Audience Participation Time…
send a text [Fruit] [Quantity]
(620) 487-2222
example: mango 540

Adaptability
Producer 1
Kafka
In-Memory
SQL RDB
RT Vis
Producer 2
Producer n
Elastic
Kibana
Consumer
Kafka
Connect

Representative Deployment
Direct Sink
Spark
Streaming
IMRDBM
S
Data Lake
Kafka
Source
DBs
Order Mgmt
Batch ETL
Cache
VertX

Empfohlen

Kafka Deployment to Steel Threadconfluent

Give Your Confluent Platform Superpowers! (Sandeep Togrika, Intel and Bert Ha...HostedbyConfluent

Application modernization patterns with apache kafka, debezium, and kubernete...Bilgin Ibryam

Couchbase Cloud No Equal (Rick Jacobs, Couchbase) Kafka Summit 2020HostedbyConfluent

Introducing Events and Stream Processing into Nationwide Building Society (Ro...confluent

From my sql to postgresql using kafka+debeziumClement Demonchy

Leveraging Mainframe Data for Modern Analyticsconfluent

Bravo Six, Going Realtime. Transitioning Activision Data Pipeline to Streamin...HostedbyConfluent

Empfohlen

Kafka Deployment to Steel Threadconfluent

Give Your Confluent Platform Superpowers! (Sandeep Togrika, Intel and Bert Ha...HostedbyConfluent

Application modernization patterns with apache kafka, debezium, and kubernete...Bilgin Ibryam

Couchbase Cloud No Equal (Rick Jacobs, Couchbase) Kafka Summit 2020HostedbyConfluent

Introducing Events and Stream Processing into Nationwide Building Society (Ro...confluent

From my sql to postgresql using kafka+debeziumClement Demonchy

Leveraging Mainframe Data for Modern Analyticsconfluent

Bravo Six, Going Realtime. Transitioning Activision Data Pipeline to Streamin...HostedbyConfluent

Removing performance bottlenecks with Kafka Monitoring and topic configurationKnoldus Inc.

Building Event-Driven Services with Apache Kafkaconfluent

Help, My Kafka is Broken! (Emma Humber & Gantigmaa Selenge, IBM) Kafka Summit...HostedbyConfluent

Kafka in Context, Cloud, & Community (Simon Elliston Ball, Cloudera) Kafka Su...HostedbyConfluent

user Behavior Analysis with Session Windows and Apache Kafka's Streams APIconfluent

Apache Kafka® in Industrial Environments confluent

Streaming all over the world Real life use cases with Kafka Streamsconfluent

Supercharge Your Real-time Event Processing with Neo4j's Streams Kafka Connec...HostedbyConfluent

Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with K...confluent

Mind the App: How to Monitor Your Kafka Streams Applications | Bruno Cadonna,...HostedbyConfluent

All Streams Ahead! ksqlDB Workshop ANZconfluent

Kafka Excellence at Scale – Cloud, Kubernetes, Infrastructure as Code (Vik Wa...HostedbyConfluent

Leveraging Microservices and Apache Kafka to Scale Developer Productivityconfluent

Building Stateful applications on Streaming Platforms | Premjit Mishra, Dell ...HostedbyConfluent

APAC Kafka Summit - Best Of confluent

Keep your Metadata Repository Current with Event-Driven Updates using CDC and...confluent

Introducing Confluent Cloud: Apache Kafka as a Service confluent

Securing Kafka At Zendesk (Joy Nag, Zendesk) Kafka Summit 2020confluent

Kafka error handling patterns and best practices | Hemant Desale and Aruna Ka...HostedbyConfluent

Building a Modern, Scalable Cyber Intelligence Platform with Apache Kafka | J...HostedbyConfluent

Partner Ecosystem Showcase for Apache Ranger and Apache AtlasDataWorks Summit

The Evolution of Data ArchitectureWei-Chiu Chuang

Weitere ähnliche Inhalte

Was ist angesagt?

Removing performance bottlenecks with Kafka Monitoring and topic configurationKnoldus Inc.

Building Event-Driven Services with Apache Kafkaconfluent

Help, My Kafka is Broken! (Emma Humber & Gantigmaa Selenge, IBM) Kafka Summit...HostedbyConfluent

Kafka in Context, Cloud, & Community (Simon Elliston Ball, Cloudera) Kafka Su...HostedbyConfluent

user Behavior Analysis with Session Windows and Apache Kafka's Streams APIconfluent

Apache Kafka® in Industrial Environments confluent

Streaming all over the world Real life use cases with Kafka Streamsconfluent

Supercharge Your Real-time Event Processing with Neo4j's Streams Kafka Connec...HostedbyConfluent

Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with K...confluent

Mind the App: How to Monitor Your Kafka Streams Applications | Bruno Cadonna,...HostedbyConfluent

All Streams Ahead! ksqlDB Workshop ANZconfluent

Kafka Excellence at Scale – Cloud, Kubernetes, Infrastructure as Code (Vik Wa...HostedbyConfluent

Leveraging Microservices and Apache Kafka to Scale Developer Productivityconfluent

Building Stateful applications on Streaming Platforms | Premjit Mishra, Dell ...HostedbyConfluent

APAC Kafka Summit - Best Of confluent

Keep your Metadata Repository Current with Event-Driven Updates using CDC and...confluent

Introducing Confluent Cloud: Apache Kafka as a Service confluent

Securing Kafka At Zendesk (Joy Nag, Zendesk) Kafka Summit 2020confluent

Kafka error handling patterns and best practices | Hemant Desale and Aruna Ka...HostedbyConfluent

Building a Modern, Scalable Cyber Intelligence Platform with Apache Kafka | J...HostedbyConfluent

Was ist angesagt? (20)

Removing performance bottlenecks with Kafka Monitoring and topic configuration

Building Event-Driven Services with Apache Kafka

Help, My Kafka is Broken! (Emma Humber & Gantigmaa Selenge, IBM) Kafka Summit...

Kafka in Context, Cloud, & Community (Simon Elliston Ball, Cloudera) Kafka Su...

user Behavior Analysis with Session Windows and Apache Kafka's Streams API

Apache Kafka® in Industrial Environments

Streaming all over the world Real life use cases with Kafka Streams

Supercharge Your Real-time Event Processing with Neo4j's Streams Kafka Connec...

Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with K...

Mind the App: How to Monitor Your Kafka Streams Applications | Bruno Cadonna,...

All Streams Ahead! ksqlDB Workshop ANZ

Kafka Excellence at Scale – Cloud, Kubernetes, Infrastructure as Code (Vik Wa...

Leveraging Microservices and Apache Kafka to Scale Developer Productivity

Building Stateful applications on Streaming Platforms | Premjit Mishra, Dell ...

APAC Kafka Summit - Best Of

Keep your Metadata Repository Current with Event-Driven Updates using CDC and...

Introducing Confluent Cloud: Apache Kafka as a Service

Securing Kafka At Zendesk (Joy Nag, Zendesk) Kafka Summit 2020

Kafka error handling patterns and best practices | Hemant Desale and Aruna Ka...

Building a Modern, Scalable Cyber Intelligence Platform with Apache Kafka | J...

Andere mochten auch

Partner Ecosystem Showcase for Apache Ranger and Apache AtlasDataWorks Summit

The Evolution of Data ArchitectureWei-Chiu Chuang

CWIN17 Frankfurt / ClouderaCapgemini

Ibm watsonVivek Mohan

Spark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice MachineData Con LA

Building the Ideal Stack for Real-Time AnalyticsSingleStore

Webinar - Sehr empfehlenswert: wie man aus Daten durch maschinelles Lernen We...Cloudera, Inc.

The Fast Path to Building Operational Applications with SparkSingleStore

Put Alternative Data to Use in Capital Markets Cloudera, Inc.

빅데이터윈윈 컨퍼런스_데이터시각화자료ABRC_DATA

Softnix Messaging ServerSoftnix Technology

Cloudera and Qlik: Big Data Analytics for BusinessData IQ Argentina

Softnix Security Data Lake Softnix Technology

Spark meetup - Zoomdata StreamingZoomdata

Using Big Data to Transform Your Customer’s Experience - Part 1 Cloudera, Inc.

ZoomdataVivek Mohan

Security implementation on hadoopWei-Chiu Chuang

Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...Spark Summit

MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...Spark Summit

Benefits of Transferring Real-Time Data to Hadoop at ScaleHortonworks

Andere mochten auch (20)

Partner Ecosystem Showcase for Apache Ranger and Apache Atlas

The Evolution of Data Architecture

CWIN17 Frankfurt / Cloudera

Ibm watson

Spark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice Machine

Building the Ideal Stack for Real-Time Analytics

Webinar - Sehr empfehlenswert: wie man aus Daten durch maschinelles Lernen We...

The Fast Path to Building Operational Applications with Spark

Put Alternative Data to Use in Capital Markets 

빅데이터윈윈 컨퍼런스_데이터시각화자료

Softnix Messaging Server

Cloudera and Qlik: Big Data Analytics for Business

Softnix Security Data Lake

Spark meetup - Zoomdata Streaming

Using Big Data to Transform Your Customer’s Experience - Part 1 

Zoomdata

Security implementation on hadoop

Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...

MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...

Benefits of Transferring Real-Time Data to Hadoop at Scale

Ähnlich wie Real-Time Analytics Visualized w/ Kafka + Streamliner + MemSQL + ZoomData, Anton Gorshkov

QCon2016--Drive Best Spark Performance on AILex Yu

Build Low-Latency Applications in Rust on ScyllaDBScyllaDB

MySQL新技术研究与实践orczhou

Introduction to Software Defined Visualization (SDVis)Intel® Software

20201006_PGconf_Online_Large_Data_ProcessingKohei KaiGai

Optimizing Performance in Rust for Low-Latency Database DriversScyllaDB

Right-size Deployment Instances to Meet Enterprise Demand WSO2

Scaling up Near Real-time Analytics @Uber &LinkedInC4Media

Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...Red_Hat_Storage

Build Low-Latency Applications in Rust on ScyllaDBScyllaDB

Gitlab, GitOps & ArgoCDHaggai Philip Zagury

High concurrency, Low latency analytics using Spark/KuduChris George

Run Scala Faster with GraalVM on any Platform / GraalVMで、どこでもScalaを高速実行しよう by...scalaconfjp

Presentation PortfolioSteve Lee

Building a modern SaaS in 2020Nikolay Stoitsev

20181210 - PGconf.ASIA UnconferenceKohei KaiGai

GPU/SSD Accelerates PostgreSQL - challenge towards query processing throughpu...Kohei KaiGai

Client Virtualization reference architacture- cnITband

JOSA TechTalks - Downgrade your CostsJordan Open Source Association

Running your Java EE 6 Applications in the CloudArun Gupta

Ähnlich wie Real-Time Analytics Visualized w/ Kafka + Streamliner + MemSQL + ZoomData, Anton Gorshkov (20)

QCon2016--Drive Best Spark Performance on AI

Build Low-Latency Applications in Rust on ScyllaDB

MySQL新技术研究与实践

Introduction to Software Defined Visualization (SDVis)

20201006_PGconf_Online_Large_Data_Processing

Optimizing Performance in Rust for Low-Latency Database Drivers

Right-size Deployment Instances to Meet Enterprise Demand

Scaling up Near Real-time Analytics @Uber &LinkedIn

Red Hat Storage Day Atlanta - Designing Ceph Clusters Using Intel-Based Hardw...

Build Low-Latency Applications in Rust on ScyllaDB

Gitlab, GitOps & ArgoCD

High concurrency, Low latency analytics using Spark/Kudu

Run Scala Faster with GraalVM on any Platform / GraalVMで、どこでもScalaを高速実行しよう by...

Presentation Portfolio

Building a modern SaaS in 2020

20181210 - PGconf.ASIA Unconference

GPU/SSD Accelerates PostgreSQL - challenge towards query processing throughpu...

Client Virtualization reference architacture- cn

JOSA TechTalks - Downgrade your Costs

Running your Java EE 6 Applications in the Cloud

Mehr von confluent

Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent

Santander Stream Processing with Apache Flinkconfluent

Unlocking the Power of IoT: A comprehensive approach to real-time insightsconfluent

Workshop híbrido: Stream Processing con Flinkconfluent

Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...confluent

AWS Immersion Day Mapfre - Confluentconfluent

Eventos y Microservicios - Santander TechTalkconfluent

Q&A with Confluent Experts: Navigating Networking in Confluent Cloudconfluent

Citi TechTalk Session 2: Kafka Deep Diveconfluent

Build real-time streaming data pipelines to AWS with Confluentconfluent

Q&A with Confluent Professional Services: Confluent Service Meshconfluent

Citi Tech Talk: Event Driven Kafka Microservicesconfluent

Confluent & GSI Webinars series - Session 3confluent

Citi Tech Talk: Messaging Modernizationconfluent

Citi Tech Talk: Data Governance for streaming and real time dataconfluent

Confluent & GSI Webinars series: Session 2confluent

Data In Motion Paris 2023confluent

Confluent Partner Tech Talk with Synthesisconfluent

The Future of Application Development - API Days - Melbourne 2023confluent

The Playful Bond Between REST And Data Streamsconfluent

Mehr von confluent (20)

Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...

Santander Stream Processing with Apache Flink

Unlocking the Power of IoT: A comprehensive approach to real-time insights

Workshop híbrido: Stream Processing con Flink

Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...

AWS Immersion Day Mapfre - Confluent

Eventos y Microservicios - Santander TechTalk

Q&A with Confluent Experts: Navigating Networking in Confluent Cloud

Citi TechTalk Session 2: Kafka Deep Dive

Build real-time streaming data pipelines to AWS with Confluent

Q&A with Confluent Professional Services: Confluent Service Mesh

Citi Tech Talk: Event Driven Kafka Microservices

Confluent & GSI Webinars series - Session 3

Citi Tech Talk: Messaging Modernization

Citi Tech Talk: Data Governance for streaming and real time data

Confluent & GSI Webinars series: Session 2

Data In Motion Paris 2023

Confluent Partner Tech Talk with Synthesis

The Future of Application Development - API Days - Melbourne 2023

The Playful Bond Between REST And Data Streams

Kürzlich hochgeladen

Mine Environment II Lab_MI10448MI__________.pptxRomil Mishra

Configuration of IoT devices - Systems managamentBharaniDharan195623

BSNL Internship Training presentation.pptxNiranjanYadav41

IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst

Ch10-Global Supply Chain - Cadena de Suministro.pdfChristianCDAM

Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ

Industrial Safety Unit-IV workplace health and safety.pptNarmatha D

Crushers to screens in aggregate productionChinnuNinan

Crystal Structure analysis and detailed information pptxachiever3003

Indian Dairy Industry Present Status and.pptMadan Karki

Past, Present and Future of Generative AIabhishek36461

Internet of things -Arshdeep Bahga .pptxVelmuruganTECE

Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort

Design and analysis of solar grass cutter.pdfTagore Institute of Engineering And Technology

Class 1 | NFPA 72 | Overview Fire Alarm Systemirfanmechengr

National Level Hackathon Participation Certificate.pdfRajuKanojiya4

Katarzyna Lipka-Sidor - BIM School Coursebim.edu.pl

complete construction, environmental and economics information of biomass com...asadnawaz62

TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catcherssdickerson1

Designing pile caps according to ACI 318-19.pptxErbil Polytechnic University

Kürzlich hochgeladen (20)

Mine Environment II Lab_MI10448MI__________.pptx

Configuration of IoT devices - Systems managament

BSNL Internship Training presentation.pptx

IVE Industry Focused Event - Defence Sector 2024

Ch10-Global Supply Chain - Cadena de Suministro.pdf

Software and Systems Engineering Standards: Verification and Validation of Sy...

Industrial Safety Unit-IV workplace health and safety.ppt

Crushers to screens in aggregate production

Crystal Structure analysis and detailed information pptx

Indian Dairy Industry Present Status and.ppt

Past, Present and Future of Generative AI

Internet of things -Arshdeep Bahga .pptx

Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service

Design and analysis of solar grass cutter.pdf

Class 1 | NFPA 72 | Overview Fire Alarm System

National Level Hackathon Participation Certificate.pdf

Katarzyna Lipka-Sidor - BIM School Course

complete construction, environmental and economics information of biomass com...

TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers

Designing pile caps according to ACI 318-19.pptx

Real-Time Analytics Visualized w/ Kafka + Streamliner + MemSQL + ZoomData, Anton Gorshkov

1. Anton Gorshkov Real-Time Analytics Visualized Kafka  Streamliner  MemSQL  ZoomData Please note that during the course of this presentation ZoomData products will be used and shown on the screen. Goldman Sachs has an ownership interest in ZoomData, Inc. and may have other business relationships with ZoomData, Inc. Nothing herein shall constitute an offer to sell or a solicitation of an offer to buy an interest in any entity or product. Learn more at GS.com/Engineering

2. >docker run kafka >docker run memsql >docker run zoomdata Initial Set-Up 2 4-CPU / 16GB / 80GB SSD / Intel Xeon E5-2670 @ 2.5GHzX

3. Context

4. Start with one producer & a queue Producer 1 Kafka

5. Add-a-Sink Producer 1 Kafka In-Memory SQL RDB Consumer

6. Add a Real-Time Visualization Producer 1 Kafka In-Memory SQL RDB RT VisConsumer

7. Enterprise Grade?

8. Resilience - Don’t lose data - Be Up - Deliver (at least once and in-order)

9. Bad Data

10. Schema Evolution

11. How “I” is the BI? - Is it a “view” or a “do work” layer? - Data-at-Rest vs Data-at-Motion - Pull vs Push - Consistency to other vis layers

12. Will It Scale? Throughput, Concurrency, Size (and at what cost…)

13. Will it Scale? Producer 1 Kafka In-Memory SQL RDB RT VisProducer 2 Producer n Consumer

14. Audience Participation Time… Kafka In-Memory SQL RDB RT Vis (620) 487-2222 Consumer

15. Audience Participation Time… send a text [Fruit] [Quantity] (620) 487-2222 example: mango 540

16. Adaptability Producer 1 Kafka In-Memory SQL RDB RT Vis Producer 2 Producer n Elastic Kibana Consumer Kafka Connect

17. Representative Deployment Direct Sink Spark Streaming IMRDBM S Data Lake Kafka Source DBs Order Mgmt Batch ETL Cache VertX

18. Learn more at GS.com/Engineering

Hinweis der Redaktion

Our online Engineering Hub (gs.com/engineering) is regularly updated with profiles of our latest projects, our engineers and our activities in the community. Take a moment to visit the hub yourself and find at least one article you can reference or talk to….