HDFS on Kubernetes—Lessons Learned with Kimoon Kim

•

24 gefällt mir•9,130 views

There is growing interest in running Apache Spark natively on Kubernetes (see https://github.com/apache-spark-on-k8s/spark). Spark applications often access data in HDFS, and Spark supports HDFS locality by scheduling tasks on nodes that have the task input data on their local disks. When running Spark on Kubernetes, if the HDFS daemons run outside Kubernetes, applications will slow down while accessing the data remotely. This session will demonstrate how to run HDFS inside Kubernetes to speed up Spark. In particular, it will show how Spark scheduler can still provide HDFS data locality on Kubernetes by discovering the mapping of Kubernetes containers to physical nodes to HDFS datanode daemons. You’ll also learn how you can provide Spark with the high availability of the critical HDFS namenode service when running HDFS in Kubernetes.

Daten & Analysen

Kimoon Kim (kimoon@pepperdata.com)
HDFS on Kubernetes --
Lessons Learned

Outline
1. Kubernetes intro
2. Big Data on Kubernetes
3. Demo
4. Problems we fixed -- HDFS data locality

Kubernetes
New open-source cluster manager.
Runs programs in Linux containers.
1000+ contributors and 40,000+ commits.

“My app was running fine
until someone installed
their software”

More isolation is good
Kubernetes provides each program with:
• a lightweight virtual file system
– an independent set of S/W packages
• a virtual network interface
– a unique virtual IP address
– an entire range of ports
• etc

Kubernetes architecture
node A node B
Pod 1 Pod 2 Pod 3
10.0.0.2
196.0.0.5 196.0.0.6
10.0.0.3 10.0.1.2
Pod, a unit of scheduling and isolation.
• runs a user program in a primary container
• holds isolation layers like an virtual IP in an infra container

Big Data on Kubernetes
github.com/apache-spark-on-k8s
• Google, Haiwen, Hyperpilot, Intel, Palantir, Pepperdata, Red Hat,
and growing.
• patching up Spark Driver and Executor code to work on
Kubernetes.
Yesterday’s talk:
spark-summit.org/2017/events/apache-spark-on-kuberne
tes/

Spark on Kubernetes
node A node B
Driver Pod Executor Pod 1 Executor Pod 2
10.0.0.2
196.0.0.5 196.0.0.6
10.0.0.3 10.0.1.2
Client
Client
Driver Pod Executor Pod 1 Executor Pod 2
10.0.0.4 10.0.0.5 10.0.1.3
Job 1
Job 2

What about storage?
Spark often stores data on HDFS.
How can Spark on Kubernetes access HDFS data?

Hadoop Distributed File System
Namenode
• runs on a central cluster node.
• maintains file system metadata.
Datanodes
• on every cluster node.
• read and write file data on local disks.

HDFS on Kubernetes
node A node B
Driver Pod Executor Pod 1 Executor Pod 2
10.0.0.2
196.0.0.5 196.0.0.6
10.0.0.3 10.0.1.2
Client
Client
Driver Pod Executor Pod 1 Executor Pod 2
10.0.0.4 10.0.0.5 10.0.1.3
Job 1
Job 2
Namenode Pod Datanode Pod 1 Datanode Pod 2
HDFS
hadoop.fs.defaultFS
hdfs://hdfs-namenode-0.hdfs-namenode.default.svc.cluster.local:8020

Demo
1. Label cluster nodes
2. Stand up HDFS
3. Launch a Spark job
4. Check Spark job output

What about data locality?
• Read data from local disks when possible
• Remote reads can be slow if network is
slow
• Implemented in HDFS daemons, integrated
with app frameworks like Spark
Client 2
node B
Client 1
node A
Datanode 1 Datanode 2
SLOWFAST

HDFS data locality in YARN
Spark driver sends tasks to right executors with tasks’ HDFS data.
Driver
Executor 1 Executor 2
/fileA /fileB
Read /fileA
Read /fileB
(/fileA → Datanode 1 → 196.0.0.5) == (Executor 1 →196.0.0.5)
Datanode 1 Datanode 2
node A
196.0.0.5
node B
196.0.0.6
Job 1
HDFS
YARN example

Hmm, how do I find right executors in Kubernetes...
(/fileA → Datanode 1 → 196.0.0.5) != (Executor 1 → 10.0.0.3)
Executor Pod 2
10.0.1.2
Driver Pod Executor Pod 1
10.0.0.2 10.0.0.3
Read /fileB
Read /fileA
/fileA /fileB
Datanode Pod 1
node A
196.0.0.5
node B
196.0.0.6
Kubernetes example
Datanode Pod 2

Fix Spark Driver code
1. Ask Kubernetes master to find the cluster node where
the executor pod is running.
2. Get the node IP.
3. Compare with the datanode IPs.

Rescued data locality
Executor Pod 2
10.0.1.2
Driver Executor Pod 1
10.0.0.2 10.0.0.3
(/fileA → Datanode 1 → 196.0.0.5) == (Executor 1 →10.0.0.3 → 196.0.0.5)
Read /fileA
Read /fileB
/fileA /fileB
node A
196.0.0.5
node B
196.0.0.6
Datanode Pod 1 Datanode Pod 2

Rescued data locality!
with data locality fix
- duration: 10 minutes
without data locality fix
- duration: 25 minutes

Recap
Got HDFS up and running.
Basic data locality works.
Open problems:
• Remaining data locality issues -- rack locality, node preference, etc
• Kerberos support
• Namenode High Availability

Join us!
● github.com/apache-spark-on-k8s
● pepperdata.com/careers/
More questions?
● Come to Pepperdata booth #101
● Mail kimoon@pepperdata.com

Empfohlen

Hive + Tez: A Performance Deep DiveDataWorks Summit

Apache Spark on K8S and HDFS Security with Ilan FlonenkoDatabricks

How to understand and analyze Apache Hive query execution plan for performanc...DataWorks Summit/Hadoop Summit

Intro to Apache SparkRobert Sanders

Apache Spark on Kubernetes Anirudh Ramanathan and Tim ChenDatabricks

Deep Dive: Memory Management in Apache SparkDatabricks

kafkaAmikam Snir

Apache Sentry for Hadoop securitybigdatagurus_meetup

Empfohlen

Hive + Tez: A Performance Deep DiveDataWorks Summit

Apache Spark on K8S and HDFS Security with Ilan FlonenkoDatabricks

How to understand and analyze Apache Hive query execution plan for performanc...DataWorks Summit/Hadoop Summit

Intro to Apache SparkRobert Sanders

Apache Spark on Kubernetes Anirudh Ramanathan and Tim ChenDatabricks

Deep Dive: Memory Management in Apache SparkDatabricks

kafkaAmikam Snir

Apache Sentry for Hadoop securitybigdatagurus_meetup

Apache Tez - A New Chapter in Hadoop Data ProcessingDataWorks Summit

Kafka Streams: What it is, and how to use it?confluent

Introduction to Kafka StreamsGuozhang Wang

The Rise of ZStandard: Apache Spark/Parquet/ORC/AvroDatabricks

Autoscaling Flink with Reactive ModeFlink Forward

Getting Started with Apache Spark on KubernetesDatabricks

Apache Spark on K8S Best Practice and Performance in the CloudDatabricks

Hive+Tez: A performance deep divet3rmin4t0r

A Thorough Comparison of Delta Lake, Iceberg and HudiDatabricks

Apache Tez: Accelerating Hadoop Query Processing DataWorks Summit

Iceberg + Alluxio for Fast Data AnalyticsAlluxio, Inc.

Apache Spark FundamentalsZahra Eskandari

Kafka Streams State Stores Being Persistentconfluent

Streaming Data Analytics with ksqlDB and Superset | Robert Stolz, PresetHostedbyConfluent

Stream processing using KafkaKnoldus Inc.

Best Practice of Compression/Decompression Codes in Apache Spark with Sophia...Databricks

Parquet performance tuning: the missing guideRyan Blue

Running Apache Spark on Kubernetes: Best Practices and PitfallsDatabricks

Apache Phoenix and HBase: Past, Present and Future of SQL over HBaseDataWorks Summit/Hadoop Summit

Downscaling: The Achilles heel of Autoscaling Apache Spark ClustersDatabricks

Apache Spark on K8s and HDFS SecurityDatabricks

Big data with hadoop Setup on Ubuntu 12.04Mandakini Kumari

Weitere ähnliche Inhalte

Was ist angesagt?

Apache Tez - A New Chapter in Hadoop Data ProcessingDataWorks Summit

Kafka Streams: What it is, and how to use it?confluent

Introduction to Kafka StreamsGuozhang Wang

The Rise of ZStandard: Apache Spark/Parquet/ORC/AvroDatabricks

Autoscaling Flink with Reactive ModeFlink Forward

Getting Started with Apache Spark on KubernetesDatabricks

Apache Spark on K8S Best Practice and Performance in the CloudDatabricks

Hive+Tez: A performance deep divet3rmin4t0r

A Thorough Comparison of Delta Lake, Iceberg and HudiDatabricks

Apache Tez: Accelerating Hadoop Query Processing DataWorks Summit

Iceberg + Alluxio for Fast Data AnalyticsAlluxio, Inc.

Apache Spark FundamentalsZahra Eskandari

Kafka Streams State Stores Being Persistentconfluent

Streaming Data Analytics with ksqlDB and Superset | Robert Stolz, PresetHostedbyConfluent

Stream processing using KafkaKnoldus Inc.

Best Practice of Compression/Decompression Codes in Apache Spark with Sophia...Databricks

Parquet performance tuning: the missing guideRyan Blue

Running Apache Spark on Kubernetes: Best Practices and PitfallsDatabricks

Apache Phoenix and HBase: Past, Present and Future of SQL over HBaseDataWorks Summit/Hadoop Summit

Downscaling: The Achilles heel of Autoscaling Apache Spark ClustersDatabricks

Was ist angesagt? (20)

Apache Tez - A New Chapter in Hadoop Data Processing

Kafka Streams: What it is, and how to use it?

Introduction to Kafka Streams

The Rise of ZStandard: Apache Spark/Parquet/ORC/Avro

Autoscaling Flink with Reactive Mode

Getting Started with Apache Spark on Kubernetes

Apache Spark on K8S Best Practice and Performance in the Cloud

Hive+Tez: A performance deep dive

A Thorough Comparison of Delta Lake, Iceberg and Hudi

Apache Tez: Accelerating Hadoop Query Processing

Iceberg + Alluxio for Fast Data Analytics

Apache Spark Fundamentals

Kafka Streams State Stores Being Persistent

Streaming Data Analytics with ksqlDB and Superset | Robert Stolz, Preset

Stream processing using Kafka

Best Practice of Compression/Decompression Codes in Apache Spark with Sophia...

Parquet performance tuning: the missing guide

Running Apache Spark on Kubernetes: Best Practices and Pitfalls

Apache Phoenix and HBase: Past, Present and Future of SQL over HBase

Downscaling: The Achilles heel of Autoscaling Apache Spark Clusters

Ähnlich wie HDFS on Kubernetes—Lessons Learned with Kimoon Kim

Apache Spark on K8s and HDFS SecurityDatabricks

Big data with hadoop Setup on Ubuntu 12.04Mandakini Kumari

State of Containers and the Convergence of HPC and BigDatainside-BigData.com

Big data processing using hadoop poster presentationAmrut Patil

Inside Docker for Fedora20/RHEL7Etsuji Nakai

Best Practices for Running Kafka on Docker ContainersBlueData, Inc.

Apache Cassandra and Apche SparkAlex Thompson

Fast Data Analytics with Spark and PythonBenjamin Bengfort

Linux Container Technology inside Docker with RHEL7Etsuji Nakai

Architecture Overview: Kubernetes with Red Hat Enterprise Linux 7.1Etsuji Nakai

Welcome to Hadoop2Land!Uwe Printz

Introduction to DockerNissan Dookeran

State of the Container EcosystemVinay Rao

Hadoop Architecture and HDFSEdureka!

Dayta AI Seminar - Kubernetes, Docker and AI on CloudJung-Hong Kim

Laravel, docker, kubernetesPeter Mein

Unraveling Docker Security: Lessons From a Production CloudSalman Baset

Tokyo OpenStack Summit 2015: Unraveling Docker SecurityPhil Estes

Hadoop on OpenStackSandeep Raju

Design and Research of Hadoop Distributed Cluster Based on RaspberryIJRESJOURNAL

Ähnlich wie HDFS on Kubernetes—Lessons Learned with Kimoon Kim (20)

Apache Spark on K8s and HDFS Security

Big data with hadoop Setup on Ubuntu 12.04

State of Containers and the Convergence of HPC and BigData

Big data processing using hadoop poster presentation

Inside Docker for Fedora20/RHEL7

Best Practices for Running Kafka on Docker Containers

Apache Cassandra and Apche Spark

Fast Data Analytics with Spark and Python

Linux Container Technology inside Docker with RHEL7

Architecture Overview: Kubernetes with Red Hat Enterprise Linux 7.1

Welcome to Hadoop2Land!

Introduction to Docker

State of the Container Ecosystem

Hadoop Architecture and HDFS

Dayta AI Seminar - Kubernetes, Docker and AI on Cloud

Laravel, docker, kubernetes

Unraveling Docker Security: Lessons From a Production Cloud

Tokyo OpenStack Summit 2015: Unraveling Docker Security

Hadoop on OpenStack

Design and Research of Hadoop Distributed Cluster Based on Raspberry

Mehr von Databricks

DW Migration Webinar-March 2022.pptxDatabricks

Data Lakehouse Symposium | Day 1 | Part 1Databricks

Data Lakehouse Symposium | Day 1 | Part 2Databricks

Data Lakehouse Symposium | Day 2Databricks

Data Lakehouse Symposium | Day 4Databricks

5 Critical Steps to Clean Your Data Swamp When Migrating Off of HadoopDatabricks

Democratizing Data Quality Through a Centralized PlatformDatabricks

Learn to Use Databricks for Data ScienceDatabricks

Why APM Is Not the Same As ML MonitoringDatabricks

The Function, the Context, and the Data—Enabling ML Ops at Stitch FixDatabricks

Stage Level Scheduling Improving Big Data and AI IntegrationDatabricks

Simplify Data Conversion from Spark to TensorFlow and PyTorchDatabricks

Scaling your Data Pipelines with Apache Spark on KubernetesDatabricks

Scaling and Unifying SciKit Learn and Apache Spark PipelinesDatabricks

Sawtooth Windows for Feature AggregationsDatabricks

Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkDatabricks

Re-imagine Data Monitoring with whylogs and SparkDatabricks

Raven: End-to-end Optimization of ML Prediction QueriesDatabricks

Processing Large Datasets for ADAS Applications using Apache SparkDatabricks

Massive Data Processing in Adobe Using Delta LakeDatabricks

Mehr von Databricks (20)

DW Migration Webinar-March 2022.pptx

Data Lakehouse Symposium | Day 1 | Part 1

Data Lakehouse Symposium | Day 1 | Part 2

Data Lakehouse Symposium | Day 2

Data Lakehouse Symposium | Day 4

5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop

Democratizing Data Quality Through a Centralized Platform

Learn to Use Databricks for Data Science

Why APM Is Not the Same As ML Monitoring

The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix

Stage Level Scheduling Improving Big Data and AI Integration

Simplify Data Conversion from Spark to TensorFlow and PyTorch

Scaling your Data Pipelines with Apache Spark on Kubernetes

Scaling and Unifying SciKit Learn and Apache Spark Pipelines

Sawtooth Windows for Feature Aggregations

Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink

Re-imagine Data Monitoring with whylogs and Spark

Raven: End-to-end Optimization of ML Prediction Queries

Processing Large Datasets for ADAS Applications using Apache Spark

Massive Data Processing in Adobe Using Delta Lake

Kürzlich hochgeladen

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一fhwihughh

Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort

Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534

Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7

原版1:1定制南十字星大学毕业证（SCU毕业证）#文凭成绩单#真实留信学历认证永久存档208367051

专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss

Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375

From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck

Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha

Multiple time frame trading analysis -brianshannon.pdfchwongval

科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss

Machine learning classification ppt.pptamreenkhanum0307

Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03

Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster

ASML's Taxonomy Adventure by Daniel Cantervoginip

Easter Eggs From Star Wars and in cars 1 and 217djon017

Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson

Kürzlich hochgeladen (20)

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一

Generative AI for Social Good at Open Data Science East 2024

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service

Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...

Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...

原版1:1定制南十字星大学毕业证（SCU毕业证）#文凭成绩单#真实留信学历认证永久存档

专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改

Advanced Machine Learning for Business Professionals

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...

From idea to production in a day – Leveraging Azure ML and Streamlit to build...

Call Girls In Dwarka 9654467111 Escorts Service

Multiple time frame trading analysis -brianshannon.pdf

科罗拉多大学波尔得分校毕业证学位证成绩单-可办理

Machine learning classification ppt.ppt

Top 5 Best Data Analytics Courses In Queens

Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024

ASML's Taxonomy Adventure by Daniel Canter

Easter Eggs From Star Wars and in cars 1 and 2

Defining Constituents, Data Vizzes and Telling a Data Story

HDFS on Kubernetes—Lessons Learned with Kimoon Kim

1. Kimoon Kim (kimoon@pepperdata.com) HDFS on Kubernetes -- Lessons Learned

2. Outline 1. Kubernetes intro 2. Big Data on Kubernetes 3. Demo 4. Problems we fixed -- HDFS data locality

3. Kubernetes New open-source cluster manager. Runs programs in Linux containers. 1000+ contributors and 40,000+ commits.

4. “My app was running fine until someone installed their software”

5. More isolation is good Kubernetes provides each program with: • a lightweight virtual file system – an independent set of S/W packages • a virtual network interface – a unique virtual IP address – an entire range of ports • etc

6. Kubernetes architecture node A node B Pod 1 Pod 2 Pod 3 10.0.0.2 196.0.0.5 196.0.0.6 10.0.0.3 10.0.1.2 Pod, a unit of scheduling and isolation. • runs a user program in a primary container • holds isolation layers like an virtual IP in an infra container

7. Big Data on Kubernetes github.com/apache-spark-on-k8s • Google, Haiwen, Hyperpilot, Intel, Palantir, Pepperdata, Red Hat, and growing. • patching up Spark Driver and Executor code to work on Kubernetes. Yesterday’s talk: spark-summit.org/2017/events/apache-spark-on-kuberne tes/

8. Spark on Kubernetes node A node B Driver Pod Executor Pod 1 Executor Pod 2 10.0.0.2 196.0.0.5 196.0.0.6 10.0.0.3 10.0.1.2 Client Client Driver Pod Executor Pod 1 Executor Pod 2 10.0.0.4 10.0.0.5 10.0.1.3 Job 1 Job 2

9. What about storage? Spark often stores data on HDFS. How can Spark on Kubernetes access HDFS data?

10. Hadoop Distributed File System Namenode • runs on a central cluster node. • maintains file system metadata. Datanodes • on every cluster node. • read and write file data on local disks.

11. HDFS on Kubernetes node A node B Driver Pod Executor Pod 1 Executor Pod 2 10.0.0.2 196.0.0.5 196.0.0.6 10.0.0.3 10.0.1.2 Client Client Driver Pod Executor Pod 1 Executor Pod 2 10.0.0.4 10.0.0.5 10.0.1.3 Job 1 Job 2 Namenode Pod Datanode Pod 1 Datanode Pod 2 HDFS hadoop.fs.defaultFS hdfs://hdfs-namenode-0.hdfs-namenode.default.svc.cluster.local:8020

12. Demo 1. Label cluster nodes 2. Stand up HDFS 3. Launch a Spark job 4. Check Spark job output

13. What about data locality? • Read data from local disks when possible • Remote reads can be slow if network is slow • Implemented in HDFS daemons, integrated with app frameworks like Spark Client 2 node B Client 1 node A Datanode 1 Datanode 2 SLOWFAST

14. HDFS data locality in YARN Spark driver sends tasks to right executors with tasks’ HDFS data. Driver Executor 1 Executor 2 /fileA /fileB Read /fileA Read /fileB (/fileA → Datanode 1 → 196.0.0.5) == (Executor 1 →196.0.0.5) Datanode 1 Datanode 2 node A 196.0.0.5 node B 196.0.0.6 Job 1 HDFS YARN example

15. Hmm, how do I find right executors in Kubernetes... (/fileA → Datanode 1 → 196.0.0.5) != (Executor 1 → 10.0.0.3) Executor Pod 2 10.0.1.2 Driver Pod Executor Pod 1 10.0.0.2 10.0.0.3 Read /fileB Read /fileA /fileA /fileB Datanode Pod 1 node A 196.0.0.5 node B 196.0.0.6 Kubernetes example Datanode Pod 2

16. Fix Spark Driver code 1. Ask Kubernetes master to find the cluster node where the executor pod is running. 2. Get the node IP. 3. Compare with the datanode IPs.

17. Rescued data locality Executor Pod 2 10.0.1.2 Driver Executor Pod 1 10.0.0.2 10.0.0.3 (/fileA → Datanode 1 → 196.0.0.5) == (Executor 1 →10.0.0.3 → 196.0.0.5) Read /fileA Read /fileB /fileA /fileB node A 196.0.0.5 node B 196.0.0.6 Datanode Pod 1 Datanode Pod 2

18. Rescued data locality! with data locality fix - duration: 10 minutes without data locality fix - duration: 25 minutes

19.

20. Recap Got HDFS up and running. Basic data locality works. Open problems: • Remaining data locality issues -- rack locality, node preference, etc • Kerberos support • Namenode High Availability

21. Join us! ● github.com/apache-spark-on-k8s ● pepperdata.com/careers/ More questions? ● Come to Pepperdata booth #101 ● Mail kimoon@pepperdata.com