SlideShare ist ein Scribd-Unternehmen logo
1 von 54
Page1 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
What’s New in Ambari?
June 2015
Yusaku Sako @ Hortonworks (Ambari PMC Chair)
Sumit Mohanty @ Hortonworks (Ambari PMC)
Page2 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
What’s Apache Ambari?
100% open-source
platform for simplifying
Hadoop cluster
management and use.
Highly extensible.
Page3 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Open Source Activity
Page4 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Inception: AMBARI-1 (Sept, 2011)
Page5 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Fast forward 4 years to today… (June, 2015)
• Latest JIRA: AMBARI-11854
• 100+ Contributors
• 50 Committers
• ~12k JIRAs filed
• ~11k JIRAs resolved
At 1.5 day per JIRA -> 66 person years! (probably more)
• Used by hundreds of companies
Page6 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Ambari – 4th Biggest Project* @ Apache
* Based on total JIRAs filed on a project basis out of 162 projects as of June 10, 2015
#2: Hadoop at ~28k as it is split across multiple JIRA Projects
#1
#3
#4
#5
Page7 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Timeline: Past 1 Year
Ambari 1.6.*
May 2014
907 JIRAs
Ambari 1.5.*
Apr 2014
1218 JIRAs
Ambari 1.7.*
Dec 2014
1620 JIRAs
Ambari 2.0.*
April 2015
1784 JIRAs
Current GA Version (2.0.1)
Ambari 2.1
Coming Soon
1520+ JIRAs
Focus of today’s talk
Resolution of 7k+ JIRAs
Page8 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
What’s new?
• Rolling Upgrade
• Alerts
• Metrics
• Enhanced Dashboard
• Smart Configurations
• Views
• Kerberos Automation
• Blueprints
Page9 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade
Page10 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade of Stack
Side-by-Side Bits and Configs
Bits:
/usr/hdp/2.2.0.0-2041
/usr/hdp/2.2.4.2-2
/usr/hdp/2.3.0.0-3000
Configs:
/etc/hive/conf/ (initial)
/etc/hive/conf/v0 (HDP 2.2.4.2)
/etc/hive/conf/v1 (HDP 2.3)
2.2.0.0 2.2.4.2 2.3.0.0minor jump major jump
Page11 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade – Manage Versions
Install bits in parallel on all agents
No down-time
Page12 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade – Orchestration
Not necessarily “one-click” but fully guided
Services are up the entire time
Upgrade one component at a time
Robust and fault-tolerant
Service-checks performed throughout
2.3.0.0-2283 2.3.0.0-2283
Page13 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade – Upgrade Catalog
Grouping and order
Page14 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade – Upgrade Catalog
Run custom scripts (python and Server-side)
Page15 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade – Upgrade Catalog
Mark steps are skippable, retryable
All service checks are skippable, all steps retryable
Page16 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade – Upgrade Catalog
Set, move, delete, transform configurations
Page17 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade – Downgrade
Page18 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Alerts
Page19 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Alert – Types
Type Description Status
Thresholds
Configurable?
PORT
Watches a port based on a configuration property such as
the URI.
OK, WARN, CRIT Yes (seconds)
WEB
Watches an HTTP or HTTPS endpoint and determines
connectivity and HTTP status code.
OK, WARN, CRIT No
AGGREGATE Aggregate of status for another alert definition. OK, WARN, CRIT Yes (percentage)
METRIC
Watches a metric or series of metrics in JMX and compares
a mathematical result against a threshold.
OK, WARN, CRIT Yes (variable)
SCRIPT Uses a custom script to handle checking. OK or CRIT No
Page20 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
UI – Current Alerts
Configured by default; managed via the the web client
Page21 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
UI – Host Alerts
Automatically refreshes
Query alert history
Page22 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
UI– Customization & Instances
Status text, thresholds, and interval
Page23 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Metrics
Page24 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Ambari Metrics Service (AMS) - Goals
Ability to collect metrics from Hadoop and other Stack services
Ability to retain metrics at a high precision for a configurable time period
Ability to automatically purge metrics after retention period
At collection time, provide clear integration point for external system
At purge time, provide clear integration point for metrics retention by
external system
Should provide default options for external metrics retention
Provide tools / utilities for analyzing metrics in retention system
Page25 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Aggregators
Metrics Collector
HTTP REST endpoint
Metrics API
Query Layer
HBASE
Phoenix server
Phoenix client
Namenode
Datanode
Nodemanager
Regionserver
Nimbus
Flume Agent
Kafka worker
Metrics Sinks Metrics Monitor
AMBARI
DashboardsViews REST API
Ambari Metrics System - Architecture
Page26 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Sample Stats
Total number of raw uncompressed Hadoop metrics written per day on a 300 node cluster =
100 GB
Rows in Phoenix table ~ 100 million
Raw query time: 500 rows selected (1.923 seconds)
Aggregate query time: 204 rows selected (0.19 seconds)
SELECT METRIC_NAME, APP_ID, INSTANCE_ID, TIMESTAMP, METRIC_SUM, HOSTS_COUNT, METRIC_MAX,
METRIC_MIN FROM METRIC_AGGREGATE WHERE METRIC_NAME IN ('dfs.datanode.BytesWritten',
'dfs.datanode.BytesRead') AND APP_ID = 'datanode' AND TIMESTAMP >= 1409770831000 and TIMESTAMP <
1409774431000;
SELECT METRIC_NAME, HOSTNAME, APP_ID, INSTANCE_ID, START_TIME, METRICS FROM METRIC_RECORD
WHERE METRIC_NAME IN ('dfs.datanode.BytesWritten','dfs.datanode.BytesRead') AND APP_ID = 'datanode'
AND START_TIME >= 1409770831000 AND START_TIME < 1409774431000 ORDER BY METRIC_NAME,
START_TIME LIMIT 500;
Page27 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Key takeaways
Using Phoenix query hints to avoid full table scans
PHOENIX-914 – Use Native Hbase timestamp to skip HFiles
Client side buffering and aggregation built into Sinks and Monitor
Cluster and Host level aggregations across various time dimensions
Table schema optimized for reads and Hbase tuned to support heavy write
loads
Page28 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Enhanced Dashboard
Page29 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Customizable Service Dashboards
Service dashboards are now customizable in Ambari 2.1
• Create new widgets
• Graphs, Dial Gauge, Number, Template
• Customize layout
• Share widgets
Future:
• Make Layouts shareable
Page30 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Recorded Demo
Page31 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Easy to expose widgets for new services
Out-of-the-box widgets are defined in the stack as JSON files
No frontend code changes necessary
Page32 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Smart Configurations
Page33 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Hadoop Configuration Challenges
• Too many configurations
• which ones are important?
• Too easy to mess up
• What are valid/reasonable values?
• What are the units?
• Ok, what about dependencies?
• Gets harder with combinations of services, host assignments, enabled
features, CPU/RAM/disks, etc
• Any recommendations? What am I doing wrong?
• Smart Configs to the rescue! (Ambari 2.1)
Page34 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Smart Configs Demo
Page35 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Ambari Smart Configs UI
Customizable layout
- Tabs
- Sections
- Sub-sections
- Simple grid layout
(Advanced Tab contains
remaining configurations)
New Widgets
- Sliders
- Recommended
- Minimum
- Maximum
- Increment Step
- Combos
- Enumerated values
- Toggles
- Binary options
- Spinners
- Splits value into multiple
controls. Time in
milliseconds split into days,
hours, minutes.
- Lists
- Enumerated values
- Single select
- Multi select
Implemented
- HDFS
- YARN
- MapReduce
- Hive
- HBase
Page36 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Stack Driven Layouts
{
"name": "default",
"description": "Default theme for HBASE service",
"configuration": {
"layouts": [
{
"name": "default",
"tabs": [
{
"name": "settings",
"display-name": "Settings",
"layout": {
"tab-columns": "3",
"tab-rows": "3",
"sections": [
...
]
}
}
]
}
],
"placement": {
"configuration-layout": "default",
"configs": [...]
},
"widgets": [
{
"config": "hbase-env/hbase_master_heapsize",
"widget": {
"type": "slider",
"units": [
{
"unit-name": "GB"
}
]
}
},
...
]
}
}
• Stacks has theme.json file
• Layout
– Tabs
– Sections
– Sub-sections
• Placement
– Configs placement in sub-sections
• Widgets
– Widget type
– Optional Units
- Bytes (B, KB, MB, GB, TB, PB)
- Time (Millis, Seconds, Minutes, Hours, Days, Months, Years)
Page37 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Config Metadata and Dependencies
{
"StackConfigurations": {
"final": "false",
"property_depends_on": [
{
"type": "yarn-site",
"name": "yarn.nodemanager.resource.memory-mb"
}
],
"property_description": “The minimum allocation for every",
"property_display_name": "Minimum Container Size (Memory)",
"property_name": "yarn.scheduler.minimum-allocation-mb",
"property_type": [],
"property_value": "512",
"property_value_attributes": {
"type": "int",
"maximum": "5120",
"minimum": "0",
"unit": "MB",
"increment_step": "256"
},
"type": "yarn-site.xml"
},
"dependencies": [
{
"StackConfigurationDependency": {
"dependency_name": "hive.tez.container.size",
"property_name": "yarn.scheduler.minimum-allocation-mb”
}
},
{
"StackConfigurationDependency": {
"dependency_name": "mapreduce.map.memory.mb",
"property_name": "yarn.scheduler.minimum-allocation-mb”
}
},
{
"StackConfigurationDependency": {
"dependency_name": "mapreduce.reduce.memory.mb",
"property_name": "yarn.scheduler.minimum-allocation-mb”
}
}…
]
}
• Extended Metadata
• Defined in property_value_attributes
• Hold non-UI metadata about value range, increment,
unit, etc
• Dependencies
• Models bi-directional relationship between configs
• Depends On (property_depends_on)
– Answers ‘which configs do I depend on?’
• Depended By (dependencies)
– Answers ‘which configs are dependent on me?’
• Ambari automatically updates dependencies
Page38 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Views
Page39 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
What are Views?
View Framework
• Provide various applications accessible from Ambari Web UI – interact
with the cluster via a browser from a single place for all users (cluster
operators, data analysis, developers, etc)
Easy to develop
• No need to understand Ambari core code – view development is just
like creating any other web application
Easy to deploy
• Packaged as a single jar file
• Auto create / auto configure
Page40 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
CS Queue Manager for Cluster Operators
Capacity Scheduler
Queue Manager
Page41 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
HDFS File Browser for General Users
HDFS File Browser
Page42 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Job Analysis for Developers
Troubleshoot Tez JobsTroubleshoot / Improve Hive queries
Page43 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Query Editors for Data Analysts
Create, edit, execute, and analyze Hive queries Create, edit, and execute Pig scripts
Page44 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Ambari Server in Views-Only mode
• Use Views on existing clusters not under Ambari’s management
• Can use Views against multiple clusters
Ambari
Server
Cluster managed by Ambari
Ambari
Server “Views-only” mode
(aka “Stand-alone” mode)
Cluster not managed by Ambari
Management
Use Views
Use Views
Use Views
Page45 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Kerberos Automation
Page46 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Kerberos Automation
New in Ambari 2.0:
• Have Ambari manage Kerberos principals and keytabs
• Once Kerberized, seamlessly handle:
• Adding new hosts
• Adding new components to existing hosts
• Adding new services
• Moving components to different hosts
• Works with existing MIT KDC or Active Directory
Page47 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Blueprints
Page48 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Automated Cluster Deployment
Simple
• Making two REST calls is all it takes to provision a cluster
Who uses it?
• Cloudbreak
• Microsoft Azure Marketplace (portal.azure.com)
• Hortonworks QA
• and many many others
Page49 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Cluster Replication
Export blueprint of source cluster
Import blueprint to replicate clusters
Page50 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Example: Create a 100-node Cluster
{
"configurations" : [
{
”hdfs-site" : {
"dfs.datanode.data.dir" : ”/hadoop/1,/hadoop/2,/hadoop/3"
}
}
],
"host_groups" : [
{
"name" : ”master-host",
"components" : [
{ "name" : "NAMENODE” },
{ "name" : "RESOURCEMANAGER” },
…
],
"cardinality" : "1"
},
{
"name" : ”worker-host",
"components" : [
{ "name" : ”DATANODE” },
{ "name" : ”NODEMANAGER” },
…
],
"cardinality" : "1+"
},
],
"Blueprints" : {
"blueprint_name" : ”multi-node-hdfs-yarn",
"stack_name" : "HDP",
"stack_version" : "2.0"
}
}
{
"blueprint" : ”multi-node-hdfs-yarn",
"host_groups" :[
{
"name" : ”master-host",
"hosts" : [
{
"fqdn" : ”master001.ambari.apache.org”
}
]
},
{
"name" : ”worker-host",
"hosts" : [
{
"fqdn" : ”worker001.ambari.apache.org”
},
{
"fqdn" : ”worker002.ambari.apache.org”
},
…
{
"fqdn" : ”worker099.ambari.apache.org”
}
]
}
]
}
1. POST /blueprints/my-blueprint 2. POST /clusters/MyCluster
Page51 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
What’s New in Blueprint
New in Ambari 2.0:
• Supports various HA configurations:
• NameNode, ResourceManager, RegionServer, Oozie Server, Hive Metastore, HiveServer2, WebHCat
Server
• Adding hosts “blueprint style” (AMBARI-8458)
New in Ambari 2.1:
• Advanced Cluster Creation – more flexible, scalable, and robust
(AMBARI-10750)
Page52 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Blueprint Caveats
As of Ambari 2.1:
• Stack advisor (smart/dynamic config recommendation/validation) is not
used (yet)
• Can’t create Kerberized Cluster (yet)
Page53 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Thank You!
Try Ambari
• Follow the Ambari Quick Start Guide (search online for “Ambari Quick Start Guide”)
Learn more
• Visit the project website (search online for “Ambari”)
Get Involved
• User Mailing List: user-subscribe@ambari.apache.org
• Developer Mailing List: dev-subscribe@ambari.apache.org
• Use JIRA to file bugs and improvement requests (search online for “Ambari JIRA”)
Page54 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Q & A
Yusaku Sako yusaku@hortonworks.com (Ambari PMC)
Sumit Mohanty smohanty@hortonworks.com (Ambari PMC)

Weitere ähnliche Inhalte

Was ist angesagt?

Format Wars: from VHS and Beta to Avro and Parquet
Format Wars: from VHS and Beta to Avro and ParquetFormat Wars: from VHS and Beta to Avro and Parquet
Format Wars: from VHS and Beta to Avro and ParquetDataWorks Summit
 
YARN Ready: Apache Spark
YARN Ready: Apache Spark YARN Ready: Apache Spark
YARN Ready: Apache Spark Hortonworks
 
Internet of things Crash Course Workshop
Internet of things Crash Course WorkshopInternet of things Crash Course Workshop
Internet of things Crash Course WorkshopDataWorks Summit
 
Combine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARNCombine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARNHortonworks
 
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionHadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionDataWorks Summit/Hadoop Summit
 
Hortonworks tech workshop in-memory processing with spark
Hortonworks tech workshop   in-memory processing with sparkHortonworks tech workshop   in-memory processing with spark
Hortonworks tech workshop in-memory processing with sparkHortonworks
 
Hadoop first ETL on Apache Falcon
Hadoop first ETL on Apache FalconHadoop first ETL on Apache Falcon
Hadoop first ETL on Apache FalconDataWorks Summit
 
Hortonworks Technical Workshop - Operational Best Practices Workshop
Hortonworks Technical Workshop - Operational Best Practices WorkshopHortonworks Technical Workshop - Operational Best Practices Workshop
Hortonworks Technical Workshop - Operational Best Practices WorkshopHortonworks
 
Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...
Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...
Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...DataWorks Summit
 
Hortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Hortonworks Technical Workshop: Real Time Monitoring with Apache HadoopHortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Hortonworks Technical Workshop: Real Time Monitoring with Apache HadoopHortonworks
 
Visualizing Big Data in Realtime
Visualizing Big Data in RealtimeVisualizing Big Data in Realtime
Visualizing Big Data in RealtimeDataWorks Summit
 
Introduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready ProgramIntroduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready ProgramHortonworks
 
What's new in apache hive
What's new in apache hive What's new in apache hive
What's new in apache hive DataWorks Summit
 
Realizing the Promise of Portable Data Processing with Apache Beam
Realizing the Promise of Portable Data Processing with Apache BeamRealizing the Promise of Portable Data Processing with Apache Beam
Realizing the Promise of Portable Data Processing with Apache BeamDataWorks Summit
 
Apache Falcon : 22 Sept 2014 for Hadoop User Group France (@Criteo)
Apache Falcon : 22 Sept 2014 for Hadoop User Group France (@Criteo)Apache Falcon : 22 Sept 2014 for Hadoop User Group France (@Criteo)
Apache Falcon : 22 Sept 2014 for Hadoop User Group France (@Criteo)Cedric CARBONE
 
Hadoop in the Cloud - The what, why and how from the experts
Hadoop in the Cloud - The what, why and how from the expertsHadoop in the Cloud - The what, why and how from the experts
Hadoop in the Cloud - The what, why and how from the expertsDataWorks Summit/Hadoop Summit
 
Data Regions: Modernizing your company's data ecosystem
Data Regions: Modernizing your company's data ecosystemData Regions: Modernizing your company's data ecosystem
Data Regions: Modernizing your company's data ecosystemDataWorks Summit/Hadoop Summit
 

Was ist angesagt? (20)

Format Wars: from VHS and Beta to Avro and Parquet
Format Wars: from VHS and Beta to Avro and ParquetFormat Wars: from VHS and Beta to Avro and Parquet
Format Wars: from VHS and Beta to Avro and Parquet
 
YARN Ready: Apache Spark
YARN Ready: Apache Spark YARN Ready: Apache Spark
YARN Ready: Apache Spark
 
Internet of things Crash Course Workshop
Internet of things Crash Course WorkshopInternet of things Crash Course Workshop
Internet of things Crash Course Workshop
 
Combine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARNCombine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARN
 
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionHadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in Production
 
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, ScaleApache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
 
Hortonworks tech workshop in-memory processing with spark
Hortonworks tech workshop   in-memory processing with sparkHortonworks tech workshop   in-memory processing with spark
Hortonworks tech workshop in-memory processing with spark
 
Hadoop first ETL on Apache Falcon
Hadoop first ETL on Apache FalconHadoop first ETL on Apache Falcon
Hadoop first ETL on Apache Falcon
 
Hortonworks Technical Workshop - Operational Best Practices Workshop
Hortonworks Technical Workshop - Operational Best Practices WorkshopHortonworks Technical Workshop - Operational Best Practices Workshop
Hortonworks Technical Workshop - Operational Best Practices Workshop
 
Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...
Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...
Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...
 
Hortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Hortonworks Technical Workshop: Real Time Monitoring with Apache HadoopHortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Hortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
 
Visualizing Big Data in Realtime
Visualizing Big Data in RealtimeVisualizing Big Data in Realtime
Visualizing Big Data in Realtime
 
Introduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready ProgramIntroduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready Program
 
What's new in apache hive
What's new in apache hive What's new in apache hive
What's new in apache hive
 
Realizing the Promise of Portable Data Processing with Apache Beam
Realizing the Promise of Portable Data Processing with Apache BeamRealizing the Promise of Portable Data Processing with Apache Beam
Realizing the Promise of Portable Data Processing with Apache Beam
 
Apache Falcon : 22 Sept 2014 for Hadoop User Group France (@Criteo)
Apache Falcon : 22 Sept 2014 for Hadoop User Group France (@Criteo)Apache Falcon : 22 Sept 2014 for Hadoop User Group France (@Criteo)
Apache Falcon : 22 Sept 2014 for Hadoop User Group France (@Criteo)
 
Spark Uber Development Kit
Spark Uber Development KitSpark Uber Development Kit
Spark Uber Development Kit
 
Hadoop in the Cloud - The what, why and how from the experts
Hadoop in the Cloud - The what, why and how from the expertsHadoop in the Cloud - The what, why and how from the experts
Hadoop in the Cloud - The what, why and how from the experts
 
Data Regions: Modernizing your company's data ecosystem
Data Regions: Modernizing your company's data ecosystemData Regions: Modernizing your company's data ecosystem
Data Regions: Modernizing your company's data ecosystem
 
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in HiveLLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
 

Andere mochten auch

Internet of Things Crash Course Workshop at Hadoop Summit
Internet of Things Crash Course Workshop at Hadoop SummitInternet of Things Crash Course Workshop at Hadoop Summit
Internet of Things Crash Course Workshop at Hadoop SummitDataWorks Summit
 
Spark crash course workshop at Hadoop Summit
Spark crash course workshop at Hadoop SummitSpark crash course workshop at Hadoop Summit
Spark crash course workshop at Hadoop SummitDataWorks Summit
 
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachEvolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachDataWorks Summit
 
Millions of Regions in HBase: Size Matters
Millions of Regions in HBase: Size MattersMillions of Regions in HBase: Size Matters
Millions of Regions in HBase: Size MattersDataWorks Summit
 
Authoring and Hosting Applications on YARN using Slider
Authoring and Hosting Applications on YARN using SliderAuthoring and Hosting Applications on YARN using Slider
Authoring and Hosting Applications on YARN using SliderDataWorks Summit
 
NextGen Apache Hadoop MapReduce
NextGen Apache Hadoop MapReduceNextGen Apache Hadoop MapReduce
NextGen Apache Hadoop MapReduceHortonworks
 
Hadoop Operations - Best Practices from the Field
Hadoop Operations - Best Practices from the FieldHadoop Operations - Best Practices from the Field
Hadoop Operations - Best Practices from the FieldDataWorks Summit
 
Data Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
Data Science: Driving Smarter Finance and Workforce Decsions for the EnterpriseData Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
Data Science: Driving Smarter Finance and Workforce Decsions for the EnterpriseDataWorks Summit
 
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
Overview of Apache Flink: the 4G of Big Data Analytics FrameworksOverview of Apache Flink: the 4G of Big Data Analytics Frameworks
Overview of Apache Flink: the 4G of Big Data Analytics FrameworksDataWorks Summit/Hadoop Summit
 
Real-Time Clinical Analytics
Real-Time Clinical AnalyticsReal-Time Clinical Analytics
Real-Time Clinical AnalyticsDataWorks Summit
 
Apache Tez - A unifying Framework for Hadoop Data Processing
Apache Tez - A unifying Framework for Hadoop Data ProcessingApache Tez - A unifying Framework for Hadoop Data Processing
Apache Tez - A unifying Framework for Hadoop Data ProcessingDataWorks Summit
 
Scaling HDFS to Manage Billions of Files with Key-Value Stores
Scaling HDFS to Manage Billions of Files with Key-Value StoresScaling HDFS to Manage Billions of Files with Key-Value Stores
Scaling HDFS to Manage Billions of Files with Key-Value StoresDataWorks Summit
 
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo ClinicBig Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo ClinicDataWorks Summit
 
June 10 145pm hortonworks_tan & welch_v2
June 10 145pm hortonworks_tan & welch_v2June 10 145pm hortonworks_tan & welch_v2
June 10 145pm hortonworks_tan & welch_v2DataWorks Summit
 
Securing Hadoop with Apache Ranger
Securing Hadoop with Apache RangerSecuring Hadoop with Apache Ranger
Securing Hadoop with Apache RangerDataWorks Summit
 

Andere mochten auch (20)

Internet of Things Crash Course Workshop at Hadoop Summit
Internet of Things Crash Course Workshop at Hadoop SummitInternet of Things Crash Course Workshop at Hadoop Summit
Internet of Things Crash Course Workshop at Hadoop Summit
 
Spark crash course workshop at Hadoop Summit
Spark crash course workshop at Hadoop SummitSpark crash course workshop at Hadoop Summit
Spark crash course workshop at Hadoop Summit
 
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachEvolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
 
Millions of Regions in HBase: Size Matters
Millions of Regions in HBase: Size MattersMillions of Regions in HBase: Size Matters
Millions of Regions in HBase: Size Matters
 
Authoring and Hosting Applications on YARN using Slider
Authoring and Hosting Applications on YARN using SliderAuthoring and Hosting Applications on YARN using Slider
Authoring and Hosting Applications on YARN using Slider
 
NextGen Apache Hadoop MapReduce
NextGen Apache Hadoop MapReduceNextGen Apache Hadoop MapReduce
NextGen Apache Hadoop MapReduce
 
LinkedIn
LinkedInLinkedIn
LinkedIn
 
Hadoop Operations - Best Practices from the Field
Hadoop Operations - Best Practices from the FieldHadoop Operations - Best Practices from the Field
Hadoop Operations - Best Practices from the Field
 
Rocking the World of Big Data at Centrica
Rocking the World of Big Data at CentricaRocking the World of Big Data at Centrica
Rocking the World of Big Data at Centrica
 
Data Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
Data Science: Driving Smarter Finance and Workforce Decsions for the EnterpriseData Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
Data Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
 
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
Overview of Apache Flink: the 4G of Big Data Analytics FrameworksOverview of Apache Flink: the 4G of Big Data Analytics Frameworks
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
 
Real-Time Clinical Analytics
Real-Time Clinical AnalyticsReal-Time Clinical Analytics
Real-Time Clinical Analytics
 
Ingest and Stream Processing - What will you choose?
Ingest and Stream Processing - What will you choose?Ingest and Stream Processing - What will you choose?
Ingest and Stream Processing - What will you choose?
 
Apache Tez - A unifying Framework for Hadoop Data Processing
Apache Tez - A unifying Framework for Hadoop Data ProcessingApache Tez - A unifying Framework for Hadoop Data Processing
Apache Tez - A unifying Framework for Hadoop Data Processing
 
Scaling HDFS to Manage Billions of Files with Key-Value Stores
Scaling HDFS to Manage Billions of Files with Key-Value StoresScaling HDFS to Manage Billions of Files with Key-Value Stores
Scaling HDFS to Manage Billions of Files with Key-Value Stores
 
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo ClinicBig Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
 
June 10 145pm hortonworks_tan & welch_v2
June 10 145pm hortonworks_tan & welch_v2June 10 145pm hortonworks_tan & welch_v2
June 10 145pm hortonworks_tan & welch_v2
 
Running Spark in Production
Running Spark in ProductionRunning Spark in Production
Running Spark in Production
 
Securing Hadoop with Apache Ranger
Securing Hadoop with Apache RangerSecuring Hadoop with Apache Ranger
Securing Hadoop with Apache Ranger
 
YARN and the Docker container runtime
YARN and the Docker container runtimeYARN and the Docker container runtime
YARN and the Docker container runtime
 

Ähnlich wie What's new in Ambari

Apache Ambari: Past, Present, Future
Apache Ambari: Past, Present, FutureApache Ambari: Past, Present, Future
Apache Ambari: Past, Present, FutureHortonworks
 
Apache Ambari - What's New in 2.1
Apache Ambari - What's New in 2.1Apache Ambari - What's New in 2.1
Apache Ambari - What's New in 2.1Hortonworks
 
Managing Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache AmbariManaging Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache AmbariJayush Luniya
 
Managing Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache AmbariManaging Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache AmbariHortonworks
 
Hortonworks Technical Workshop: What's New in HDP 2.3
Hortonworks Technical Workshop: What's New in HDP 2.3Hortonworks Technical Workshop: What's New in HDP 2.3
Hortonworks Technical Workshop: What's New in HDP 2.3Hortonworks
 
Hadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and FutureHadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and FutureDataWorks Summit
 
SAM—streaming analytics made easy
SAM—streaming analytics made easySAM—streaming analytics made easy
SAM—streaming analytics made easyDataWorks Summit
 
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...StampedeCon
 
Hortonworks technical workshop operations with ambari
Hortonworks technical workshop   operations with ambariHortonworks technical workshop   operations with ambari
Hortonworks technical workshop operations with ambariHortonworks
 
Fast SQL on Hadoop, really?
Fast SQL on Hadoop, really?Fast SQL on Hadoop, really?
Fast SQL on Hadoop, really?DataWorks Summit
 
Apache Ambari - What's New in 2.4
Apache Ambari - What's New in 2.4 Apache Ambari - What's New in 2.4
Apache Ambari - What's New in 2.4 Hortonworks
 
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and TroubleshootingApache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and TroubleshootingDataWorks Summit/Hadoop Summit
 
SAM - Streaming Analytics Made Easy
SAM - Streaming Analytics Made EasySAM - Streaming Analytics Made Easy
SAM - Streaming Analytics Made EasyDataWorks Summit
 
Hive present-and-feature-shanghai
Hive present-and-feature-shanghaiHive present-and-feature-shanghai
Hive present-and-feature-shanghaiYifeng Jiang
 
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...DataWorks Summit
 
Pivotal cf for_devops_mkim_20141209
Pivotal cf for_devops_mkim_20141209Pivotal cf for_devops_mkim_20141209
Pivotal cf for_devops_mkim_20141209minseok kim
 
Hive Performance Dataworks Summit Melbourne February 2019
Hive Performance Dataworks Summit Melbourne February 2019Hive Performance Dataworks Summit Melbourne February 2019
Hive Performance Dataworks Summit Melbourne February 2019alanfgates
 
Fast SQL on Hadoop, Really?
Fast SQL on Hadoop, Really?Fast SQL on Hadoop, Really?
Fast SQL on Hadoop, Really?DataWorks Summit
 
Manage Add-on Services in Apache Ambari
Manage Add-on Services in Apache AmbariManage Add-on Services in Apache Ambari
Manage Add-on Services in Apache AmbariJayush Luniya
 

Ähnlich wie What's new in Ambari (20)

Apache Ambari: Past, Present, Future
Apache Ambari: Past, Present, FutureApache Ambari: Past, Present, Future
Apache Ambari: Past, Present, Future
 
Apache Ambari - What's New in 2.1
Apache Ambari - What's New in 2.1Apache Ambari - What's New in 2.1
Apache Ambari - What's New in 2.1
 
Managing Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache AmbariManaging Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache Ambari
 
Managing Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache AmbariManaging Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache Ambari
 
Hortonworks Technical Workshop: What's New in HDP 2.3
Hortonworks Technical Workshop: What's New in HDP 2.3Hortonworks Technical Workshop: What's New in HDP 2.3
Hortonworks Technical Workshop: What's New in HDP 2.3
 
Hadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and FutureHadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and Future
 
SAM—streaming analytics made easy
SAM—streaming analytics made easySAM—streaming analytics made easy
SAM—streaming analytics made easy
 
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
 
Hortonworks technical workshop operations with ambari
Hortonworks technical workshop   operations with ambariHortonworks technical workshop   operations with ambari
Hortonworks technical workshop operations with ambari
 
Fast SQL on Hadoop, really?
Fast SQL on Hadoop, really?Fast SQL on Hadoop, really?
Fast SQL on Hadoop, really?
 
Apache Ambari - What's New in 2.4
Apache Ambari - What's New in 2.4 Apache Ambari - What's New in 2.4
Apache Ambari - What's New in 2.4
 
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and TroubleshootingApache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
 
Streaming analytics manager
Streaming analytics managerStreaming analytics manager
Streaming analytics manager
 
SAM - Streaming Analytics Made Easy
SAM - Streaming Analytics Made EasySAM - Streaming Analytics Made Easy
SAM - Streaming Analytics Made Easy
 
Hive present-and-feature-shanghai
Hive present-and-feature-shanghaiHive present-and-feature-shanghai
Hive present-and-feature-shanghai
 
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
 
Pivotal cf for_devops_mkim_20141209
Pivotal cf for_devops_mkim_20141209Pivotal cf for_devops_mkim_20141209
Pivotal cf for_devops_mkim_20141209
 
Hive Performance Dataworks Summit Melbourne February 2019
Hive Performance Dataworks Summit Melbourne February 2019Hive Performance Dataworks Summit Melbourne February 2019
Hive Performance Dataworks Summit Melbourne February 2019
 
Fast SQL on Hadoop, Really?
Fast SQL on Hadoop, Really?Fast SQL on Hadoop, Really?
Fast SQL on Hadoop, Really?
 
Manage Add-on Services in Apache Ambari
Manage Add-on Services in Apache AmbariManage Add-on Services in Apache Ambari
Manage Add-on Services in Apache Ambari
 

Mehr von DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

Mehr von DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Kürzlich hochgeladen

The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 

Kürzlich hochgeladen (20)

The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 

What's new in Ambari

  • 1. Page1 © Hortonworks Inc. 2011 – 2015. All Rights Reserved What’s New in Ambari? June 2015 Yusaku Sako @ Hortonworks (Ambari PMC Chair) Sumit Mohanty @ Hortonworks (Ambari PMC)
  • 2. Page2 © Hortonworks Inc. 2011 – 2015. All Rights Reserved What’s Apache Ambari? 100% open-source platform for simplifying Hadoop cluster management and use. Highly extensible.
  • 3. Page3 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Open Source Activity
  • 4. Page4 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Inception: AMBARI-1 (Sept, 2011)
  • 5. Page5 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Fast forward 4 years to today… (June, 2015) • Latest JIRA: AMBARI-11854 • 100+ Contributors • 50 Committers • ~12k JIRAs filed • ~11k JIRAs resolved At 1.5 day per JIRA -> 66 person years! (probably more) • Used by hundreds of companies
  • 6. Page6 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Ambari – 4th Biggest Project* @ Apache * Based on total JIRAs filed on a project basis out of 162 projects as of June 10, 2015 #2: Hadoop at ~28k as it is split across multiple JIRA Projects #1 #3 #4 #5
  • 7. Page7 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Timeline: Past 1 Year Ambari 1.6.* May 2014 907 JIRAs Ambari 1.5.* Apr 2014 1218 JIRAs Ambari 1.7.* Dec 2014 1620 JIRAs Ambari 2.0.* April 2015 1784 JIRAs Current GA Version (2.0.1) Ambari 2.1 Coming Soon 1520+ JIRAs Focus of today’s talk Resolution of 7k+ JIRAs
  • 8. Page8 © Hortonworks Inc. 2011 – 2015. All Rights Reserved What’s new? • Rolling Upgrade • Alerts • Metrics • Enhanced Dashboard • Smart Configurations • Views • Kerberos Automation • Blueprints
  • 9. Page9 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade
  • 10. Page10 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade of Stack Side-by-Side Bits and Configs Bits: /usr/hdp/2.2.0.0-2041 /usr/hdp/2.2.4.2-2 /usr/hdp/2.3.0.0-3000 Configs: /etc/hive/conf/ (initial) /etc/hive/conf/v0 (HDP 2.2.4.2) /etc/hive/conf/v1 (HDP 2.3) 2.2.0.0 2.2.4.2 2.3.0.0minor jump major jump
  • 11. Page11 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade – Manage Versions Install bits in parallel on all agents No down-time
  • 12. Page12 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade – Orchestration Not necessarily “one-click” but fully guided Services are up the entire time Upgrade one component at a time Robust and fault-tolerant Service-checks performed throughout 2.3.0.0-2283 2.3.0.0-2283
  • 13. Page13 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade – Upgrade Catalog Grouping and order
  • 14. Page14 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade – Upgrade Catalog Run custom scripts (python and Server-side)
  • 15. Page15 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade – Upgrade Catalog Mark steps are skippable, retryable All service checks are skippable, all steps retryable
  • 16. Page16 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade – Upgrade Catalog Set, move, delete, transform configurations
  • 17. Page17 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade – Downgrade
  • 18. Page18 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Alerts
  • 19. Page19 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Alert – Types Type Description Status Thresholds Configurable? PORT Watches a port based on a configuration property such as the URI. OK, WARN, CRIT Yes (seconds) WEB Watches an HTTP or HTTPS endpoint and determines connectivity and HTTP status code. OK, WARN, CRIT No AGGREGATE Aggregate of status for another alert definition. OK, WARN, CRIT Yes (percentage) METRIC Watches a metric or series of metrics in JMX and compares a mathematical result against a threshold. OK, WARN, CRIT Yes (variable) SCRIPT Uses a custom script to handle checking. OK or CRIT No
  • 20. Page20 © Hortonworks Inc. 2011 – 2015. All Rights Reserved UI – Current Alerts Configured by default; managed via the the web client
  • 21. Page21 © Hortonworks Inc. 2011 – 2015. All Rights Reserved UI – Host Alerts Automatically refreshes Query alert history
  • 22. Page22 © Hortonworks Inc. 2011 – 2015. All Rights Reserved UI– Customization & Instances Status text, thresholds, and interval
  • 23. Page23 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Metrics
  • 24. Page24 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Ambari Metrics Service (AMS) - Goals Ability to collect metrics from Hadoop and other Stack services Ability to retain metrics at a high precision for a configurable time period Ability to automatically purge metrics after retention period At collection time, provide clear integration point for external system At purge time, provide clear integration point for metrics retention by external system Should provide default options for external metrics retention Provide tools / utilities for analyzing metrics in retention system
  • 25. Page25 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Aggregators Metrics Collector HTTP REST endpoint Metrics API Query Layer HBASE Phoenix server Phoenix client Namenode Datanode Nodemanager Regionserver Nimbus Flume Agent Kafka worker Metrics Sinks Metrics Monitor AMBARI DashboardsViews REST API Ambari Metrics System - Architecture
  • 26. Page26 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Sample Stats Total number of raw uncompressed Hadoop metrics written per day on a 300 node cluster = 100 GB Rows in Phoenix table ~ 100 million Raw query time: 500 rows selected (1.923 seconds) Aggregate query time: 204 rows selected (0.19 seconds) SELECT METRIC_NAME, APP_ID, INSTANCE_ID, TIMESTAMP, METRIC_SUM, HOSTS_COUNT, METRIC_MAX, METRIC_MIN FROM METRIC_AGGREGATE WHERE METRIC_NAME IN ('dfs.datanode.BytesWritten', 'dfs.datanode.BytesRead') AND APP_ID = 'datanode' AND TIMESTAMP >= 1409770831000 and TIMESTAMP < 1409774431000; SELECT METRIC_NAME, HOSTNAME, APP_ID, INSTANCE_ID, START_TIME, METRICS FROM METRIC_RECORD WHERE METRIC_NAME IN ('dfs.datanode.BytesWritten','dfs.datanode.BytesRead') AND APP_ID = 'datanode' AND START_TIME >= 1409770831000 AND START_TIME < 1409774431000 ORDER BY METRIC_NAME, START_TIME LIMIT 500;
  • 27. Page27 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Key takeaways Using Phoenix query hints to avoid full table scans PHOENIX-914 – Use Native Hbase timestamp to skip HFiles Client side buffering and aggregation built into Sinks and Monitor Cluster and Host level aggregations across various time dimensions Table schema optimized for reads and Hbase tuned to support heavy write loads
  • 28. Page28 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Enhanced Dashboard
  • 29. Page29 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Customizable Service Dashboards Service dashboards are now customizable in Ambari 2.1 • Create new widgets • Graphs, Dial Gauge, Number, Template • Customize layout • Share widgets Future: • Make Layouts shareable
  • 30. Page30 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Recorded Demo
  • 31. Page31 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Easy to expose widgets for new services Out-of-the-box widgets are defined in the stack as JSON files No frontend code changes necessary
  • 32. Page32 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Smart Configurations
  • 33. Page33 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Hadoop Configuration Challenges • Too many configurations • which ones are important? • Too easy to mess up • What are valid/reasonable values? • What are the units? • Ok, what about dependencies? • Gets harder with combinations of services, host assignments, enabled features, CPU/RAM/disks, etc • Any recommendations? What am I doing wrong? • Smart Configs to the rescue! (Ambari 2.1)
  • 34. Page34 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Smart Configs Demo
  • 35. Page35 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Ambari Smart Configs UI Customizable layout - Tabs - Sections - Sub-sections - Simple grid layout (Advanced Tab contains remaining configurations) New Widgets - Sliders - Recommended - Minimum - Maximum - Increment Step - Combos - Enumerated values - Toggles - Binary options - Spinners - Splits value into multiple controls. Time in milliseconds split into days, hours, minutes. - Lists - Enumerated values - Single select - Multi select Implemented - HDFS - YARN - MapReduce - Hive - HBase
  • 36. Page36 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Stack Driven Layouts { "name": "default", "description": "Default theme for HBASE service", "configuration": { "layouts": [ { "name": "default", "tabs": [ { "name": "settings", "display-name": "Settings", "layout": { "tab-columns": "3", "tab-rows": "3", "sections": [ ... ] } } ] } ], "placement": { "configuration-layout": "default", "configs": [...] }, "widgets": [ { "config": "hbase-env/hbase_master_heapsize", "widget": { "type": "slider", "units": [ { "unit-name": "GB" } ] } }, ... ] } } • Stacks has theme.json file • Layout – Tabs – Sections – Sub-sections • Placement – Configs placement in sub-sections • Widgets – Widget type – Optional Units - Bytes (B, KB, MB, GB, TB, PB) - Time (Millis, Seconds, Minutes, Hours, Days, Months, Years)
  • 37. Page37 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Config Metadata and Dependencies { "StackConfigurations": { "final": "false", "property_depends_on": [ { "type": "yarn-site", "name": "yarn.nodemanager.resource.memory-mb" } ], "property_description": “The minimum allocation for every", "property_display_name": "Minimum Container Size (Memory)", "property_name": "yarn.scheduler.minimum-allocation-mb", "property_type": [], "property_value": "512", "property_value_attributes": { "type": "int", "maximum": "5120", "minimum": "0", "unit": "MB", "increment_step": "256" }, "type": "yarn-site.xml" }, "dependencies": [ { "StackConfigurationDependency": { "dependency_name": "hive.tez.container.size", "property_name": "yarn.scheduler.minimum-allocation-mb” } }, { "StackConfigurationDependency": { "dependency_name": "mapreduce.map.memory.mb", "property_name": "yarn.scheduler.minimum-allocation-mb” } }, { "StackConfigurationDependency": { "dependency_name": "mapreduce.reduce.memory.mb", "property_name": "yarn.scheduler.minimum-allocation-mb” } }… ] } • Extended Metadata • Defined in property_value_attributes • Hold non-UI metadata about value range, increment, unit, etc • Dependencies • Models bi-directional relationship between configs • Depends On (property_depends_on) – Answers ‘which configs do I depend on?’ • Depended By (dependencies) – Answers ‘which configs are dependent on me?’ • Ambari automatically updates dependencies
  • 38. Page38 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Views
  • 39. Page39 © Hortonworks Inc. 2011 – 2015. All Rights Reserved What are Views? View Framework • Provide various applications accessible from Ambari Web UI – interact with the cluster via a browser from a single place for all users (cluster operators, data analysis, developers, etc) Easy to develop • No need to understand Ambari core code – view development is just like creating any other web application Easy to deploy • Packaged as a single jar file • Auto create / auto configure
  • 40. Page40 © Hortonworks Inc. 2011 – 2015. All Rights Reserved CS Queue Manager for Cluster Operators Capacity Scheduler Queue Manager
  • 41. Page41 © Hortonworks Inc. 2011 – 2015. All Rights Reserved HDFS File Browser for General Users HDFS File Browser
  • 42. Page42 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Job Analysis for Developers Troubleshoot Tez JobsTroubleshoot / Improve Hive queries
  • 43. Page43 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Query Editors for Data Analysts Create, edit, execute, and analyze Hive queries Create, edit, and execute Pig scripts
  • 44. Page44 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Ambari Server in Views-Only mode • Use Views on existing clusters not under Ambari’s management • Can use Views against multiple clusters Ambari Server Cluster managed by Ambari Ambari Server “Views-only” mode (aka “Stand-alone” mode) Cluster not managed by Ambari Management Use Views Use Views Use Views
  • 45. Page45 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Kerberos Automation
  • 46. Page46 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Kerberos Automation New in Ambari 2.0: • Have Ambari manage Kerberos principals and keytabs • Once Kerberized, seamlessly handle: • Adding new hosts • Adding new components to existing hosts • Adding new services • Moving components to different hosts • Works with existing MIT KDC or Active Directory
  • 47. Page47 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Blueprints
  • 48. Page48 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Automated Cluster Deployment Simple • Making two REST calls is all it takes to provision a cluster Who uses it? • Cloudbreak • Microsoft Azure Marketplace (portal.azure.com) • Hortonworks QA • and many many others
  • 49. Page49 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Cluster Replication Export blueprint of source cluster Import blueprint to replicate clusters
  • 50. Page50 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Example: Create a 100-node Cluster { "configurations" : [ { ”hdfs-site" : { "dfs.datanode.data.dir" : ”/hadoop/1,/hadoop/2,/hadoop/3" } } ], "host_groups" : [ { "name" : ”master-host", "components" : [ { "name" : "NAMENODE” }, { "name" : "RESOURCEMANAGER” }, … ], "cardinality" : "1" }, { "name" : ”worker-host", "components" : [ { "name" : ”DATANODE” }, { "name" : ”NODEMANAGER” }, … ], "cardinality" : "1+" }, ], "Blueprints" : { "blueprint_name" : ”multi-node-hdfs-yarn", "stack_name" : "HDP", "stack_version" : "2.0" } } { "blueprint" : ”multi-node-hdfs-yarn", "host_groups" :[ { "name" : ”master-host", "hosts" : [ { "fqdn" : ”master001.ambari.apache.org” } ] }, { "name" : ”worker-host", "hosts" : [ { "fqdn" : ”worker001.ambari.apache.org” }, { "fqdn" : ”worker002.ambari.apache.org” }, … { "fqdn" : ”worker099.ambari.apache.org” } ] } ] } 1. POST /blueprints/my-blueprint 2. POST /clusters/MyCluster
  • 51. Page51 © Hortonworks Inc. 2011 – 2015. All Rights Reserved What’s New in Blueprint New in Ambari 2.0: • Supports various HA configurations: • NameNode, ResourceManager, RegionServer, Oozie Server, Hive Metastore, HiveServer2, WebHCat Server • Adding hosts “blueprint style” (AMBARI-8458) New in Ambari 2.1: • Advanced Cluster Creation – more flexible, scalable, and robust (AMBARI-10750)
  • 52. Page52 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Blueprint Caveats As of Ambari 2.1: • Stack advisor (smart/dynamic config recommendation/validation) is not used (yet) • Can’t create Kerberized Cluster (yet)
  • 53. Page53 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Thank You! Try Ambari • Follow the Ambari Quick Start Guide (search online for “Ambari Quick Start Guide”) Learn more • Visit the project website (search online for “Ambari”) Get Involved • User Mailing List: user-subscribe@ambari.apache.org • Developer Mailing List: dev-subscribe@ambari.apache.org • Use JIRA to file bugs and improvement requests (search online for “Ambari JIRA”)
  • 54. Page54 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Q & A Yusaku Sako yusaku@hortonworks.com (Ambari PMC) Sumit Mohanty smohanty@hortonworks.com (Ambari PMC)

Hinweis der Redaktion

  1. Hadoop projects combined is: ~28k HDFS: 8247 MapReduce: 6117 YARN: 3664 Hadoop Common: 10072
  2. Too many configurations – which ones are important? 2 Configurations from 1 section and 2 from another section might be most important No easy way to group across sections Majority Text fields Configs almost always shown as text fields Can be shown in more intuitive controls No units help Configs might shown to user in one unit (days, GB), and be saved in a different unit (milliseconds, B) What are acceptable values? Open ended text fields don’t help when values have to been within a minimum/maximum values No support for a enum of values No configuration dependencies After install if you change one config, you have to remember to change others
  3. Notice that can upgrade in either same stack e.g., 2.2.*, or 2.2 -> 2.3
  4. OpenTSDB is popular solution on top of HBASE. Time Series DB
  5. -
  6. Too many configurations – which ones are important? 2 Configurations from 1 section and 2 from another section might be most important No easy way to group across sections Majority Text fields Configs almost always shown as text fields Can be shown in more intuitive controls No units help Configs might shown to user in one unit (days, GB), and be saved in a different unit (milliseconds, B) What are acceptable values? Open ended text fields don’t help when values have to been within a minimum/maximum values No support for a enum of values No configuration dependencies After install if you change one config, you have to remember to change others
  7. Introduced in Ambari 1.7
  8. Allow cluster creation or scaling to be started via the REST API prior to all/any hosts being available. As hosts register with Ambari server they will be matched to request host groups and provisioned according to the requested topology Allow host predicates to be specified along with host count to provide more flexibility in matching hosts to host groups. This will allow for host flavors where different host groups are matched to different host flavors Break up the current monolithic provisioning request into a request for each host operation. For example, install on host A, start on host A, install on hostB, etc. This will allow hosts to make progress even when another host encounters a failure. Allow a host count to be specified in the cluster creation template instead of host names. This is documented in https://issues.apache.org/jira/browse/AMBARI-6275