SlideShare ist ein Scribd-Unternehmen logo
1 von 50
University of Minnesota 
GeoJinni 
Spatial Data processing with Hadoop 
http://spatialhadoop.cs.umn.edu/ 
@spatialhadoop 
Ahmed Eldawy
Claudius Ptolemy (AD 90 – AD 168)
Al Idrisi (1099–1165)
Cholera cases in the London epidemic of 1854
Cool technology..!! 
Can I use it in my 
application? 
Oh..!! But, it is not 
made for me. Can’t 
make use of it as is 
My pleasure. Here 
it is..
Kindly let me get 
the technology 
you have 
Kindly let me 
understand your needs
HELP..!! I have too 
much data. Your 
technology is not 
helping me 
mmm…Let me 
check with my good 
friends there. 
My pleasure. Here 
it is.. 
Cool DBMS 
technology..!! 
Can I use it in my 
application? 
Oh..!! But, it is not 
made for me. Can’t 
make use of it as is
Kindly let me 
understand your needs 
Kindly let me 
get the 
technology you 
have
Let me check with 
my other good 
friends there. 
HELP..!! Again, I have 
too much data. Your 
technology is not 
helping me 
Cool MapReduce technology..!! 
Can I use it in my application? 
Sorry, seems like the 
DBMS technology 
cannot scale more 
My pleasure. Here 
Oh..!! But, it is not 
made for me. Can’t 
make use of it as is 
it is..
Kindly let me 
understand your needs 
Kindly let me 
get the 
technology you 
have
Kindly let me 
understand your needs 
Kindly let me 
get the 
technology you 
have 
aka 
GeoJinni
VGI Sensor networks 
27 
Tons of Spatial data out there… 
Smart phones Satellite Images 
Medical data 
Traffic data 
Geotagged Microblogs 
Geotagged pictures
GeoJinni 
Website: http://spatialhadoop.cs.umn.edu/ 
Download source code, binary distribution, and instructions 
Email us at: shadoop@cs.umn.edu 
■ Released in March 2013; 75,000 downloads since then 
Spatial language Built-in spatial data types 
28 
Spatial Indexes Spatial Operations
User Programs 
Pig 
Latin 
Hadoop 
Java APIS 
Job Monitoring and 
29 
The Built-in Approach of GeoJinni 
Spatial Modules 
User Programs 
Pig 
Latin 
Hadoop 
Java APIS 
Job Monitoring and 
Scheduling 
MapReduce 
Runtime 
Storage (HDFS) 
(Spatial) 
User Program 
+ 
MapReduce 
APIs 
+ 
Job Monitoring 
and Scheduling 
+ 
MapReduce 
Runtime 
+ 
Storage 
+ 
… 
Scheduling 
MapReduce 
Runtime 
Storage (HDFS) 
Spatial 
Language 
Spatial 
Operators 
Early 
Pruning 
Spatial 
Indexing 
The On-top 
Approach 
From Scratch 
Approach 
The Built-in Approach 
(GeoJinni)
30 
Spatial Data & Hadoop 
Spatial Data Hadoop 
points = LOAD ’points’ AS 
(id:int, x:int, y:int); 
result = FILTER points BY 
x < xmax AND x >= xmin AND 
y < ymax AND y >= ymin; 
Takes 193 seconds 
 GeoJinni 
GeoJinni 
points = LOAD ’points’ AS 
(id:int, location:point); 
result = FILTER points BY 
IsOverlap(location, rectangle 
(xmin, ymin, xmax, ymax)); 
Finishes in 2 seconds
31 
GeoJinni Architecture 
Applications: MNTG [SSTD’13, ICDE’14] 
SHAHED [ICDE’15] – TAREEG [SIGMOD’14, SIGSPATIAL’14] 
Spatio-temporal Hadoop 
Language: Pigeon [ICDE’14] 
Operations: Basic [VLDB’13] – CG_Hadoop [SIGSPATIAL’13] 
Data Mining – Visualization [Under submission] 
MapReduce: Spatial File Splitter – Spatial Record Reader 
Indexing: Grid File – R-tree – R+-tree [ICDE’15]
32 
Language Layer: Pigeon 
■ Extends Pig Latin with OGC-compliant primitives 
 Spatial data types (e.g., Polygon) 
 Basic operations (e.g., Area) 
 Spatial predicates (e.g., Touches) 
 Spatial analysis (e.g., Union) 
 Spatial aggregate functions (e.g., Convex Hull) 
cities = LOAD ’cities’ 
AS (city_id: int, city_geom); 
City_area = FOREACH cities 
GENERATE Area(city_geom) AS area; 
A. Eldawy and M. F. Mokbel. Pigeon: A Spatial MapReduce Language. In ICDE, 2014
33 
Indexing Layer: R+-tree
34 
Indexing Layer: Grid File
35 
Non-indexed Heap File
36 
Range Query 
SpatialFileSplitter 
prunes blocks 
outside the query 
range 
SpatialRecordReader 
passes local indexes 
to the map function 
Map function selects 
records in range
37 
CG_Hadoop 
■ Make use of GeoJinni to speedup 
computational geometry algorithms 
 Polygon union, Skyline, Convex Hull, 
Farthest/Closest Pair 
■ Single machine implementation 
 E.g., Skyline of 4 billion points takes three hours 
■ Straight forward implementation in Hadoop 
 Hadoop parallel execution 
■ More efficient implementation 
in GeoJinni 
 Spatial indexing 
 Early pruning 
■ Free open source as part of GeoJinni 
Single 
Machine Hadoop 
GeoJinni 
29x 
260x 
1x
38 
Convex Hull 
Find the minimal convex polygon that contains all points 
Input Output
39 
Convex Hull in CG_Hadoop 
Hadoop CG_Hadoop 
Partition 
Pruning 
Local hull 
Global hull
40 
Map rendering 
■ Map rendering creates an image that represents the 
data 
■ Visualization is an international language 
■ Can reveal patterns that are otherwise hard to spot 
■ The visual system occupies about one third of the 
human brain 
210 LINESTRING (-2.3634904 51.3845649, -2.3634254 51.3843983, - 
2.3631927 51.3838436) [highway#primary,ref#A4,name#Gay Street] 
420 LINESTRING (-1.8230973 52.5541131, -1.8230368 52.5540756, - 
1.8229324 52.5540109, -1.8227961 52.5539014, -1.8227365 52.5538461, - 
1.8226952 52.5538058, -1.8226204 52.5537103, -1.8223988 52.5534041, - 
1.8221814 52.5531498, -1.8218478 52.5528188, -1.8215581 52.5525626, - 
1.8213525 52.5524042) [source#GPS 
Survey,highway#residential,postal_code#B72,name#Moss 
Drive,is_in#Sutton Coldfield,maxspeed#30,abutters#residential] 
490 LINESTRING (-0.1896508 51.6456414, -0.1895803 51.6456036, - 
0.1895245 51.645551, -0.1890055 51.6450801, -0.1887808 51.6448764, - 
0.1885605 51.6446756, -0.1883084 51.6443753, -0.1875496 51.6433375, - 
0.1864572 51.6415288, -0.1862165 51.6411939, -0.1859495 51.6406583, - 
0.1858855 51.6405461) [lit#yes,surface#asphalt,maxspeed#30 
mph,highway#residential,abutters#residential,name#Sherrards Way] 
770 LINESTRING (-1.8184653 52.5723683, -1.8182353 52.5723576, - 
…
41 
Smoothing 
Input Buffer 
Only 
Buffer + 
Merge
42 
Multi-level Image 
■ Many images at 
different zoom 
levels 
 Pan 
 Zoom in/out 
 Fly to 
■ More details as 
the zoom level 
increases
43 
MNTG - World-wide traffic generator 
for road networks 
http://mntg.cs.umn.edu/ 
M. F. Mokbel, L. Alarabi, J. Bao, A. Eldawy, A. Magdy, M. Sarwat, E. Waytas, and S. 
Yackel. MNTG: An Extensible Web-based Traffic Generator. In SSTD, 2013
44 
SHAHED – A tool for querying and 
visualizing spatio-temporal satellite data 
http://shahed.cs.umn.edu/ 
"SHAHED: A MapReduce-based System for Querying and Visualizing Spatio-temporal 
Satellite Data“, Ahmed Eldawy et al, ICDE 2015
45 
World Temperature
46 
Smooth World Temperature
47 
World Heat Map on Google Earth
48 
TAREEG – Web-based extractor for 
OpenStreetMap data using MapReduce 
http://tareeg.net/ 
L. Alarabi, A. Eldawy, R. Alghamdi, and M. F. Mokbel. TAREEG: A MapReduce-Based 
Web Service for Extracting Spatial Data from OpenStreetMap. In SIGMOD, 2014
49 
Extracted Road Network
GeoJinni 
Analyze your spatial data efficiently 
50 
Built-in spatial data types 
Spatial high level language 
Efficient Spatial Operations 
Language 
Data types 
Spatial Indexes 
Indexes Operations 
Analyze Datasets your are organized data on large efficiently clusters using with spatial built-in indexes 
spatial 
operations that runs efficiently using spatial indexes 
Interact Have with all your the system spatial and datasets express ready your to queries load in 
in a 
simple SpatialHadoop (Grid Website: high or level R-tree) http://language with that spatialhadoop.the are with built-adapted built-in spatial cs.to in umn.MapReduce 
spatial data edu/ 
support 
types 
Download source code, binary distribution, and instructions 
Email us at: shadoop@cs.umn.edu

Weitere ähnliche Inhalte

Was ist angesagt?

Introduction to Map Reduce
Introduction to Map ReduceIntroduction to Map Reduce
Introduction to Map ReduceApache Apex
 
Analysing of big data using map reduce
Analysing of big data using map reduceAnalysing of big data using map reduce
Analysing of big data using map reducePaladion Networks
 
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons LearnedHadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons LearnedDataWorks Summit
 
GeoSpatially enabling your Spark and Accumulo clusters with LocationTech
GeoSpatially enabling your Spark and Accumulo clusters with LocationTechGeoSpatially enabling your Spark and Accumulo clusters with LocationTech
GeoSpatially enabling your Spark and Accumulo clusters with LocationTechRob Emanuele
 
Thorny path to the Large-Scale Graph Processing (Highload++, 2014)
Thorny path to the Large-Scale Graph Processing (Highload++, 2014)Thorny path to the Large-Scale Graph Processing (Highload++, 2014)
Thorny path to the Large-Scale Graph Processing (Highload++, 2014)Alexey Zinoviev
 
Apache Hadoop Big Data Technology
Apache Hadoop Big Data TechnologyApache Hadoop Big Data Technology
Apache Hadoop Big Data TechnologyJay Nagar
 
Scaling Storage and Computation with Hadoop
Scaling Storage and Computation with HadoopScaling Storage and Computation with Hadoop
Scaling Storage and Computation with Hadoopyaevents
 
Mastering Hadoop Map Reduce - Custom Types and Other Optimizations
Mastering Hadoop Map Reduce - Custom Types and Other OptimizationsMastering Hadoop Map Reduce - Custom Types and Other Optimizations
Mastering Hadoop Map Reduce - Custom Types and Other Optimizationsscottcrespo
 
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014soujavajug
 
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)npinto
 
Extending Hadoop for Fun & Profit
Extending Hadoop for Fun & ProfitExtending Hadoop for Fun & Profit
Extending Hadoop for Fun & ProfitMilind Bhandarkar
 
Hadoop eco system-first class
Hadoop eco system-first classHadoop eco system-first class
Hadoop eco system-first classalogarg
 
Python in an Evolving Enterprise System (PyData SV 2013)
Python in an Evolving Enterprise System (PyData SV 2013)Python in an Evolving Enterprise System (PyData SV 2013)
Python in an Evolving Enterprise System (PyData SV 2013)PyData
 

Was ist angesagt? (20)

Map Reduce introduction
Map Reduce introductionMap Reduce introduction
Map Reduce introduction
 
Introduction to Map Reduce
Introduction to Map ReduceIntroduction to Map Reduce
Introduction to Map Reduce
 
Analysing of big data using map reduce
Analysing of big data using map reduceAnalysing of big data using map reduce
Analysing of big data using map reduce
 
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons LearnedHadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
 
Working with Scientific Data in MATLAB
Working with Scientific Data in MATLABWorking with Scientific Data in MATLAB
Working with Scientific Data in MATLAB
 
GeoSpatially enabling your Spark and Accumulo clusters with LocationTech
GeoSpatially enabling your Spark and Accumulo clusters with LocationTechGeoSpatially enabling your Spark and Accumulo clusters with LocationTech
GeoSpatially enabling your Spark and Accumulo clusters with LocationTech
 
Thorny path to the Large-Scale Graph Processing (Highload++, 2014)
Thorny path to the Large-Scale Graph Processing (Highload++, 2014)Thorny path to the Large-Scale Graph Processing (Highload++, 2014)
Thorny path to the Large-Scale Graph Processing (Highload++, 2014)
 
Apache Hadoop Big Data Technology
Apache Hadoop Big Data TechnologyApache Hadoop Big Data Technology
Apache Hadoop Big Data Technology
 
Scaling Storage and Computation with Hadoop
Scaling Storage and Computation with HadoopScaling Storage and Computation with Hadoop
Scaling Storage and Computation with Hadoop
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
 
Mastering Hadoop Map Reduce - Custom Types and Other Optimizations
Mastering Hadoop Map Reduce - Custom Types and Other OptimizationsMastering Hadoop Map Reduce - Custom Types and Other Optimizations
Mastering Hadoop Map Reduce - Custom Types and Other Optimizations
 
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
 
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
 
Apache Spark Overview
Apache Spark OverviewApache Spark Overview
Apache Spark Overview
 
MapReduce Algorithm Design
MapReduce Algorithm DesignMapReduce Algorithm Design
MapReduce Algorithm Design
 
Extending Hadoop for Fun & Profit
Extending Hadoop for Fun & ProfitExtending Hadoop for Fun & Profit
Extending Hadoop for Fun & Profit
 
Pig Experience
Pig ExperiencePig Experience
Pig Experience
 
Hadoop eco system-first class
Hadoop eco system-first classHadoop eco system-first class
Hadoop eco system-first class
 
Python in an Evolving Enterprise System (PyData SV 2013)
Python in an Evolving Enterprise System (PyData SV 2013)Python in an Evolving Enterprise System (PyData SV 2013)
Python in an Evolving Enterprise System (PyData SV 2013)
 

Andere mochten auch

Sept 17 2013 - THUG - HBase a Technical Introduction
Sept 17 2013 - THUG - HBase a Technical IntroductionSept 17 2013 - THUG - HBase a Technical Introduction
Sept 17 2013 - THUG - HBase a Technical IntroductionAdam Muise
 
Geo-based content processing using hbase
Geo-based content processing using hbaseGeo-based content processing using hbase
Geo-based content processing using hbaseRavi Veeramachaneni
 
MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...
MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...
MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...nishimurashoji
 
Computation of spatial data on Hadoop Cluster
Computation of spatial data on Hadoop ClusterComputation of spatial data on Hadoop Cluster
Computation of spatial data on Hadoop ClusterAbhishek Sagar
 
Big data landscape version 2.0
Big data landscape version 2.0Big data landscape version 2.0
Big data landscape version 2.0Matt Turck
 
AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913
AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913
AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913Allwyn Antony
 
Spatial Indexing
Spatial IndexingSpatial Indexing
Spatial Indexingtorp42
 
The DE-9IM Matrix in Details using ST_Relate: In Picture and SQL
The DE-9IM Matrix in Details using ST_Relate: In Picture and SQLThe DE-9IM Matrix in Details using ST_Relate: In Picture and SQL
The DE-9IM Matrix in Details using ST_Relate: In Picture and SQLtorp42
 
SQLBits X SQL Server 2012 Spatial Indexing
SQLBits X SQL Server 2012 Spatial IndexingSQLBits X SQL Server 2012 Spatial Indexing
SQLBits X SQL Server 2012 Spatial IndexingMichael Rys
 
European creativity festival 2014: DataViz workshop
European creativity festival 2014: DataViz workshopEuropean creativity festival 2014: DataViz workshop
European creativity festival 2014: DataViz workshopOutliers Collective
 
Recasting the Role of Big (or Little) Data
Recasting the Role of Big (or Little) DataRecasting the Role of Big (or Little) Data
Recasting the Role of Big (or Little) DataMerck
 
Becoming a Smarter City by Analyzing & Visualizing Spatial Data
Becoming a Smarter City by Analyzing & Visualizing Spatial DataBecoming a Smarter City by Analyzing & Visualizing Spatial Data
Becoming a Smarter City by Analyzing & Visualizing Spatial DataPatrick Stotz
 
Enabling Access to Big Geospatial Data with LocationTech and Apache projects
Enabling Access to Big Geospatial Data with LocationTech and Apache projectsEnabling Access to Big Geospatial Data with LocationTech and Apache projects
Enabling Access to Big Geospatial Data with LocationTech and Apache projectsRob Emanuele
 
Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...
Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...
Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...nacis_slides
 
Application of probability in daily life and in civil engineering
Application of probability in daily life and in civil engineeringApplication of probability in daily life and in civil engineering
Application of probability in daily life and in civil engineeringEngr Habib ur Rehman
 
a Secure Public Cache for YARN Application Resources
a Secure Public Cache for YARN Application Resourcesa Secure Public Cache for YARN Application Resources
a Secure Public Cache for YARN Application ResourcesDataWorks Summit
 
Improving HDFS Availability with IPC Quality of Service
Improving HDFS Availability with IPC Quality of ServiceImproving HDFS Availability with IPC Quality of Service
Improving HDFS Availability with IPC Quality of ServiceDataWorks Summit
 
Map reduce: beyond word count
Map reduce: beyond word countMap reduce: beyond word count
Map reduce: beyond word countJeff Patti
 
How to use Parquet as a Sasis for ETL and Analytics
How to use Parquet as a Sasis for ETL and AnalyticsHow to use Parquet as a Sasis for ETL and Analytics
How to use Parquet as a Sasis for ETL and AnalyticsDataWorks Summit
 

Andere mochten auch (20)

Sept 17 2013 - THUG - HBase a Technical Introduction
Sept 17 2013 - THUG - HBase a Technical IntroductionSept 17 2013 - THUG - HBase a Technical Introduction
Sept 17 2013 - THUG - HBase a Technical Introduction
 
Geo-based content processing using hbase
Geo-based content processing using hbaseGeo-based content processing using hbase
Geo-based content processing using hbase
 
MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...
MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...
MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...
 
Computation of spatial data on Hadoop Cluster
Computation of spatial data on Hadoop ClusterComputation of spatial data on Hadoop Cluster
Computation of spatial data on Hadoop Cluster
 
Big data landscape version 2.0
Big data landscape version 2.0Big data landscape version 2.0
Big data landscape version 2.0
 
AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913
AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913
AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913
 
Spatial Indexing
Spatial IndexingSpatial Indexing
Spatial Indexing
 
The DE-9IM Matrix in Details using ST_Relate: In Picture and SQL
The DE-9IM Matrix in Details using ST_Relate: In Picture and SQLThe DE-9IM Matrix in Details using ST_Relate: In Picture and SQL
The DE-9IM Matrix in Details using ST_Relate: In Picture and SQL
 
VO Course 11: Spatial indexing
VO Course 11: Spatial indexingVO Course 11: Spatial indexing
VO Course 11: Spatial indexing
 
SQLBits X SQL Server 2012 Spatial Indexing
SQLBits X SQL Server 2012 Spatial IndexingSQLBits X SQL Server 2012 Spatial Indexing
SQLBits X SQL Server 2012 Spatial Indexing
 
European creativity festival 2014: DataViz workshop
European creativity festival 2014: DataViz workshopEuropean creativity festival 2014: DataViz workshop
European creativity festival 2014: DataViz workshop
 
Recasting the Role of Big (or Little) Data
Recasting the Role of Big (or Little) DataRecasting the Role of Big (or Little) Data
Recasting the Role of Big (or Little) Data
 
Becoming a Smarter City by Analyzing & Visualizing Spatial Data
Becoming a Smarter City by Analyzing & Visualizing Spatial DataBecoming a Smarter City by Analyzing & Visualizing Spatial Data
Becoming a Smarter City by Analyzing & Visualizing Spatial Data
 
Enabling Access to Big Geospatial Data with LocationTech and Apache projects
Enabling Access to Big Geospatial Data with LocationTech and Apache projectsEnabling Access to Big Geospatial Data with LocationTech and Apache projects
Enabling Access to Big Geospatial Data with LocationTech and Apache projects
 
Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...
Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...
Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...
 
Application of probability in daily life and in civil engineering
Application of probability in daily life and in civil engineeringApplication of probability in daily life and in civil engineering
Application of probability in daily life and in civil engineering
 
a Secure Public Cache for YARN Application Resources
a Secure Public Cache for YARN Application Resourcesa Secure Public Cache for YARN Application Resources
a Secure Public Cache for YARN Application Resources
 
Improving HDFS Availability with IPC Quality of Service
Improving HDFS Availability with IPC Quality of ServiceImproving HDFS Availability with IPC Quality of Service
Improving HDFS Availability with IPC Quality of Service
 
Map reduce: beyond word count
Map reduce: beyond word countMap reduce: beyond word count
Map reduce: beyond word count
 
How to use Parquet as a Sasis for ETL and Analytics
How to use Parquet as a Sasis for ETL and AnalyticsHow to use Parquet as a Sasis for ETL and Analytics
How to use Parquet as a Sasis for ETL and Analytics
 

Ähnlich wie Spatial Data processing with Hadoop

How to empower community by using GIS lecture 1
How to empower community by using GIS lecture 1How to empower community by using GIS lecture 1
How to empower community by using GIS lecture 1wang yaohui
 
Large Scale Geo Processing on Hadoop
Large Scale Geo Processing on HadoopLarge Scale Geo Processing on Hadoop
Large Scale Geo Processing on HadoopChristoph Körner
 
Visualising large spatial databases and Building bespoke geodemographics
Visualising large spatial databases and Building bespoke geodemographicsVisualising large spatial databases and Building bespoke geodemographics
Visualising large spatial databases and Building bespoke geodemographicsDr Muhammad Adnan
 
Synthetic Data and Graphics Techniques in Robotics
Synthetic Data and Graphics Techniques in RoboticsSynthetic Data and Graphics Techniques in Robotics
Synthetic Data and Graphics Techniques in RoboticsPrabindh Sundareson
 
Euro30 2019 - Benchmarking tree approaches on street data
Euro30 2019 - Benchmarking tree approaches on street dataEuro30 2019 - Benchmarking tree approaches on street data
Euro30 2019 - Benchmarking tree approaches on street dataFabion Kauker
 
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...IOSR Journals
 
mini project_shortest path visualizer.pptx
mini project_shortest path visualizer.pptxmini project_shortest path visualizer.pptx
mini project_shortest path visualizer.pptxtusharpawar803067
 
Of Nodes and Maps (Web Mapping with Drupal - Part II)
Of Nodes and Maps (Web Mapping with Drupal - Part II)Of Nodes and Maps (Web Mapping with Drupal - Part II)
Of Nodes and Maps (Web Mapping with Drupal - Part II)Ranel Padon
 
Field Data Collecting, Processing and Sharing: Using web Service Technologies
Field Data Collecting, Processing and Sharing: Using web Service TechnologiesField Data Collecting, Processing and Sharing: Using web Service Technologies
Field Data Collecting, Processing and Sharing: Using web Service TechnologiesNiroshan Sanjaya
 
Walking in the Cloud: A New Paradigm in Geospatial World
Walking in the Cloud: A New Paradigm in Geospatial WorldWalking in the Cloud: A New Paradigm in Geospatial World
Walking in the Cloud: A New Paradigm in Geospatial WorldICIMOD
 
Using python to analyze spatial data
Using python to analyze spatial dataUsing python to analyze spatial data
Using python to analyze spatial dataKudos S.A.S
 
CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...
CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...
CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...Si Chen
 
Introduction of MAGO3D
Introduction of MAGO3DIntroduction of MAGO3D
Introduction of MAGO3DSANGHEE SHIN
 
Geolocation in Drupal
Geolocation in DrupalGeolocation in Drupal
Geolocation in DrupalMediacurrent
 
Drupal Day 2011 - Thinking spatially with your open data
Drupal Day 2011 - Thinking spatially with your open dataDrupal Day 2011 - Thinking spatially with your open data
Drupal Day 2011 - Thinking spatially with your open dataDrupalDay
 
Thinking spatially with your open data
Thinking spatially with your open dataThinking spatially with your open data
Thinking spatially with your open dataTwinbit
 
Open Source GIS
Open Source GISOpen Source GIS
Open Source GISJoe Larson
 
Web Mapping with Drupal
Web Mapping with DrupalWeb Mapping with Drupal
Web Mapping with DrupalRanel Padon
 
Arnold webuquerque20110302
Arnold webuquerque20110302Arnold webuquerque20110302
Arnold webuquerque20110302lisaarn
 

Ähnlich wie Spatial Data processing with Hadoop (20)

How to empower community by using GIS lecture 1
How to empower community by using GIS lecture 1How to empower community by using GIS lecture 1
How to empower community by using GIS lecture 1
 
Large Scale Geo Processing on Hadoop
Large Scale Geo Processing on HadoopLarge Scale Geo Processing on Hadoop
Large Scale Geo Processing on Hadoop
 
Visualising large spatial databases and Building bespoke geodemographics
Visualising large spatial databases and Building bespoke geodemographicsVisualising large spatial databases and Building bespoke geodemographics
Visualising large spatial databases and Building bespoke geodemographics
 
Synthetic Data and Graphics Techniques in Robotics
Synthetic Data and Graphics Techniques in RoboticsSynthetic Data and Graphics Techniques in Robotics
Synthetic Data and Graphics Techniques in Robotics
 
Euro30 2019 - Benchmarking tree approaches on street data
Euro30 2019 - Benchmarking tree approaches on street dataEuro30 2019 - Benchmarking tree approaches on street data
Euro30 2019 - Benchmarking tree approaches on street data
 
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...
 
mini project_shortest path visualizer.pptx
mini project_shortest path visualizer.pptxmini project_shortest path visualizer.pptx
mini project_shortest path visualizer.pptx
 
Of Nodes and Maps (Web Mapping with Drupal - Part II)
Of Nodes and Maps (Web Mapping with Drupal - Part II)Of Nodes and Maps (Web Mapping with Drupal - Part II)
Of Nodes and Maps (Web Mapping with Drupal - Part II)
 
Field Data Collecting, Processing and Sharing: Using web Service Technologies
Field Data Collecting, Processing and Sharing: Using web Service TechnologiesField Data Collecting, Processing and Sharing: Using web Service Technologies
Field Data Collecting, Processing and Sharing: Using web Service Technologies
 
Walking in the Cloud: A New Paradigm in Geospatial World
Walking in the Cloud: A New Paradigm in Geospatial WorldWalking in the Cloud: A New Paradigm in Geospatial World
Walking in the Cloud: A New Paradigm in Geospatial World
 
Using python to analyze spatial data
Using python to analyze spatial dataUsing python to analyze spatial data
Using python to analyze spatial data
 
CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...
CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...
CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...
 
Introduction of MAGO3D
Introduction of MAGO3DIntroduction of MAGO3D
Introduction of MAGO3D
 
Geolocation in Drupal
Geolocation in DrupalGeolocation in Drupal
Geolocation in Drupal
 
Drupal Day 2011 - Thinking spatially with your open data
Drupal Day 2011 - Thinking spatially with your open dataDrupal Day 2011 - Thinking spatially with your open data
Drupal Day 2011 - Thinking spatially with your open data
 
Thinking spatially with your open data
Thinking spatially with your open dataThinking spatially with your open data
Thinking spatially with your open data
 
Open Source GIS
Open Source GISOpen Source GIS
Open Source GIS
 
Web Mapping with Drupal
Web Mapping with DrupalWeb Mapping with Drupal
Web Mapping with Drupal
 
Arnold webuquerque20110302
Arnold webuquerque20110302Arnold webuquerque20110302
Arnold webuquerque20110302
 
1 mapreduce-fest
1 mapreduce-fest1 mapreduce-fest
1 mapreduce-fest
 

Mehr von VisionGEOMATIQUE2014

Géomatique appliquée : revue des solutions novatrices mises en place en 2014
Géomatique appliquée : revue des solutions novatrices mises en place en 2014Géomatique appliquée : revue des solutions novatrices mises en place en 2014
Géomatique appliquée : revue des solutions novatrices mises en place en 2014VisionGEOMATIQUE2014
 
Indoor location with the Bluetooth Low Energy standard
Indoor location with the Bluetooth Low Energy standardIndoor location with the Bluetooth Low Energy standard
Indoor location with the Bluetooth Low Energy standardVisionGEOMATIQUE2014
 
ScribeUI: La productivité avec MapServer
ScribeUI: La productivité avec MapServerScribeUI: La productivité avec MapServer
ScribeUI: La productivité avec MapServerVisionGEOMATIQUE2014
 
Fast, Distributed Geoprocessing with Scala, Spark and GeoTrellis
Fast, Distributed Geoprocessing with Scala, Spark and GeoTrellisFast, Distributed Geoprocessing with Scala, Spark and GeoTrellis
Fast, Distributed Geoprocessing with Scala, Spark and GeoTrellisVisionGEOMATIQUE2014
 
OpenGL ES pour le développement d’applications géospatiales sur Android
OpenGL ES pour le développement d’applications géospatiales sur AndroidOpenGL ES pour le développement d’applications géospatiales sur Android
OpenGL ES pour le développement d’applications géospatiales sur AndroidVisionGEOMATIQUE2014
 
Accès ouvert aux données météorologiques d’Environnement Canada
Accès ouvert aux données météorologiques d’Environnement CanadaAccès ouvert aux données météorologiques d’Environnement Canada
Accès ouvert aux données météorologiques d’Environnement CanadaVisionGEOMATIQUE2014
 
TDW FOSS GEO-STACK FOR MINERAL EXPLORATION
TDW FOSS GEO-STACK FOR MINERAL EXPLORATIONTDW FOSS GEO-STACK FOR MINERAL EXPLORATION
TDW FOSS GEO-STACK FOR MINERAL EXPLORATIONVisionGEOMATIQUE2014
 
Solution Geoctopus : améliorations et défis
Solution Geoctopus : améliorations et défisSolution Geoctopus : améliorations et défis
Solution Geoctopus : améliorations et défisVisionGEOMATIQUE2014
 
Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...
Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...
Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...VisionGEOMATIQUE2014
 
GeoMesa: Scalable Geospatial Analytics
GeoMesa:  Scalable Geospatial AnalyticsGeoMesa:  Scalable Geospatial Analytics
GeoMesa: Scalable Geospatial AnalyticsVisionGEOMATIQUE2014
 
Montrajet.ca : une solution multimodale de covoiturage et de planification d'...
Montrajet.ca : une solution multimodale de covoiturage et de planification d'...Montrajet.ca : une solution multimodale de covoiturage et de planification d'...
Montrajet.ca : une solution multimodale de covoiturage et de planification d'...VisionGEOMATIQUE2014
 
Automatisation de la cartographie et de l'analyse des données de comptage de ...
Automatisation de la cartographie et de l'analyse des données de comptage de ...Automatisation de la cartographie et de l'analyse des données de comptage de ...
Automatisation de la cartographie et de l'analyse des données de comptage de ...VisionGEOMATIQUE2014
 
MACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORING
MACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORINGMACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORING
MACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORING VisionGEOMATIQUE2014
 
Les contributions de la géomatique au développement de la ville intelligente
Les contributions de la géomatique au développement de la ville intelligenteLes contributions de la géomatique au développement de la ville intelligente
Les contributions de la géomatique au développement de la ville intelligenteVisionGEOMATIQUE2014
 
SIGim la plateforme adaptée à la gestion municipale
SIGim la plateforme adaptée à la gestion municipaleSIGim la plateforme adaptée à la gestion municipale
SIGim la plateforme adaptée à la gestion municipaleVisionGEOMATIQUE2014
 
Optimisation et analyse des parcours de déneigement à la Ville de Shawinigan
Optimisation et analyse des parcours de déneigement à la Ville de ShawiniganOptimisation et analyse des parcours de déneigement à la Ville de Shawinigan
Optimisation et analyse des parcours de déneigement à la Ville de ShawiniganVisionGEOMATIQUE2014
 
AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...
AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...
AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...VisionGEOMATIQUE2014
 
Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...
Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...
Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...VisionGEOMATIQUE2014
 
JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...
JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...
JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...VisionGEOMATIQUE2014
 

Mehr von VisionGEOMATIQUE2014 (20)

Géomatique appliquée : revue des solutions novatrices mises en place en 2014
Géomatique appliquée : revue des solutions novatrices mises en place en 2014Géomatique appliquée : revue des solutions novatrices mises en place en 2014
Géomatique appliquée : revue des solutions novatrices mises en place en 2014
 
Indoor location with the Bluetooth Low Energy standard
Indoor location with the Bluetooth Low Energy standardIndoor location with the Bluetooth Low Energy standard
Indoor location with the Bluetooth Low Energy standard
 
ScribeUI: La productivité avec MapServer
ScribeUI: La productivité avec MapServerScribeUI: La productivité avec MapServer
ScribeUI: La productivité avec MapServer
 
Fast, Distributed Geoprocessing with Scala, Spark and GeoTrellis
Fast, Distributed Geoprocessing with Scala, Spark and GeoTrellisFast, Distributed Geoprocessing with Scala, Spark and GeoTrellis
Fast, Distributed Geoprocessing with Scala, Spark and GeoTrellis
 
OpenGL ES pour le développement d’applications géospatiales sur Android
OpenGL ES pour le développement d’applications géospatiales sur AndroidOpenGL ES pour le développement d’applications géospatiales sur Android
OpenGL ES pour le développement d’applications géospatiales sur Android
 
Accès ouvert aux données météorologiques d’Environnement Canada
Accès ouvert aux données météorologiques d’Environnement CanadaAccès ouvert aux données météorologiques d’Environnement Canada
Accès ouvert aux données météorologiques d’Environnement Canada
 
LocationTech Data Commons
LocationTech Data CommonsLocationTech Data Commons
LocationTech Data Commons
 
TDW FOSS GEO-STACK FOR MINERAL EXPLORATION
TDW FOSS GEO-STACK FOR MINERAL EXPLORATIONTDW FOSS GEO-STACK FOR MINERAL EXPLORATION
TDW FOSS GEO-STACK FOR MINERAL EXPLORATION
 
Solution Geoctopus : améliorations et défis
Solution Geoctopus : améliorations et défisSolution Geoctopus : améliorations et défis
Solution Geoctopus : améliorations et défis
 
Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...
Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...
Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...
 
GeoMesa: Scalable Geospatial Analytics
GeoMesa:  Scalable Geospatial AnalyticsGeoMesa:  Scalable Geospatial Analytics
GeoMesa: Scalable Geospatial Analytics
 
Montrajet.ca : une solution multimodale de covoiturage et de planification d'...
Montrajet.ca : une solution multimodale de covoiturage et de planification d'...Montrajet.ca : une solution multimodale de covoiturage et de planification d'...
Montrajet.ca : une solution multimodale de covoiturage et de planification d'...
 
Automatisation de la cartographie et de l'analyse des données de comptage de ...
Automatisation de la cartographie et de l'analyse des données de comptage de ...Automatisation de la cartographie et de l'analyse des données de comptage de ...
Automatisation de la cartographie et de l'analyse des données de comptage de ...
 
MACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORING
MACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORINGMACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORING
MACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORING
 
Les contributions de la géomatique au développement de la ville intelligente
Les contributions de la géomatique au développement de la ville intelligenteLes contributions de la géomatique au développement de la ville intelligente
Les contributions de la géomatique au développement de la ville intelligente
 
SIGim la plateforme adaptée à la gestion municipale
SIGim la plateforme adaptée à la gestion municipaleSIGim la plateforme adaptée à la gestion municipale
SIGim la plateforme adaptée à la gestion municipale
 
Optimisation et analyse des parcours de déneigement à la Ville de Shawinigan
Optimisation et analyse des parcours de déneigement à la Ville de ShawiniganOptimisation et analyse des parcours de déneigement à la Ville de Shawinigan
Optimisation et analyse des parcours de déneigement à la Ville de Shawinigan
 
AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...
AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...
AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...
 
Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...
Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...
Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...
 
JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...
JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...
JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...
 

Kürzlich hochgeladen

Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 

Kürzlich hochgeladen (20)

Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 

Spatial Data processing with Hadoop

  • 1. University of Minnesota GeoJinni Spatial Data processing with Hadoop http://spatialhadoop.cs.umn.edu/ @spatialhadoop Ahmed Eldawy
  • 2.
  • 3. Claudius Ptolemy (AD 90 – AD 168)
  • 5.
  • 6.
  • 7.
  • 8. Cholera cases in the London epidemic of 1854
  • 9.
  • 10.
  • 11.
  • 12. Cool technology..!! Can I use it in my application? Oh..!! But, it is not made for me. Can’t make use of it as is My pleasure. Here it is..
  • 13.
  • 14. Kindly let me get the technology you have Kindly let me understand your needs
  • 15.
  • 16. HELP..!! I have too much data. Your technology is not helping me mmm…Let me check with my good friends there. My pleasure. Here it is.. Cool DBMS technology..!! Can I use it in my application? Oh..!! But, it is not made for me. Can’t make use of it as is
  • 17.
  • 18. Kindly let me understand your needs Kindly let me get the technology you have
  • 19.
  • 20.
  • 21.
  • 22.
  • 23. Let me check with my other good friends there. HELP..!! Again, I have too much data. Your technology is not helping me Cool MapReduce technology..!! Can I use it in my application? Sorry, seems like the DBMS technology cannot scale more My pleasure. Here Oh..!! But, it is not made for me. Can’t make use of it as is it is..
  • 24.
  • 25. Kindly let me understand your needs Kindly let me get the technology you have
  • 26. Kindly let me understand your needs Kindly let me get the technology you have aka GeoJinni
  • 27. VGI Sensor networks 27 Tons of Spatial data out there… Smart phones Satellite Images Medical data Traffic data Geotagged Microblogs Geotagged pictures
  • 28. GeoJinni Website: http://spatialhadoop.cs.umn.edu/ Download source code, binary distribution, and instructions Email us at: shadoop@cs.umn.edu ■ Released in March 2013; 75,000 downloads since then Spatial language Built-in spatial data types 28 Spatial Indexes Spatial Operations
  • 29. User Programs Pig Latin Hadoop Java APIS Job Monitoring and 29 The Built-in Approach of GeoJinni Spatial Modules User Programs Pig Latin Hadoop Java APIS Job Monitoring and Scheduling MapReduce Runtime Storage (HDFS) (Spatial) User Program + MapReduce APIs + Job Monitoring and Scheduling + MapReduce Runtime + Storage + … Scheduling MapReduce Runtime Storage (HDFS) Spatial Language Spatial Operators Early Pruning Spatial Indexing The On-top Approach From Scratch Approach The Built-in Approach (GeoJinni)
  • 30. 30 Spatial Data & Hadoop Spatial Data Hadoop points = LOAD ’points’ AS (id:int, x:int, y:int); result = FILTER points BY x < xmax AND x >= xmin AND y < ymax AND y >= ymin; Takes 193 seconds  GeoJinni GeoJinni points = LOAD ’points’ AS (id:int, location:point); result = FILTER points BY IsOverlap(location, rectangle (xmin, ymin, xmax, ymax)); Finishes in 2 seconds
  • 31. 31 GeoJinni Architecture Applications: MNTG [SSTD’13, ICDE’14] SHAHED [ICDE’15] – TAREEG [SIGMOD’14, SIGSPATIAL’14] Spatio-temporal Hadoop Language: Pigeon [ICDE’14] Operations: Basic [VLDB’13] – CG_Hadoop [SIGSPATIAL’13] Data Mining – Visualization [Under submission] MapReduce: Spatial File Splitter – Spatial Record Reader Indexing: Grid File – R-tree – R+-tree [ICDE’15]
  • 32. 32 Language Layer: Pigeon ■ Extends Pig Latin with OGC-compliant primitives  Spatial data types (e.g., Polygon)  Basic operations (e.g., Area)  Spatial predicates (e.g., Touches)  Spatial analysis (e.g., Union)  Spatial aggregate functions (e.g., Convex Hull) cities = LOAD ’cities’ AS (city_id: int, city_geom); City_area = FOREACH cities GENERATE Area(city_geom) AS area; A. Eldawy and M. F. Mokbel. Pigeon: A Spatial MapReduce Language. In ICDE, 2014
  • 34. 34 Indexing Layer: Grid File
  • 36. 36 Range Query SpatialFileSplitter prunes blocks outside the query range SpatialRecordReader passes local indexes to the map function Map function selects records in range
  • 37. 37 CG_Hadoop ■ Make use of GeoJinni to speedup computational geometry algorithms  Polygon union, Skyline, Convex Hull, Farthest/Closest Pair ■ Single machine implementation  E.g., Skyline of 4 billion points takes three hours ■ Straight forward implementation in Hadoop  Hadoop parallel execution ■ More efficient implementation in GeoJinni  Spatial indexing  Early pruning ■ Free open source as part of GeoJinni Single Machine Hadoop GeoJinni 29x 260x 1x
  • 38. 38 Convex Hull Find the minimal convex polygon that contains all points Input Output
  • 39. 39 Convex Hull in CG_Hadoop Hadoop CG_Hadoop Partition Pruning Local hull Global hull
  • 40. 40 Map rendering ■ Map rendering creates an image that represents the data ■ Visualization is an international language ■ Can reveal patterns that are otherwise hard to spot ■ The visual system occupies about one third of the human brain 210 LINESTRING (-2.3634904 51.3845649, -2.3634254 51.3843983, - 2.3631927 51.3838436) [highway#primary,ref#A4,name#Gay Street] 420 LINESTRING (-1.8230973 52.5541131, -1.8230368 52.5540756, - 1.8229324 52.5540109, -1.8227961 52.5539014, -1.8227365 52.5538461, - 1.8226952 52.5538058, -1.8226204 52.5537103, -1.8223988 52.5534041, - 1.8221814 52.5531498, -1.8218478 52.5528188, -1.8215581 52.5525626, - 1.8213525 52.5524042) [source#GPS Survey,highway#residential,postal_code#B72,name#Moss Drive,is_in#Sutton Coldfield,maxspeed#30,abutters#residential] 490 LINESTRING (-0.1896508 51.6456414, -0.1895803 51.6456036, - 0.1895245 51.645551, -0.1890055 51.6450801, -0.1887808 51.6448764, - 0.1885605 51.6446756, -0.1883084 51.6443753, -0.1875496 51.6433375, - 0.1864572 51.6415288, -0.1862165 51.6411939, -0.1859495 51.6406583, - 0.1858855 51.6405461) [lit#yes,surface#asphalt,maxspeed#30 mph,highway#residential,abutters#residential,name#Sherrards Way] 770 LINESTRING (-1.8184653 52.5723683, -1.8182353 52.5723576, - …
  • 41. 41 Smoothing Input Buffer Only Buffer + Merge
  • 42. 42 Multi-level Image ■ Many images at different zoom levels  Pan  Zoom in/out  Fly to ■ More details as the zoom level increases
  • 43. 43 MNTG - World-wide traffic generator for road networks http://mntg.cs.umn.edu/ M. F. Mokbel, L. Alarabi, J. Bao, A. Eldawy, A. Magdy, M. Sarwat, E. Waytas, and S. Yackel. MNTG: An Extensible Web-based Traffic Generator. In SSTD, 2013
  • 44. 44 SHAHED – A tool for querying and visualizing spatio-temporal satellite data http://shahed.cs.umn.edu/ "SHAHED: A MapReduce-based System for Querying and Visualizing Spatio-temporal Satellite Data“, Ahmed Eldawy et al, ICDE 2015
  • 46. 46 Smooth World Temperature
  • 47. 47 World Heat Map on Google Earth
  • 48. 48 TAREEG – Web-based extractor for OpenStreetMap data using MapReduce http://tareeg.net/ L. Alarabi, A. Eldawy, R. Alghamdi, and M. F. Mokbel. TAREEG: A MapReduce-Based Web Service for Extracting Spatial Data from OpenStreetMap. In SIGMOD, 2014
  • 49. 49 Extracted Road Network
  • 50. GeoJinni Analyze your spatial data efficiently 50 Built-in spatial data types Spatial high level language Efficient Spatial Operations Language Data types Spatial Indexes Indexes Operations Analyze Datasets your are organized data on large efficiently clusters using with spatial built-in indexes spatial operations that runs efficiently using spatial indexes Interact Have with all your the system spatial and datasets express ready your to queries load in in a simple SpatialHadoop (Grid Website: high or level R-tree) http://language with that spatialhadoop.the are with built-adapted built-in spatial cs.to in umn.MapReduce spatial data edu/ support types Download source code, binary distribution, and instructions Email us at: shadoop@cs.umn.edu