SlideShare ist ein Scribd-Unternehmen logo
1 von 56
Geo-spatial Research for Impact:
Beyond Analysis  Towards Synthesis
Ramesh Raskar
Assoc. Professor
MIT Media Lab
with
Nikhil Naik
Ilke Demir
Poverty, Pop-density, Crop, Methane .. maps
Google Street View
Online User Photos: Social voting in the real world
Phototourism by Seitz at al
Analysis: Perceived Safety
Naik, Raskar, Hidalgo 2016
Analysis  Synthesis
Street Address: Assign | Adopt
• 75% of the world population without street addresses
• Timely ambulance delivery
• Stimulate digital economy via eCommerce
• Protect property rights of the marginalized
• Crisis response (Haiti, 48 hrs to coord aid)
Beyond Analysis  Towards Synthesis
Actionable Insights
Traffic Nudge
Crisis Response
Street Addr
Govt Policies
Large Scale Visual Study
Visual Census
Socio-eco observations
Sentiment maps
Occasional Frequent
Low
Quality
Census
High
Quality
Satellite,
Street view
Phone, Mobility,
Social, Financial
AV,
PlanetLabs etc
Sense
Process
Respond
Capture Analyze Act
Low
Level
Mid
Level
High
Level
Capture Analyze Act
Low
Level
Street-level
Measure
•Visual Census
•Counting/Density
Alert
•Traffic Interventions
Mid
Level
Aerial
Understand
•Geolocalization
•Crowds
•3D Modeling
Assist
•Crisis Response
•Agriculture
•Insurance/Hedge
Funds
High
Level
Satellite
Predict
• Economic activity
Change
•Street Addr
•Government planning
Capture  Present  Future
Low
Level
Street-level
Street View
Crowdsourcing
(mapillary)
Traffic Cameras
Autonomous
Vehicles
Cloud Dashcams
Mid
Level
Aerial
Commercial airplanes
Amateur Drones
Autonomous
Drones
High
Level
Satellite
Meter-resolution
Limited coverage for
high-res
Sub-meter, real-
time, entire Earth
(Planet Lab,
OneWeb)
Ack: Nikhil Naik
Analyze Present  Future
Low
Level
Measure
•Visual Census
•Counting/Density
Streetscore,
Streetchange,
Visual Census
(Fei-Fei Li)
Generative
modeling of
cities
Mid
Level
Understand
•Geolocalization
•Crowds
•3D Modeling
DeepRoadMap
per (Urtasun et
al.)
Design
Suggestions
High
Level
Aggregate and
Predict
• Economic activity
Jean and
Ermon
Jayachandran
Deforestation
Study
Real-time data
analysis.
Actionable
information
Socio-Economic Inference from Digital Patterns
Satellite ImageryStreet-level Imagery Aerial Imagery
Phone Mobility Data
Social Networks, Photos
Device
Activity
Built
Environment
Human
Activity
84 countries, no data
Ack: Nikhil Naik
400K Google StreetViews
Socio Economic and Physical environment
4k Images, 200K pairwise comparisons
Place Pulse 1.0
Naik, Raskar, Hidalgo 2016
1.7K Ranked 400K Predicted
Naik, Raskar, Hidalgo 2016
New York Boston
Detroit
Chicago
Validated with Crime Stats
streetscore.media.mit.edu
105 Kent Avenue, Brooklyn, NY
August 2007
Streetscore = 1.8/10
105 Kent Avenue, Brooklyn, NY
September 2014
Streetscore = 7.2/10
Urban Growth in New York
2007 - 2014
Raskar, Camera Culture, MIT Media Lab
http://cameraculture.info
MIT Media Lab
Ilke Demir, Nikhil Naik
Capture Analyze Act
Low
Level
Street-level
Measure
•Visual Census
•Counting/Density
Alert
•Traffic Interventions
Mid
Level
Aerial
Understand
•Geolocalization
•Crowds
•3D Modeling
Assist
•Crisis Response
•Agriculture
•Insurance/Hedge
Funds
High
Level
Satellite
Aggregate and Predict
• Economic activity
Change
•Street Addr
•Government planning
Example: Congestion Pricing
(Nikhil Naik et al.)
• Jakarta: Response of vehicles by types
• CV + traffic cameras across the city to detect types + number of vehicles
• Understand traffic flows before/after congestion pricing is introduced
• Adjust rates for different types of vehicles/ in different areas
Street Address: Assign | Adopt
• 75% of the world population
without street addresses
• Timely ambulance delivery
• Stimulate digital economy via eCommerce
• Protect property rights of the marginalized
• Crisis response (Haiti, 48 hrs to coord aid)
Street Addresses
from Satellite Imagery
İlke Demir, Forest Hughes,
Aman Raj, Kaunil Dhruv,
Suryanarayana M. Muddala,
Sanyam Garg, Barrett Doo,
Ramesh Raskar
IJCG 2018
Thanks
Nikhil Naik Tristan Swedish Praneeth Vepakomma Ilke Demir
Forest Hughes Jatin Malhotra SuryaNarayana Murthy
Kavnil Dhruv Aman Raj Barrett Doo Praveen Gedam Anna Roy
Cesar A. Hidalgo Guan Pang Jing Huang Daniel Aliaga
Manohar Paluri Pierre Roux Yael Maguire Leo Tsourides Divyaa
Ravichandran Sanyam Garg Sai Sri Sathya Grace Kermani
Tobias Tiecke Andreas Gros Santanu Bhattacharya
Kabir Rustogi Will Marshall
Pilot 1:
Will people use street names?
In use After 6 monthsCommunity assigned names + Signs
Pilot 2
Will SMEs use the labeling scheme?
Measured relative efficiency for Pizza delivery
Pilot 3
Suitable for E-commerce work-flow?
What3words
parrot.casino.failed
Robocodes
75D. Road NE27.
Dhule.MhIn
Landmarks
Green Park
Poor Coverage
Lat/lon
Goal: Memorable Geocoding
Physical
1050 Market St,
Fresno, CA, US
.
What3words:
A: parrot.casino.failed
B: issuer.lollipop.ripe
- Irrelevant words
based on lat/lon.
Robocodes:
75D.NE27.Dhule.MhIn
76C.NE27.Dhule.MhIn
- Hierarchical and
linear addresses.
Google Maps:
Near Green Park
Near Green Park
- No street names or numbers
Point vs Edges for Geometric Queries
Addressing Schemes Around the World
London postal code system:
Radial regions based on orientation and distance
South Korea streets:
Meter markers
Japan block system:
Hard to decipher
Dubai addressing:
Uses districts
Berlin numbering:
Zigzag house pattern
Robocode Scheme
• 5 alphanumeric fields
• Hierarchical and linear descriptors
• To close the gap between physical
addresses and automated geocoding
Road naming scheme:
- distance from the center
- orientation in odd parity
i.e. WB14
Region naming scheme:
- orientation wrt downtown
- distance from downtown
i.e. WB
House numbering scheme:
- meter markers on the road
- block letters from the road
i.e. 38K WB14
“I7 Hacker Way, Menlo Park, CA, US”
Design Choices
Linear: similar addresses stored in a linear fashion
Hierarchical: top-down structure for spatial encapsulation
Compressible: 5x4 max (chars x words)
Universal: independent of local language
Inquirable: useful for geometric, proximity-based, and type-ahead queries
Extendible: dynamically modifiable for new places
Robust: flexible for overestimation and noise
StructuralDesignParameters
forefficientcomputerimplementation
Linear: closer addresses are given related names
Hierarchical: top-down subdivision of the world
Memorable: short and alphanumeric, easily convertible
Intuitive: with a sense of direction and distance
Topological: consistent with road topology
Inclusive: with local names (city, state)
Physical: consistent with natural boundaries
SemanticDesignParameters
foruserfriendliness
Machine
Needs
Human
Needs
Our Pipeline
Satellite Images Predictions Road Segments
Clustering
Segmentation
RegionsRoad IDsMarkers + Blocks
Labeling
Satellite Images
ç
• Irregular urban structure
• Illumination/weather/country
• Different road types
Segmentation
• Binary road masks
• 19K*19K, 0.5m/pixel
• SegNet
Extracting Road Segments
• Orientation based median filtering
• Road segments by orientation bucketing
NF
NH
NE
Region Creation
• Road graph: Node=intersection,
edge=road, weight=length
• Partition for max inter, min intra
connectivity, using normalized min-cut.
• 𝑛 𝑚𝑎𝑥 = 𝑐𝑒𝑖𝑙
𝑟𝑜𝑎𝑑𝑠
88
Region and Road Naming
• Cmax
𝑟𝑜𝑎𝑑𝑠
𝐴
→ 𝐶𝐴 (downtown)
• Orientation bucketing into N, S, W, E
• Trace regions based on distance to CA
• Orientation bucketing into major axes
• Trace roads based on order
Offsetting and Meter Marking
• 5 meter marker along the road
• Odd/even based on RHR
• Distance field of roads: block offset
Unmapped Developing Country
Regions follow natural boundaries
Street Address with Robocodes
• From Satellite Imagery to Deployed Street Addresses
• Generative address : linear, hierarchical, and intuitive
• Human friendly rather than machine friendly
Occasional Frequent
Low
Quality
Census
High
Quality
Satellite,
Street view
Phone, Mobility,
Social, Financial
AV,
PlanetLabs etc
Sense
Process
Respond
Capture Analyze  Act
Low
Level
Street-level
Measure
•Visual Census
•Counting/Density
Alert
•Traffic Interventions
Mid
Level
Aerial
Understand
•Geolocalization
•Crowds
•3D Modeling
Assist
•Crisis Response
•Agriculture
•Insurance/Hedge
Funds
High
Level
Satellite
Aggregate and Predict
• Economic activity
Change
•Street Addr
•Government planning
Pervasive
Recording,
Incentives,
Distributed
ownership,
Privacy,
Full proof
authenticity,
Equality
Act Alert Assist Change
Low/Mi
d/High
Level
Alert about the
state of
people/economy/b
uilt environment
(e.g., predict crop
yield from satellite
imagery, predict
insurance price
from street view)
Assist in acting on
information by
providing
suggestions based
on data
(e.g., design
optimal congestion
pricing based on
detected cars,
design crisis
response in
hurricanes)
?
Inaccessible Areas
• To extend our format to cover areas that are not accessible by
streets, we explored different implementations to cover such
areas, which are 26*5 m away from any street.
• Geocoding as a function (excluding the version field):
f (info, lat, lon) = x.y.z.t
• For places with roads, info={road network, city, country}
f (R, C) = x.y.city.country
• Extreme case: only reliable information is latitude/longitude!
52
f(C,lat,lon) = hash(round(lat,3)) + dir(lat) .
hash(round(lon,3)) +dir(lon) . C
L-A-T-dir.L-O-N-dir.name.area
Inaccessible Areas: Blackholes!
• Linear hashing:
• 26 letters + 10 digits
• 100m x 100 m granularity
• Last letter is the hemisphere
• Range: 359.999, longitude: 7PRZ W
• Hierarchical hashing:
• Enlarge the grid from to 1 km x 1 km
• Using two floating points = three letters
• Within each cell, re-hash it to a 36 x 36 grid = one letter
• New resolution: 30m, represented by five letters
53
f(C,lat,lon) = hash(round(lat,2)) + hash(lat - round(lat,2)) + dir(lat) .
hash(round(lon,2)) + hash(lon - round(lon,2)) + dir(lon) . C
LlatLlatHlatDlat .LlonLlonHlonDlon . name . Ocean /Continent /etc
Thanks!
What next?
Today
Tomorrow
Friday
A month
• Robocode.info
• Join our presentation in CVPR WiCV. Friday 10am
• Join us with your new ideas at SIGGRAPH 2018 Maps & Urban Data session.
Code: https://github.com/facebookresearch/street-addresses
Paper: https://research.fb.com/publications/generative-street-
addresses-from-satellite-imagery/
Bonus: Blackholes!
Main aim: f(<place>)=robocode
Base case:
<place> = <house, street, city, country>
“12C.NA14.PALO.CAUS”
No street:
<place> = <lat, lon, city, country>
“F12.HN3.PALO.CAUS”
No city/country:
<place> = <lat, lon, other info (ocean, dessert, etc.)>
“JK3.3DF.PAC.OCEA”
Region Experiments
• Experimented with (a) normalized min-cut, (b) Newman-
Girvan, (c) modularity based partitioning.
• Experimented with image based methods (superpixels,
region growing) and the dual of the road graph.
• Evaluated with urban rules (geography, population,
road distribution)
Output Maps and Tools
• .osm maps with roads (meter marking and offsetting on the fly)
• ID-tool of MapBox for on-demand inverse/forward geocoding
• rtree extension for efficient spatial querying
• Experimental mobile app for self navigation
• 21.7% decrease in arrival time using Robocodes
Analyze: Three Types of Outputs
1. Semantic
2. Objective
Population Density
76000/sq. mile
3. Qualitative
Assign a semantic
label to each pixel
Label road quality
as
“Bad” or “Good”

Weitere ähnliche Inhalte

Ähnlich wie Geo-spatial Research: Transition from Analysis to Synthesis

Improve site selection and network planning with Dynamic Demographics
Improve site selection and network planning with Dynamic DemographicsImprove site selection and network planning with Dynamic Demographics
Improve site selection and network planning with Dynamic DemographicsPrecisely
 
Data-driven urbanism (Amsterdam, Jan 2017)
Data-driven urbanism (Amsterdam, Jan 2017)Data-driven urbanism (Amsterdam, Jan 2017)
Data-driven urbanism (Amsterdam, Jan 2017)robkitchin
 
The Smart City Opportunity - The CTO Perspective
The Smart City Opportunity - The CTO PerspectiveThe Smart City Opportunity - The CTO Perspective
The Smart City Opportunity - The CTO PerspectiveCrowdsourcing Week
 
BDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big DataBDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big DataBig Data Value Association
 
BDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big DataBDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big DataBig Data Value Association
 
The Real-Time City? Data-driven, networked urbanism and the production of sm...
The Real-Time City? Data-driven, networked urbanism  and the production of sm...The Real-Time City? Data-driven, networked urbanism  and the production of sm...
The Real-Time City? Data-driven, networked urbanism and the production of sm...robkitchin
 
Understanding Human Mobility
Understanding Human MobilityUnderstanding Human Mobility
Understanding Human MobilityWidy Widyawan
 
Smart cities: realising the promises while minimizing the perils
Smart cities: realising the promises while minimizing the perilsSmart cities: realising the promises while minimizing the perils
Smart cities: realising the promises while minimizing the perilsrobkitchin
 
Human factor in big data qrowd bdve
Human factor in big data qrowd bdveHuman factor in big data qrowd bdve
Human factor in big data qrowd bdveLuis Daniel Ibáñez
 
NDGeospatialSummit2022 - A Geographic Approach to Public Works
NDGeospatialSummit2022 - A Geographic Approach to Public WorksNDGeospatialSummit2022 - A Geographic Approach to Public Works
NDGeospatialSummit2022 - A Geographic Approach to Public WorksNorth Dakota GIS Hub
 
Geographical information system : GIS and Social Media
Geographical information system : GIS and Social Media Geographical information system : GIS and Social Media
Geographical information system : GIS and Social Media Imran Ghaznavi
 
UrbanFootprint at NPC 2018
UrbanFootprint at NPC 2018UrbanFootprint at NPC 2018
UrbanFootprint at NPC 2018UrbanFootprint
 
Access to markets, technologies, and services (Carlo Azzarri, IFPRI)
Access to markets, technologies, and services (Carlo Azzarri, IFPRI)Access to markets, technologies, and services (Carlo Azzarri, IFPRI)
Access to markets, technologies, and services (Carlo Azzarri, IFPRI)ExternalEvents
 
La telefonía móvil como fuente de información para el estudio de la movilidad...
La telefonía móvil como fuente de información para el estudio de la movilidad...La telefonía móvil como fuente de información para el estudio de la movilidad...
La telefonía móvil como fuente de información para el estudio de la movilidad...Esri España
 

Ähnlich wie Geo-spatial Research: Transition from Analysis to Synthesis (20)

Improve site selection and network planning with Dynamic Demographics
Improve site selection and network planning with Dynamic DemographicsImprove site selection and network planning with Dynamic Demographics
Improve site selection and network planning with Dynamic Demographics
 
Data-driven urbanism (Amsterdam, Jan 2017)
Data-driven urbanism (Amsterdam, Jan 2017)Data-driven urbanism (Amsterdam, Jan 2017)
Data-driven urbanism (Amsterdam, Jan 2017)
 
The Smart City Opportunity - The CTO Perspective
The Smart City Opportunity - The CTO PerspectiveThe Smart City Opportunity - The CTO Perspective
The Smart City Opportunity - The CTO Perspective
 
Gis dr rahul
Gis dr rahulGis dr rahul
Gis dr rahul
 
BDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big DataBDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big Data
 
BDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big DataBDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big Data
 
The Real-Time City? Data-driven, networked urbanism and the production of sm...
The Real-Time City? Data-driven, networked urbanism  and the production of sm...The Real-Time City? Data-driven, networked urbanism  and the production of sm...
The Real-Time City? Data-driven, networked urbanism and the production of sm...
 
Understanding Human Mobility
Understanding Human MobilityUnderstanding Human Mobility
Understanding Human Mobility
 
Smart cities: realising the promises while minimizing the perils
Smart cities: realising the promises while minimizing the perilsSmart cities: realising the promises while minimizing the perils
Smart cities: realising the promises while minimizing the perils
 
What is-gis
What is-gisWhat is-gis
What is-gis
 
Human factor in big data qrowd bdve
Human factor in big data qrowd bdveHuman factor in big data qrowd bdve
Human factor in big data qrowd bdve
 
Basic Gis
Basic GisBasic Gis
Basic Gis
 
NDGeospatialSummit2022 - A Geographic Approach to Public Works
NDGeospatialSummit2022 - A Geographic Approach to Public WorksNDGeospatialSummit2022 - A Geographic Approach to Public Works
NDGeospatialSummit2022 - A Geographic Approach to Public Works
 
Geographical information system : GIS and Social Media
Geographical information system : GIS and Social Media Geographical information system : GIS and Social Media
Geographical information system : GIS and Social Media
 
UrbanFootprint at NPC 2018
UrbanFootprint at NPC 2018UrbanFootprint at NPC 2018
UrbanFootprint at NPC 2018
 
Access to markets, technologies, and services (Carlo Azzarri, IFPRI)
Access to markets, technologies, and services (Carlo Azzarri, IFPRI)Access to markets, technologies, and services (Carlo Azzarri, IFPRI)
Access to markets, technologies, and services (Carlo Azzarri, IFPRI)
 
Intro_GIS.ppt
Intro_GIS.pptIntro_GIS.ppt
Intro_GIS.ppt
 
CKX: Wellbeing Toronto - More Than Just a Map
CKX: Wellbeing Toronto - More Than Just a MapCKX: Wellbeing Toronto - More Than Just a Map
CKX: Wellbeing Toronto - More Than Just a Map
 
Building a City Dashboard
Building a City DashboardBuilding a City Dashboard
Building a City Dashboard
 
La telefonía móvil como fuente de información para el estudio de la movilidad...
La telefonía móvil como fuente de información para el estudio de la movilidad...La telefonía móvil como fuente de información para el estudio de la movilidad...
La telefonía móvil como fuente de información para el estudio de la movilidad...
 

Mehr von Camera Culture Group, MIT Media Lab

God’s Eye View: Will global AI empower us or destroy us? | Ramesh Raskar
God’s Eye View: Will global AI empower us or destroy us? | Ramesh Raskar God’s Eye View: Will global AI empower us or destroy us? | Ramesh Raskar
God’s Eye View: Will global AI empower us or destroy us? | Ramesh Raskar Camera Culture Group, MIT Media Lab
 
Dont follow the rainbow: How to avoid career traps that can lead you to fail,...
Dont follow the rainbow: How to avoid career traps that can lead you to fail,...Dont follow the rainbow: How to avoid career traps that can lead you to fail,...
Dont follow the rainbow: How to avoid career traps that can lead you to fail,...Camera Culture Group, MIT Media Lab
 
Making Invisible Visible, Ramesh Raskar Keynote at Embedded Vision 2019
Making Invisible Visible, Ramesh Raskar Keynote at Embedded Vision 2019Making Invisible Visible, Ramesh Raskar Keynote at Embedded Vision 2019
Making Invisible Visible, Ramesh Raskar Keynote at Embedded Vision 2019Camera Culture Group, MIT Media Lab
 
Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...
Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...
Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...Camera Culture Group, MIT Media Lab
 

Mehr von Camera Culture Group, MIT Media Lab (20)

Raskar Sig2017 Siggraph Achievement Award Talk
Raskar Sig2017 Siggraph Achievement Award TalkRaskar Sig2017 Siggraph Achievement Award Talk
Raskar Sig2017 Siggraph Achievement Award Talk
 
Lost Decade of Computational Photography
Lost Decade of Computational PhotographyLost Decade of Computational Photography
Lost Decade of Computational Photography
 
Covid Safe Paths
Covid Safe PathsCovid Safe Paths
Covid Safe Paths
 
God’s Eye View: Will global AI empower us or destroy us? | Ramesh Raskar
God’s Eye View: Will global AI empower us or destroy us? | Ramesh Raskar God’s Eye View: Will global AI empower us or destroy us? | Ramesh Raskar
God’s Eye View: Will global AI empower us or destroy us? | Ramesh Raskar
 
Dont follow the rainbow: How to avoid career traps that can lead you to fail,...
Dont follow the rainbow: How to avoid career traps that can lead you to fail,...Dont follow the rainbow: How to avoid career traps that can lead you to fail,...
Dont follow the rainbow: How to avoid career traps that can lead you to fail,...
 
Raskar PhD and MS Thesis Guidance
Raskar PhD and MS Thesis GuidanceRaskar PhD and MS Thesis Guidance
Raskar PhD and MS Thesis Guidance
 
Making Invisible Visible, Ramesh Raskar Keynote at Embedded Vision 2019
Making Invisible Visible, Ramesh Raskar Keynote at Embedded Vision 2019Making Invisible Visible, Ramesh Raskar Keynote at Embedded Vision 2019
Making Invisible Visible, Ramesh Raskar Keynote at Embedded Vision 2019
 
Augmented Surgeons: AI AR for Anatome, Raskar Aria 2019
Augmented Surgeons: AI AR for Anatome, Raskar Aria 2019Augmented Surgeons: AI AR for Anatome, Raskar Aria 2019
Augmented Surgeons: AI AR for Anatome, Raskar Aria 2019
 
Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...
Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...
Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...
 
Unspoken Challenges in AR and XR
Unspoken Challenges in AR and XRUnspoken Challenges in AR and XR
Unspoken Challenges in AR and XR
 
Raskar stanfordextremecompuimagingapr2016
Raskar stanfordextremecompuimagingapr2016Raskar stanfordextremecompuimagingapr2016
Raskar stanfordextremecompuimagingapr2016
 
What is SIGGRAPH NEXT? Intro by Ramesh Raskar
What is SIGGRAPH NEXT? Intro by Ramesh RaskarWhat is SIGGRAPH NEXT? Intro by Ramesh Raskar
What is SIGGRAPH NEXT? Intro by Ramesh Raskar
 
What is Media in MIT Media Lab, Why 'Camera Culture'
What is Media in MIT Media Lab, Why 'Camera Culture'What is Media in MIT Media Lab, Why 'Camera Culture'
What is Media in MIT Media Lab, Why 'Camera Culture'
 
Raskar UIST Keynote 2015 November
Raskar UIST Keynote 2015 NovemberRaskar UIST Keynote 2015 November
Raskar UIST Keynote 2015 November
 
Multiview Imaging HW Overview
Multiview Imaging HW OverviewMultiview Imaging HW Overview
Multiview Imaging HW Overview
 
Time of Flight Cameras - Refael Whyte
Time of Flight Cameras - Refael WhyteTime of Flight Cameras - Refael Whyte
Time of Flight Cameras - Refael Whyte
 
Leap Motion Development (Rohan Puri)
Leap Motion Development (Rohan Puri)Leap Motion Development (Rohan Puri)
Leap Motion Development (Rohan Puri)
 
Compressed Sensing - Achuta Kadambi
Compressed Sensing - Achuta KadambiCompressed Sensing - Achuta Kadambi
Compressed Sensing - Achuta Kadambi
 
Coded Photography - Ramesh Raskar
Coded Photography - Ramesh RaskarCoded Photography - Ramesh Raskar
Coded Photography - Ramesh Raskar
 
Stereo and 3D Displays - Matt Hirsch
Stereo and 3D Displays - Matt HirschStereo and 3D Displays - Matt Hirsch
Stereo and 3D Displays - Matt Hirsch
 

Kürzlich hochgeladen

Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 

Kürzlich hochgeladen (20)

Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 

Geo-spatial Research: Transition from Analysis to Synthesis

  • 1. Geo-spatial Research for Impact: Beyond Analysis  Towards Synthesis Ramesh Raskar Assoc. Professor MIT Media Lab with Nikhil Naik Ilke Demir
  • 2. Poverty, Pop-density, Crop, Methane .. maps
  • 4. Online User Photos: Social voting in the real world Phototourism by Seitz at al
  • 5. Analysis: Perceived Safety Naik, Raskar, Hidalgo 2016
  • 7. Street Address: Assign | Adopt • 75% of the world population without street addresses • Timely ambulance delivery • Stimulate digital economy via eCommerce • Protect property rights of the marginalized • Crisis response (Haiti, 48 hrs to coord aid)
  • 8. Beyond Analysis  Towards Synthesis Actionable Insights Traffic Nudge Crisis Response Street Addr Govt Policies Large Scale Visual Study Visual Census Socio-eco observations Sentiment maps
  • 9. Occasional Frequent Low Quality Census High Quality Satellite, Street view Phone, Mobility, Social, Financial AV, PlanetLabs etc Sense Process Respond
  • 11. Capture Analyze Act Low Level Street-level Measure •Visual Census •Counting/Density Alert •Traffic Interventions Mid Level Aerial Understand •Geolocalization •Crowds •3D Modeling Assist •Crisis Response •Agriculture •Insurance/Hedge Funds High Level Satellite Predict • Economic activity Change •Street Addr •Government planning
  • 12. Capture  Present  Future Low Level Street-level Street View Crowdsourcing (mapillary) Traffic Cameras Autonomous Vehicles Cloud Dashcams Mid Level Aerial Commercial airplanes Amateur Drones Autonomous Drones High Level Satellite Meter-resolution Limited coverage for high-res Sub-meter, real- time, entire Earth (Planet Lab, OneWeb) Ack: Nikhil Naik
  • 13. Analyze Present  Future Low Level Measure •Visual Census •Counting/Density Streetscore, Streetchange, Visual Census (Fei-Fei Li) Generative modeling of cities Mid Level Understand •Geolocalization •Crowds •3D Modeling DeepRoadMap per (Urtasun et al.) Design Suggestions High Level Aggregate and Predict • Economic activity Jean and Ermon Jayachandran Deforestation Study Real-time data analysis. Actionable information
  • 14. Socio-Economic Inference from Digital Patterns Satellite ImageryStreet-level Imagery Aerial Imagery Phone Mobility Data Social Networks, Photos Device Activity Built Environment Human Activity 84 countries, no data Ack: Nikhil Naik
  • 15. 400K Google StreetViews Socio Economic and Physical environment
  • 16. 4k Images, 200K pairwise comparisons Place Pulse 1.0 Naik, Raskar, Hidalgo 2016
  • 17. 1.7K Ranked 400K Predicted
  • 19. New York Boston Detroit Chicago Validated with Crime Stats streetscore.media.mit.edu
  • 20. 105 Kent Avenue, Brooklyn, NY August 2007 Streetscore = 1.8/10
  • 21. 105 Kent Avenue, Brooklyn, NY September 2014 Streetscore = 7.2/10
  • 22. Urban Growth in New York 2007 - 2014
  • 23. Raskar, Camera Culture, MIT Media Lab http://cameraculture.info MIT Media Lab Ilke Demir, Nikhil Naik
  • 24. Capture Analyze Act Low Level Street-level Measure •Visual Census •Counting/Density Alert •Traffic Interventions Mid Level Aerial Understand •Geolocalization •Crowds •3D Modeling Assist •Crisis Response •Agriculture •Insurance/Hedge Funds High Level Satellite Aggregate and Predict • Economic activity Change •Street Addr •Government planning
  • 25. Example: Congestion Pricing (Nikhil Naik et al.) • Jakarta: Response of vehicles by types • CV + traffic cameras across the city to detect types + number of vehicles • Understand traffic flows before/after congestion pricing is introduced • Adjust rates for different types of vehicles/ in different areas
  • 26. Street Address: Assign | Adopt • 75% of the world population without street addresses • Timely ambulance delivery • Stimulate digital economy via eCommerce • Protect property rights of the marginalized • Crisis response (Haiti, 48 hrs to coord aid)
  • 27. Street Addresses from Satellite Imagery İlke Demir, Forest Hughes, Aman Raj, Kaunil Dhruv, Suryanarayana M. Muddala, Sanyam Garg, Barrett Doo, Ramesh Raskar IJCG 2018
  • 28. Thanks Nikhil Naik Tristan Swedish Praneeth Vepakomma Ilke Demir Forest Hughes Jatin Malhotra SuryaNarayana Murthy Kavnil Dhruv Aman Raj Barrett Doo Praveen Gedam Anna Roy Cesar A. Hidalgo Guan Pang Jing Huang Daniel Aliaga Manohar Paluri Pierre Roux Yael Maguire Leo Tsourides Divyaa Ravichandran Sanyam Garg Sai Sri Sathya Grace Kermani Tobias Tiecke Andreas Gros Santanu Bhattacharya Kabir Rustogi Will Marshall
  • 29. Pilot 1: Will people use street names? In use After 6 monthsCommunity assigned names + Signs
  • 30. Pilot 2 Will SMEs use the labeling scheme? Measured relative efficiency for Pizza delivery
  • 31. Pilot 3 Suitable for E-commerce work-flow?
  • 32. What3words parrot.casino.failed Robocodes 75D. Road NE27. Dhule.MhIn Landmarks Green Park Poor Coverage Lat/lon Goal: Memorable Geocoding Physical 1050 Market St, Fresno, CA, US .
  • 33. What3words: A: parrot.casino.failed B: issuer.lollipop.ripe - Irrelevant words based on lat/lon. Robocodes: 75D.NE27.Dhule.MhIn 76C.NE27.Dhule.MhIn - Hierarchical and linear addresses. Google Maps: Near Green Park Near Green Park - No street names or numbers Point vs Edges for Geometric Queries
  • 34. Addressing Schemes Around the World London postal code system: Radial regions based on orientation and distance South Korea streets: Meter markers Japan block system: Hard to decipher Dubai addressing: Uses districts Berlin numbering: Zigzag house pattern
  • 35. Robocode Scheme • 5 alphanumeric fields • Hierarchical and linear descriptors • To close the gap between physical addresses and automated geocoding Road naming scheme: - distance from the center - orientation in odd parity i.e. WB14 Region naming scheme: - orientation wrt downtown - distance from downtown i.e. WB House numbering scheme: - meter markers on the road - block letters from the road i.e. 38K WB14 “I7 Hacker Way, Menlo Park, CA, US”
  • 36. Design Choices Linear: similar addresses stored in a linear fashion Hierarchical: top-down structure for spatial encapsulation Compressible: 5x4 max (chars x words) Universal: independent of local language Inquirable: useful for geometric, proximity-based, and type-ahead queries Extendible: dynamically modifiable for new places Robust: flexible for overestimation and noise StructuralDesignParameters forefficientcomputerimplementation Linear: closer addresses are given related names Hierarchical: top-down subdivision of the world Memorable: short and alphanumeric, easily convertible Intuitive: with a sense of direction and distance Topological: consistent with road topology Inclusive: with local names (city, state) Physical: consistent with natural boundaries SemanticDesignParameters foruserfriendliness Machine Needs Human Needs
  • 37. Our Pipeline Satellite Images Predictions Road Segments Clustering Segmentation RegionsRoad IDsMarkers + Blocks Labeling
  • 38. Satellite Images ç • Irregular urban structure • Illumination/weather/country • Different road types
  • 39. Segmentation • Binary road masks • 19K*19K, 0.5m/pixel • SegNet
  • 40. Extracting Road Segments • Orientation based median filtering • Road segments by orientation bucketing
  • 41. NF NH NE Region Creation • Road graph: Node=intersection, edge=road, weight=length • Partition for max inter, min intra connectivity, using normalized min-cut. • 𝑛 𝑚𝑎𝑥 = 𝑐𝑒𝑖𝑙 𝑟𝑜𝑎𝑑𝑠 88
  • 42. Region and Road Naming • Cmax 𝑟𝑜𝑎𝑑𝑠 𝐴 → 𝐶𝐴 (downtown) • Orientation bucketing into N, S, W, E • Trace regions based on distance to CA • Orientation bucketing into major axes • Trace roads based on order
  • 43. Offsetting and Meter Marking • 5 meter marker along the road • Odd/even based on RHR • Distance field of roads: block offset
  • 44. Unmapped Developing Country Regions follow natural boundaries
  • 45. Street Address with Robocodes • From Satellite Imagery to Deployed Street Addresses • Generative address : linear, hierarchical, and intuitive • Human friendly rather than machine friendly
  • 46. Occasional Frequent Low Quality Census High Quality Satellite, Street view Phone, Mobility, Social, Financial AV, PlanetLabs etc Sense Process Respond
  • 47. Capture Analyze  Act Low Level Street-level Measure •Visual Census •Counting/Density Alert •Traffic Interventions Mid Level Aerial Understand •Geolocalization •Crowds •3D Modeling Assist •Crisis Response •Agriculture •Insurance/Hedge Funds High Level Satellite Aggregate and Predict • Economic activity Change •Street Addr •Government planning Pervasive Recording, Incentives, Distributed ownership, Privacy, Full proof authenticity, Equality
  • 48.
  • 49. Act Alert Assist Change Low/Mi d/High Level Alert about the state of people/economy/b uilt environment (e.g., predict crop yield from satellite imagery, predict insurance price from street view) Assist in acting on information by providing suggestions based on data (e.g., design optimal congestion pricing based on detected cars, design crisis response in hurricanes) ?
  • 50. Inaccessible Areas • To extend our format to cover areas that are not accessible by streets, we explored different implementations to cover such areas, which are 26*5 m away from any street. • Geocoding as a function (excluding the version field): f (info, lat, lon) = x.y.z.t • For places with roads, info={road network, city, country} f (R, C) = x.y.city.country • Extreme case: only reliable information is latitude/longitude! 52
  • 51. f(C,lat,lon) = hash(round(lat,3)) + dir(lat) . hash(round(lon,3)) +dir(lon) . C L-A-T-dir.L-O-N-dir.name.area Inaccessible Areas: Blackholes! • Linear hashing: • 26 letters + 10 digits • 100m x 100 m granularity • Last letter is the hemisphere • Range: 359.999, longitude: 7PRZ W • Hierarchical hashing: • Enlarge the grid from to 1 km x 1 km • Using two floating points = three letters • Within each cell, re-hash it to a 36 x 36 grid = one letter • New resolution: 30m, represented by five letters 53 f(C,lat,lon) = hash(round(lat,2)) + hash(lat - round(lat,2)) + dir(lat) . hash(round(lon,2)) + hash(lon - round(lon,2)) + dir(lon) . C LlatLlatHlatDlat .LlonLlonHlonDlon . name . Ocean /Continent /etc
  • 52. Thanks! What next? Today Tomorrow Friday A month • Robocode.info • Join our presentation in CVPR WiCV. Friday 10am • Join us with your new ideas at SIGGRAPH 2018 Maps & Urban Data session. Code: https://github.com/facebookresearch/street-addresses Paper: https://research.fb.com/publications/generative-street- addresses-from-satellite-imagery/
  • 53. Bonus: Blackholes! Main aim: f(<place>)=robocode Base case: <place> = <house, street, city, country> “12C.NA14.PALO.CAUS” No street: <place> = <lat, lon, city, country> “F12.HN3.PALO.CAUS” No city/country: <place> = <lat, lon, other info (ocean, dessert, etc.)> “JK3.3DF.PAC.OCEA”
  • 54. Region Experiments • Experimented with (a) normalized min-cut, (b) Newman- Girvan, (c) modularity based partitioning. • Experimented with image based methods (superpixels, region growing) and the dual of the road graph. • Evaluated with urban rules (geography, population, road distribution)
  • 55. Output Maps and Tools • .osm maps with roads (meter marking and offsetting on the fly) • ID-tool of MapBox for on-demand inverse/forward geocoding • rtree extension for efficient spatial querying • Experimental mobile app for self navigation • 21.7% decrease in arrival time using Robocodes
  • 56. Analyze: Three Types of Outputs 1. Semantic 2. Objective Population Density 76000/sq. mile 3. Qualitative Assign a semantic label to each pixel Label road quality as “Bad” or “Good”

Hinweis der Redaktion

  1. Various Street View datasets are now available online. And perhaps the most popular one is Google Street View, which has covered more than a hundred countries to date. And interestingly Street View provides researchers with a new way to observe neighborhood.
  2. Check Steve Seitz and U of Washington Phototourism Page
  3. So why street addresses are important, why we need adequate mapping. Let me ask you, how many of you had a unique address up to your house or flat number, back home? According to geocoding companies, 75% of the world is unmapped and UN says 4 billion people are invisible because of that. This lack of addressing is even worse in disaster zones, for example in Haiti Earthquake, Humanatarian Openstreetmap community started remotely mapping the disaster area in 48 hours, and mapped adequately in 6 months for NGOs and aid agencies to use. But are curves on a plane enough to be a map? How do we refer to places, how do we define locations? Are those maps complete without any labels?
  4. Capture, analyze, act
  5. Capture, analyze, act
  6. Capture, analyze, act
  7. Capture, analyze, act
  8. Socio Economic and Physical environment,
  9. Cities are physical too: Using computer vision to measure the quality and impact of urban appearance N Naik, R Raskar, CA Hidalgo - American Economic Review, 2016 - aeaweb.org Streetscore-predicting the perceived safety of one million streetscapes N Naik, J Philipoom, R Raskar… - Proceedings of the …, 2014 - openaccess.thecvf.com
  10. Capture, analyze, act
  11. So why street addresses are important, why we need adequate mapping. Let me ask you, how many of you had a unique address up to your house or flat number, back home? According to geocoding companies, 75% of the world is unmapped and UN says 4 billion people are invisible because of that. This lack of addressing is even worse in disaster zones, for example in Haiti Earthquake, Humanatarian Openstreetmap community started remotely mapping the disaster area in 48 hours, and mapped adequately in 6 months for NGOs and aid agencies to use. But are curves on a plane enough to be a map? How do we refer to places, how do we define locations? Are those maps complete without any labels?
  12. Welcome to our presentation of Robocodes! Hopefully we’ll jump-start the conference with an impactful presentation to keep you pumped up for the rest of the week. Today I will introduce you our approach for generative street addressing from satellite imagery. This project is developed at Facebook and MIT Media Lab.
  13. Will people use street Signs $40 per sign City counsel used approved Naming streets is politically unsavvy 20% of first time deliveries undelivered $300M/year loss because of inadequate addressing Need an addressing system for emergency, businesses, and residents
  14. On the bright side, we don’t need to re-invent addresses, as there are many addressing schemes already organically being developed and experimented throughout the history. Within these addressing schemes, London postal code system caught our attention with its linear and hierarchical features. Also other addressing schemes guided us for some design choices that I will discuss next.
  15. Our addresses consist of four alphanumeric fields, hierarchically designating larger areas in order. It resembles real world addresses, containing house number, street name, city, state and country. The region naming scheme is inspired from the London scheme in the previous slide. The first letter is dedicated for orientation from the downtown (north, south, west, east) and the second letter cues the distance from the downtown. So the yellow house on the left is in region WB. Then, within a region, the roads are named according to their orientation and order. Finally, the house numbering is based on the distances along and from the road.
  16. We followed some design principles for making our system both efficient, and user friendly. I won’t go through all, but for example, the storage should be supporting forward and inverse geoqueries, should be extendible for address changes, should be compressible, etc. On the other hand, the linear and hierarchical properties of the scheme supports user integration, intuitiveness and memorability.
  17. Back to our pipeline, we start by extracting road masks from satellite images by deep learning, then we find the individual road segments from those predictions. We create regions from the road graph, and label the parcels by distance fields from the roads.
  18. Focusing on developing countries, the urban structure in those satellite images are not easy to detect and organize. Also there are different weather and illumination conditions per country.
  19. Our annotaters created binary road masks from satellite images, on zoom level 19 satellite images. Then we train a SegNet model on those images, and we are able to learn road predictions as shown here, with 72.6% precision and 57.2% recall.
  20. After we have the predictions, we threshold and thin the roads to find the skeleton. Then we use orientation bucketing to find individual road segments, which creates the base for streets.
  21. After we have the road segments we create a road graph where nodes represent intersections and edges represents road weighted by their length. We partition the graph for maximum inter minimum intra connectivity of clusters, using normalized min cut. We also use some urban rules to limit the clusters.
  22. After the regions are created, we mark the densest are as the downtown, and bucket other regions based on their orientation with respect to the downtown, in south, north, east, and west directions. After the regions are named, we find the two major axes of the roads within each region, and name the roads in each region based on their orientation and order.
  23. Finally, we put meter markers on the roads by pixel distance, and we compute distance fields to create the block offsets for the house numbering. The gradient in the left image shows those offsets
  24. As defined in our motivation, we want to locate and connect the invisible 4 billion, right? So we tested our system on unmapped developing countries. The map coverage improved up to 80% in some areas,
  25. In conclusion, we presented a robust addressing scheme, a deep learning and graph partitioning approach for street extraction, and ready to deploy maps and tools for easy geoqueries.
  26. Capture, analyze, act
  27. Capture, analyze, act
  28. Capture, analyze, act
  29. To partition the road graph we experimented with different algorithms as newman girvan and modularity based partitioning. Since there is no clear definition of regions, our domain experts evaluated the success of region creation by using urban rules as geography, population, and road distribution. We also experimented with partitioning the dual of the road graph and traditional image based approaches as mean shift and super pixels
  30. As an output, our system generates .osm files with roads. However since it does not make sense to output addresses for each five by five area, we compute the meter marking and offsetting on the fly. We modified the id-tool of MapBox, which is also the tool that openstreetmap uses, to integrate that last mile computation. We also developed an rtree extension for efficient querying. Lastly, as you can imagine, our addresses needs to be evaluated by real users: so we developed a mobile app to enable self navigation with our robocodes. In our user studies, we observed that using our smart addresses decreased the arrival time of the agents by 21.7 percent.