SlideShare a Scribd company logo
1 of 52
Download to read offline
Everything is not a graph problem.
Lessons from life in the trenches.
Denise Koessler Gosnell, Ph.D.
Senior Graph Consultant, DataStax
@denisekgosnell
Theory vs. Reality?
© DataStax, All Rights Reserved.2
@denisekgosnell
© DataStax, All Rights Reserved.3
Agenda
@denisekgosnell
© DataStax, All Rights Reserved.4
The Graph
@denisekgosnell
© DataStax, All Rights Reserved.5
The Graph
The vision
@denisekgosnell@denisekgosnell
person
service
device
product
device
addressstore
emailcredit
card
© DataStax, All Rights Reserved.6
The Graph
Reality
@denisekgosnell
What does your problem need?
@denisekgosnell@denisekgosnell
• Across all of my data, how
many [___] are there?
• How many of [___] happened
in the past x amount of time?
• What most closely matches
[___]?
© DataStax, All Rights Reserved.8
@denisekgosnell
The starting point
The Graph
© DataStax, All Rights Reserved.9
Goals?
@denisekgosnell
© DataStax, All Rights Reserved.10
@denisekgosnell
Goals?
@denisekgosnell
© DataStax, All Rights Reserved.11
Goals?
@denisekgosnell
Not a graph problem.
@denisekgosnell@denisekgosnell
© DataStax, All Rights Reserved.13
Status
@denisekgosnell
You have graph data.
What do you plan on doing with it?
@denisekgosnell@denisekgosnell
How many sessions are needed?
© DataStax, All Rights Reserved.15
How many sessions are needed?
Brynn Lender is organizing a conference with 4 speakers:
Arya, Bran, Rob, and Sansa.
Logistic Issues:
1. Arya’s and Sansa’s conference talk requires the same
room, as does Bran’s and Rob’s.
2. Mr. Lender does not want to have Bran’s and Sansa’s
talk at the same time.
3. Arya and Bran are helping each other with their
presentations and need to attend each other’s talks.
© DataStax, All Rights Reserved.16
We have graph data?
@denisekgosnell@denisekgosnell
How many sessions are needed?
© DataStax, All Rights Reserved.18
How many sessions are needed?
© DataStax, All Rights Reserved.19
@denisekgosnell
Bran
SansaArya
Rob
How many sessions are needed?
© DataStax, All Rights Reserved.20
@denisekgosnell
Bran
SansaArya
Rob
How many sessions are needed?
© DataStax, All Rights Reserved.21
@denisekgosnell
Bran
SansaArya
Rob
How many sessions are needed?
© DataStax, All Rights Reserved.22
@denisekgosnell
Bran
SansaArya
Rob
How many sessions are needed?
© DataStax, All Rights Reserved.23
@denisekgosnell
Bran
SansaArya
Rob
How many sessions are needed?
© DataStax, All Rights Reserved.24
@denisekgosnell
Bran
SansaArya
Rob
How many sessions are needed?
© DataStax, All Rights Reserved.25
@denisekgosnell
Bran
SansaArya
Rob
How many sessions are needed?
© DataStax, All Rights Reserved.26
Session 1
Session 2
Session 3
@denisekgosnell
Bran
SansaArya
Rob
Graph Analytics Problems.
@denisekgosnell@denisekgosnell
• For all options, what is the best
route from city a to city b?
• What is a way to schedule
[____]?
• How do I place alarms in a
building to uniquely identify a
room according to alerts?
© DataStax, All Rights Reserved.28
Common Qs
@denisekgosnell
• All pairs, shortest path
• Maximum clique
• Chromatic number
• Vertex degree distribution
• Page Rank
• …
© DataStax, All Rights Reserved.29
Common
Algorithms
@denisekgosnell
© DataStax, All Rights Reserved.30
Status
@denisekgosnell
Graph Database Problems.
@denisekgosnell@denisekgosnell
@denisekgosnell@denisekgosnell
First: Graph Based Entity Resolution
Telecom Social Networks
© DataStax, All Rights Reserved.33
@denisekgosnell
Telecom Social Networks
© DataStax, All Rights Reserved.34
@denisekgosnell
Telecom Social Fingerprinting
© DataStax, All Rights Reserved.35
@denisekgosnell
Telecom Social Fingerprinting
© DataStax, All Rights Reserved.36
August 2017
@denisekgosnell
Telecom Social Fingerprinting
© DataStax, All Rights Reserved.37
August 2017 September 2017
@denisekgosnell
Telecom Social Fingerprinting
© DataStax, All Rights Reserved.38
August 2017 September 2017
@denisekgosnell
Telecom Social Fingerprinting
© DataStax, All Rights Reserved.39
August 2017 September 2017
@denisekgosnell
Telecom Social Fingerprinting
© DataStax, All Rights Reserved.40
August 2017 September 2017
@denisekgosnell
Telecom Social Fingerprinting
© DataStax, All Rights Reserved.41
August 2017 September 2017
@denisekgosnell
Telecom Social Fingerprinting
© DataStax, All Rights Reserved.42
@denisekgosnell
@denisekgosnell@denisekgosnell
Second: Graph DB as a Master Identity Store
© DataStax, All Rights Reserved.44
The Graph
@denisekgosnell
We’re back.
@denisekgosnell
person
service
device
product
device
addressstore
emailcredit
card
© DataStax, All Rights Reserved.45
Good?
@denisekgosnell
person person
person
@denisekgosnell@denisekgosnell
What are you trying to read from your database?
© DataStax, All Rights Reserved.47
Better.
@denisekgosnell@denisekgosnell
person
master_uuid
person
master_uuid
address
device payment
device
flight
© DataStax, All Rights Reserved.48
Best?
@denisekgosnell
provenance provenance provenance provenance
person
master_uuid
person
master_uuid
address
device payment
device
flight
© DataStax, All Rights Reserved.49
Status:
@denisekgosnell
© DataStax, All Rights Reserved.50
Status:
@denisekgosnell
Everything is not a graph problem.
(but there are plenty)
Denise Koessler Gosnell, Ph.D.
Senior Graph Consultant, DataStax
@denisekgosnell
• GitHub:
• github.com/datastax/graph-examples
• @DeniseKGosnell
• Twitter: @DeniseKGosnell
• Email: Denise.Gosnell@datastax.com
© DataStax, All Rights Reserved.52
Contact

More Related Content

Similar to Everything is not a graph problem. But, there are plenty.

ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationMapR Technologies
 
Fighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligenceFighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligenceRon Bodkin
 
Streaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine LearningStreaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine LearningTed Dunning
 
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel PlatformsSTING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel PlatformsJason Riedy
 
Application Development Trends Webinar
Application Development Trends WebinarApplication Development Trends Webinar
Application Development Trends WebinarSandeep Adwankar
 
Are You Underestimating the Value Within Your Data? A conversation about grap...
Are You Underestimating the Value Within Your Data? A conversation about grap...Are You Underestimating the Value Within Your Data? A conversation about grap...
Are You Underestimating the Value Within Your Data? A conversation about grap...Neo4j
 
Scalable and Efficient Algorithms for Analysis of Massive, Streaming Graphs
Scalable and Efficient Algorithms for Analysis of Massive, Streaming GraphsScalable and Efficient Algorithms for Analysis of Massive, Streaming Graphs
Scalable and Efficient Algorithms for Analysis of Massive, Streaming GraphsJason Riedy
 
Webinar - Fighting Bank Fraud with Real-time Graph Database
Webinar - Fighting Bank Fraud with Real-time Graph Database Webinar - Fighting Bank Fraud with Real-time Graph Database
Webinar - Fighting Bank Fraud with Real-time Graph Database DataStax
 
Webinar: Fighting Fraud with Graph Databases
Webinar: Fighting Fraud with Graph DatabasesWebinar: Fighting Fraud with Graph Databases
Webinar: Fighting Fraud with Graph DatabasesDataStax
 
Map r chicago_advanalytics_oct_meetup
Map r chicago_advanalytics_oct_meetupMap r chicago_advanalytics_oct_meetup
Map r chicago_advanalytics_oct_meetupAlan Iovine
 
DataStax | Network Analysis Adventure with DSE Graph, DataStax Studio, and Ti...
DataStax | Network Analysis Adventure with DSE Graph, DataStax Studio, and Ti...DataStax | Network Analysis Adventure with DSE Graph, DataStax Studio, and Ti...
DataStax | Network Analysis Adventure with DSE Graph, DataStax Studio, and Ti...DataStax
 
Open Source North - MongoDB Advanced Schema Design Patterns
Open Source North - MongoDB Advanced Schema Design PatternsOpen Source North - MongoDB Advanced Schema Design Patterns
Open Source North - MongoDB Advanced Schema Design PatternsMatthew Kalan
 
Solving Cybersecurity at Scale
Solving Cybersecurity at ScaleSolving Cybersecurity at Scale
Solving Cybersecurity at ScaleDataWorks Summit
 
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...The Hive
 
Machine Learning logistics
Machine Learning logisticsMachine Learning logistics
Machine Learning logisticsTed Dunning
 
Finding Changes in Real Data
Finding Changes in Real DataFinding Changes in Real Data
Finding Changes in Real DataTed Dunning
 
Sharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data LessonsSharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data LessonsGeorge Stathis
 
The Amino Analytical Framework - Leveraging Accumulo to the Fullest
The Amino Analytical Framework - Leveraging Accumulo to the Fullest The Amino Analytical Framework - Leveraging Accumulo to the Fullest
The Amino Analytical Framework - Leveraging Accumulo to the Fullest Donald Miner
 
Using Machine Learning to Understand and Predict Marketing ROI
Using Machine Learning to Understand and Predict Marketing ROIUsing Machine Learning to Understand and Predict Marketing ROI
Using Machine Learning to Understand and Predict Marketing ROIDATAVERSITY
 
Webinar - The Agility Challenge - Powering Cloud Apps with Multi-Model & Mixe...
Webinar - The Agility Challenge - Powering Cloud Apps with Multi-Model & Mixe...Webinar - The Agility Challenge - Powering Cloud Apps with Multi-Model & Mixe...
Webinar - The Agility Challenge - Powering Cloud Apps with Multi-Model & Mixe...DataStax
 

Similar to Everything is not a graph problem. But, there are plenty. (20)

ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & Evaluation
 
Fighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligenceFighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligence
 
Streaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine LearningStreaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine Learning
 
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel PlatformsSTING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
STING: Spatio-Temporal Interaction Networks and Graphs for Intel Platforms
 
Application Development Trends Webinar
Application Development Trends WebinarApplication Development Trends Webinar
Application Development Trends Webinar
 
Are You Underestimating the Value Within Your Data? A conversation about grap...
Are You Underestimating the Value Within Your Data? A conversation about grap...Are You Underestimating the Value Within Your Data? A conversation about grap...
Are You Underestimating the Value Within Your Data? A conversation about grap...
 
Scalable and Efficient Algorithms for Analysis of Massive, Streaming Graphs
Scalable and Efficient Algorithms for Analysis of Massive, Streaming GraphsScalable and Efficient Algorithms for Analysis of Massive, Streaming Graphs
Scalable and Efficient Algorithms for Analysis of Massive, Streaming Graphs
 
Webinar - Fighting Bank Fraud with Real-time Graph Database
Webinar - Fighting Bank Fraud with Real-time Graph Database Webinar - Fighting Bank Fraud with Real-time Graph Database
Webinar - Fighting Bank Fraud with Real-time Graph Database
 
Webinar: Fighting Fraud with Graph Databases
Webinar: Fighting Fraud with Graph DatabasesWebinar: Fighting Fraud with Graph Databases
Webinar: Fighting Fraud with Graph Databases
 
Map r chicago_advanalytics_oct_meetup
Map r chicago_advanalytics_oct_meetupMap r chicago_advanalytics_oct_meetup
Map r chicago_advanalytics_oct_meetup
 
DataStax | Network Analysis Adventure with DSE Graph, DataStax Studio, and Ti...
DataStax | Network Analysis Adventure with DSE Graph, DataStax Studio, and Ti...DataStax | Network Analysis Adventure with DSE Graph, DataStax Studio, and Ti...
DataStax | Network Analysis Adventure with DSE Graph, DataStax Studio, and Ti...
 
Open Source North - MongoDB Advanced Schema Design Patterns
Open Source North - MongoDB Advanced Schema Design PatternsOpen Source North - MongoDB Advanced Schema Design Patterns
Open Source North - MongoDB Advanced Schema Design Patterns
 
Solving Cybersecurity at Scale
Solving Cybersecurity at ScaleSolving Cybersecurity at Scale
Solving Cybersecurity at Scale
 
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
 
Machine Learning logistics
Machine Learning logisticsMachine Learning logistics
Machine Learning logistics
 
Finding Changes in Real Data
Finding Changes in Real DataFinding Changes in Real Data
Finding Changes in Real Data
 
Sharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data LessonsSharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data Lessons
 
The Amino Analytical Framework - Leveraging Accumulo to the Fullest
The Amino Analytical Framework - Leveraging Accumulo to the Fullest The Amino Analytical Framework - Leveraging Accumulo to the Fullest
The Amino Analytical Framework - Leveraging Accumulo to the Fullest
 
Using Machine Learning to Understand and Predict Marketing ROI
Using Machine Learning to Understand and Predict Marketing ROIUsing Machine Learning to Understand and Predict Marketing ROI
Using Machine Learning to Understand and Predict Marketing ROI
 
Webinar - The Agility Challenge - Powering Cloud Apps with Multi-Model & Mixe...
Webinar - The Agility Challenge - Powering Cloud Apps with Multi-Model & Mixe...Webinar - The Agility Challenge - Powering Cloud Apps with Multi-Model & Mixe...
Webinar - The Agility Challenge - Powering Cloud Apps with Multi-Model & Mixe...
 

Recently uploaded

AntColonyOptimizationManetNetworkAODV.pptx
AntColonyOptimizationManetNetworkAODV.pptxAntColonyOptimizationManetNetworkAODV.pptx
AntColonyOptimizationManetNetworkAODV.pptxLina Kadam
 
Uk-NO1 Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Exp...
Uk-NO1 Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Exp...Uk-NO1 Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Exp...
Uk-NO1 Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Exp...Amil baba
 
Module-1-Building Acoustics(Introduction)(Unit-1).pdf
Module-1-Building Acoustics(Introduction)(Unit-1).pdfModule-1-Building Acoustics(Introduction)(Unit-1).pdf
Module-1-Building Acoustics(Introduction)(Unit-1).pdfManish Kumar
 
Triangulation survey (Basic Mine Surveying)_MI10412MI.pptx
Triangulation survey (Basic Mine Surveying)_MI10412MI.pptxTriangulation survey (Basic Mine Surveying)_MI10412MI.pptx
Triangulation survey (Basic Mine Surveying)_MI10412MI.pptxRomil Mishra
 
Javier_Fernandez_CARS_workshop_presentation.pptx
Javier_Fernandez_CARS_workshop_presentation.pptxJavier_Fernandez_CARS_workshop_presentation.pptx
Javier_Fernandez_CARS_workshop_presentation.pptxJavier Fernández Muñoz
 
Module-1-(Building Acoustics) Noise Control (Unit-3). pdf
Module-1-(Building Acoustics) Noise Control (Unit-3). pdfModule-1-(Building Acoustics) Noise Control (Unit-3). pdf
Module-1-(Building Acoustics) Noise Control (Unit-3). pdfManish Kumar
 
STATE TRANSITION DIAGRAM in psoc subject
STATE TRANSITION DIAGRAM in psoc subjectSTATE TRANSITION DIAGRAM in psoc subject
STATE TRANSITION DIAGRAM in psoc subjectGayathriM270621
 
22CYT12 & Chemistry for Computer Systems_Unit-II-Corrosion & its Control Meth...
22CYT12 & Chemistry for Computer Systems_Unit-II-Corrosion & its Control Meth...22CYT12 & Chemistry for Computer Systems_Unit-II-Corrosion & its Control Meth...
22CYT12 & Chemistry for Computer Systems_Unit-II-Corrosion & its Control Meth...KrishnaveniKrishnara1
 
Indian Tradition, Culture & Societies.pdf
Indian Tradition, Culture & Societies.pdfIndian Tradition, Culture & Societies.pdf
Indian Tradition, Culture & Societies.pdfalokitpathak01
 
Immutable Image-Based Operating Systems - EW2024.pdf
Immutable Image-Based Operating Systems - EW2024.pdfImmutable Image-Based Operating Systems - EW2024.pdf
Immutable Image-Based Operating Systems - EW2024.pdfDrew Moseley
 
Katarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School CourseKatarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School Coursebim.edu.pl
 
Curve setting (Basic Mine Surveying)_MI10412MI.pptx
Curve setting (Basic Mine Surveying)_MI10412MI.pptxCurve setting (Basic Mine Surveying)_MI10412MI.pptx
Curve setting (Basic Mine Surveying)_MI10412MI.pptxRomil Mishra
 
Theory of Machine Notes / Lecture Material .pdf
Theory of Machine Notes / Lecture Material .pdfTheory of Machine Notes / Lecture Material .pdf
Theory of Machine Notes / Lecture Material .pdfShreyas Pandit
 
Gravity concentration_MI20612MI_________
Gravity concentration_MI20612MI_________Gravity concentration_MI20612MI_________
Gravity concentration_MI20612MI_________Romil Mishra
 
priority interrupt computer organization
priority interrupt computer organizationpriority interrupt computer organization
priority interrupt computer organizationchnrketan
 
Guardians of E-Commerce: Harnessing NLP and Machine Learning Approaches for A...
Guardians of E-Commerce: Harnessing NLP and Machine Learning Approaches for A...Guardians of E-Commerce: Harnessing NLP and Machine Learning Approaches for A...
Guardians of E-Commerce: Harnessing NLP and Machine Learning Approaches for A...IJAEMSJORNAL
 
Artificial Intelligence in Power System overview
Artificial Intelligence in Power System overviewArtificial Intelligence in Power System overview
Artificial Intelligence in Power System overviewsandhya757531
 
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...Sumanth A
 
Turn leadership mistakes into a better future.pptx
Turn leadership mistakes into a better future.pptxTurn leadership mistakes into a better future.pptx
Turn leadership mistakes into a better future.pptxStephen Sitton
 

Recently uploaded (20)

AntColonyOptimizationManetNetworkAODV.pptx
AntColonyOptimizationManetNetworkAODV.pptxAntColonyOptimizationManetNetworkAODV.pptx
AntColonyOptimizationManetNetworkAODV.pptx
 
Versatile Engineering Construction Firms
Versatile Engineering Construction FirmsVersatile Engineering Construction Firms
Versatile Engineering Construction Firms
 
Uk-NO1 Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Exp...
Uk-NO1 Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Exp...Uk-NO1 Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Exp...
Uk-NO1 Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Exp...
 
Module-1-Building Acoustics(Introduction)(Unit-1).pdf
Module-1-Building Acoustics(Introduction)(Unit-1).pdfModule-1-Building Acoustics(Introduction)(Unit-1).pdf
Module-1-Building Acoustics(Introduction)(Unit-1).pdf
 
Triangulation survey (Basic Mine Surveying)_MI10412MI.pptx
Triangulation survey (Basic Mine Surveying)_MI10412MI.pptxTriangulation survey (Basic Mine Surveying)_MI10412MI.pptx
Triangulation survey (Basic Mine Surveying)_MI10412MI.pptx
 
Javier_Fernandez_CARS_workshop_presentation.pptx
Javier_Fernandez_CARS_workshop_presentation.pptxJavier_Fernandez_CARS_workshop_presentation.pptx
Javier_Fernandez_CARS_workshop_presentation.pptx
 
Module-1-(Building Acoustics) Noise Control (Unit-3). pdf
Module-1-(Building Acoustics) Noise Control (Unit-3). pdfModule-1-(Building Acoustics) Noise Control (Unit-3). pdf
Module-1-(Building Acoustics) Noise Control (Unit-3). pdf
 
STATE TRANSITION DIAGRAM in psoc subject
STATE TRANSITION DIAGRAM in psoc subjectSTATE TRANSITION DIAGRAM in psoc subject
STATE TRANSITION DIAGRAM in psoc subject
 
22CYT12 & Chemistry for Computer Systems_Unit-II-Corrosion & its Control Meth...
22CYT12 & Chemistry for Computer Systems_Unit-II-Corrosion & its Control Meth...22CYT12 & Chemistry for Computer Systems_Unit-II-Corrosion & its Control Meth...
22CYT12 & Chemistry for Computer Systems_Unit-II-Corrosion & its Control Meth...
 
Indian Tradition, Culture & Societies.pdf
Indian Tradition, Culture & Societies.pdfIndian Tradition, Culture & Societies.pdf
Indian Tradition, Culture & Societies.pdf
 
Immutable Image-Based Operating Systems - EW2024.pdf
Immutable Image-Based Operating Systems - EW2024.pdfImmutable Image-Based Operating Systems - EW2024.pdf
Immutable Image-Based Operating Systems - EW2024.pdf
 
Katarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School CourseKatarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School Course
 
Curve setting (Basic Mine Surveying)_MI10412MI.pptx
Curve setting (Basic Mine Surveying)_MI10412MI.pptxCurve setting (Basic Mine Surveying)_MI10412MI.pptx
Curve setting (Basic Mine Surveying)_MI10412MI.pptx
 
Theory of Machine Notes / Lecture Material .pdf
Theory of Machine Notes / Lecture Material .pdfTheory of Machine Notes / Lecture Material .pdf
Theory of Machine Notes / Lecture Material .pdf
 
Gravity concentration_MI20612MI_________
Gravity concentration_MI20612MI_________Gravity concentration_MI20612MI_________
Gravity concentration_MI20612MI_________
 
priority interrupt computer organization
priority interrupt computer organizationpriority interrupt computer organization
priority interrupt computer organization
 
Guardians of E-Commerce: Harnessing NLP and Machine Learning Approaches for A...
Guardians of E-Commerce: Harnessing NLP and Machine Learning Approaches for A...Guardians of E-Commerce: Harnessing NLP and Machine Learning Approaches for A...
Guardians of E-Commerce: Harnessing NLP and Machine Learning Approaches for A...
 
Artificial Intelligence in Power System overview
Artificial Intelligence in Power System overviewArtificial Intelligence in Power System overview
Artificial Intelligence in Power System overview
 
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...
 
Turn leadership mistakes into a better future.pptx
Turn leadership mistakes into a better future.pptxTurn leadership mistakes into a better future.pptx
Turn leadership mistakes into a better future.pptx
 

Everything is not a graph problem. But, there are plenty.

Editor's Notes

  1. What I hope you get out of this talk: Life in the trenches Target audience: architects, PMs, CTOs. 2017 was the year of the graph, what have we learned?
  2. This talk: Life in the trenches What problems have gone wrong How I classify ”graph problems” About me…
  3. Describe: non-graph problems, graph analytics problems, graph database problems Include: -- overlooked architectural issue -- recommendation
  4. Has anyone in here been tasked with building this? The new singular system that will have it all? Story #1: This is my story of failure: tasked to build the one graph that ruled them all.
  5. The world makes so much sense like this. Obviously a graph. Grand vision: “the one graph that rules it all”
  6. -- overlooked architectural issue – these are the types of questions I was being asked. -- recommendation Entity resolution: ---> workload doesn’t match the data model -- huge swiss army knife and only opening cans of beans. (graph for bar charts) got a graph, only need bar charts - needed the simple and we had the powerful ---> I lost site of what my manager really needed
  7. Is this really what you problem needs? Remember the one graph that I had to build? At the end of the day, my c-suite wanted this.
  8. Is this really what you problem needs?
  9. Is this really what you problem needs?
  10. Essentially – We were letting the process of “how we think about the data” Drive the Actual implementation details.
  11. my ask: use the right tool for the right problem
  12. Ok. You have graph data. But do you need to analyze it or query it?
  13. Let’s do a ”for – instance”
  14. Essentially --
  15. Query the audience: 4 sessions? 3? 2? Graph problem?
  16. Build a conflict graph
  17. Arya’s and Sansa’s conference talk requires the same room, as does Bran’s and Rob’s.
  18. Mr. Lender does not want to have Bran’s and Sansa’s talk at the same time.
  19. Arya and Bran are helping each other with their presentations and need to attend each other’s talks. … WTF where are you going with this Denise??
  20. Arya and Bran are helping each other with their presentations and need to attend each other’s talks. … WTF where are you going with this Denise??
  21. Describe: non-graph problems, graph analytics problems, graph database problems Include: -- overlooked architectural issue -- recommendation
  22. Describe: non-graph problems, graph analytics problems, graph database problems Include: -- overlooked architectural issue -- recommendation
  23. This duality exists with any problem – not just graph problems: algorithmy vs storage/retrivaly Describe: non-graph problems, graph analytics problems, graph database problems Include: -- overlooked architectural issue -- recommendation What are the warning signs that showed us that you were going down the wrong path? --- aka --- what to look for to know that you are about to get fired before you get fired trade offs: --- all in 1 system and have bad SLAs Or -– data copied in multiple places in the system and the complexity of (1) maintaining data duplication and (2) properly routing queries to their clever data model for performant answers Life in the trenches Entity resolution: ---> workload doesn’t match the data model -- huge swiss army knife and only opening cans of beans. (graph for bar charts) got a graph, only need bar charts - needed the simple and we had the powerful ---> I lost site of what my manager really needed Entity resolution with a mysql database: - I answered today’s question, but didn’t look far enough down the road to answer the next question I’ve got a hammer, and I am hammering in nails – ok, now remove the nails Now open the can of beans with the hammer
  24. Is there a situation in which graph databases have been useful for resolving entities? 2 examples
  25. Is there a situation in which graph databases have been useful for resolving entities? 2 examples
  26. First story: 2012
  27. The churn problem: a special version of ER problem. After trying SVMs, ANNs, linear models.. You name it – this is what we arrived at
  28. Sub graph from time t – 1. (aka last month_)
  29. Induced subgraph for time t
  30. Induced subgraph for time t
  31. Induced subgraph for time t
  32. Induced subgraph for time t
  33. This is the identity which in production we would label with confidence as being associated to the identity from August
  34. How good was this? See red, you are dead.
  35. Is there a situation in which graph databases have been useful for resolving entities? 2 examples
  36. How do we use a graph database to make something like this happen?
  37. We don’t get to skip that step just because we are in a graph db anymore
  38. WHY the blue vertices? WHAT becomes a property? ---> WHAT ARE YOU NEEDING TO READ FROM. YOUR DB? We don’t get to skip that step anymore. 1 QUESTION: WHAT DO YOU WANT OUT
  39. What is best? It all depends. Vertices typically become things that require edges between master identities Before you hammer away at the key board, do some planning. What do you need back out? That is what will define a solid graph architecture for your application.
  40. This duality exists with any problem – not just graph problems: algorithmy vs storage/retrivaly trade offs: --- all in 1 system and have bad SLAs Or -– data copied in multiple places in the system and the complexity of (1) maintaining data duplication and (2) properly routing queries to their clever data model for performant answers
  41. This duality exists with any problem – not just graph problems: algorithmy vs storage/retrivaly trade offs: --- all in 1 system and have bad SLAs Or -– data copied in multiple places in the system and the complexity of (1) maintaining data duplication and (2) properly routing queries to their clever data model for performant answers
  42. if you take anything away, it is that the graph community as a whole is at a critical point. if we continue to approach graphs a "CAN SOLVE EVERYTHING", we need to reconsider. This could have a wave of ramifications for the whole community Use the right tool for the right problem.