SlideShare ist ein Scribd-Unternehmen logo
1 von 48
From Monitoring to
Domain-Oriented
Observability
What’s the difference between monitoring and
observability and why does it matter?
2. About me
● Took part in developing of microservice architecture based on
Event Sourcing and CQRS
● I have obtained a position of Tech Lead. I implemented Canary
release and Feature Toggles (aka Feature Flags), migrated
microservice from REST to Event-Driven
● Made a webinar about microservices testing.
● I am trying to apply best engineering practices to Safeguard Cyber project
● I want to build such a process in a company (team) in which it will be pleasant to work and develop
professionally. Where ideas will be heard, where a person will not need to sacrifice his family or
health for professional growth.
3. Parts of presentation
● Why you should start thinking about monitoring and our case
● Theoretical minimum about monitoring
● How to switch from monitoring to Domain-oriented Observability
4. Part 1
● Why you should start thinking about monitoring and our case
5. About our product
6. Threat Detection
7. Cyber Defense
8. Machine Learning and AI
9. Fire on Production
10. Fire on production
11. Fire on production
12. Fire on production
12. Fire on production
14. Problems we faced with
● What is the cause of performance drop?
● What can lead to poor system performance?
● How can certain changes influence the system?
● Our product is difficult to fit in SLA
15. Part 2
● Theoretical minimum about monitoring
16 Monitoring
1) Logging
2) Tracing
3) Metrics
4) Alerts
“A log is an immutable,
timestamped record of event
describing what happened over
time “
17 Logging
18. Kibana filters
19. Kibana filters result
20. Errors frequency analysis
“A trace is a representation of series
of causally related distributed events
that encode the end-to-end request
flow through a distributed system”
21. Tracing
22. Tracing
“Metrics are a numeric
representation of data
measure over intervals of
time”
23. Metrics
24. Metrics
25. Metrics
26. Metrics
● throughput
● success
● error
● performance
27. Metrics subtypes
● Rate - the number of requests, per second, you services are serving.
● Errors - the number of failed requests per second.
● Duration - distributions of the amount of time each request takes.
28. Three key metrics by RED methodology
31. Trends are very important.
30. Trends are very important
Automated alerts are essential to
monitoring. They allow you to spot
problems anywhere in your
infrastructure, so that you can
rapidly identify their causes and
minimize service degradation and
disruption. Alerts draw human
attention to the particular systems
that require observation,
inspection, and intervention.
31. Alerts
● There should be people’s reaction
● Alert should have priority
● There should be possibility to disable notifications
● Alert should provide further instructions
32. Alert rules
33. Alerts in Kibana
34. Part 3
● How to switch from monitoring to Domain-oriented Observability
Definition:
“In control theory, observability is a measure of how well internal states of a
system can be inferred from knowledge of its external outputs. The observability
and controllability of a system are mathematical duals.”
- Wikipedia
In English:
Can you understand what’s happening inside your code and system, simply by
asking questions using your tools? Can you answer any new question you think
of, or only the ones you prepared for?
35. Observability
36. Black and White box Component View
37. Observability code example
38. Observability code example
39. Cleanup the mess
40. Cleanup the mess
41. Moving code to class
42. Domain Probe: DiscountInstrumentation
● AOP
● DECORATOR
● λάμβδα
43. Other opportunities
44. Testing + Monitoring
45. Testing + Monitoring
Start being proactive
Don’t be firefighters
● The RED Method
● Monitoring Distributed Systems
● Domain-Oriented Observability
● Distributed Systems Observability by Cindy Sridharan
● Testing in Production, the safe way
● Deploy != Release part1 and part2
● SRE: Observability: Metric Namespaces and Structures
● Observability: Metric, Logging, and Tracing
● Decorator
● Monitoring in the time of Cloud Native
● https://www.elastic.co/learn
46. Resources:
Questions

Weitere ähnliche Inhalte

Was ist angesagt?

Monitoring and observability
Monitoring and observabilityMonitoring and observability
Monitoring and observability
Theo Schlossnagle
 

Was ist angesagt? (20)

Observability
Observability Observability
Observability
 
More Than Monitoring: How Observability Takes You From Firefighting to Fire P...
More Than Monitoring: How Observability Takes You From Firefighting to Fire P...More Than Monitoring: How Observability Takes You From Firefighting to Fire P...
More Than Monitoring: How Observability Takes You From Firefighting to Fire P...
 
Observability – the good, the bad, and the ugly
Observability – the good, the bad, and the uglyObservability – the good, the bad, and the ugly
Observability – the good, the bad, and the ugly
 
How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...
How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...
How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...
 
Monitoring and observability
Monitoring and observabilityMonitoring and observability
Monitoring and observability
 
Observability for modern applications
Observability for modern applications  Observability for modern applications
Observability for modern applications
 
Observability
ObservabilityObservability
Observability
 
Observability & Datadog
Observability & DatadogObservability & Datadog
Observability & Datadog
 
Observability
ObservabilityObservability
Observability
 
Observabilidade: Será que você está fazendo do jeito certo?
Observabilidade: Será que você está fazendo do jeito certo?Observabilidade: Será que você está fazendo do jeito certo?
Observabilidade: Será que você está fazendo do jeito certo?
 
Principles of System Observability
Principles of System Observability Principles of System Observability
Principles of System Observability
 
Logging and observability
Logging and observabilityLogging and observability
Logging and observability
 
Demystifying observability
Demystifying observability Demystifying observability
Demystifying observability
 
Observability, Distributed Tracing, and Open Source: The Missing Primer
Observability, Distributed Tracing, and Open Source: The Missing PrimerObservability, Distributed Tracing, and Open Source: The Missing Primer
Observability, Distributed Tracing, and Open Source: The Missing Primer
 
Improve monitoring and observability for kubernetes with oss tools
Improve monitoring and observability for kubernetes with oss toolsImprove monitoring and observability for kubernetes with oss tools
Improve monitoring and observability for kubernetes with oss tools
 
Observability, what, why and how
Observability, what, why and howObservability, what, why and how
Observability, what, why and how
 
Migrating Monitoring to Observability – How to Transform DevOps from being Re...
Migrating Monitoring to Observability – How to Transform DevOps from being Re...Migrating Monitoring to Observability – How to Transform DevOps from being Re...
Migrating Monitoring to Observability – How to Transform DevOps from being Re...
 
Observability-101
Observability-101Observability-101
Observability-101
 
Observability and its application
Observability and its applicationObservability and its application
Observability and its application
 
Building a centralized observability platform
Building a centralized observability platformBuilding a centralized observability platform
Building a centralized observability platform
 

Ähnlich wie Monitoring and observability

Agile Gurugram 2023 | Observability for Modern Applications. How does it help...
Agile Gurugram 2023 | Observability for Modern Applications. How does it help...Agile Gurugram 2023 | Observability for Modern Applications. How does it help...
Agile Gurugram 2023 | Observability for Modern Applications. How does it help...
AgileNetwork
 
The differing ways to monitor and instrument
The differing ways to monitor and instrumentThe differing ways to monitor and instrument
The differing ways to monitor and instrument
Jonah Kowall
 

Ähnlich wie Monitoring and observability (20)

Observability in highly distributed systems
Observability in highly distributed systemsObservability in highly distributed systems
Observability in highly distributed systems
 
5 Clear Signs You Need Security Policy Automation
5 Clear Signs You Need Security Policy Automation5 Clear Signs You Need Security Policy Automation
5 Clear Signs You Need Security Policy Automation
 
ThirdEye - LinkedIn's Business-wide monitoring platform
ThirdEye - LinkedIn's Business-wide monitoring platformThirdEye - LinkedIn's Business-wide monitoring platform
ThirdEye - LinkedIn's Business-wide monitoring platform
 
Top 10 Practices of Highly Successful DevOps Incident Management Teams
Top 10 Practices of Highly Successful DevOps Incident Management TeamsTop 10 Practices of Highly Successful DevOps Incident Management Teams
Top 10 Practices of Highly Successful DevOps Incident Management Teams
 
Monitorama - Please, no more Minutes, Milliseconds, Monoliths or Monitoring T...
Monitorama - Please, no more Minutes, Milliseconds, Monoliths or Monitoring T...Monitorama - Please, no more Minutes, Milliseconds, Monoliths or Monitoring T...
Monitorama - Please, no more Minutes, Milliseconds, Monoliths or Monitoring T...
 
Cerita
CeritaCerita
Cerita
 
Top 10 Practices of Highly Successful DevOps Incident Management Teams
Top 10 Practices of Highly Successful DevOps Incident Management TeamsTop 10 Practices of Highly Successful DevOps Incident Management Teams
Top 10 Practices of Highly Successful DevOps Incident Management Teams
 
Top 10 Practices of Highly Successful DevOps Incident Management Teams
Top 10 Practices of Highly Successful DevOps Incident Management TeamsTop 10 Practices of Highly Successful DevOps Incident Management Teams
Top 10 Practices of Highly Successful DevOps Incident Management Teams
 
BSIT3CD_Continuation of Cyber incident response (1).pdf
BSIT3CD_Continuation of Cyber incident response (1).pdfBSIT3CD_Continuation of Cyber incident response (1).pdf
BSIT3CD_Continuation of Cyber incident response (1).pdf
 
Cloud Native DevOps
Cloud Native DevOpsCloud Native DevOps
Cloud Native DevOps
 
Monitoring - deeper dive
Monitoring  - deeper diveMonitoring  - deeper dive
Monitoring - deeper dive
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
What is Platform Observability? An Overview
What is Platform Observability? An OverviewWhat is Platform Observability? An Overview
What is Platform Observability? An Overview
 
VTU 5TH SEM CSE SOFTWARE ENGINEERING SOLVED PAPERS - JUN13 DEC13 JUN14 DEC14 ...
VTU 5TH SEM CSE SOFTWARE ENGINEERING SOLVED PAPERS - JUN13 DEC13 JUN14 DEC14 ...VTU 5TH SEM CSE SOFTWARE ENGINEERING SOLVED PAPERS - JUN13 DEC13 JUN14 DEC14 ...
VTU 5TH SEM CSE SOFTWARE ENGINEERING SOLVED PAPERS - JUN13 DEC13 JUN14 DEC14 ...
 
From sensor readings to prediction: on the process of developing practical so...
From sensor readings to prediction: on the process of developing practical so...From sensor readings to prediction: on the process of developing practical so...
From sensor readings to prediction: on the process of developing practical so...
 
Clone of an organization
Clone of an organizationClone of an organization
Clone of an organization
 
Agile Gurugram 2023 | Observability for Modern Applications. How does it help...
Agile Gurugram 2023 | Observability for Modern Applications. How does it help...Agile Gurugram 2023 | Observability for Modern Applications. How does it help...
Agile Gurugram 2023 | Observability for Modern Applications. How does it help...
 
Observability for Application Developers (1)-1.pptx
Observability for Application Developers (1)-1.pptxObservability for Application Developers (1)-1.pptx
Observability for Application Developers (1)-1.pptx
 
The differing ways to monitor and instrument
The differing ways to monitor and instrumentThe differing ways to monitor and instrument
The differing ways to monitor and instrument
 
WTF is a Microservice - Rafael Schloming, Datawire
WTF is a Microservice - Rafael Schloming, DatawireWTF is a Microservice - Rafael Schloming, Datawire
WTF is a Microservice - Rafael Schloming, Datawire
 

Mehr von Danylenko Max (6)

How to write clean tests
How to write clean testsHow to write clean tests
How to write clean tests
 
Consumer Driven Contract.pdf
Consumer Driven Contract.pdfConsumer Driven Contract.pdf
Consumer Driven Contract.pdf
 
Consumer driven contract
Consumer driven contractConsumer driven contract
Consumer driven contract
 
Fail fast! approach
Fail fast! approachFail fast! approach
Fail fast! approach
 
How to successfully grow a code review culture
How to successfullygrow a code review cultureHow to successfullygrow a code review culture
How to successfully grow a code review culture
 
Testing microservices
Testing microservicesTesting microservices
Testing microservices
 

Kürzlich hochgeladen

Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
drm1699
 

Kürzlich hochgeladen (20)

Food Delivery Business App Development Guide 2024
Food Delivery Business App Development Guide 2024Food Delivery Business App Development Guide 2024
Food Delivery Business App Development Guide 2024
 
Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024
 
BusinessGPT - Security and Governance for Generative AI
BusinessGPT  - Security and Governance for Generative AIBusinessGPT  - Security and Governance for Generative AI
BusinessGPT - Security and Governance for Generative AI
 
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Workshop -  Architecting Innovative Graph Applications- GraphSummit MilanWorkshop -  Architecting Innovative Graph Applications- GraphSummit Milan
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
 
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
 
Modern binary build systems - PyCon 2024
Modern binary build systems - PyCon 2024Modern binary build systems - PyCon 2024
Modern binary build systems - PyCon 2024
 
Community is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea GouletCommunity is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea Goulet
 
Software Engineering - Introduction + Process Models + Requirements Engineering
Software Engineering - Introduction + Process Models + Requirements EngineeringSoftware Engineering - Introduction + Process Models + Requirements Engineering
Software Engineering - Introduction + Process Models + Requirements Engineering
 
Microsoft365_Dev_Security_2024_05_16.pdf
Microsoft365_Dev_Security_2024_05_16.pdfMicrosoft365_Dev_Security_2024_05_16.pdf
Microsoft365_Dev_Security_2024_05_16.pdf
 
A Deep Dive into Secure Product Development Frameworks.pdf
A Deep Dive into Secure Product Development Frameworks.pdfA Deep Dive into Secure Product Development Frameworks.pdf
A Deep Dive into Secure Product Development Frameworks.pdf
 
Abortion Pill Prices Germiston ](+27832195400*)[ 🏥 Women's Abortion Clinic in...
Abortion Pill Prices Germiston ](+27832195400*)[ 🏥 Women's Abortion Clinic in...Abortion Pill Prices Germiston ](+27832195400*)[ 🏥 Women's Abortion Clinic in...
Abortion Pill Prices Germiston ](+27832195400*)[ 🏥 Women's Abortion Clinic in...
 
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptxFrom Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
 
Evolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI EraEvolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI Era
 
Navigation in flutter – how to add stack, tab, and drawer navigators to your ...
Navigation in flutter – how to add stack, tab, and drawer navigators to your ...Navigation in flutter – how to add stack, tab, and drawer navigators to your ...
Navigation in flutter – how to add stack, tab, and drawer navigators to your ...
 
Your Ultimate Web Studio for Streaming Anywhere | Evmux
Your Ultimate Web Studio for Streaming Anywhere | EvmuxYour Ultimate Web Studio for Streaming Anywhere | Evmux
Your Ultimate Web Studio for Streaming Anywhere | Evmux
 
From Theory to Practice: Utilizing SpiraPlan's REST API
From Theory to Practice: Utilizing SpiraPlan's REST APIFrom Theory to Practice: Utilizing SpiraPlan's REST API
From Theory to Practice: Utilizing SpiraPlan's REST API
 
Abortion Clinic In Johannesburg ](+27832195400*)[ 🏥 Safe Abortion Pills in Jo...
Abortion Clinic In Johannesburg ](+27832195400*)[ 🏥 Safe Abortion Pills in Jo...Abortion Clinic In Johannesburg ](+27832195400*)[ 🏥 Safe Abortion Pills in Jo...
Abortion Clinic In Johannesburg ](+27832195400*)[ 🏥 Safe Abortion Pills in Jo...
 
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
 
GraphSummit Milan & Stockholm - Neo4j: The Art of the Possible with Graph
GraphSummit Milan & Stockholm - Neo4j: The Art of the Possible with GraphGraphSummit Milan & Stockholm - Neo4j: The Art of the Possible with Graph
GraphSummit Milan & Stockholm - Neo4j: The Art of the Possible with Graph
 
Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...
Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...
Abortion Pill Prices Turfloop ](+27832195400*)[ 🏥 Women's Abortion Clinic in ...
 

Monitoring and observability

  • 1. From Monitoring to Domain-Oriented Observability What’s the difference between monitoring and observability and why does it matter?
  • 2. 2. About me ● Took part in developing of microservice architecture based on Event Sourcing and CQRS ● I have obtained a position of Tech Lead. I implemented Canary release and Feature Toggles (aka Feature Flags), migrated microservice from REST to Event-Driven ● Made a webinar about microservices testing. ● I am trying to apply best engineering practices to Safeguard Cyber project ● I want to build such a process in a company (team) in which it will be pleasant to work and develop professionally. Where ideas will be heard, where a person will not need to sacrifice his family or health for professional growth.
  • 3. 3. Parts of presentation ● Why you should start thinking about monitoring and our case ● Theoretical minimum about monitoring ● How to switch from monitoring to Domain-oriented Observability
  • 4. 4. Part 1 ● Why you should start thinking about monitoring and our case
  • 5. 5. About our product
  • 9. 9. Fire on Production
  • 10. 10. Fire on production
  • 11. 11. Fire on production
  • 12. 12. Fire on production
  • 13. 12. Fire on production
  • 14. 14. Problems we faced with ● What is the cause of performance drop? ● What can lead to poor system performance? ● How can certain changes influence the system? ● Our product is difficult to fit in SLA
  • 15. 15. Part 2 ● Theoretical minimum about monitoring
  • 16. 16 Monitoring 1) Logging 2) Tracing 3) Metrics 4) Alerts
  • 17. “A log is an immutable, timestamped record of event describing what happened over time “ 17 Logging
  • 21. “A trace is a representation of series of causally related distributed events that encode the end-to-end request flow through a distributed system” 21. Tracing
  • 23. “Metrics are a numeric representation of data measure over intervals of time” 23. Metrics
  • 27. ● throughput ● success ● error ● performance 27. Metrics subtypes
  • 28. ● Rate - the number of requests, per second, you services are serving. ● Errors - the number of failed requests per second. ● Duration - distributions of the amount of time each request takes. 28. Three key metrics by RED methodology
  • 29. 31. Trends are very important.
  • 30. 30. Trends are very important
  • 31. Automated alerts are essential to monitoring. They allow you to spot problems anywhere in your infrastructure, so that you can rapidly identify their causes and minimize service degradation and disruption. Alerts draw human attention to the particular systems that require observation, inspection, and intervention. 31. Alerts
  • 32. ● There should be people’s reaction ● Alert should have priority ● There should be possibility to disable notifications ● Alert should provide further instructions 32. Alert rules
  • 33. 33. Alerts in Kibana
  • 34. 34. Part 3 ● How to switch from monitoring to Domain-oriented Observability
  • 35. Definition: “In control theory, observability is a measure of how well internal states of a system can be inferred from knowledge of its external outputs. The observability and controllability of a system are mathematical duals.” - Wikipedia In English: Can you understand what’s happening inside your code and system, simply by asking questions using your tools? Can you answer any new question you think of, or only the ones you prepared for? 35. Observability
  • 36. 36. Black and White box Component View
  • 41. 41. Moving code to class
  • 42. 42. Domain Probe: DiscountInstrumentation
  • 43. ● AOP ● DECORATOR ● λάμβδα 43. Other opportunities
  • 44. 44. Testing + Monitoring
  • 45. 45. Testing + Monitoring
  • 46. Start being proactive Don’t be firefighters
  • 47. ● The RED Method ● Monitoring Distributed Systems ● Domain-Oriented Observability ● Distributed Systems Observability by Cindy Sridharan ● Testing in Production, the safe way ● Deploy != Release part1 and part2 ● SRE: Observability: Metric Namespaces and Structures ● Observability: Metric, Logging, and Tracing ● Decorator ● Monitoring in the time of Cloud Native ● https://www.elastic.co/learn 46. Resources: