There are several factors that will make your Splunk implementation a success. This presentation covers why our organisation implemented Splunk for log management and the steps you can take to make your implementation successful.
2. 2
Paul Gilowey
Foundation Technology Specialist
paul.gilowey@santam.co.za
@paulcgt
Sustainable Logging:
SUCCEEDING WITH SPLUNK
Words and thoughts expressed herein are my own, and not those of Santam.
25. 25
•have influence across departments
•act as product owner
•be fanatical
•be hands-on
•have a development background
•be an architect
Dave Keeshan - https://www.flickr.com/photos/spudmurphy/
Your champion should…
32. 32
Demonstrate thoughtful logging
[DEBUG] TxId=328, Counting invoice line items…
[INFO ] TxId=328, Invoice LineItemsTotal=420
[DEBUG] TxId=328, Calling remote service JDE…
[TRACE] TxId=328, JDE Request: {“TxID”:”328”, “Items”[{“desc”:”Motor Vehicle”,”prem”:305.24},…
[WARN ] TxId=328, Timed out while calling remote service JDE… target system may be down. Will retry in 30s.
33. 33
Show the benefit of structured log events [INFO] Purchase complete - total=42 currency=ZAR language=en_ZA priority=13 “Purchase complete” priority<4 | stats sum(total) as currencyTotal by currency | table currency, currencyTotal
36. 36
Formalise stacktrace logging policy
Function call ->
Function call ->
Function call ->
Function call
<- Log stacktrace
<- Log stacktrace
<- Log stacktrace
<- Log stacktrace
37. 37
Avoid filtering events.
[DEBUG] TxId=328, Real important debug statement.
[INFO ] TxId=328, This would have been useful to see...
[DEBUG] TxId=328, Useful when we really need it.
[TRACE] TxId=328, Oh man, I need this event so bad.
[DEBUG] TxId=328, Flippin’ important debug message.
[INFO ] TxId=328, This would have been useful to see...
[WARN ] TxId=328, Why am I logging at all?
38. 38
Avoid filtering events.
[WARN ] TxId=328, Real important debug statement.
[WARN ] TxId=328, This would have been useful to see...
[WARN ] TxId=328, Useful when we really need it.
[WARN ] TxId=328, Oh man, I need this event so bad.
[WARN ] TxId=328, Flippin’ important debug message.
[WARN ] TxId=328, Cummon, I *really* wanna see this!
[WARN ] TxId=328, Why am I logging at all?
52. 52
Learn
Implement
Stabilise
Spread the word
Refine
Take a
smart approach
3
53. 53 Dashboards are pretty, alerts are king Reactive becomes proactive Register defects (ERROR = defect) Filter, don’t flood mailboxes
Build alerts
and
set policy
4
54. 54 Get a feel for the pain Make sure filtering is working Police false positives
Receive
all alerts
yourself
5
55. 55 Mine their data yourself
–Find what’s difficult to show
–Build dashboards to showcase their solutions Broaden their minds – complement traditional BI by using log events
Help
managers
look good
6
56. 56
“Not too hot, not too cold, just right!”
“Meh – too sloooow…”
“Too expensive!”
Apply the Goldilocks Principle
7
57. 57
Monitor licence usage by source or source type
index=_internal source=*metrics.log
group="per_sourcetype_thruput"
| stats sum(kb) as KB by series
| where KB > 20000
8
59. 59
Encourage thoughtful logging
Promote good logging practices
Police bad behaviour
Be intimately involved
Adopt a helpful attitude
Make sure you show value
To be successful:
60. Thanks for listening!
Paul Gilowey
Foundation Technology Specialist
paul.gilowey@santam.co.za
@paulcgt