Professional Documents
Culture Documents
Observability
Observability
Demystified
Cory Watson
Technical Evangelist | Splunk
© 2019 SPLUNK INC.
Forward-Looking Statements
During the course of this presentation, we may make forward-looking statements regarding future events or
the expected performance of the company. We caution you that such statements reflect our current
expectations and estimates based on factors currently known to us and that actual events or results could
differ materially. For important factors that may cause actual results to differ from those contained in our
forward-looking statements, please review our filings with the SEC.
The forward-looking statements made in this presentation are being made as of the time and date of its live
presentation. If reviewed after its live presentation, this presentation may not contain current or accurate
information. We do not assume any obligation to update any forward-looking statements we may make. In
addition, any information about our roadmap outlines our general product direction and is subject to change
at any time without notice. It is for informational purposes only and shall not be incorporated into any contract
or other commitment. Splunk undertakes no obligation either to develop the features or functionality
described or to include any such feature or functionality in a future release.
Splunk, Splunk>, Listen to Your Data, The Engine for Machine Data, Splunk Cloud, Splunk Light and SPL are trademarks and registered trademarks of Splunk Inc. in the
United States and other countries. All other brand names, product names, or trademarks belong to their respective owners. © 2019 Splunk Inc. All rights reserved.
© 2019 SPLUNK INC.
Cory Watson
Why listen to me?
▶ Remote, from Nashville, TN
▶ Technical Director, Office of the CTO @ SignalFx
▶ Principal Engineer, Observability Lead @ Stripe
▶ SRE, Engineer Manager, Observability @ Twitter
▶ 7 years in observability, > 20 years experience
© 2019 SPLUNK INC.
Making Things
With the best of intentions!
Complexity Happens
That escalated quickly
▶ Diverse technologies
▶ Interaction of protections
Credit: Linh Ha
© 2019 SPLUNK INC.
“Human practitioners
“Complex systems are are the adaptable “Failure free operations
intrinsically element of complex require experience
hazardous…” systems” with failure”
© 2019 SPLUNK INC.
Now, Observability
What is observability?
© 2019 SPLUNK INC.
Monitoring vs Observability
Monitoring is a subset
Monitoring
Knowns
Known Unknowns
Unknown Unknowns
Observability
© 2019 SPLUNK INC.
Events
Maybe logs? Definitely actions.
© 2019 SPLUNK INC.
Tools
What you’ll actually use
Digression, Risk
Everything is terrifying
© 2019 SPLUNK INC.
Systems Change
Complexity is added, etc
▶ Growth
▶ New operators
▶ New customers
▶ Improvements
▶ etc
© 2019 SPLUNK INC.
“All
practitioner
actions
are
gambles”
Richard I. Cook, MD
© 2019 SPLUNK INC.
▶ Failure is intrinsic
Ok, Deployment
How do we do this?
© 2019 SPLUNK INC.
Common Advice
Applicable to all the things
▶ Use what you have
▶ Leverage common frameworks, libraries, middleware
▶ Publish conventions and guidance
▶ Monitor usage and control output
▶ Offer help
© 2019 SPLUNK INC.
Logs
Low effort, high volume
▶ Consider as a product
▶ Structure, organize
▶ Add levels and criticality
Metrics
Medium effort, medium volume
▶ Relate to logging
▶ Use common patterns like RED
and USE
Tracing
High effort, maximum awareness
▶ High correlation
▶ CI/CD Pipeline
▶ Feature flags
▶ Internal company tools
▶ ???
© 2019 SPLUNK INC.
Visualize
Descend as needed
High Level Indicators
Ad Hoc Investigation
© 2019 SPLUNK INC.
Dashboard Design
More than just boxes
© 2019 SPLUNK INC.
Chart Design
Every chart is a story
© 2019 SPLUNK INC.
Monitor Symptoms
Causes Change
Flag Unsafe Situations
▶ (Again) Start with RED, USE, etc
▶ Deal with actionable problems
▶ Give aid to human, support
adaptability (RTO, SLA, comms)
Support Humans
▶ Learn and improve
Wait, Learning
Some guidance on that
© 2019 SPLUNK INC.
Teach
Get the basics
Practice
Humans learn by doing
▶ Use often
▶ Gamedays, chaos, experiments
▶ Hypotheses
▶ After-action, etc
Credit: Mídia
© 2019 SPLUNK INC.
Failure
Humans do it too
Learning
Diverse Studies
▶ Diverse cases
▶ Successes and failures
▶ Halo and horn effects
▶ Never stop
© 2019 SPLUNK INC.
Wrap Up
Some guidance on that
© 2019 SPLUNK INC.
I believe in you!
This is an investment in people.
© 2019 SPLUNK INC.
Thank
You!
Go to the .conf19 mobile app to
RATE THIS
SESSION