Professional Documents
Culture Documents
Chaos Engineering
Chaos Engineering
D
iane Glazman will never fly British Airways (BA)
again. Glazman and her husband were among
the 75,000 people affected by the three-day BA
system failure summer 2017. On their way from San
Francisco to their son’s college graduation in Edinburgh,
they were stranded in London—without their luggage—at
the beginning of what was to be the three-week dream
tour of Scotland. “Listening to the excuses was
frustrating because nothing explained why BA was so
unprepared for such a catastrophic failure,” says
Glazman.
1
Chaos Engineering: Breaking Your Systems for Fun and Profit
2
Chaos Engineering: Breaking Your Systems for Fun and Profit
$1M – $5M+
98%$100,000+
80% $300,000+
Break your systems on It’s called chaos engineering, and it’s being adopted by
purpose. Find out their leading financial institutions, internet companies, and
manufacturing firms throughout the world. Such
weaknesses and fix
businesses understand that the trillions of dollars lost
them before they break
annually due to downtime is not acceptable to their
when least expected. customers, their stockholders, and their employees.
3
Chaos Engineering: Breaking Your Systems for Fun and Profit
A more complex,
distributed world
Already Moved
Planning to Move
86%
In Cloud by End of
16%
2017
Planning to Move
4
Chaos Engineering: Breaking Your Systems for Fun and Profit
5
Chaos Engineering: Breaking Your Systems for Fun and Profit
6
Chaos Engineering: Breaking Your Systems for Fun and Profit
Chaos engineering,
a primer
with something that has computers have limits, and possible points of failure. By
injecting a system with something that has the potential
the potential to disrupt
to disrupt it, you can identify where the system may be
it, you can identify
weak, and can take steps to make it more resilient.
where the system may
be weak, and can take
That covers the systems under your purview. Problems
steps to make it more
that occur with your cloud service providers are out of
resilient. your control, and you can’t resolve outages by adding
extra boxes or power supplies.
7
Chaos Engineering: Breaking Your Systems for Fun and Profit
8
Chaos Engineering: Breaking Your Systems for Fun and Profit
9
Chaos Engineering: Breaking Your Systems for Fun and Profit
10
Chaos Engineering: Breaking Your Systems for Fun and Profit
Benefits of
chaos engineering
11
Chaos Engineering: Breaking Your Systems for Fun and Profit
12
Chaos Engineering: Breaking Your Systems for Fun and Profit
13
Chaos Engineering: Breaking Your Systems for Fun and Profit
Best practices in
chaos engineering
14
Chaos Engineering: Breaking Your Systems for Fun and Profit
No “kill switch” or safety valve to stop out- Kill switch to avoid impacting users
of-control experiments from taking down
production systems
Figure 3: The debate between building and buying chaos engineering tools
15
Chaos Engineering: Breaking Your Systems for Fun and Profit
16
Chaos Engineering: Breaking Your Systems for Fun and Profit
Conclusion:
eschew complacency
17
Chaos Engineering: Breaking Your Systems for Fun and Profit
18