Professional Documents
Culture Documents
Hollnagel - 2012 - A Bird's Eye View of RE (2011)
Hollnagel - 2012 - A Bird's Eye View of RE (2011)
engineering
www.functionalresonance.com
E-mail: hollnagel.erik@gmail.com
What can
SAFETY = FREEDOM FROM UNACCEPTABLE RISK go wrong?
Accidents, incidents, …
© Erik Hollnagel, 2012
INES definitions
ACCIDENT:
“Any unintended event, including
operating errors, equipment failures or
other mishaps, the consequences or
potential consequences of which are not
negligible from the point of view of
protection or safety”
SAFETY:
INES does not give a definition of safety.
Industrial safety can be defined as the ability to manage the risks inherent to
operations or related to the environment. Industrial safety is not a dislike of risks;
rather it is a commitment to clearly identify them in relation to production
operations, assess them in terms of quality and quantity, and manage them.
Age of technology
1893
1769 Railroad Safety 1961
Industrial Appliance Act 1931 Fault tree
Revolution Industrial analysis 1979 1986
accident IT Three Mile Challenger
prevention Revolution Island Чернобыль
From Safety-I
© Erik Hollnagel, 2012
Theories and models of the negative
No surprises
Small surprises
Big surprises
Unexampled events
6σ 3σ 3σ 6σ
Everyday performance
Things that go right
(Safety-II)
Things that go
wrong (Safety-I)
© Erik Hollnagel, 2012
A formalism for system characterisation
Comprehensibility
The system functions
can range from being Difficult
easy to understand to
being difficult to
understand – either in
part or in whole.
(‘clockwork’) to being
irregular and disorderly.
gh
Hi
Tractable
Standardized industrial
production is stable,
relatively simple to
describe, and well Easy
understood. Descriptions
Simple Elaborate
w
Lo
Homogeneous
processes
gh
Hi
Heterogeneous
processes
gh
Hi
AVOID
anything that may have
negative consequences
for yourself, your group,
or organisation
COMPENSATE FOR
MAINTAIN/CREATE
unacceptable conditions
conditions that may be so that it becomes
of use in case of future possible to do your work.
problems.
© Erik Hollnagel, 2012
The ETTO principle
LPCS: Low
Pressure
Core Spray
H11: Transition
H4: 323 P1 & P2 H7: Re-establish from STF K7 till
blocked (switches containment STF K3. Switch
open). settings, except 323 positions not noted.
P1 & P2
The decision not to restore the LPCS system due to person safety aspects was
correct. But as a consequence the last barrier in preventing the LPCS to be
inoperable was broken.
The operators thought that the closing of the disconnecting switches for the LPCS
pumps were included in the startup procedure, but never checked this assumption.
(The evidence shows) that for a quarter of a century or more, the practice of liners
when in the vicinity of ice at night had been in clear weather to keep the course, to
maintain the speed and to trust to a sharp look-out to enable
them to avoid the danger. … I accept the evidence as to the
practice and as to the immunity from casualties which is said
to have accompanied it. But the event has proved the practice
to be bad. Its root is probably to be found in the competition
and in the desire of the public for quick passages rather in the
judgement of the navigators.”
© Erik Hollnagel, 2012
Safety-II when everything goes right
Safety-II: Safety is defined as a condition where the number of successful outcomes
(meaning everyday work) is as high as possible. It is the ability to succeed under
varying conditions.
Safety-II is achieved by trying to make sure that things go right, rather than by
preventing them from going wrong.
Expected
outcomes
Individuals and organisations must (success)
adjust to the current conditions in Everyday
everything they do. Everyday performance
performance must be variable in order variability
for things o work. Unexpected
outcomes
(failure)
To Safety-II
From Safety-I
© Erik Hollnagel, 2012
From the negative to the positive
Negative outcomes are All outcomes (positive
caused by failures and and negative) are due to
malfunctions. performance variability..
Eliminate failures
and malfunctions as Improve resilience.
far as possible.
Actual
Common and
cost-
LO Irrelevant effective, but
are they
relevant? Ease of
measurement
LO HI (availability)
© Erik Hollnagel, 2012
The ‘real’ indicators
Safety is defined as that which
Unplanned Automatic Scrams is ‘measured’ by the indicators.
per 7,000 Hours Critical Availability is more important
than meaningfulness.
Industrial Safety Accident Rate
Actual
HI
Everyday
Frequency
performance Accidents
In Incidents
ci
de
Ac nt Near misses
ci s
LO de
nt
s
LO HI
Similarity
Learning should focus on the frequency of events rather than on the severity.
Actual
Although such maps may seem authoritative, a model is just a model until the
methods used to produce it have been verified. The regions assessed as most
dangerous are the zones of three hypothetical ‘scenario earthquakes’ (Tokai, Tonankai
and Nanka). However, since 1979, earthquakes that caused 10 or more
fatalities in Japan actually occurred in places assigned a relatively low probability.
This discrepancy … strongly suggests that the hazard map and the methods used
to produce it are flawed and should be discarded.
Know what to do, Know what to Know what has Know what
be able to do it. look for happened to expect
The goal of Safety-II is to increase the number of things that go right, by enhancing
the abilities to respond, monitor, learn, and anticipate.
Failures
Efforts to maintain or improve safety focus on what can go
wrong and result in adverse outcomes.
Many theories, models, and methods explain or predict how
things can go wrong - with varying degrees of success.
Some also propose solutions, focusing M, T, and O issues –
also with varying degrees of success. Effort