Professional Documents
Culture Documents
58A All Slides
58A All Slides
LECTURE NOTES
ON
RISK ANALYSIS & MANAGEMENT
Prof. Dr. İlhan OR
Industrial Engineering Department, Boğaziçi University,
Istanbul
Spring 2021
PART 1
INTRODUCTION TO RISK CONCEPTS
u Examples of Natural
Hazards:
• Earthquakes;
• Tsunami;
• Hurricanes (Cyclones, Typhoons);
• Tornadoes;
• Floods (flash, coastal, urban);
• Wildfires;
• Drought/Famine;
• Thunderstorms, Lightning, Hail;
• Blizzards, Avalanches;
• Mudslides, Landslides.
Level of
Perceived
Risk
Real Risk
Level
Realization of Time
Catastrophic Event
Receive Information
Acceptance
10 % 5% 10 % 60 % 12-14 % 1-3 %
Evacuate Attack Warm/ Wait for
instruct others’ Paralyzed Panic
themselves the threat
others initiatives
u (Istanbul/Sultanbeyli) A young man sprayed “shelltox” into his mouth in order to kill the fly
that he accidentally swallowed. Unfortunately, he was “accidentally” and fatally effected by
the chemical and died.
Hazard Likelihood, Hazard Consequence; Inadequate Safety Measures; Unsafe Practices; Unsuccessfull Risk Communication
u (Dilovasi Iskelesi Kocaeli) An unfortunate event on the tanker “Gaziantep”: The third officer
decided to inspect the steam boiler and went inside without notifying anybody; then someone
else saw the open boiler hatch and just closed it. The officer died as a result of a “horrible
accident” when the ship took anchor before anybody noticed his absence.
Hazard Likelihood, Hazard Consequence; Inadequate Safety Measures; Unsafe Practices; Unsuccessfull Risk Communication
u (Hendek - Adapazarı) Under the influence of alcohol, 5 young man, travelling on the TEM
expressway, pulled their car into the emengency lane, got out and started dancing on the
expressway, to the tune of the lively belly-dance music paling on the car’s radio.
Unfortunately, the oncoming cars did not notice them and 3 of them got killed in the ensuing
“accidents”.
Hazard Likelihood, Hazard Consequence; Inadequate Safety Measures; Unsafe Practices; Unsuccessfull Risk Communication
PART 2
INTRODUCTION
TO
RISK MANAGEMENT
• “Bouncing back” aims at only maintaining the status-quo and lacks the component of
“adaptation”, without which, the system does not learn to better manage such shocks
and remains potentially vulnerable to disaster should another disturbance occur.
• Additionally, the disturbance may have changed the system so that the status it falls
back to is either no longer possible or has other implications.
• In the left image, the system
behaves like the engineering
resilience definition, until a
certain disturbance threshold
is reached.
• Then, through adaptation, the
system is pushed into a new
equilibrium state residing on
Line A.
• Line A demonstrates a more
resilient system than line B,
which is more easily forced
past the threshold and into an
undesirable regime.
◆ In the 90s, in Honduras, a new road was planned for the city, so a new bridge was needed. The
new 484 meter Choluteca Bridge, was built by a Japanese company between 1996-98.
◆ Since the region is hurricane prone, the authorities insisted on very high technical specifications.
So, the contractor built a strong, solid bridge, designed to withstand extreme weather conditions.
◆ Soon after the bridges opening, Honduras was hit by
a devistating Hurricane. Choluteka river flooded the
entire region. Many bridges were were destroyed; but
the Choluteca Bridge survived with minor damage.
◆ While the bridge itself was in good condition,the roads
on its either end of were totally swept away. Moreover,
the Choluteca River had carved itself a new channel
during the flood and it now flowed beside the bridge,
not beneath it.
◆ So, while the bridge was strong enough to survive the
hurricane, it became a bridge over nothing, spanning
just dry ground. It became known as “The Bridge to
Nowhere”.
The lessons from the Choluteca Bridge are very much relevant to Risk Management.
i.A serial system is only as strong as its weakest component (the swept away roads);
ii.The world may change in ways we never imagined; this bridge is an excellent
methaphore for what can happen to us (our careers, our businesses, our lives) as the
world around us gets transformed – adapt to change, or else …;
iii.Be careful when in your career you aim to become an expert in some specific, narrow
area; that expertise might soon become redundant;
iv.We get focused on creating the best solution for a given problem, while ignoring the
possibility that the problem itself might change;
v.We focus on building the strongest, most sophisticated product or service, without
thinking of the possibility that the market could change and the need could vanish.
Hondurans focused on the bridge and ignored the possibility that the river below could
change course.
vi.“Built to Last” migth have been a popular mantra, but “Built to Adapt” could be the
way to go.
New risks
Step 3: Risk Response Development
• Develop strategy to reduce occurence likelihood;
• Develop strategy to reduce impact;
• Develop contingency plans.
Risk management plan
Step 4: Risk Response Control & Impl.
New risks
• Implement risk strategy;
• Monitor & adjust plans for new risks;
• Trainings and Exercises.
Activities that are performed in the system may be inherently risky (e.g.
mining, hazardous material transportation, air transportation);
The technology used may have inherent risks, or exacerbate risk in the
system (e.g. heavy equipment);
Physical environment may be inherently risky,
• Susceptibility to natural disasters;
• Proximity to and nature of populated areas and other businesses;
• Supporting infrastructure (power, telecommunications, water, transportation)
The product or service provided may have inherent risks,
• Potentially dangerous materials, and products or services;
• Demographics of customers;
• Liability for defective services and products;
• Quality issues.
Human and organizational errors can be propagated by organizations and
individuals executing/coordinating tasks, or using/coordinating technology.
Sources: World Economic Forum, the Intergovernmental Panel on Climate Change, the
Chicago Actuarial Association, the Global Challenges Foundation, NASA
Resources Available?
Category Quality
Catastrophic Jeopardizes the existence of the organization.
Major No longer possible to achieve strategic objectives.
Minor Some short term/tactical disruptions but little effect on strategic objectives.
◆ An example description
of “Consequence Levels”
in a P-I Table.
Medium 5 4, 9 1
Low
V. Low 11 7 3
V. Low Low Medium High V. High
Probability
High severity
3. Medium 4 5 6 7 8
2. Low 3 4 5 6 7 Medium severity
1. V. Low 2 3 4 5 6 Low severity
1.V. Low 2. Low 3. Medium 4. High 5. V. High
Probability
• If a risk has “k” possible types of impact, they can all be combined into a singe score as,
é k Pi +Ii ù
S = log10 ê å 10 ú
ë i=1 û
• In this equation “Inv” refers to the amount of investment (of people, time, money
etc.) necessary to pull the “S” score from Sold level to Snew level.
Periodically, “Inherent Risks” (i.e. risk estimates before the
execution of any risk mitigation efforts) may be plotted together with
“Residual Risks” (i.e. risk estimates after the execution of risk
mitigation efforts) in order to monitor the risk management efforts.
4
A D
3
G C F E
Impact
2
B
F
1
H
1 2 3 4 5
Probability
Ilhan Or - Boğaziçi University 104
Analyzing and Assessing Risk
P-I Table Implementation: Torino Asteroid Impact Scale
◆ It is a risk-assessment scale assigning values to celestial objects
moving near Earth (http://neo.jpl.nasa.gov/risk/).
• It takes into account the object's size and speed, as well as the probability
that it will collide with Earth.
• The scale runs from zero to 10. An object with value 0-1 has virtually no
chance of causing damage on Earth; 10 means certain global catastrophe.
• Close encounters, assigned values 2-7, could be categorized as ranging
from "events meriting concern" to "threatening events”.
• “Certain collisions” merit values 8-10, depending on whether the impact
energy is large enough to cause local, regional or global devastation.
◆ It is difficult to figure out what level of anxiety we should have
about an approaching asteroid.
• Torino scale puts in perspective whether a Near-Earth Object merits
public concern, just as the Richter Scale does with earthquakes.
5
• Some types of Risk Assessment 4 B I
I: Interface Prob
(Severity) Matrices used to provide 3
S: System Freeze
a basis for prioritizing which risks B: User Backlash
2 F
to address, resemble P-I tables. H: Hardware Malf.
1 H
1 2 3 4 5
Exposure
2- Serious Injury 2- Unlikely 2- Infrequent (1-3 month)
3- Major Injury 3- Likely 3- Frequent (weekly)
4- Multiple casualties 4- Very likely 4- High (daily)
5- At least one fatality 5- Inevitable (imminent) 5- Constant
high
Introduce Measures
To Avoid Scenario
Ignore
low (Accept Risk)
low high
Level of Impact
u The Contingency Plan implemented by the Paris Fire Department during the
2019 fire at the Notre Dame Cathedral was prepared 160 years ago (following
the original building’s demise and rebuilding after the French Revolution).
u Its key clauses regarding priorities in any fire intervention operation are:
1. First save human beings trapped.
2. Next save the artworks in the Cathedral.
3. Next save the Altar (the big Cross).
4. Next save the furniture.
5. Next try to save the building.
u Notice the highest priority is given to irreplacable human life and artwork;
lower priority is given to semi-replacable Altar and furniture; while lowest
priority is given to the building, which was considered replacable.
u Additionally, an oak forest was initiated at the Versailles Palace Gardens to be
cut and used for the church roof if it ever became necessary.
•Accordingly, the oak trees at the Versailles Gardens provided a crucial resource during the
2019 rebuilding campaign.
Ilhan Or - Boğaziçi University 122
Risk Assessment and Response Matrix
Detect Accept/Reduce Contingency Trigger
Risk Event Chance Severity When
Diffty Share/Transfer Plan Event
Transfer/Accept: Having old
Delay
Pre- Better contracts with machine on
Late Delivery Low Medium Low exceeding 5
Instltn penalty clauses; better standby for
customs agencies days
backup
Reduce: Having experts Prod rate
Operators’ Post- flown in to 10% below
Adaptation Low Low Medium On site training before
Instltn delivery; Better training support local planned after
Problems
procedures team 5 days
Reduce: Having old
Machine not
Post- Wide communication machine on Acceptance
Confirming to Low High Medium standby for
Instltn with order;Pre delivery on tests negative
Specifications backup
site inspection
Reduce: Order
Order Influence top mngmt Having a
Financial delayed by 5
Low High Low Placmn priorities; obtain self leasing plan
Problems financing from ready days because
t
manufacturer of financing
u Rehabilitation
• Of losses and functionalities.
u Monitor & Adjust Plans (for old and new risks) based on the
experience gained.
• Update the related risk data (incidents, risk factors etc.)
• Revise the related risk assessments;
• Documentation;
• Revise basic operational procedures, equipment, communication,
information and material needs (with more flexibility & resiliance in
mind);
• Suggestions for new regulations and organizational framework;
• Reconsider strategies for preventive and mitigation measures;
• Revise contingency plans;
• Revise training and exercise programs.
u Example: Revised Fire Precautions in BU after Galatasaray fire.
Ilhan Or - Boğaziçi University 128
Risk Response Control & Implementation
Keys to Success
Availability of the Refuge (as a part of the Contingency Plan) which
provided the trapped miners with crucial physical needs and morale.
• Food and water supplies;
• Medical Supplies;
• Communication & other equipment (fire extinguishers, drilling equipment);
• Clothing and Hygiene material.
Excellent Training of the Men and their Leader.
• Organization for extended (unknown duration) stay;
• Excellent physical conditions of men;
• Team spirit and high morale.
Immediate,decisive, professional, creative actions of the Rescue Team.
Good luck: 2 km undamaged galleries, underground water availability.
Aspects that could have led to Failure (if Luck was not there)
Emergency ladders to scale up the ventilation shafts were not
operational.
The ventilation of the refuge was very poor.
• If it were not for the undamaged galleries, the conditions would have been
much worse.
There were no toilets.
• If it were not for the undamaged galleries, the conditions would have been
much worse.
The maps of the mine-shafts were out of date.
• This slowed down the rescue operations considerably.
Available supplies at the Refuge could have been better
• They used the batteries of the mining trucks around to power their lamps.
Keys to Success
Availability of a sound, serious and well designed Contingency Plan.
• All craft in the immediate neighborhood of the crash landing area were directed to
participate at the rescue operations (some of the first arriving boat were tourist
carrying ferries);
• Port authority had divers on call that were able to participate at the rescue
operations immediately;
• Helicopters were made immediately available to coordinate and support rescue
operations.
Good luck: good weather and water conditions; no mishap during landing.
PART 3
QUANTITATIVE
RISK ASSESSMENT
• Standard Deviation (σ): It is the square root of the Variance and has the
advantage of featuring the same units as the data to which it refers.
– For a set of n data points (x1, …xn), the standard deviation (σ) is defined as,
• Mean Deviation: The average of the absolute differences between the data
points and their mean.
P æçè x - mx ³ ks x ö÷ø £ 12
k
– (0 <SP <1) indicates a longer left) tail; (1 <SP) indicates a longer right tail.
Skewness Examples
0 xi
Ilhan Or - Boğaziçi University 180
Quantitative Modeling & Assessment of Risk
Key Statistical Analysis Tools
Regression Analysis – Logistic Regression
◆ The portion is called the “Linear Predictor”;
◆ The “Odds Ratio”, which is the likelihood of success divided by the
likelihood of failure is an important concept in Logistic Regression;
Hypothesis Testing
Hypothesis Testing is a key component of classical statistics.
• Test of simple hypotheses: Hypotheses in which the distnibution of the random
variable is fully specified.
The standard appoach in Hypothesis Testing is using Test Statistics.
• A real valued function of the data whose distribution, under the hypothesis is
known.
• If the value of the test statistic based on the data is extreme, then the hypothesis is
rejected.
Hypothesis Testing involves the idea of Significance Level.
• A hypothesis is rejected at m% significance level, if the value of the test statistic
calculated from the datais in the upper m% tail of the distribution.
Key Problem: The exact distribution of the test statistic is usually unknown.
• Often, asymptotic (limiting) distribution of the test statistic is known, which is
reliable when number of observations is large.
Kolmogorov-Smirnov
Distribution Fitting
Empirical Cumulative Frequency
versus
Theoretical Distribution Function
Hypothesis Testing
If the issue is important, the observed trend suspicion (statistical conjecture)
should not be just attributed to randomness and ignored even when
hypothesis testing fails.
• Suppose 2, 4, 3, 5, 6 accidents leading to injuries have been observed in a
company in five consecutive years and say we are concerned about an increasing
trend in accidents.
• This example data set would not pass a statistical significance test (in this case
regarding increasing trends).
Correlation Analysis
The investigation of dependence relationships among random factors.
• Recognition and investigation of the interdependencies between uncertain
components is crucial in most risk studies.
• Such as the possible dependence relationships among road congestion, highway
quality and speed limit in the above example.
• Such as the dependence between interest rates and mortgage rates.
– A strong positive correlation is expected: if the interest rate turns out to be at the
high end of the distribution, the mortgage rate is also expected to feature a high
value.
• If interdependency between two random variables is ignored, the joint
probabilities of these random variables will be incorrectly modeled.
– While a low interest rate in concert with a high mortgage rate is not very likely in
reality, under a false independence assumption this situation would be quite routine.
Example
◆ Suppose X = Uniform (-1, 1) ; Y = X2
Then, Cov(X,Y) = E(X Y) – E(X) E(Y) = E(X3) – E(X) E(X2) = 0
Since, E(X3) = E(X) = 0
◆ This is one reason Scatter Plots are as important as the numerical correlation
statistics.
• Independent variable is plotted on the x-axis, and the dependent on the y-axis.
• They provide a simple way of visualizing form of a correlation or dependency.
Statistical Inference
Given a set of observations regarding some random phenomena, estimation
(inference) of the probability distribution describing it.
• Usually, it is assumed that the distribution is one of a family of distributions f(t/θ)
parameterized by θ, and the assessment of the likely values of θ is endeavored.
• It is generally agreed that Statistical Inference should be based on the likelihood
of a parameter given the data.
• The Likelihood Function L(θ/x) for given data x, is equal to the Probability
Density Function of x given θ, f(x/θ).
In Risk Analysis and/or Assessment random phenomenon of interest are
usually the occurrence of undesirable events and their impact levels.
The realizations and/or levels of factors that may be triggering undesirable
events or their consequences may also be additional random phenomenon of
interest.
Bayesian Inference
The key principle behind Bayesian Inference is the revision of probabilistic
estimates based on current data.
• As such, Bayesian Inference allows the use of prior knowledge, which is very
important when there is little data (as is often the case).
• This “prior knowledge/expertise” of the modeler regarding the random
phenomena is contained in his estimate of a “prior distribution”.
The Bayesian viewpoint is popular in the risk analysis community.
• The parameter is considered to be stochastic in order to represent all the various
sources of uncertainty affecting incident occurrence.
• The process of updating a prior distribution to obtain a posterior distribution
gives an important role to the analyst/engineer.
– He/she can use their expertise to decide on the form of the prior distribution, then
experiments/observations/data can be deployed to update the prior.
• This combination of giving weight to experts, but still allowing for the scientific
evidence makes the approach popular.
A1 A2 A3 A4 A5 A6
Imagine an urn filled with marbles; half red, half white. Next,
imagine blindly drawing 4 marbles from the urn, recording the
number of reds in the sample, throwing the balls back into the urn.
If such trials are repeated many times, we will find the outcome (2 rd, 2 wh)
occuring (almost) 6 times as often as the outcome (4 red or 4 white).
Suppose two samplings are done from the same urn. In the first, 4
marbles, in the second 7 marbles are drawn on each trial. In both
cases the occurance of extreme samples (all white or all red) are
recorded.
If this is repeated long enough, it will be observed that extreme outcomes
occur 8 times more often in the first sampling.
• Expected percentages being 12.5% and 1.56%.
This statistical fact is relevant to the cancer example.
Lessons Learned
There is a strong tendency to believe that small samples resemble the
population from which they are drawn.
• We are prone to exaggerate consistency and coherence of what we see.
Statistics produce many observations that appear to beg for causal
explanations; but do not lend themselves to such explanations.
Many facts of the world are due to chance, including accidents of sampling.
Causal explanations of chance events are inevitably wrong.
Probabilistic assessments on key issues are biased regarding “cost of being
wrong” and “publicity” considerations.
Difference between Individual & Community Variances regarding impacts of
Risk Mitigation Measures might create a natural conflict of interest.
Unbiasedness of the sample data is very important.
Fault Trees
All root, intermediary & basic events of a Fault Tree have binary character.
• The corresponding indicator variable Xi= 1 in case of success, Xi= 0 otherwise.
The system being represented must be Coherent.
• The system as a whole cannot improve when one or more
subsystems/components fail.
Hospital System Power Failure System Power
Case Failure
Priority AND Gate (output if and only if the inputs occur in a given order)
m Out of n Gate (output if and only if at least m of the n inputs are active)
m
Transfer In (the tree is developed further elsewhere)
Inhibit (output occurs if and only if the single input occurs in the presence of
a conditioning event)
External Event
Operator pushes
wrong switch when
alarm sounds
Operator pushes wrong
Alarm Sounds
Alarm Sounds switch when alarm sounds
E1
P1
P2
A
Reactor Tank
S1
S2
E2
G1
G2 G3
P1 No signal to E1 P2 No signal to E1
fails P1 fails fails P2 fails
G4
G5
A No signal to E1
fails A fails
G6 G7
S1 E2 S2 E2
fails fails fails fails
Ilhan Or - Boğaziçi University 235
Quantitative Modeling & Assessment of Risk
Fault Trees - Waterway Example
Op.
Building
North
WLM WLM
WLM WLM
WMP WMP
Downstream Waterway Upstream
WLM WLM
Op.
Building
South
G1
G2 G3
G4 G5 G6 G7
No signal from North bank No signal from North bank No signal from South bank No signal from South bank
G8 G9 G10 G11
NU C SU ND C SD SU C NU SD C ND
Structure Functions
The Root Event of a Fault Tree can be represented by an indicator variable XR which
is a Boolean Function of the Boolean Variables X1, …, Xn describing the states of the
n events of the system.
XR = Φ(X1, …, Xn )
This function is called a Structure Function and incorporates all the causal
relationships leading to the root event.
•The Structure Function of a Series System:
•The Structure Function of a Parallel System:
Formal definition of Coherent Systems
i.Φ(X1=1, …, Xn=1) = 1 (when all components are in success state, system is
successful);
ii.Φ(X1=0, …, Xn=0) = 0 (when all components are failed, system is failed);
iii.Φ(X) ≥ Φ(Y) for X ≥ Y
Usually it is assumed that the basic events in a Cut Set occur independently.
• Probability of the Cut Set is the product of the probabilities of the Basic Events.
System Power
Failure
Generato
r
Failure
Switch Battery
Failure Failure
Advantages
Modeling via few, simple logic operations;
Directing the analysis to ferret out failures;
Focusing on one Root Event of interest at a time;
Pointing out the aspects of the system important to failure;
Providing a graphical communication tool whose understanding is easy and
analysis is transparent;
Providing an insight into system behavior;
Through minimal cuts sets, providing a synthetic result enabling the
identification of critical components.
Event Trees
Event Trees follow a “forward logic”: They begin with an initiating event
(an abnormal incident) and “propagate” this event through the system under
study by considering all possible ways in which it can effect the system
behavior of the (sub)systms.
Nodes of an Event Tree represent the possible functioning or
malfunctioning of a (sub)system.
• The intervention (or not) of protection systems which are supposed to take
action for the mitigation of the abnormal incident (accident/failure);
• The fulfillment (or not) of safety functions;
• The occurrence (or not) of physical phenomena (fires, dispersion etc.).
A path through an Event Tree resulting in an accident is called an accident
sequence.
• Accident sequences are quantified in terms of their probability of occurrence.
• Different endpoints of a tree can give the same consequences.
Success
IF1S2S3
Success Failure
IF1S2F3
Failure
IF1F2S3
Failure Success
Failure
IF1F2F3
Ilhan Or - Boğaziçi University 248
Quantitative Modeling & Assessment of Risk
Event Trees
Event Trees
Event Trees begin with a defined accident/failure (initiating) event
• There is one Event Tree for each different accident/failure (initiating) event
considered.
• Similar initiating events may be grouped and only one representative event in
each group may be investigated in detail.
Once an initiating event is defined, all the safety functions that are required
to mitigate the accident must be defined and organized according to their
time of intervention.
The logic order of the required functions must also be accounted for.
• If the successful fulfillment of a given function is dependent on the fulfillment of another
one, the tree needs to be oriented such that the dependent functions follow those upon
which they depend;
• System dependencies can be Functional - failure of intervention of a system renders
helpless (or increases failure likelihood) regarding the intervention of a successive one;
• Or Structural - if the systems share some common parts or flow so that failure of that
part makes them both fail.
Initiating
event IS1F2S3: P(S2/F1,I) x P(F1/I) x P(I)
Failure Success
Failure
IS1F2F3: P(F2/F1,I) x P(F1/I) x P(I)
Failure
IS1F2F3: P(F2/F1,I) x P(F1/I) x P(I)
◆ Çöllolar open coal mine in the Afşin-Elbistan region is one of the largest such mines in Turkey,
annually supplying around 10 million tons of lignite to the nearby thermal power plant.
◆ On February 10, 2011, a major landslide occured on the eastern wall of the Çöllolar mine. 10 miner
perished in this disaster, which occured along the full 1150 meter lenght and 140 meter height of
the eastern wall and carried 50 million m3 of material.
◆ Experts examining the disaster site came up with the following findings:
▪ The coal layers were not continous and featured almost vertical discontinuities.
▪ There were soft clay layers sandwiched between coal & other layers, and sloped towards the mine.
▪ There were a number of drainage wells in the mine area to drain the underground water; but they
were not well maintained.
▪ As the excavations contiued, many stress cracks appeared in the high grounds around the mine;
the management had the cracks filled with ash.
▪ In the days right before the landslide, the ash level in some of the cracks displayed around 20 cm.
depressions.
▪ The experts believe that the above factors all contributed to the landslide disaster.
No Stress Cracks
No/Non Effective IS1F2F3S4
Drainage Wells
Steep Open Mine Significant Stress Cracks IS1F2F3F4
Walls
Effective Drainage No Stress Cracks
Wells
IF1S2S3S4
No Soft Clay Significant Stress Cracks IF1S2S3F4
Layers
No/Non Effective No Stress Cracks
IF1S2F3S4
Drainage Wells
Significant Stress Cracks IF1S2F3F4
Non-continous
Coal Layer No Stress Cracks
Effective Drainage
Wells
IF1F2S3F4
Soft Clay Significant Stress Cracks IF1F2S3F4
Layers
No Stress Cracks
No/Non Effective IF1F2F3S4
Drainage Wells
Significant Stress Cracks IF1F2F3F4
◆ The survivors of the accident said that they did not notice
anything unusual until the crash.
◆ Some indicated that the landing speed seemed to be faster
than usual, but there was no loss of control.
◆ Some claimed that the aircraft had stalled while flying.
◆ One survivor said he felt a shaking then a slight sense of
uplifting but then came the sudden and unexpected crash.
◆ The crash angle was very small (as if the aircraft was flying
parallel to ground at an extremely low altitude).
◆ The Dutch authorites initiated rescue efforts, including 100
emergency personnel, 60 ambulances and 5 helicopters.
◆ Civil aviation experts acknowledged a defective altimeter and pilots’ realization of this being too late. Auto-pilot
landing was also confirmed with low visibility conditions most probably being the primary reason for this choice.
◆ Holland Department of Aviation Safety declared that the aircraft was on auto-pilot at the time of the incident and
that a faulty altimeter led to loss of speed right before the incident.
◆ Deployment of the auto-pilot system at landing is quite routine in many airlines and is decided upon by the pilots.
◆ According to the black box, at a height of 1950 ft the left altimeter suddenly indicated a change in altitude (from
1950 to 8 ft) and passed this onto the auto-pilot. This change had a direct impact upon the auto-throttle system
which provides engine power.
◆ Due to the low visibility conditions and their steep descent angle, the crew did not notice the flawed actions of the
auto-throttle.
◆ Meanwhile, a relatively inexperienced second pilot was in the controls of the auto-pilot (since the captain did not
see the altimeter defect, nor the low visibility conditions as a serious problem).
◆ When the aircraft started stalling, the pilots finally acknowledged the gravity of the situation; the second pilot
tried to apply full power to break away from this unusual and dangerous situation.
◆ Unfortunately, (for some reason) the auto-throttle was still active; so, it pushed back the controls to the IDLE
position, probably preventing a last minute speed-up.
◆ Airspeed was reduced to 170 km (way below the minimum necessary 260 km) and the plane crashed.
“Autopilot Flight & Cross-check Circuits Effective” Branch of the Event Tree
◆ If the plane were on auto pilot and cross-check circuits were effective, then the autopilot would try
to confirm the altitude information from the (faulty) altimeter system with other related status
information from the cross-check system (such as speed of aircraft and status of the landing gear).
◆ In case of conflicting feedback, the autopilot system would warn the pilot before taking any action
and suggest switching to manual flight.
◆ From that point onwards the manual flight conditions would prevail.
Altimeter Accident
Malfunction /Undesirable Event
Autopilot Experienced Sequences
Possible Event Flight Pilot
Good Visibility
Tree of the THY Ineffective Conditions Success IF1F2S3S4
Tekirdağ Cross-check
Success Failure IF1F2S3F4
Circuits
Accident
Failure Success IF1F2F3S4
Failure IF1F2F3F4
“Autopilot Flight & Cross-check Circuits Ineffective” Branch of the Event Tree
◆ If the plane were on auto pilot & cross-check circuits were not effective, but visibility conditions
were good, then, presumably, the pilot would notice that something is amiss the instant the autopilot
cut-off power and attempted to reduce aircraft speed to that of immediate landing speed.
◆ Then, pilot would have ample time to override/shut-off the autopilot system properly and proceed
with manual landing. The pilot’s success or failure in properly switching-off the autopilot and then
proceeding with manual landing would be slightly influenced by his experience.
◆ If the visibility conditions were bad, it could take some time for the pilot to notice that something is
wrong. Even then, an experienced pilot might immediately and properly switch to manual flight,
while an inexperienced pilot might panic and do not properly switch to manual flight.
u Example: We would like to assess time and cost overruns risks in Oil
Prospecting/Drilling Operations in a certain area, given,
• Probability distribution regarding existance of an oil reserve;
• Probability distribution regarding size and quality of the oil reserve;
• Probability distribution regarding depth of the field and soil characteristics;
• Probability distribution regarding marketability & sale price of the extracted oil.
u “Assessing time and cost overrun risks” may involve,
• Estimating drilling costs and durations;
• Estimating amount and duration of oil extraction;
• Estimating the revenue to be gained from the sales of the extracted oil.
u In the ensuing MC simulation,
• A series (set) of realizations for each of the random factors (existance, size and quality of
oil, depth of oil field and soil quality, market profile and oil prices) are generated based
on the assumed probability distributions (one iteration);
• Related extraction time, extraction costs and oil sales revenues are determined based on
the generated realizations;
• Thousands of simulation iterations accumulate thousands of such determined extraction
time, extraction cost and oil sales revenue realizations;
• Various statistical studies can then be done based on the accumulated data set.
◆ Çöllolar open coal mine in the Afşin-Elbistan region is one of the largest such mines in Turkey,
annually supplying around 10 million tons of lignite to the nearby thermal power plant.
◆ On February 10, 2011, a major landslide occured on the eastern wall of the Çöllolar mine. 10 miner
perished in this disaster, which occured along the full 1150 meter lenght and 140 meter height of
the eastern wall and carried 50 million m3 of material.
Experts examining the disaster site came up with the following findings:
▪ The coal layers were not continous and featured almost vertical discontinuities.
▪ There were soft clay layers sandwiched between coal & other layers, and sloped towards the mine.
▪ There were a number of drainage wells in the mine area to drain the underground water; but they
were not well maintained.
▪ As the excavations contiued, many stress cracks appeared in the high grounds around the mine;
the management had the cracks filled with ash.
▪ In the days right before the landslide, the ash level in some of the cracks displayed around 20 cm.
depressions.
▪ The experts believe that the above factors all contributed to the landslide disaster.
◆ The survivors of the accident said that they did not notice
anything unusual until the crash.
◆ Some indicated that the landing speed seemed to be faster
than usual, but there was no loss of control.
◆ Some claimed that the aircraft had stalled while flying.
◆ One survivor said he felt a shaking then a slight sense of
uplifting but then came the sudden and unexpected crash.
◆ The crash angle was very small (as if the aircraft was flying
parallel to ground at an extremely low altitude).
◆ The Dutch authorites initiated rescue efforts, including 100
emergency personnel, 60 ambulances and 5 helicopters.
◆ Civil aviation experts acknowledged a defective altimeter and pilots’ realization of this being too late. Auto-pilot
landing was also confirmed with low visibility conditions most probably being the primary reason for this choice.
◆ Holland Department of Aviation Safety declared that the aircraft was on auto-pilot at the time of the incident and
that a faulty altimeter led to loss of speed right before the incident.
◆ Deployment of the auto-pilot system at landing is quite routine in many airlines and is decided upon by the pilots.
◆ According to the black box, at a height of 1950 ft the left altimeter suddenly indicated a change in altitude (from
1950 to 8 ft) and passed this onto the auto-pilot. This change had a direct impact upon the auto-throttle system
which provides engine power.
◆ Due to the low visibility conditions and their steep descent angle, the crew did not notice the flawed actions of the
auto-throttle.
◆ Meanwhile, a relatively inexperienced second pilot was in the controls of the auto-pilot (since the captain did not
see the altimeter defect, nor the low visibility conditions as a serious problem).
◆ When the aircraft started stalling, the pilots finally acknowledged the gravity of the situation; the second pilot
tried to apply full power to break away from this unusual and dangerous situation.
◆ Unfortunately, (for some reason) the auto-throttle was still active; so, it pushed back the controls to the IDLE
position, probably preventing a last minute speed-up.
◆ Airspeed was reduced to 170 km (way below the minimum necessary 260 km) and the plane crashed.
Been Designed
Probably Designed
“Autopilot Flight & Cross-check Circuits Effective” Branch of the Event Tree
◆ If the plane were on auto pilot and cross-check circuits were effective, then the autopilot would try
to confirm the altitude information from the (faulty) altimeter system with other related status
information from the cross-check system (such as speed of aircraft and status of the landing gear).
◆ In case of conflicting feedback, the autopilot system would warn the pilot before taking any action
and suggest switching to manual flight.
◆ From that point onwards the manual flight conditions would prevail.
Altimeter Accident
Malfunction /Undesirable Event
Autopilot Experienced Sequences
Possible Event Flight Pilot
Good Visibility
Tree of the THY Ineffective Conditions Success IF1F2S3S4
Tekirdağ Cross-check
Success Failure IF1F2S3F4
Circuits
Accident
Failure Success IF1F2F3S4
Failure IF1F2F3F4
“Autopilot Flight & Cross-check Circuits Ineffective” Branch of the Event Tree
◆ If the plane were on auto pilot & cross-check circuits were not effective, but visibility conditions
were good, then, presumably, the pilot would notice that something is amiss the instant the autopilot
cut-off power and attempted to reduce aircraft speed to that of immediate landing speed.
◆ Then, pilot would have ample time to override/shut-off the autopilot system properly and proceed
with manual landing. The pilot’s success or failure in properly switching-off the autopilot and then
proceeding with manual landing would be slightly influenced by his experience.
◆ If the visibility conditions were bad, it could take some time for the pilot to notice that something is
wrong. Even then, an experienced pilot might immediately and properly switch to manual flight,
while an inexperienced pilot might panic and do not properly switch to manual flight.