Professional Documents
Culture Documents
Failure Klde, Effect and Criticality Analysis
Failure Klde, Effect and Criticality Analysis
D.J. Lawson,
ABSTRACT
1. INTRODUCTION
The examples given in this paper contain fictitious system data which is used
solely to demonstrate the method.
1ST LEVEL
2ND LEVEL
LEVEL
4TH LEVEL
INPUT
FAILURE-
MODE
INTERNAL
FAILURE
MODE
1,1. Definitions
INTERNAL FAILURE A failure mode that has originated from within the unit being
MODE considered.
LOCAL EFFECT The effect of a failure mode on the unit under consideration.
END EFFECT A failure effect that has no further effect at the current
level.
2. APPLICATION
An FMECA may be carried out to any level of assembly and so the bounds of the
analysis can vary. For example the analysis might consider the failure modes of
individual Line-Replaceable-Units (LRU's) and trace their effects up to complete
system level. On the other hand, the analysis might consider a single LRU (perhaps
shown to be critical by some other criteria) and examine the effects of component
failures on the performance of the LRU. A 'partial' FMECA, where some elements of
the system are ignored, is not recommended since the omitted items may contain
failure modes which produce unenvisaged critical effects. An FMECA is potentially
one of the most beneficial and productive tasks in a well structured reliability
programme. It serves to verify design integrity, assess system safety, identify and
quantify sources of undesirable failure modes and document the reliability risks.
It should be applied with the following objectives in mind:
3) to categorise all effects with respect to their severity and compare their
criticality. Critical effects are those with both high severity and high
failure rate.
Objectives (2) and (3) are sometimes omitted and the analysis called a
Failure Mode and Effect Analysis (FMEA).
i) List all actions needed to locate and correct each failure mode,
In general, therefore, an FMECA can most usefully be performed when the system
is in its simplest state (i.e. comprises the minimum number of items necessary to
perform the system function) to determine those areas where redundancy, or some
other reliability improvement technique, should be applied.
However, if a system has redundant items they must be analysed to verify that
their failure modes cannot degrade system performance and so that their failure
mechanisms can be established from the lower level results of the system FMECA.
60
3. REQUIREMENTS
The requirements summarised below will help to ensure that the potential of an
FHECA is realised.
i) the analysis is completed at the lowest level where design engineers have
extensive knowledge,
iii) it eliminates the need to trace each effect through the system
individually.
61
6) The severity and failure rate of each failure should be stated so that its
criticality can be determined.
1. Define the function and purpose of the system and establish all
differing modes of operation.
NO - not detected.
4.2. Procedure
Once the system breakdown has been established each unit must be
identified by a unit number as suggested in Figure 2. Levels may be
typically defined as:
0 MSA Level
I
I I
2 3 MS Level
I
I I I I I
1.1 1.2 1.3 3.1 3.2 SS Level
I
1.2.1
I
1.2.2 1.2.3 3.2.1
ILRU Level
3.2.2
I
1.2.2.1 1.2.2.2 MOD Level
UNIT NAME: Con t ro1 and Decode T PARTS: 819262, 819259 TOT. FR:
MS 1 55 4 LRU 3 MOD - COMP NEXT LEVEL UP: Synchro Interface Unit
SOURCE/ ~
No. I FAILURE MODE F.R. No. FAILURE EFFECT DEST. « BITE
u
~
I
Loss of 3.4 V reference EXT Loss of attitude rate data 1. 2.5 .6 I A I YES
2 No chip select to RAM 4.6 121 Angular data not updated on 1.2.5.7 A YES
all channels 1.3.1.2
INTERNAL
GMf 203101944
CONVENTION
INPUT EFFECTS
MODES
RESULTANT EFFECT
OF MODE A
INPUT MODE 1.2 RESULTANT EFFECT
yINTERNAL
MODES
B-----~
I
~---- ..
OF MODE B
- ---'
INTERNAL MODE A
Internal modes at the lowest level are given a failure rate instead of
a source number.
For example:
FAlL~E MDDE a A aA
Diode D6 short circuit 0.76 10 7.6
Diode D6 intermittent circuit 0.18 10 1.8
Diode D6 open circuit 0.06 10 0.6
However, time, labour and cost usually prohibit a full system analysis
being taken to component level. If this is the case, failure modes at the
lowest level could be restricted to circuit functions, for instance, loss of
power supply regulation. The failure rate of a failure mode at this higher
level of assembly may either be determined by summing the failure rates of all
its contributory components or by the apportionment of the total failure rate
of the unit, which is generally available.
must be taken not to isolate these simultaneous failures during the analysis.
If they are allowed to become isolated it will be possible to record
impossible partial failure effects.
5. COMPUTER AIDS
ANAL IS
EDIT
A flexible editing, and data handling program designed to facilitate the
updating of information as the system design and analysis progresses.
MODE
This program will produce a list of all input and internal failure modes, for
all specified units, together with their source or failure rate.
EFFECT
The output from MODE and EFFECT provides a useful index to the output from the
following three programs.
TRACEUP
This traces all internal failure modes to their resulting effects at any
specified level. The route taken by the trace is indicated by listing all units at
intermediate levels Which are affected by the failure. Category and detectability
are also indicated Where applicable. Any sensitive failure modes Which result in
numerous, high category effects will be immediately highlighted. The output data
from TRACEUP is shown in Figure 5.
TRACED
This program will trace down from all effects at any specified unit to their
contributory failure modes at any specified level. The failure rate of each
contributory internal mode is given together with the total failure rate of each
effect. It is therefore simple to determine Where an improvement in reliability
would be most effective should the failure rate of the effect prove unacceptably
high. An example of the output data is shown in Figure 6.
It is also possible to trace all effects of a given category and calculate the
total failure rate for that category. This will not necessarily be the sum of the
failure rates of the individual effects since the program avoids inaccuracies by
enauring that the failure rate of a mode is not summed more than once. This
facility enables the probability of, say, a catastrophic system failure to be
determined by excluding the failure rates of modes not contributing to this category
and thus avoiding a pessimistic result.
FMEA nF FLIGHT CONTROL SYSTE~ PRO GRAM TRACEUP
CAT BI TE
1.1 SS CONTROL
7 LOSS OF SYNCHRO OUTPUTS ~1 AND Q2 ON CHANNEL 2 A YES
•
1.3 SS MONITOR
• 7 VALIDITY DISCRETES ON CHANNEL 2 FAILED B YE S
SOURCE FR/I0E6HR
CATEGORY A EFFECT
FAllU~E RUE
re~ Mill! O~ HOURS ••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••• INCREASING CRITICALITY
I 30J.~0 I
' ~74 .CO I
, 2~~ .CO I
I "
' I
I I
I
I 42
I
,
I
A
I
I
m:~~ : : I I I / I
~r; :~~ \ I I 84 I 62 I / \
m:g
ZIt~.I)O
I' 1'0 II 111// I,'
:4).0) I I J I
:)4.('0 I 59 I /1 I
;g:~g I I I 27 / : :
: 1~. co I I I 3 I I
:10.00 I I ?3 I I I
i~~ :~g I I 29 : /31 I :
:;;:~~ : : H : / " :
1)3.00 I I 94 V 70 I
17' .,,) I I 1 I LINES OF EQUAL
:~~:::g I : 44 : / : 99 : CRITICALITY
I~~ .uO I I I I 68 I
1~ 3.") I I I I 63 I
144 ••'0 I : I.~ I I
g~:~~ ! ~~ I / ~~ :
ig:~~: II 72 I
gj:~~ : 81 II 92 I
I;~:~~ I / I I
:;':~5 I 5ri IOJ H I
~~:~g : n r
:)3: /
I 52
~;: ~~ 70 71
~~:~~ t 95 " 91
4Z.(," : 87 / : .-
~~:g / II '-0 H
~~:~g A3 : 00 I
1 l;:~~ I ~ 1: 1 : 66 I I I
!...~:~..! .......... !.L •••••••• !.....=....!..........!..........!........_.!
u C ~ S CATEClRY
INCR:AS1t:G S;VtRITY .. )
6
BOARD NA~E ORA.INC NUMBER LOCATION F.R.1I10 HRS.I OTHER POSSIBLE EFFECTS
-------- ------------- ----------------------
BUFFER STORE BOARD IC062C/IIOI 12 32.60 19. 41
6
BOARD NAME ORA 'INC NUMBER LOCATION F.R. [/10 HRS.' OTHER POSSIBLE EFFECTS ~
DIAGNOSE
A further program has been developed to produce a diagnostic manual from the
FMECA data as an aid to maintenance engineers. Here failure mode descriptions are
replaced by the board (or unit) identifiers, and locations, of each board that might
be responsible for the effect in question. The effect number of any additional
effect that may be observed if a given board failed is inserted against that board
identifier together with the board failure rate.
METHOD OF USE
Once a failure effect has been observed an index is searched for the
corresponding effect description. The associated effect number is read off and the
relevant page located in the diagnostic manual. (See Figure 8.) This states the
board, or boards, whose failure would result in the observed effect. If more than
one board is given, the effects listed in the OTHER POSSIBLE EFFECTS column can be
checked to help reduce the number of possible boards responsible. When it is
impossible to reduce the list to a single board the remaining boards should be
replaced in decreasing order of failure rate until the fault is rectified.
74
REFERENCES