Professional Documents
Culture Documents
07.IAM SSG - Reliability - Engineering - v1.1 - 16
07.IAM SSG - Reliability - Engineering - v1.1 - 16
Lifecycle Delivery
Reliability Engineering
Managing the
Organisation
Asset
Management
Asset
Management
System
16
Asset
Portfolio
© Copyright The Institute of Asset Management 2016. All rights reserved www.theIAM.org
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
ii © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Asset Information
©C
Copyright
i ht 2014 IInstitute
tit t off A
Assett M
Managementt ((www.theiam.org/copyright)
th i / i ht)
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. iii
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Acknowledgements ii
Contents
1 INTRODUCTION TO SUBJECT SPECIFIC GUIDANCE 1
1.1 Purpose of the SSGs 1
1.2 The SSGs in context 1
1.3 SSGs and the issue of Complexity versus Maturity 1
1.4 Further reading 2
2 INTRODUCTION 3
2.1 The Purpose, Intended Use, and Intended Audience of this SSG 3
2.2 Aligning this Document 5
2.3 Complexity versus Maturity 6
2.4 Navigating & Using this document 7
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distributio
iv © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
5.6 HAZOP 28
5.6.1 What is it? 28
5.6.2 Limitations 28
5.6.3 When is it used? 29
5.6.4 Complimentary techniques 29
5.6.5 Example procedure steps 29
5.6.6 Prerequisites 30
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. v
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
6 CASE STUDIES 48
6.1 Case Study 1 50
6.1.1 Introduction 50
6.1.2 Challenge 50
6.1.3 Approach 50
6.1.4 Deliverables 51
6.1.5 Results 51
6.3.1 Introduction 54
6.3.2 Challenge 54
6.3.3 Approach 54
6.3.4 Deliverables 55
6.3.5 Results 55
6.4 Case Study 4 56
6.4.1 Introduction 56
6.4.2 Challenge 56
6.4.3 Approach 56
6.4.4 Deliverables 56
6.4.5 Results 56
6.5 Case Study 5 58
6.5.1 Introduction 58
6.5.2 Challenge 58
6.5.3 Approach 58
6.5.4 Deliverables 60
6.5.5 Results 60
6.6 Case Study 6 61
6.6.1 Introduction 61
6.6.2 Challenge 61
6.6.3 Approach 61
6.6.4 Deliverables 61
6.6.5 Results 62
6.7 Case Study 7 62
6.7.1 Introduction 62
6.7.2 Challenge 62
6.7.3 Approach 62
6.7.4 Deliverables 63
6.7.5 Results 64
6.8 Case Study 8 64
6.8.1 Introduction 64
6.8.2 Challenge 64
6.8.3 Approach 65
6.8.4 Maintenance Analyses 65
6.8.5 Re-usable Maintenance Support Processes 65
6.8.6 Innovative Technologies for Condition Monitoring 65
6.8.7 Deliverables 65
6.8.5 Results 66
7 REFERENCES 67
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. vii
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
The SSGs cover the 39 Subjects in The Anatomy directly as a ‘one to one’ (where a
subject is very broad), or grouped (where subjects are very closely related).
1.1 Purpose of the SSGs practical approaches and solutions that are
This document provides guidance for good asset economic, viable, understandable and usable.
management. It is part of a suite of Subject Specific The underlying requirement for continual
Guidance documents that explains the 39 subject areas improvement should drive progress.
identified in “Asset Management – an Anatomy”, also
published by the Institute of Asset Management. These 1.2 The SSGs in context
subject areas are also acknowledged by the Global The SSGs are a core element within the IAM Body
Forum for Maintenance and Asset Management as the of Knowledge and they have been peer reviewed
“Asset Management Landscape”. and assessed by the IAM Expert Panel. They align
fully with the IAM’s values and beliefs that relate
PAS 55 and ISO 55001 set out requirements which to both the development of excellence in the asset
describe what is be done to be competent in asset management discipline and provision of support to
management, however they don’t offer advice on those who seek to achieve that level of excellence.
how it should be done. The SSGs are intended
to develop the next level of detail for each subject 1.3 SSGs and the issue of Complexity
in The Anatomy. They should therefore be read versus Maturity
as guidance; they are not prescriptive, but rather It is important to understand and contrast these
intended to help organisations by providing a terms. Put simply:
consolidated view of good practice, drawn from • The complexity of the business will drive the
experienced practitioners across many sectors. complexity of the solution required; and
• The maturity of the organisation will determine its
The SSGs include simple as well as complex solutions, ability to recognise and implement an appropriate
together with real examples from different industries solution.
to support the explanatory text because it is
understood that industries and organisations differ A very mature organisation may choose a simple
in scale and sophistication. In addition, they are at solution where a naive organisation may think
different stages of asset management; some may be that a complex solution will solve all its problems.
relatively mature while others are at the beginning of In truth, there is no universal best practice in Asset
the journey. Management – only good practice that is appropriate
for the operating context of any particular
Accordingly, there is flexibility for each organisation organisation. What is good practice for one
to adopt their own ‘fit for purpose’ alternative organisation may not be good practice for another.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
1 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 2
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
2 Introduction
2.1 The purpose, intended use, and the organisations within them. To provide additional
intended audience of this SSG context this SSG provides case study examples from
PAS 55 and ISO 5500x1 are the formal specification different sectors to demonstrate the key points of
and standard for the implementation of an Asset guidance. However, any document generic enough
Management System, setting out the minimal to be applied to multiple industry sectors must be at
requirements an organisation would need to a relatively high level of detail.
meet to gain accreditation to that specification or
standard. For any organisation or individual wanting Those familiar with BSI PAS 55:2008 will be aware
to master the discipline, knowledge of PAS 55 and that this specification itemises 28 requirements for
ISO 5500x is not the whole picture. As well as the organisations seeking to demonstrate good Asset
standard and management system aspects, they Management practices.2 These requirements are a
need to understand the full breadth and depth of clear foundation for implementing and operating
the component parts that make up the landscape of an Asset Management System. They are, however,
Asset Management and this is supported through the distinct from the capabilities such organisations
SSGs. need – these are the 39 Subjects described in
the Anatomy.3 The areas for requirements are
Standards could therefore be regarded as ‘what’ is summarised in the following figures from PAS 55 and
required for an Asset Management System. This ISO 5500x.
SSG, as one of many being developed by the IAM,
supports the ‘how’ to deliver the component parts The Asset Management Anatomy has been built
and in its development has tried to cover the range around 6 Subject Groups and 39 subjects and now
of industry sectors currently associated with the IAM provides a stable platform on which the IAM can
and recognise that differences in levels of maturity develop SSGs. These six subject groups and 39
and operating contexts exist within those sectors and subjects are also aligned with The Asset Management
Figure 1: Requirements for good asset management practices - PAS 55 (left) ISO 55002 (right)
1. The term ISO 5500x is used generically in this document to refer to the family of standards comprised on ISO 55000, 55001, and 55002 unless specific reference is made
to a section within one of those standards.
2. Clause 4, PAS 55
3. Asset Management – an Anatomy Version 2, July 2014, Institute of Asset Management
4. Second Edition, March 2014
Sell, Recycle
Install and Operate and
Identify need Select Purchase and/or
Configure Maintain Replace
Manage
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
Landscape4 (published by The Global Forum on Note that in ISO 5500x the explicit identification of
Maintenance and Asset Management) to facilitate different life cycle activities (such as create/acquire,
the exchange and alignment of maintenance and operate, maintain, renew/dispose) has been dropped
asset management knowledge and practices. and instead it accommodates more diverse life cycle
stages of different asset types. The figure above
The six Subject Groups5 are: shows some examples of lifecycle variations.
• Strategy and Planning.
• Asset Management Decision-Making. We recognise that you will spend more time and
• Lifecycle Delivery. money if you don’t have the right assets in the first
• Asset Information. place, and we recognise that prior stages are also
• Organisation and People. important by getting an asset design right in the
• Risk & Review. first place. However, many of the earlier and later
lifecycle phases are covered by other areas within
This SSG specifically pertains to Reliability Engineering the Lifecycle Delivery subject group of the Asset
which sits within the Lifecycle Delivery area of Management Landscape, as listed below, and this
the Asset Management Subject Groups. It will SSG focuses more on the Operate / Utilise & Maintain
become part of a full series of SSGs covering all / In Service area of the asset lifecycle.
39 Subjects and a smaller series of Sector Specific
Guidelines (where these are desired by a particular Lifecycle Delivery:
sector). These are not designed as text books or • Technical Standards & Legislation.
course material but as reference documents for • Asset Creation & Acquisition.
professionals working in or requiring guidance in • Systems Engineering.
this field. We would expect everybody involved in • Configuration Management.
asset management to have a working knowledge of • Maintenance Delivery.
the 39 Subjects, but the degree to which they might • Reliability Engineering (this SSG).
need deep or specialist knowledge will depend on • Asset Operations.
the job or task they perform. • Resource Management.
• Shutdown & Outage Management.
Reliability Engineering has a broad scope and as such, • Fault & Incident Response.
the group developing this SSG considers lifecycle • Asset Decommissioning & Disposal.
as a central component of this document, and this
document is based on the assumption that most people In applying this SSG, the reader is requested to view
who read this SSG will be people who have existing Reliability Engineering as a toolbox. It has many
assets and are looking at better ways to manage the sections with many tools that apply across the whole
reliability associated with them. For this reason this SSG lifecycle. But some tools are used more often than
focuses on the “in service” phase of the asset lifecycle. others and some tools are more general whereas
5. Updated to reflect Version 2 of the Anatomy
6. An Anatomy of Asset Management Issue 3 July 2014
others have very specific applications. Given the • Alignment (‘line of sight’) of organisational
assumption that most use of the SSG will probably be in objectives feeding clearly into asset management
the “in service” stage and be related to mostly physical strategies, objectives, plans and day-to-day activities.
asset management this SSG describes the applicability • Whole life cycle asset management planning and
of techniques to be used for these purposes. cross-disciplinary collaboration to achieve the
best value combined outcome.
The following diagram illustrates typical priorities and • Risk management and risk-based decision-
concerns for asset managers. This topic is covered in making.
other IAM documents but aligning the organisational • The enablers for integration and sustainability;
and strategic goals with work carried out under the particularly leadership, consultation,
asset management system is critical for maintaining communication, competency development and
clear line of sight. This line of sight is maintained by information management.
using the right tool for the right job.
An in depth comparison of these two publications
Typical priorities & ‘values’
is outside the scope of this document but some key
differences are listed below9:
Keeping stakeholders happy Corporate/
Organization • PAS 55 primarily relates to the management of
Management
Portfolio return on investment physical assets, whereas ISO 55001 is intended to
n
Systems performance,
lue
effectiveness
eC
impact an organisation looking to improve its asset to assess capabilities and a roadmap for improving
management performance, specifically within the them. In its simplest form, a maturity model is a set
area covered by this SSG. of characteristics, attributes, indicators, or patterns
that represent progression and achievement in a
In simple terms: particular domain or discipline. The artefacts that
• The complexity of the business will drive the make up the model are typically agreed upon by
complexity of the solution required. the domain or discipline and are validated through
• Organisational maturity will determine its ability application and iterative recalibration11.
to recognise and implement an appropriate
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
10. Maturity Models 101: A Primer for Applying Maturity Models to Smart Grid Security, Resilience, and Interoperability, Software Engineering Institute, 2012, Caralli, Knight,
Montgomery
11. Ibid
12. Ibid
13. https://theiam.org/products-and-services/pas55-methodology
14. https://theiam.org/products-and-services/Self-Assessment-Methodology
Introduction This section. Introduction to the purpose of the SSGs and how they relate
to PAS 55 and ISO 5500x.
What Does “Reliability Overview of scope and key terminology.
Engineering” Mean?
Concepts, Principles and High level summary of the important considerations, concepts and
Key Factors principles of Reliability Engineering.
Reliability Engineering Description and overview of several helpful techniques for Reliability
Toolbox Engineering.
Guidance for Reliability Builds on the previous section and looks more at key inputs, outputs and
Engineering required experience.
Case Studies Examples to illustrate the use of some Reliability Engineering Tools.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
7 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
The objectives of Reliability Engineering can be Such commercial evaluation can / should consider
summarised as: the following:
1) To apply engineering knowledge and • Reduced production losses.
specialist tools/techniques to prevent or • Lower production unit cost.
reduce the likelihood and/or consequence • Reduced maintenance costs.
of failure, recognising that you should • Improved employee safety.
determine the consequences and probability • Better process stability.
of failure and then decide what, if anything • Extended equipment life.
to do. You can decide to reduce or accept • Reduced spare parts inventory.
the consequences. You can decide to reduce • Improved sense of employee ownership.
or accept the probability of failure, or do • Reduced risk of environmental issues.
nothing. • Reduced overtime.
2) To identify and correct the causes of failure • Continuous improvement.
that do occur, despite the efforts to prevent
them. The key aspects of reliability management include:
3) Determine ways of coping with failures that • Corporate level involvement.
do occur, if their causes have not been • Integral part of product development not parallel.
identified or corrected. • Reliability procedures integrated into design process.
• Built into programme plan and production of a
On a case-by-case basis, the above objectives may reliability plan.
need justifying commercially if a proposed Reliability • Ownership of the reliability plan within the design team.
Engineering solution is likely to be ‘resource hungry’.
©C
Copyright
i ht 2014
2014 IInstitute
titt t off A
ti Assett M
Managementt ((www.theIAM.org/copyright)
th IAM
IAM / i ht) 7. Support
7.1 Resources 7.4 Communication
7.2 Competence 7.5 Information requirements
7.3 Awareness 7.6 Documented information
Sell, Recycle
Install and Operate and
Identify need Select Purchase and/or
Configure Maintain Replace
Manage
Identify need Design Construct Commission Operate and Decommission Residual
Maintain Liabilities
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
9 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
these constraints as it aims to provide optimal asset asset systems where individual components may
performance. In order to achieve this goal the following fail without necessarily impacting system reliability,
four key elements16 in Figure 4 provide a concise focus of and the system may still perform the primary
Reliability Engineering activities: function for which it was intended. Even if no
individual part of the system has failed the system
• Reliability deals with potential events (Upper as a whole may eventually cease to perform the
Left) primary function for which it was intended because
This means that failure is regarded as a it has been in a failing state; when this is the case
probabilistic phenomenon: and the likelihood of then system reliability will obviously be impacted.
failure may be random or may vary over time The system requirements specification is the
according to a distribution (probability density criterion against which reliability is measured.
function). Reliability Engineering is concerned There are processes to be considered at each
with delivering a specified probability of not failing, lifecycle stage in order to meet required functionality.
16. Asset Management – an anatomy Version 3, July 2014, Institute of Asset Management
• Reliability applies to a specified period: usually The concept of Reliability Engineering techniques as a
time (Lower Right) toolbox is described in the following sections of this
Reliability Engineering seeks to ensure that document, but the figure below from An Anatomy
components and materials will meet the of Asset Management shows how an organisation’s
requirements during the specified period. Note objectives (e.g. improvement of reliability, elimination
that units other than time may be used, for of particular problems) “requirements” need
example distance or number of cycles. Applying techniques specific to the asset life cycle “Reliability
specified time periods to reliability introduces the Engineering tools” and capabilities to apply those
basis for reliability comparisons and allows the techniques and learn from the results “capabilities”.
use of time as a normalising denominator for
reliability comparisons. These metrics are often 3.3 Terminology
applied to maintenance strategies and the focus Reliability is a word that is often used in many
of this document is on “in service” stages of the contexts. Since this is the foundation for Reliability
lifecycle. Engineering we have presented several related terms
here so that the reader can understand the context
• Activities are restricted to operation under in which these terms are used in this guidance
stated conditions (Lower Left) document and understand what is covered and what
This constraint is necessary because it is is not covered.
impossible to design a system for unlimited
conditions. Since there have been many Resilience is a topic that is gaining more attention
approaches developed over time for different in recent years internationally, particularly in regard
aspects of Reliability Engineering this SSG to the management of natural disasters and many
attempts to relate the tools covered by this SSG countries in Europe, Australia, USA, Japan to name
to when they are applicable. a few have or are developing a critical infrastructure
resilience strategy.
AM Strategies
Continual improvement
AM Plans
AM Capabilities
Processes, resources,
competencies & technologies
Life Cycle Activities
Aquire/create
Reliability Portfolio
Engineering Tools Utilize
Asset Systems
Maintain
Dispose/Replace Assets
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
11 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Reliability Reliability is the probability that an item can perform its intended function for a specified
interval under stated conditions. A single word definition for reliability is dependability17.
From IEC 60050 – 191-12-01: probability that an item can perform a required function under
given conditions for a given time interval18.
Resiliency Infrastructure resilience is the ability to reduce the magnitude and/or duration of disruptive
events. The effectiveness of a resilient infrastructure or enterprise depends upon its ability to
anticipate, absorb, adapt to and/or rapidly recover from a potentially disruptive event19.
Redundancy Duplication of components in electronic or mechanical equipment so that operations can
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
For example, in the context of critical infrastructure, fails it impacts its reliability. When a system fails it
resilience refers to: impacts its reliability. But when a system withstands
• Coordinated planning across sectors and an event without failing how do you measure that?
networks to assess failure probability, And how severe does the event need to be? And
consequence and post-failure activities. how do you measure that? In the end resilience is
• Responsive, flexible and timely recovery measures. something that affects different industries differently
• The development of an organisational culture that and has different approaches dependent on local
has the ability to provide a minimum level of factors so metrics will likely be tied to appropriate
service during locally agreed standards.
• Interruptions, emergencies and disasters, and
return to full operations quickly. Resilience fits within the overall concept of reliability
• In this way, resilience results in the building of but is particularly valuable for dealing with severe
capacity in organisations to be agile, adaptive and and non-traditional hazards. The measurement
to improve by learning from experience22. of resilience is another topic that this SSG cannot
address but the list of topics outside of the scope
While there are metrics to measure reliability, of this SSG only serves to illustrate the breadth of
resilience is trickier to measure. When an asset Reliability Engineering as a topic. For readers who
17. Practical Reliability Tools for Refineries and Chemical Plants (Barringer)
18. Due to be superseded soon by IEC 60050-192
19. US National Infrastructure Advisory Council 2010, pg. 15.
20. IAM Knowledge Center
21. Ibid
22. Australia, Critical Infrastructure Resilience Strategy, 2010
are interested in delving deeper into the topic of The following illustrations from the University of
resilience, the University of Kansas and the European Kansas use two axes that show operational state
Network and Information Security Agency (ENISA) against service levels seen by a customer. The
have both done some interesting work on the topic first diagram shows the impact on the state of the
of resilience metrics23 24. network when an event occurs and how the event
is first remediated and then recovered from. The
second diagram compares the impacts of an event
on two networks, one that is not resilient to events
and one that is.
Operational State N
Normal Partially Severely with resilience
Operation Degraded Degraded without resilience
Si aggregate network state
Imparied Unacceptable
Unacceptable
S’3
Remediate
Imparied
S2
S3
Detect Sr
Acceptable
Defend S’1
Recover
Acceptable
S2
S0 S0 S1
Figure 6: Resilience state space and strategy Figure 7: University of Kansas framework to
inner loop25 quantify network resilience26
23. https://wiki.ittc.ku.edu/resilinets/Main_Page#ResiliNets_Wiki
24. Measurement Frameworks and Metrics for Resilient Networks and Services: Technical report, ENISA, February 2011
25. Modelling and Analysis of Network Resilience, IEEE COMSNETS, Bangalore, India, 2011
26. https://wiki.ittc.ku.edu/resilinets/Main_Page#ResiliNets_Wiki
4.2 Principles The P-F curve shows the time taken for the onset of
The principles of Reliability Engineering were failure to be detected; point ‘P’ (Potential Failure) and
introduced in the previous section, and this SSG uses which will be variable dependent on the detection
Figure 4 to relate these concepts to PAS55 and to the method that is used and point ‘F’ (Functional Failure)
tools discussed in this document. These concepts are which occurs when the system no longer meets the
simple and foundational to this topic. user requirements. The time range between P and
F, commonly called the P-F interval, is the window of
Reliability Engineering: opportunity during which an appropriate (originating)
• Deals with potential events. task to detect the onset of failure can be conducted;
• Is predicted on required function. once completed the (remedial) task can be planned
• Applies to a specified interval: usually time, but and completed and the system returned to its
may be linked to operational cycles or throughput. required performance state.
• Applies to operation under stated conditions.
P-F Interval
Condition
(Required
Point of Functional Failure
Performance)
Minimum Acceptable Performance
Failed Condition
Operating Age
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 14
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
A hidden failure where an undetected failure does probabilities and consequences and part of managing
not lead to a loss of primary function in normal failures is deciding when to prevent them and when
circumstances warrants further discussion and is to tolerate them. If the cost of preventing a failure is
also covered in the section on Reliability Centred greater than the cost of the consequence then failure
Maintenance (RCM) on page 33, and specifically in may be a viable option. This is not a trivial topic
Figure 16: Decision Logic for functional failures NOT however and the timeframes for pre-failure actions
evident to operations personnel during normal duties need to be considered against the timeframes for
on page 37. A ‘hidden’ failure will not be noticed post-failure actions as well as any potential additional
under normal operating conditions and requires a impacts that a failure may cause. It is also important
second failure to occur before it is noticed e.g. you to note that in critical situations or where safety
will not know that a fire detector has failed until a is an issue there may be a zero tolerance policy to
defect that causes a fire has occurred. failure and/or injury which will dictate the way this is
managed.
Understanding that an asset failure that impacts
reliability may have its cause anywhere from design Whatever the balance of risk and reward in deciding
to a point in time immediately before (functional) the right approach the reader is urged to remember
failure, the P-F curve provides another way to view that the rates of failure over time may vary for many
the applicability of this SSG. This SSG provides an reasons and the following failure shapes provide
overview of tools that can be applied27 during the examples of temporal variations that may occur.
lifecycle of an asset including at any point on the
P-F curve. For instances where an organisation has
Initial break-in period
multiple similar assets or where replacement assets Bathtub Pattern D = 7%
27. Note that the P-F curve is drawn in relation to operating age and some of the tools discussed will be used in design i.e. before the operating phase commences.
28. Practical Reliability Engineering, Chapter 9, O’Connor
Identify need
Create or Operate and Dispose an/or • An incident has occurred or a situation exists
Acquire Maintain Replace
where action needs to be taken in the short term
Identify need Select Purchase
Install and Operate and
Sell, Recycle
and/or
that may include giving instructions to largely
Configure Maintain Replace
untrained staff so as to meet immediate
Operate and
Manage objectives.
Identify need Design Construct Commission Decommission Residual
Maintain Liabilities • The organisation is looking at how to meet longer
term objectives in terms of improvement processes.
Figure 10: Variations of asset life cycle stages
The seven basic RCM questions are:
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
29. http://www.reliabilityweb.com/art08/7_questions_rcm.htm
30. For those used to looking at risk as the product of probability and consequence/severity this is covered in the ‘CA’ of the FMECA and will likely have an associated
criticality matrix.
imagine if this was the third time in a few weeks that • Identify critical assets – the failure of these assets
a similar failure had occurred. Clearly there is reason has the most significant impact – what are the
to do a deeper investigation and look at whether critical parts, components, functions analysed?
there is a common cause affecting all of the incidents • Assess asset inter-dependability (one asset or
and if so what can be done to proactively avoid component failure can influence others) – what
them in the future. interdependencies if any exist between these
parts, function, components?
Whichever tool or combination of tools is selected, • Components classification hierarchy (e.g.
it or they must be implemented in the context of a critical, potentially critical, run to failure),
Reliability Plan which should contain the followings31: differentiate between critical, potential critical, or
• Statement of reliability requirement. run to failure parts, components or processes.
• Organisation for reliability. • Source the information and data required for
• Reliability activities to be performed and why. performing the Reliability Engineering analysis and
• Timing of major activities. ensure its quality (relevance, currency accuracy)
• Management of suppliers. when looking to events that had already occurred.
• Standards and company procedures to be used. • Data can include: components and their hierarchy
• Lesson learned feedback. and criticality; components condition; age;
• Risk Analysis/risk register. relationship between components.
• Data collection and analysis procedure. • Develop and apply a consistent convention/
• Reliability monitoring plan. terminology for defining the components and the
failure modes.
Additionally, although each Reliability Engineering
tool may target a specific type of issue, the
application of most tools will involve consideration
of the elements listed below. The intention is to give
the reader a flavour of some of the elements that
could be considered and we recognise that this list
and the approaches suggested are not exhaustive
and that several elements could be quantified as a
compound or multidimensional measure showing
e.g. quantified effects on risk to life, product quality,
company reputation etc. As an introduction to this
topic the SSG does not try to offer ways in which to
combine techniques to solve complex or interrelated
issues and this is why there is a “Derived Benefits”
section for each tool so that a decision on which tool
to select could be influenced by what else a reader
could benefit from:
• Determine the scope of the Reliability Engineering
program (Functions, condition, timeframe, why is
the Reliability Engineering tool applied, what is
the expected outcome and in what timeframe?).
• Identify the asset reliability criteria – what are the
asset reliability criteria that need to be achieved or
preserved? e.g. safety criteria, energy reduction;
level of service, regulatory conformance etc.
• Identify asset system boundaries, interface issues
– to what process or system functions or
components/parts is the Reliability Engineering
analysis tool applied to?
31. O’Connor, Chapter 15, Reliability management, Practical Reliability Engineering, John Wiley
This is the main body of the guidance. It consists of It should be assumed that different examples of good
a number of sub-sections with each section covering practice will be appropriate for different business
one of the following ‘tools’. The key factors considered contexts. For example an asset register for a ‘simple’
are the drivers for needing a Reliability Engineering company can be a spreadsheet and be fit for purpose
approach so, whereas this section includes an overview / good practice but for a major utility good practice
of each tool, it also addresses how specific tools can for an asset register will be very different.
help to address individual factors.
This section provides a high level description of the
The tools discussed in this SSG are: prerequisites, objectives and ‘spin-offs” that can be
expected from the use of each tool. In this section
FMEA Failure Modes and Effects Analysis See the following descriptions also focus on the “how”
FMECA Failure Modes, Effects and p24 as opposed to the “what” when it comes to applying
Criticality Analysis these tools. The discussion of the “how” includes:
FTA Fault Tree Analysis See • People (roles, responsibilities).
p27 • Process (inputs/outputs, key steps, controls, etc.).
• Information or knowledge.
HAZOP Hazard and operability study See • Pitfalls or challenges and how to avoid them.
p28 • Examples of good industry practice.
• Success factors are and how these can be
ETA Event Tree Analysis See
p31 achieved.
• Suitable benchmarks.
RCM32 Reliability Centred Maintenance See
p33 5.1 Selecting the right tool for the right
CRO Cost Risk Optimisation See job
p38 Table 4 lists some of tasks frequently carried out
by asset managers/engineers. The purpose of this
RCA Root Cause Analysis See table is to act like a guide to this document. It takes
p39 the tools listed in Table 3: Reliability Engineering
“Tools” and maps them to a number of business
FRACAS Failure Reporting and Corrective See
Action System p41 tasks (requirements). In this way the reader can look
for a situation that is similar to a situation they may
DRACAS Defect Reporting and Corrective See have seen and then look up which tools are most
Action System p41 suited to adding value to that task. The suggested
‘suitable tools’ are not exclusive to a specific task and
TPM Total Productive Maintenance See
each task may be suited to one or more tools and
p44
may well be used together in order to arrive at the
OEE Overall Equipment Effectiveness See desired result.
p45
When selecting which tool to use sometimes there is
LCVR Life Cycle Value Realisation See a good fit and sometimes there isn’t and this makes
p46 it difficult to prescribe which tools you should use.
Some are forward looking, some look at historical
Table 3: Reliability Engineering “tools”
32. NOTE: A search for RCM and related publications on the Internet reveals appearances in a plethora of guises - hyphenated and not hyphenated, Centered and Centred,
capitalised and lower case. In this SSG we have adopted RCM as the generic term to cover all of the above variations.
can be a powerful aid. Different tools may also following a potentially hazardous event and
provide qualitative or quantitative findings but one suited to design stage and/or pre-commissioning
thing that is always important to bear in mind is the Reliability Engineering studies.
extent of the problem being addressed. Whether
what is optimal for a single asset analysed in isolation • RCM
is still optimal for the wider system or the company A methodology used to identify the strategies
as a whole needs to be considered if line of sight which can be implemented to manage the failure
is to be maintained. Line of sight is defined33 by modes causing the functional failure of any
the IAM as “the clear connectivity between the physical asset in a given operating context
organization strategic plan (commonly called the through asking the seven basic questions of RCM.
business plan) and the on-the-ground daily activities
of individual departments (planning, engineering, •CRO
procurement, operations, maintenance, performance Often used for long term investment planning
management, etc.).” activities to model the impacts of multiyear
factors and look for optimal solutions as to when
The importance of eliminating barriers such as silos to repair, refurbish, or replace assets while
for P&L and KPIs, in addition to the sharing of data, maintaining an acceptable risk profile and
are equally essential so that business processes are balancing multiple constraints.
viewed holistically. The sharing of data provides more
input for tools and often requires support from top • RCA
management but this also helps develop line of sight. An umbrella term for various reactive
methodologies used to identify the root cause of
Ultimately the choice of tool depends on the failures after an event has occurred. The objective
challenges and/or objectives within each organisation is to make recommendations to prevent reoccurrence
but as a starting point for which tools to consider the recognising that there may be more than one root
following points provide a brief overview of each tool: cause for failure and several effective actions.
• FTA • TPM
A ‘top down’ highly-structured deductive analysis More often applied to a manufacturing rather
tool which enables defined potential events to be than services environment this is a method that
evaluated by looking backward (top down) at brings together both design and operational
the potential causes, predicting probability of aspects of professional maintenance to deliver
each contributing cause happening. real improvement in an operational environment.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
21 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
where multiple simulations are run many times in which the probability estimate for a hypothesis is
order to obtain the distribution of an unknown updated as additional evidence is acquired. In order
probabilistic entity. They are often used in physical to evaluate the probability of a hypothesis, it is
and mathematical problems and are most useful required to specify some prior probability, which is
when it is difficult or impossible to obtain a closed- then updated in the light of new data. Bayesian
form expression, or infeasible to apply a deterministic methods also include uncertainty resulting from lack
algorithm. They are particularly useful for assessing of information.
how the failure probabilities of individual items
interact to affect the failure probability of a complex The Weibull distribution is widely used in Reliability
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
system made of these items. Engineering and elsewhere due to its versatility and
relative simplicity. Depending on the values of the
Probabilistic risk assessment is a systematic and parameters, the Weibull distribution can be used to
comprehensive methodology to evaluate risks model a variety of life behaviours and a graphical
associated with a complex engineered system representation of a Weibull distribution is used in
and is used in Event Tree Analysis and Fault Tree An Anatomy of Asset Management in Section 5.3.6
Analysis (described later in this section). Probabilistic along with depictions of lognormal distribution and
algorithms, such as used in Monte Carlo analysis, exponential distributions to represent Reliability
which depend on random input have a chance Engineering.
of producing an incorrect result. Monte Carlo
simulations address this by running the simulations And the list goes on. The purpose of the last few
multiple times to obtain a distribution of probable paragraphs is to reinforce the need for strong
results. modelling skills when applying Reliability Engineering
techniques. There are several books on this topic
Bayesian inference has found application in a which get very deep into mathematical proofs and
range of fields and is a method of inference in details of various techniques.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 22
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
23 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Like any other Reliability Engineering method, the Not all failures and failure modes are critical. The
application of FMEA and FMECA is applied to: “criticality” of a failure mode is determined with
• Operations under stated conditions. consideration to the level of risk it poses if failure was
• Required system/product functions and not prevented. “Criticality” Failures are prioritised
specifications. according to how serious their consequences are,
• A specified period, which is not always time how frequently they occur; they are normally derived
(e.g. it can be km, cycles etc.) using an agreed criticality matrix.
The understating of the following factors is critical Risk can be determined in regard to safety, social,
in the application of FMEA and FMECA: loss of reputation, environment, financial considerations
• Failure35 can be defined as the point at which an and any combination of the above and more.
asset stops performing the functions for which it
was designed or the inability of an item to meet The purpose of the FMEA or FMECA is to take
a desired standard or performance. actions to eliminate, mitigate or reduce failures,
• Partial Failures36 lead to a reduced level of starting with the highest-priority ones. Failure
performance that can be considered as separate modes and effects analysis also documents current
functional failures since a reduced level of knowledge and actions about the risks of failures, for
functionality may be acceptable for a period of use in continuous improvement.
time.
• Failure mode can be defined as the way, or 5.4.3 Complimentary Techniques
mode, in which something might fail or a single Cost Risk Optimisation (CRO), RCM, HAZOP
event that may cause a functional failure.
• Failure Effects can be defined as the physical 5.4.4 Example Procedure Steps
manifestation, if any, that result from the The following table shows the steps used when
occurrence of a failure mode. conducting a FMEA and FMECA.
• Criticality can be defined as the relative measure
of the consequence of a particular failure mode Steps Description FMEA FMECA
and its frequency of occurrence.
1 Functions: Identify and
Broadly, the failure modes of an asset can be defined record what the asset
in terms of: does (not what is),
• Capacity/utilisation, refers to the effect of over-
include required standards
or under-capacity on the required level of service. of performance.
• Reliability and levels of service, where the asset 2 Functional Failures:
fails to meet required level of service. Document the way in
• Structural, refers to the physical condition of the which the asset can fail
asset and its components, as a measure of its to fulfil its Functions.
deterioration, remaining life and service potential. 3 Failure Modes:
• Cost or economic impact, where the cost to Determine what causes
maintain or operate an asset is likely to exceed each Functional Failure.
the economic return expected, or the customer’s 4 Failure Effects: Write
willingness to pay, to retain an asset. what happens if nothing
• Obsolescence, where technological changes or
were done to predict or
lack of replacement parts can render assets prevent each Failure Mode.
uneconomical to operate.
• Operator error.
5 (Failure) Criticality:
Determine how each
Each of these types of failure has distinct attributes Failure Mode matters.
that require consideration and evaluation. Specific
Table 5: FMEA / FMECA steps
information needs to be collected in order to ascertain each
mode of failure and its magnitude and consequences.
35. the system has no output at all
36. the system is operating, but below the required standard
mechanism).
• When there is an intended change to the output 5.4.8 Derived Benefits
required from a system (i.e. increase in flow or • Captures collective knowledge of team.
pressure). • Provides more visibility of reliability performance
• When there is a need to analyse consequence(s) problems.
of a failure event in an existing system. • Increases repeatability and reproducibility across
the system.
Data • Provide documented evidence of cause and effect
• System design documents – to include system and proposed solutions.
boundary definition (i.e. Piping and • Identifies the critical to quality characteristics of a
Instrumentation (P&I) diagram). system.
• Historical failure rate data for system components • Provides a record of the analysis of the logic and
(i.e. In Service Data, NPRD 2011 etc.)37. basic causes leading to a top event.
• Operating statements/user operating requirements. • Helps solve problems at their root rather than just
fixing the obvious.
Experience • Generates recommendations for reducing risk.
• Knowledge of operation and maintenance of the • Focuses upon key areas on which to concentrate
process under consideration. quality control, inspection and manufacturing
• Ability to interpret P&I diagrams. process controls.
• Knowledge of the impact of system performance • Provides early identification of design deficiencies.
change outside the process under consideration • Evaluates the probability (or rate of occurrence) of
(i.e. on other processes, on customers, on anomalous operating conditions in preparation
environments). for criticality analysis.
• Familiarity with FMEA/FMECA methodology. • Decomposes large and complex systems into
smaller more manageable parts.
5.4.6 Typical Proprietary Tools
There is benefit from using an appropriate tool 5.4.9 References
that will allow interfacing with other Reliability
Engineering disciplines, employing a write once, read 10 steps to creating an FMEA
many times approach and potentially having links http://blog.gembaacademy.com/2007/06/28/10-
to an appropriate Engineering Breakdown Structure steps-to-creating-a-fmea
(or Bill of Materials or similar). No proprietary tool British Standard BS EN 60812:2006
is necessary for a simple FMECA however, as the NOTE: Transport for London (Underground, Buses,
process can be carried out using a spreadsheet, Overground, Trams etc.) require suppliers to deliver
either produced in-house or from a readily available to this standard as do many others
free template on the Internet. Care should be taken IEC 60812
in this situation that the design of spreadsheet is Analysis techniques for system reliability –
appropriate for the analysis in question and tailored Procedure for failure mode and effects analysis
to suit the particular project. (FMEA)
37. Nonelectronic Parts Reliability Data (NPRD) provides failure rate data for a wide variety of component types including mechanical, electromechanical, and electronic
assemblies. It provides summary and detailed data sorted by part type, quality level, environment and data source.
5.5 FTA - Fault Tree Analysis 3) Continue to break down each element with
additional gates to lower levels. Consider the
5.5.1 What is it? relationships between the elements to help
Fault Tree Analysis (FTA) is another technique you decide whether to use an “and” or an
for reliability and safety analysis. Bell Telephone “or” logic gate.
Laboratories developed the concept in 1962 for the 4) Finalise and review the complete diagram.
US Air Force for use with the Minuteman system. The chain can only be terminated in a basic
It was later adopted and extensively applied by fault: human, hardware or software.
the Boeing Company. FTA is a ‘top down’ highly- 5) If possible, evaluate the probability of
structured deductive analysis tool which enables occurrence for each of the lowest level
defined potential events to be evaluated by looking elements and calculate the statistical
backward (top down) at the potential causes, probabilities from the bottom up.
predicting probability of each contributing cause
happening. It is recommended that this is carried out 5.5.5 Prerequisites
by a trained FTA practitioner if it is to be effective.
Situation
5.5.2 When is it used? Used during the design stage of a system to identify
The deductive analysis begins with a general weaknesses in that system, and to identify probability
conclusion, then attempts to determine the specific of failure. Includes consideration of human errors
causes of the conclusion by constructing a logic (i.e. design of control system for mechanical handling
diagram called a fault tree (top-down approach). equipment)
The main purpose of the fault tree analysis is to
help identify potential causes of system failures Data
before the failures actually occur. It can also be • System design documents – to include system
used to evaluate the probability of the top event boundary definition (i.e. P&I diagram).
using analytical or statistical methods. These • Historical failure rate data for system components
calculations involve system quantitative reliability (i.e. In Service Data, NPRD 2011 etc.).
and maintainability information, such as failure • List of potential failure modes (i.e. fail to start/
probability, failure rate and repair rate. stop, fail to open/shut).
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
27 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
5.5.8 Derived Benefits A HAZOP study may also consider quality issues in
• Provides the timeline of events leading to the proposed design. It is advisable to cover aspects
incident/event. of maintenance operations, including isolation,
• Provides a record of the analysis of the logic and preparation and removal for maintenance since
basic causes leading to a top event. these often create hazards as well as an operability
• Exposes all the different relationships that are problem. Where there are manual operations
necessary to result in a specific (top) event. or activities, it may be necessary to analyse the
• Builds a framework for thorough qualitative and ergonomics of the whole operation or activity in
quantitative evaluation of the top event. detail.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 28
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
38. http://www.isograph.com/software/hazop/
are taken into account. Where the team considers 5.6.7 Typical Proprietary Tools
the risk to be non-trivial or where an aspect requires Isograph ‘Hazop+’. Note that training by the software
more investigation, a formal record is generated to tool supplier should be preceded by an understanding
allow the problem to be followed up outside the of the terminology, diagrams and calculations involved
meeting. The team then moves on with the analysis. in HAZOP analysis
5.7 ETA - Event Tree Analysis nuclear industry, a package is widely used which is
able to perform both ETA and FTA.
5.7.1 What is it?
ETA is a commonly applied technique used for 5.7.3 Complimentary Techniques
identifying the consequences that can occur ETA compliments other techniques such as FTA.
following a potentially hazardous event. ETA is Where ETA looks at the outcomes of the event, FTA
particularly suited to design stage and/or pre- looks at the causes of the event. Also HAZOP.
commissioning Reliability Engineering studies. Using
ETA design and procedural weaknesses can be 5.7.4 Example Procedures Steps
identified, and probabilities of the various outcomes Steps required to perform ETA:
from events can be determined. The technique
utilises a forward logic approach for the assessment Steps Description
of reliability and safety analysis and is based on
binary logic, in which an event has or has not
happened or a component has or has not failed.
1 Define the system: Define what needs
ETA origins date back to when UKAEA first to be involved or where to draw the
introduced it in its design offices in 1968 initially to boundaries.
try to use whole plant risk assessment to optimise 2 Identify the accident scenarios: Perform
the design of a 500MW Steam Generating Heavy a system assessment to find hazards or
Water Reactor. The methodology used has not accident scenarios within the system
changed much to this day and, although it was first design.
applied for risk management for the nuclear industry, 3 Identify the initiating events: Use a hazard
it is now utilised by a multiple variety of industries analysis to define initiating events.
including: 4 Identify intermediate events: Identify
• Offshore Oil.
countermeasures associated with the
• Transportation.
specific scenario.
• Gas Production.
• Shipbuilding.
5 Build the event tree diagram.
6 Obtain event failure probabilities: If the
5.7.2 When is it used? failure probability cannot be obtained use
ETA begins with an initiating event and works fault tree analysis to calculate it.
towards the final result, with the method providing 7 Identify the outcome risk: Calculate the
information on how failure can occur and the overall probability of the event paths and
probability of occurrence. It allows for multiple determine the risk.
failures to be analysed and highlights system 8 Evaluate the outcome risk: Evaluate
weaknesses. the risk of each path and determine its
acceptability.
This technique may be applied to a system early in
the design process to identify potential issues that 9 Recommend corrective action: If the
may arise, rather than correcting the issues after outcome risk of a path is not acceptable
they occur. With this forward logic process use of develop design changes that change the
ETA, as a tool in risk assessment, can help to prevent risk.
negative outcomes from occurring by providing a risk 10 Document the ETA: Document the entire
assessor with the probability of occurrence. process on the event tree diagrams and
update for new information as needed.
Although ETA can be relatively simple, software
can be used for more complex systems to build the Table 6: Steps for performing ETA
diagram and perform calculations. This ensures
human error is removed from the process. In the
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
31 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
As stated on the previous page an event tree begins The example left can be shown as a flowchart below:
with an initiating event such as: component failure,
increase in temperature/pressure or the release of Initiating
Event
Fire dectected? Fire alarm works? Sprinkler works? Resultant event
N
could be used during the design phase of a fire Possible fatalities
protection system. Extensive damage
the siren. N
4
6) The control panel has identified the signal P = 0.9 Fire contained
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 32
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
At the end of each branch (scenario) the probabilities 5.7.8 Derived Benefits
are multiplied together as so: • Provides a timeline of events leading to incident/
1) 0.1 x 0.3 x 0.5 = 0.015 / per year. event.
2) 0.1 x 0.3 x 0.5 = 0.015 / per year. • Accounts for timing, dependence and domino
3) 0.1 x 0.7 = 0.07 / per year. effects among various accident contributors.
4) 0.9 / per year. • Generates recommendations for reducing risk.
• Performs analysis simultaneously in the failure or
All the final properties should add up to 1 (100%). success domain.
• Facilitates decomposition of large and complex
5.7.5 Prerequisites systems into smaller more manageable parts.
Situation • Identifies end events that would otherwise not be
One or more potential ‘top level’ failure events to foreseen.
analyse (i.e. engine will not start) which could be • Facilitates increase in system dependability by
due to one or more co-existing faults which need using more appropriate activities taking into
identifying and analysing for root cause. consideration of the operating context.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
33 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Definition of RCM
Source Description
John Moubray RCM A process used to determine what must be done to ensure that any physical
definition: 1991/1997 asset continues to do what its users want it to do in its present operating
context.
F Stanley Steve Nowlan and The term Reliability-Centered maintenance refers to a scheduled maintenance
Howard F.Heap: Dec 1978 program realise the inherent reliability capabilities of equipment
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
SAE-JA 1011 August 1998 RCM is a specific process used to identify the policies which must be
and SAE-JA 1012 Jan 2002 implemented to manage the failure modes which could cause the functional
failure of any physical asset in a given operating context
MIL-HDBK-338b-1998 – A disciplined logic or methodology used to identify preventive and corrective
Page 58 maintenance task to realise the inherent reliability of equipment at a
minimum expenditure of resources
Dependability management: RCM is a method established for achieving an initial preventive maintenance
BS IEC 60300-3-10-2001 - program, which is intended to ensure that inherent reliability and safety levels
Page 27 for equipment and structures are achieved and maintained
Dependability management: RCM is a method to identify and select failure management polices to
60300-3-11 : 2009 efficiently and effectively achieve the required levels of safety, availability and
:Application of Reliability economy of operation. Failure management policies can include maintenance
Centred maintenance activities, operational changes, design modification, or other actions in order
to mitigate the consequence of failures.
Defence Standard 00-45: The Defence Standard that provides the requirements for deriving
Reliability Centred maintenance programmes within the Ministry of Defence.
Maintenance to Manage
Engineering Failures
S4000P: International Development of preventive maintenance task requirements and a
Specification For Developing methodology to continuously improve preventive maintenance during a
and Continuously Improving product (asset) in service phase therefore being applicable during the whole
Preventive Maintenance product (asset) lifecycle.
1) What are the functions and associated RCM analyses require a multi-discipline team or
desired standards of performance of the multi stakeholder engagement, and are ideally
asset in its present operating context led by a facilitator with current experience in the
(functions)? methodologies (and proprietary tools) to be used.
2) In what ways can it fail to fulfil its functions
(functional failures)? 5.8.2 When is it used?
3) What causes each functional failure (failure Typically RCM implementations focuses on risk
modes)? reduction / mitigation while looking to realise the
4) What happens when each failure occurs inherent reliability that has been designed into
(failure effects)? the asset. The quality of RCM analysis including
5) In what way does each failure matter (failure the knowledge of the various stakeholders
consequences)? i.e. maintainer, operator, Original Equipment
6) What should be done to predict or prevent Manufacturer (OEM) etc. confidence and quality on
each failure (proactive tasks and task intervals)? historic data coupled with data quality of enterprise
7) What should be done if a suitable proactive systems play a significant part and provide the
task cannot be found (default actions)? foundation for making auditable and defensible asset
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 34
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
management decisions. In addition to the failure with the relevant stakeholders. The differing
management strategy, organisations use the RCM interactions can be conducted dependent upon the
analysis for the following: pressures of an organisations resources and ability
• To understand fully the risk levels to which to support the analysis activity in a full time capacity
organisations operate their asset systems. with efficiencies to be gained through the individual
• To define the critical capital and operating engagement. The asset to be analysed can also be
maintenance requirements of all major determined dependent upon the requirement and
operational assets. strategy of the organisation i.e. complete systems,
• To optimise operational and maintenance sub systems or lower i.e. motor or pump set that is
procedures. the subject of reliability issues. They follow the RCM
• To enhance asset knowledge including the process of defining function, functional failure, failure
mechanics, criticality, and frequency of failure. modes, effects, criticality, consequence analysis and
• To identify the opportunity for operational and task analysis. The period and duration of the RCM
capital expenditure savings, by delivering analysis may vary depending on the complexity of
justifiable operational and maintenance regimes. the asset systems, operating context, capability and
• To aid the design of future capital works and the competency of the stakeholders. The selection of
procurement of plant and equipment by the intervention actions is built on decision logic
understanding the full implications of the defined in the standards, which at a high level takes
operation of individual assets and sites. into account:
• To verify and validate the operational and • Will the functional failure become apparent to the
maintenance risk and operating regimes. operator under normal circumstances if the
• To redefine the inspection strategy by asset Failure Mode occurs on its own?
group/ class.
• To identify and support the use of Condition The answer to this question tells you whether the
Monitoring. failure is hidden or evident and takes you down the
• To identify and embed Reliability in the design. appropriate side of the algorithm with more decision
• To define Asset level Inspection strategy. logic questions, for example if the answer to the
• To develop basic and advanced analytics for Asset above question is “no”:
Systems and Assets. • Does the functional failure cause loss or
• To develop the process for application of the RCM secondary damage that could have an adverse
analysis for a wider population of similar Asset effect on operating safety or lead to a serious
groups with similar operating context but with a environmental impact?
varying level of risk. • Does the hidden functional failure in combination
with a second failure /event cause loss or
5.8.3 Complimentary Techniques secondary damage that could have an effect on
Activities that support the RCM process are: operating safety or lead to a serious
• Backfit RCM. environmental impact?
• In Service Maintenance Optimisation Review. • Is there a design change which will make a
hidden functional failure apparent to the crew or
Whilst not necessarily considered a complimentary operator?
technique the development of FMEA/FMECA within
the RCM process precedes the RCM algorithm / Based on the decision logic a range of options
decision process. Also compliments CRO. ranging from:
• On Condition Tasks.
5.8.4 Example Procedures Steps • Hard Time Tasks (can also be known separately as
RCM analysis is typically facilitated / conducted by Scheduled Restoration and Scheduled
experienced practitioners who can either undertake Replacement Tasks).
a series of structured analysis sessions for an asset • Failure Finding Tasks.
system or an asset with multidiscipline teams or • Redesign (Mandatory or Desirable).
develop the analysis through individual engagement • No Scheduled Maintenance.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
35 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Scheduled discard Scheduled discard Scheduled discard Scheduled discard Scheduled discard Scheduled discard
or restoration task or restoration task or restoration task or restoration task or restoration task or restoration task
10 Is a combination of 17 Is a scheduled failure- 20 Is a scheduled failure- 23 Is a scheduled failure-
No scheduled No scheduled
tasks technically feasible maintenance maintenance finding task technically finding task technically finding task technically
and worth doing? feasible and worth doing? feasible and worth doing? feasible and worth doing?
Yes No Yes No Yes No Yes No
Redesign may Redesign may
be desirable be desirable
Combination Redesign Failure-finding Redesign Failure-finding No scheduled Failure-finding No scheduled
of tasks compulsory task compulsory task maintenance task maintenance
EVIDENT SAFETY AND EVIDENT OPERATIONAL EVIDENT NON-OPERATIONAL HIDDEN SAFETY AND HIDDEN OPERATIONAL HIDDEN NON-OPERATIONAL
ENVIRONMENTAL CONSEQUENCES CONSEQUENCES ENVIRONMENTAL CONSEQUENCES CONSEQUENCES
CONSEQUENCES Over a period of time, the failure Over a period of time, the failure CONSEQUENCES Over a period of time, the failure Over a period of time, the failure
The failure management policy management policy must cost less management policy must cost less The failure management policy management policy must reduce management policy must reduce
must reduce the risk of the than the cost of the operational than the cost of repairing the failure must reduce the risk of the multiple the probability of a multiple failure the probability of a multiple
failure to a tolerable level consequences plus repair costs failure to a tolerable level (and associated total costs) to failure (and associated total
an acceptable minimum costs) to an acceptable
EFFECTIVENESS CRITERIA minimum
39. SAE JA1012 “A Guide to the Reliability-Centered Maintenance (RCM) Standard,” January 2002
H1
Does the failure mode, in combination with a further failure, cause a
loss of function or secondary damage which could have a direct effect Is the failure evident?
on safety or the environment?
No No No
HS2 HO2 HN2
No No No
HS3 HO3 HN3
Is a scheduled discard Is a scheduled discard Is a scheduled discard
task effective? Do the task at task effective? Do the task at task effective? Do the task at
less than less than less than
Does it reduce the the Life Is the task worth doing? the Life Is the task worth doing? the Life
Yes Yes Yes
potential multiple failure
rate to a tolerable level?
No No No
HS4 HO4 HN4
Is a scheduled failure-finding Is a scheduled failure-finding Is a scheduled failure-finding
task effective? Do the task at task effective? Do the task at task effective? Do the task at
the failure-finding the failure-finding the failure-finding
Does it reduce the Yes interval Is the task worth doing? Yes interval Is the task worth doing? Yes interval
potential multiple failure
rate to a tolerable level?
No No No
Redesign is mandatory
No scheduled maintenance No scheduled maintenance
Redesign may be desirable Redesign may be desirable
Figure 16: Decision logic for functional failures not evident to Operations Personnel during normal duties40
managers which enhances control and direction together based on specific shared characteristics or
of maintenance programs. individual asset characteristics.
• Delivers a clear rationale to the maintenance
organisation of its objectives and purpose and
TOTAL
the reason for which it is performing the 3500 IMPACT
scheduled maintenance tasks. 3000
500 PREVENTATIVE
5.8.9 References ACTION COSTS
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
In depth explanation Maintenance or Life Cycle (months/years)
RCM (3) by John Moubray/The Aladon Network/
J.R.Paul Lanthier, due August 2014. Figure 17: Balancing risk and cost to find the
optimum solution41
IEC 60300-3-11
Dependability management - Part 3-11: Application
guide - Reliability Centred Maintenance
5.9.3 Complimentary Techniques
ASD 4000P RCM, FMEA/FMECA, RCA
International specification for developing and
continuously improving preventive maintenance 5.9.4 Example Procedures Steps
One of the earliest tasks in the creation of a
CRO model is the determination of its Functional
5.9 CRO - Cost Risk Optimisation Specification. This describes the purpose of the
model, and contains high-level specifications which
5.9.1 What is it? should address the following questions:
The goal of CRO like any optimisation problem is to • What assets are to be modelled?
find the values of controllable factors determining • What are the Key Performance Indicators for the
the behaviour of a system that maximises desirable system, and how will they be modelled?
outcomes. There are different techniques that can • What are the likely data requirements, and how
be used to achieve the desired results but the two will issues with the data be handled?
principle methods are probabilistic, and Bayesian. • What costs should be calculated in the model?
While Bayesian methods are also probabilistic they • What are the possible intervention methods, and
follow more of a rules-based approach as opposed (in general) their effects?
to a simulation approach. From the input data, • What scenarios might be run?
a model is formed which predicts how the asset
base will perform into the future, both in terms Intervention is a term used in CRO to represent
of serviceability and cost. CRO analyses require proactive modification of assets or their behaviour to
a multi-disciplinary team. They are essentially manage costs and/or performance. Depending on
led by a facilitator with current experience in the the performance of the assets, varying operational
methodologies (and proprietary tools) to be used. costs will be incurred. The need for reduction of
costs leads to the suggestion of interventions (which • Generates recommendations for work and
will have their own intrinsic, direct costs), which investment based on level of desired risk.
modify the assets (or their performance) in some • Provides a better understanding of uncertainty
way. This modifies the outcomes of the performance and impact of potential risks on key factors.
predictions, and thus the operational costs incurred • Evaluates and compares multiple investment
and the need for further intervention. The loop scenarios.
continues until the optimum balance is determined
between operational and capital expenditure subject 5.9.9 References
to risk and performance constraints.
Overview
5.9.5 Prerequisites Cost/Risk Optimisation (John Woodhouse) http://
Situation www.plant-maintenance.com/articles/ Costriskop.
This tool is ideally suited to where there are pdf
conflicting alternative options (i.e. shutdown Case study
planning on continuous process plants when repair/ SASOL experiences in cost/risk optimization (SSF
renew decisions are involved). This methodology Pty / TWP Ltd) http://www.twpl.co.uk/ _assets/
is based on focusing on the financial impacts of client/case-study/SASOL%20cost%20risk%20
the competing options. Normally used for major optimization%20ERTC%202003v2.pdf
decisions (i.e. repair/replace furnace flare stack), the
methodology can be applied for minor stand-alone
assets (repair/replace pump). 5.10 RCA - Root Cause Analysis
Data 5.10.1 What is it?
• Historical failure rate data for system components RCA is an umbrella term for various different
(i.e. In Service Data, NPRD 2011 etc.). methodologies used to identify the root cause of
• Cost and budget data. failures (as opposed to addressing the symptoms
• Functional requirements (i.e. capacity/emission of the failure) with the objective being to make
limits). recommendations to prevent the reoccurrence of the
• Technical and cost data for alternatives being failure.
considered.
• FMEA/FMECA study findings (if available). RCA is a reactive method, as analysis is completed
after an event has occurred and therefore there may
Experience be more than one root cause for failure, thus there
• Previous experience with CRO process (i.e. may be several effective actions that address the root
SALVO), terminology, diagrams, calculations. cause. RCA can be considered to be an iterative
• Knowledge of required asset functionality and process and a tool of continuous improvement.
reliability.
This tool is simple to use in order to rapidly identify
5.9.6 Typical Proprietary Tools the originating (root) cause of a range of target
• Decision Support Tools (DSTL) APT software suite. actual events (i.e. flooding occurred, power failed). It
• WiLCO from SEAMS. is most often the methodology of choice when rapid
analyses are required.
5.9.7 Objectives
• Better understanding of costs and risks involved 5.10.2 When is it used?
with selected option. The purpose of RCA is to identify the factors
• Optimum analysis of selected options. influencing an asset prior to failure in order to
identify what behaviours, actions, or conditions need
5.9.8 Derived Benefits to be changed to prevent recurrence of a similar
• Screens out potential costly system modifications failure, and to identify the lessons to be learned to
(retrofits). promote the achievement of better consequences.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
39 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 40
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
5.11 FRACAS/DRACAS - Failure/Data, and manage the data outputs from the individual
Recording, Analysis and Corrective Action RCA activities to allow systematic improvement of
asset or system performance as applicable.
Systems
Fracas Vs Dracas
5.11.1 What is it?
The terms FRACAS and DRACAS are frequently
The purpose of a FRACAS or DRACAS is to provide
confused, misused and interchanged unfortunately
a defined, systematic closed loop process to address
too often to the determent of equipment
non-conformance that occur within an asset or
development. Whereas FRACAS specifically
system, develop and manage a plan of action to
addresses failure, DRACAS embraces failure as well as
correct them, and if applicable ensure that these
other incidents and observations which with failures
have a positive effect on future performance.
if appropriately addressed may lead to an overall
improvement to the equipment capability.
A FRACAS/DRACAS can be seen as a process to
facilitate individual RCA activities to be conducted
Term Description
Incident Incidents are not confined to those ‘known’ faults and failures which affect the ability
of equipment to perform or be operated satisfactorily (this would be to pre-judge an
incident as a fault/failure). Other events, such as observed deterioration, may also be
reported as incidents as well as actions, such as modifications, scheduled maintenance
and the repair/replacement of faulty items. Collecting such ‘data’ in a FRACAS/DRACAS
provides important information on all occurrences and observations which arise during
a reporting period and facilitates sentencing, classification, failure and trend analysis.
Failure An event that prevents an item from performing a required function to the required
specification.
Observation Technical observations from hands-on personnel are used for keeping a record of
problems which are not defects or failures in their own right. For example, an item may
be superficially degraded but still fully capable of operating satisfactorily. These records are
useful in anticipating failures from progressive, worsening conditions. The observation of a
degradation condition may initiate a design review, even though failure has not arisen.
Non-Conformance Failure to conform to accepted standards.
Table 8: FRACAS/DRACAS terminology
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
41 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
5.11.2 When is it used? ensuring that all the responsible parties fulfil their
A FRACAS/DRACAS is by its nature a process driven obligations as detailed in the procedure.
closed loop reporting system. It requires procedures • Program/Project Manager - is responsible for
for reporting, data collection, failure reporting (which ensuring the implementation of corrective actions
may come from disparate defect reporting systems) at the program level as a result of the evaluation
and trend analysis and corrective action management and identification of problems relating to
and can include the formation of an Incident Reliability issues.
Sentencing Committee, made up of the relevant • Reliability Engineer - is responsible for reviewing
stakeholders that review and allocate sentencing collected “Field Data” for quality and
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
codes that start to identify the root cause of the completeness. In addition the engineer will review
non-conformance. FRACAS/DRACAS procedures will and analysis all data from all sources to identify
guide how failures are reported, where information any potential problems and trends.
is stored, which analysis methodologies to use, when • System Maintainers - are responsible for the
they are used and how they are used, therefore actual collection of “Field Data”. The System
a FRACAS/DRACAS procedure would be specific Maintainer may be the organisation implementing
to an individual organisation’s needs. A FRACAS/ the FRACAS/DRACAS or the customer of the
DRACAS procedure should also define the individual system / product.
responsibilities at all levels, and a description of the • Production/QA Staff - would be responsible for
basic elements required. A typical FRACAS/DRACAS the collection of failure data during the
flow is detailed below: production phase.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 42
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
43 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Early Equipment
Management
Improvement
Development
Maintenance
Maintenance
Autonomous
Professional
Quality
problems.
• Provides documented evidence of cause and
effect and proposed solutions.
• Screens out potential costly system modifications
(retrofits).
• Focuses upon key areas on which to concentrate
Figure 22: The pillars of TPM43
42. This paper lists further recommended reading as a Bibliography.
43. Note diagrams showing the “pillars of TPM” often show 5,6,7,8 or more pillars depending on the source.
5.12.5 Prerequisites
Situation 5.13 OEE - Overall Equipment
TPM can be used during the development of a Effectiveness
company’s maintenance strategy to structure the
document and make the interactions between 5.13.1 What is it?
key functions more clearly defined. It can be used Overall Equipment Effectiveness (OEE) is a data
alongside the Asset Management Policy in organisations collection and analysis tool that was developed
that manage the complete asset lifecycle. within manufacturing companies to understand the
overall performance of an asset, recognising that
Data downtime is not the only cause of reduced asset
• Details of policy and procedures across the
performance.
organisation.
In simple terms it can be described as the product of
Experience
availability, performance and quality.
• Basic understanding of Asset Management Strategy.
• Understanding of TPM to support development
OEE = Availability x Performance x Quality
and implementation.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
45 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 46
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
44. Breakdowns of each of these into more granular benefits can be found in the Life Cycle Value Realisation SSG Document
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 48
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
6 Case Studies
A number of case study examples of good practice will be included to supplement the
guidance and to show what “good” looks like. These will be from a range of sectors.
They will give real-life examples of the theory explained in previous sections.
Case studies should recognise different operating Note from the authors: In selecting case studies
contexts, e.g.: for this SSG we essentially used existing published
• Good practice for a small asset management studies and tried to reverse engineer the techniques
organisation might be different than for a large used. We recognise that it would be more useful
organisation. if the case studies explained how/why the chosen
• For asset owners as opposed to asset operators, etc. technique was used since this would be excellent
guidance for others seeking to select an appropriate
The case study should be introduced by stating what technique. If possible it would be useful to know
it is intended to demonstrate and how it links to or what difficulties were encountered and how they
explains the guidance. This section will allow the were overcome and by including these points in
contributors to bring case studies from their own future case studies For those that write them there
organisations or experience. It will be a section would be considerable ‘added value’ if these two
that can live and be easily maintained as common points were documented during execution of a
practice matures in this area. project, and included in case study write-ups. To do
so would be a further step towards best practice.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
49 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
6.1.1 Introduction
This is a case study taken from Metronet on the,
“Renewing the Tube,” Project. With Metronet in the
process of purchasing the first new fleet of London
Underground trains for 17 years in 2009, whilst Figure 23: London Underground Train
planning to introduce another four fleets from 2012
onwards, the Fleet Asset Managers wished to lay 6.1.3 Approach
the foundations of a long term investment plan to Investigation and assessment was undertaken to aid
maximise the return on their investment and insulate decision making on when to refurbish or replace
that investment from political changes. The design major systems, such as door engines, traction motors,
life of the new fleet would be 40 years, however, wheels and brakes. The case study initially examined
existing fleets had already exceeded this design life the doors as this system failed most frequently at the
based on technology and materials developed in the highest cost, closely followed by traction and brakes.
early 1960s. These three systems accounted for over 40% of
failures by both frequency and cost. An investigation
6.1.2 Challenge was initiated into the differences between the door
Purchases of major long-life assets such as rolling system maintenance regimes on the different fleets to
stock, planes, wind turbines etc., represent identify, and then roll out, best practice.
a substantial investment by companies or
governments, from which the owner needs to ensure As an example of how Reliability Engineering can
maximum return over the asset’s economically impact design during the “in service” part of the
viable operational life. This case study discusses one lifecycle the case study looked at how the design
particular example of how a customer explored the can be made more cost effective by designing for
potential to extend the life of their asset purchase by ease of maintenance or replacement. It considered
approximately 50% and the methods they used to whether or not it would be better to create modular
achieve this. systems that can be removed for service off-line,
than allowing easy access in situ, rather than how
Significant forward planning was required as the to minimise the cost of operating which was already
known time interval for extension would be 20 years. in place, or what the selected supplier delivers after
This is the life of the bogies. After 20 years, the cast interpreting specifications.
iron develops sufficient fatigue cracking to require
replacement, so for an investment to be worthwhile, The work also considered the degree of performance
the fleet would need to achieve 60 years. recovery from planned maintenance and how best
to plan maintenance interventions through the use
Reliability Engineering looks at all aspects of the of tools such as RCM since not all suppliers have
asset lifecycle from concept through specification embraced such working techniques and many
to design, production, commissioning, operating, products were supplied prior to the development
maintaining, decommissioning and disposal/recycling. of RCM, e.g., the 67TS rolling stock was in service
In this instance, it looked at both the fleet and 11 years prior to the release of Nowlan and Heaps’
infrastructure (depot and equipment) due to the landmark study of commercial aircraft reliability and
multiple interfaces between the two. maintenance in 1978.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 50
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
In order to determine what would be required to service, longer operating hours (24hr tube), PPM
extend the life of a fleet due for delivery 2 years later, it frequency etc. could be demonstrated by the WLCM
was necessary to understand the current situation with • Better understanding of existing fleet performance
the existing fleets and what had been learnt over nearly through improved analysis and Reliability Growth
150 years of operating the London Underground. Plans
• Improved performance through the introduction
The initial stage involved gathering data and lessons of RCM
learnt from the existing fleets. Failure data was • Able to constructively influence the new fleet
gathered from the eight Metronet fleets in order to design, e.g. exchangeable wiring cartridges
identify common failures, which would allow each
fleet to learn from improvements made on other fleets
as well as provide feedback into the design process. 6.2 Case Study 2
1) Defining the problem was an important first Requirement To conduct downstream/ upstream
step to identify issues with current practices design check as part of change
and subsequently identify a root cause. management
2) The second step was to clearly define the
Intended Ensure that technical characteristics
target, i.e., what does success look like when Process of proposed asset replacement
the problem is resolved? There is a gap
are compatible with related asset
between where you are and where you wish
systems
to be that needs to be closed. Closing that
gap was the target. 6.2.1 Introduction
3) The third step was Root Cause Analysis (RCA). To ensure that the technical characteristics of
4) The fourth step was to develop a proposed asset replacement are mutually compatible
countermeasure, planned to address the gap with related asset systems the subject of this
and meet the target. design check concerned the replacement of four
5) The fifth step is to test and refine the vertical spindle horizontal hot strip coilers with two
countermeasure to achieve the target. horizontal mandrel downcoilers. Justification for
6) The final, sixth, step was to assess the replacement included:
lessons learnt and consider where else this • strip surface damage in the water-cooled twist
solution could be applied, or if there are guides no longer met increasing surface quality
similar issues that would benefit from this requirements.
solution. • vertical spindle coiler and associated asset
downtime was identified as a limitation to
6.1.4 Deliverables increasing mill throughput.
• Maintenance practices based on RCM and
condition monitoring This design check concerns the interface between the
• Designs were reviewed for ease of maintenance new equipment and the installed environment. It is
or replacement not a design or reliability check on the as-purchased
• Performance recovery predicted following service equipment or system, nor a dimensional check on
or refurbishment the physical environment. It is a compatibility design
• Decision support tool developed to determine check to ensure that critical operational interfaces
whole life cost of intervention and design options match. For example - cooling water pressure, flow,
temperature, reliability; OHT operation cycle time for
6.1.5 Results mandrel changes; strip speeds and interval times.
The following benefits were identified as a result of
the analysis: 6.2.2 Challenge
• A Whole Life Cost Model was produced as a The downcoiler system design will have been ‘frozen’
Decision Support Tool for the next 65 years against external system data supplied at the time
• The effect of changes of use, i.e. more frequent of placing the purchase and installation contract.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
51 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
6.2.3 Approach
A boundary diagram [Process Map] was constructed
from the new downcoiler system data to show all
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
To support this, additional data from both Figure 24: Example of a strip mill downcolier
operational records (to establish current system installation
performance at the identified system boundaries),
from the FRACAS system (to identify current
performance anomalies), and from the original Where values were obtained from operational
design P&ID drawings were obtained to provide records for ‘actual operational data’, the margin
design, and current performance data. Against each of error was deduced from the record sample size
‘new system data’ was entered the external system reviewed, and upper/lower boundaries calculated.
data as ‘design data’ and ‘actual operational data’. Where comparison of values (new system, existing
Each data value entered was qualified by reference to system design and existing system actual) gave rise
the source of information. Where the value had to to concern, a risk evaluation was carried out, using
be calculated from base information the calculations the ETA methodology, and included in the report
were included. together with proposals for risk mitigation.
Oxygen
Flow rate Pressure Quality
Hot rolled coils Mandrel changes
Frequency Weight O.D. Frequency Cycle time
I.D Temperature Cycle time
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 52
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
53 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Triggers:
Technician Sub-part rejection
Problem identified
Engineerng planner Batch slot missed
Repeat failures
Maintenance lead Customer complaints
Customer Engineering cause for miss vs plan
Longer than one shift
Go to the problem
Clarify the
problem Clarify the problem
BREAKDOWN ACTION
(Complete 5W1H)
Identify
PLANNED ACTIVITY
countermeasure
eeting
Review M
Agenda
Review / performance
meeting
Version 1.0 20/9/2012
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 54
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Why #4
Who found the problem?
Problem statement
Countermeasures
No Countermeasure SAP Who
#1
#2
Notes/Comments
#3
#4
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
55 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
6.4 Case Study 4 To help focus and prioritise activities a Criticality analysis
was performed. This process considers the impact
Requirement To predict failure modes which may to SHE, Quality and Operations in the event of a
seriously affect expected or desired system failure, plotted against the perceived reliability,
operation maintainability and complexity of that system.
Intended Identify:
Process - relevant critical assets The RCM (Reliability Centred Maintenance) studies
- potential failure modes which were completed both on site and at suppliers
would affect operations during the design phase of the project. The
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
6.4.5 Results
Of the 822 Failure Modes identified across 11 critical
process and service systems, 85% of these were
random. Traditional scheduled maintenance would
Figure 28: Sterile Manufacturing Facility have addressed only 15%.
how to work with the design team and equipment 85% 89% RCM has addressed 89%
of failure modes.
suppliers to focus not just on design quality but
considering functional reliability. % Failure modes addressed with traditional (left) and SPP5 approach (right)
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 56
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
reduced by 68%. The project team are confident in We have had excellent feedback from our suppliers,
delivering what is a unique and complex asset with two of whom have adopted the process for their
reliability built in. future project work.
S. O. D.
Severity x Occurrence x Dectection
68%
We found this RCM session very useful and will use
this methodology in future also for our other machines
and especially for new developments.
(The Risk Priority Number)
Best regards,
Udo Baeuerle, Head of Technology
In addition we have trained Engineers, Technicians
and Operators who now have a better understanding
of both the nature of failure and the equipment
itself.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
57 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
6.5.3 Approach
Each of the four test programmes required a different
approach using a different set of input data.
Cold Tests
The major part of asset-related documentation
provided with the mill was provided by sub-suppliers
to the turnkey contractor, and required analysis and
reformatting to create the Equipment Register (base
Typical long products mill train
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 58
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Units
Systems
Units
Utilities
Units
Systems
Units
Wire Drawing
Units
Systems Billet Reception Sizes/Qualities
Units
and Reheating
Units Furance
Systems
Units
Round Bar
Units Sizes/Qualities
Systems
Units Mill Train and
Cooling Beds
Units
Systems
Units Re-bar
Sizes/Qualities
Units
Systems
Units Coil and Bar
Units handling
Systems
Units
line for CMMS. The preliminary analysis allowed procedures, pass/fail criteria
grouping into generic equipment types, and a • Responsibilities and the approval process
Cold Test Schedule was developed for each type, • Risks and contingencies
and reviewed against the manufacturer’s technical
documentation. Results of these tests were recorded in the
Equipment Register as part of the equipment history.
An FMEA approach was used to develop the Cold
Test schedule. Not a ‘full-blown’ FMEA as this Hot Tests
would have been conducted during the design and Piping and Instrumentation Diagrams (P&I Ds) were
manufacturing phases of the individual units, but the used to allocate each equipment item (units in the
FMEA results from these earlier phases (Functions, above diagram) into discreet systems, and a Hot Test
Functional Failures, Failure Modes) were used as Schedule developed for each system. For this phase
a basis for developing failure/success criteria for a full FMEA was carried out at ‘System’ level, with
installed equipment. Not all equipment required a emphasis on (Failure) Criticality. Initiation of some
Cold Test, and some of the tests required services system tests were dependent on completion of an
to be available (e.g. power, compressed air). Where ‘upstream’ system test, and the resulting test plan
practical this was achieved by sequential cold test was set out in the form of a dependency diagram.
planning, but in most cases required a temporary Each Hot Test Schedule was clearly documented as
service supply. Each Cold Test Schedule clearly for the Cold Test Schedules, with the addition of:
documented: • Prerequisite equipment Cold Test status
• What would and would not be tested • Prerequisite system Hot Test status (from
• Prerequisite equipment installation status dependency diagram)
• How the test would be conducted, equipment
and materials needed, test configurations and
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
59 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
Results of these tests were recorded in the and Provisional Acceptance, the mill commenced on
Equipment Register as part of the system history. an operational programme under client management
and staff, with technical support by the turnkey
Load Tests contractor, aimed at achieving a steady state of
The mill train contract specification was used as a production over a three month period. There were
basis for designing the Load Tests. The objective no specific Performance Test schedules for this
was to prove that the mill could achieve the speeds, period; the production records being analysed and
tolerances and qualities required in the contract data extrapolated to support negotiation between
for each of the contracted sizes. The mill train the client and the turnkey contractor as regards the
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
was divided into three sections for the purpose of mill’s expectation of reaching the target annual mix.
designing Load Tests. Twelve finished sizes were
listed in the contractual product mix and, although 6.5.4 Deliverables
the initial plan was to carry out the Load Tests for Cold Test schedules, and test results including a
each of the three sections in sequence, they were report on any remedial actions taken.
conducted as one integrated test plan
Hot Test schedules, and test results including a report
A Load Test schedule was developed for each size on any remedial actions taken.
listed, and included upstream tests (billet handling,
reheating furnace, shears) and downstream tests Load Test schedules, and test results including a
(coil and bar handling and storage). The same report on any remedial actions taken.
format was used as for Cold and Hot Tests.
Whilst the FMEA approach was still valid, the 6.5.5 Results
consequence of failure was more aligned to failure Complete test schedules were developed by the
to meet downstream operational and contractual turnkey contractor for Cold, Hot and Load Tests. The
requirements than had been the case for the Cold contractor remained totally responsible for carrying
and Hot Tests. Provisional Acceptance of the mill out these tests, with the client’s engineer witnessing
was dependent on successful completion of all ten their conduct. Client staff, who had been trained
Load Test schedules. externally by the turnkey contractor at other facilities,
were invited to participate in the Load Tests as part
of their continued training.
Performance Tests
With the exception of the 6mm deformed bar, each
of the listed sizes contributed to a target annual
mixed finished product output of 540,000 tonnes.
On successful completion of the Load Test schedules,
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 60
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
6.6.4 Deliverables
• Awareness and understanding about asset care
and reliability within the Operating Team.
Figure 29: Cartoning Machine • Increased level of technical understanding
Study
complete
(% on condition)
400 (% not 16% 10%
addressed)
300
(% traditional
200 20% time based)
100 48%
(% redesign) 6%
0 (% spares)
27/06/2011
27/07/2011
27/08/2011
27/09/2011
27/10/2011
27/11/2011
27/12/2011
27/01/2012
27/02/2012
27/03/2012
27/04/2012
27/05/2012
27/06/2012
27/07/2012
27/08/2012
27/09/2012
27/10/2012
27/11/2012
27/12/2012
27/01/2013
27/02/2013
27/03/2013
27/04/2013
27/05/2013
27/06/2013
27/07/2013
27/08/2013
27/09/2013
27/10/2013
27/11/2013
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
61 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
6.6.5 Results
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
The methodology allows the selection of an Working alongside DML (now Babcock Marine)
appropriate maintenance task that addresses each LSC Group supplied the RCM expertise and
identified failure. RCM provides a fully auditable was responsible for recording the analysis and
decision-making process for determining the reporting the results. DML supplied the appropriate
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 62
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
engineering expertise, allowing the RCM analysis task selection and frequency. The analysis for the
to be conducted on a team basis. This partnership crane identified 66 functions, 132 functional
brought together the following attributes: failures and 694 engineering failure modes. 180
• Knowledge and experience of the RCM process. scheduled maintenance tasks were derived, of
• Detailed knowledge of the appropriate design which 54 tasks were new tasks that had not
features, installation and commissioning. previously been carried out. Many of the existing
• Knowledge of how the crane is used, operated, maintenance tasks were not justified by the RCM
maintained and supported. analysis and auditable rationale was provided to
• Knowledge of the condition of the crane and its support their removal from the maintenance
components at overhaul, including understanding schedule. 2 ‘Mandatory Redesigns’ were
of the actual failure modes, and their effects. recommended to address engineering failure
• Specialist knowledge of constraining influences, modes with safety consequences and a ‘Desirable
e.g. Health and Safety, environmental legislation, Redesign’ was recommended to prevent
regulatory bodies, etc. economic consequences.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
63 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
short, medium and long term risks. Through a deployed on a range of military operations, and like
number of workshops and interactive working many platforms has a complex set of equipment,
sessions, the LSC Group specialist helped facilitate systems and subsystems on board.
the definition of all existing and future risks such HMS OCEAN, like the majority of Royal Navy
that they were properly defined, measured and platforms, is maintained using Reliability Centred
understood, as well as ensuring all risks had Maintenance (RCM) methodology.
mitigation action owners.
RCM is the UK Ministry of Defence (MOD) mandated
With improvements identified, the team could maintenance policy for future platforms, major
work towards driving these forward, ensuring equipment procurements and selected in-service
that both risk management and scheduling platforms. As a philosophy, RCM focuses on optimising
activity would be more coherently integrated availability through preventative maintenance –
as part of project controls. The assessment of the thereby realising an assets inherent reliability delivering
project risks and the resultant update of the Risk maximum operational availability, safely.
Register enabled senior management to make
better business decisions, adding benefit and 6.8.2 Challenge
increasing levels of delivery success. With more hours of maintenance demanded
than capacity for Ships Staff delivery there was a
6.7.5 Results requirement to reduce the maintenance burden
The RCM Analysis has identified the appropriate whilst maintaining equipment safety and delivering
maintenance tasks to ensure safe, reliable and the necessary operational capability.
economical use of the crane throughout its life.
The analysis has justified the removal of existing For HMS OCEAN, putting the required maintenance
maintenance tasks from the schedule and has theory into practice was resulting in an inability to
justified new tasks to prevent adverse safety, complete all the necessary maintenance. Factors
availability or economic consequences all in a such as training, manning issues and changing ship
structured, auditable manner with the net result operating profiles impacted on the capacity and
being greater equipment availability at a significantly the capability of Suitably Qualified and Experienced
reduced cost. (SQEP) personnel to complete the maintenance.
Furthermore, delays in feeding back experiences
The RCM-derived maintenance schedule identified from defect and repair events to optimise future
reductions of 725 man-hours per year compared to the maintenance were causing the problems to continue
current regime. When applied to a second identical and grow.
crane this gave a potential saving of £1.3 million
through the anticipated life of the cranes, with the RCM
analysis costs being recouped within the first year.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 64
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
It was recognised that an increasing number of • Operating Context Review – to check that
unreliable and high maintenance systems were far equipment is operated and analysed in the way it
outweighing the capacity of the Ships Company to had been identified to operate.
complete and that this was further impacting on the • FMECA – identifying the failure mode and effect
availability and sustainability of all ship systems. reviewed against the function and functional
failure.
6.8.3 Approach • Task Review – ensuring that the task manages the
A prioritised schedule of maintenance activity failure mode that has been identified. This would
developed using proven RCM methodology to address the issues associated with equipment and
achieve the required reduction in Ships Staff systems not currently being operated in
maintenance hours. accordance with RCM philosophy.
Working with HMS OCEAN since 1999, LSC Group A total of twenty one analyses associated with six
was chosen to help reduce the maintenance burden. high failing systems were identified and then a
The target was to achieve a reduction of >20% of revalidation of the maintenance schedules was
Ships Staff maintenance through the identification completed using the process defined above.
and completion of maintenance reviews for a range
of equipments. 6.8.6 Innovative Technologies for Condition
Monitoring
The purpose of the maintenance review was to During the review, LSC Group identified a number
investigate if the current Ships Staff maintenance of Condition Monitoring techniques and technology
burden could be reduced by addressing and that would reduce the maintenance burden on Ships
improving the process for aligning maintenance Staff. Improvements included:
intervals with anticipated failure modes – whilst • Continuous monitoring of exhaust emissions –
looking at the potential use of innovative technology this can be used to identify potential failures on
and Condition Monitoring techniques. Internal Combustion Engines through trend
analysis of exhaust gases.
6.8.4 Maintenance Analyses • Automated chlorine monitoring to test water –
Working closely with the MOD and Babcock team, reducing the need to complete routine, daily
LSC Group specialists conducted a programme of maintenance.
analyses to identify those tasks that generate high • Fire hydrants on upper deck – reduction in
maintenance load for both the Weapons and Marine lubrication process by changing the maintenance
Engineering departments. process.
It was quickly identified that 83 individual tasks The use of On Condition Tasks and Condition
(some 5% of the overall number of maintenance Monitoring would enable intervention to be planned
tasks) generated 60% of the maintenance load. before a predicted failure could occur. Furthermore
the new technologies would significantly reduce the
6.8.5 Re-usable Maintenance Support Processes time required to complete maintenance tasks.
A critical focus of the project was to ensure that
re-usable processes to support maintenance reviews 6.8.7 Deliverables
were developed using RCM methodology. LSC Group recommended a series of process,
maintenance and condition monitoring improvements
LSC Group looked to incorporate mission profiles that would significantly reduce the man hours required
and readiness states to highlight which systems and to complete maintenance.
equipment should be reviewed. • A total of twenty one analyses associated with six
high failing systems were identified.
Targeting a reduction in ship staff hours and tasks, • 83 high loading maintenance tasks identified and
LSC Group developed a rolling Maintenance Review proposed changes would reduce from ~19000 to
process for In-Service systems including: 8800 hours per year.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
65 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
• 213 changes were recommended to maintenance (RN) and Royal Fleet Auxiliary (RFA) Platforms. This
schedules. study has proven that intelligently and rigorously
• Average maintenance hours per year were applied RCM provides optimised and defensible and
anticipated to reduce by 46% (Marine Engineering maintenance to maximise equipment and platform
Department) and 14% (Weapons Engineering reliability.
department) respectively.
• Introduction of Condition Monitoring techniques
would enable intervention to be planned before a
predicted failure could occur:
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
6.8.8 Results
The overall objective of the maintenance review
was to address the balance between capacity and
capability to undertake the necessary maintenance
activity. A predicted reduction of 20% in Ships Staff
man hours required to maintain key equipment
onboard HMS OCEAN, whilst continuing to meet
the required availability levels through optimised
maintenance schedules. These maintenance
schedules continue to deliver the necessary
defensibility and auditability and meet the evolved
operational context.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 66
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
7 References
1. The term ISO 5500x is used generically in this document to refer to the family of standards comprised on
ISO 55000, 55001, and 55002 unless specific reference is made to a section within one of those standards.
2. Clause 4, PAS 55
3. Asset Management – an Anatomy Version 2, July 2014, Institute of Asset Management
4. Second Edition, March 2014
5. Updated to reflect Version 2 of the Anatomy
6. An Anatomy of Asset Management Issue 3 July 2014
7. An Anatomy of Asset Management Issue 3 July 2014
8. ISO 55000, John Woodhouse, Chairman, Experts Panel, Institute of Asset Management
9. Ibid
10. Maturity Models 101: A Primer for Applying Maturity Models to Smart Grid Security, Resilience, and
Interoperability, Software Engineering Institute, 2012, Caralli, Knight, Montgomery
11. Ibid
12. Ibid
13. https://theiam.org/products-and-services/pas55-methodology
14. https://theiam.org/products-and-services/Self-Assessment-Methodology
15. Source - Warwick Manufacturing Group
16. Asset Management – an anatomy Version 3, July 2014, Institute of Asset Management
17. Practical Reliability Tools for Refineries and Chemical Plants (Barringer)
18. Due to be superseded soon by IEC 60050-192
19. US National Infrastructure Advisory Council 2010, pg. 15.
20. IAM Knowledge Center
21. Ibid
22. Australia, Critical Infrastructure Resilience Strategy, 2010
23. https://wiki.ittc.ku.edu/resilinets/Main_Page#ResiliNets_Wiki
24. Measurement Frameworks and Metrics for Resilient Networks and Services: Technical report, ENISA,
February 2011
25. Modelling and Analysis of Network Resilience, IEEE COMSNETS, Bangalore, India, 2011
26. https://wiki.ittc.ku.edu/resilinets/Main_Page#ResiliNets_Wiki
27. Note that the P-F curve is drawn in relation to operating age and some of the tools discussed will be used
in design i.e. before the operating phase commences.
28. Practical Reliability Engineering, Chapter 9, O’Connor
29. http://www.reliabilityweb.com/art08/7_questions_rcm.htm
30. For those used to looking at risk as the product of probability and consequence/severity this is covered in
the ‘CA’ of the FMECA and will likely have an associated criticality matrix.
31. O’Connor, Chapter 15, Reliability management, Practical Reliability Engineering, John Wiley
32. NOTE: A search for RCM and related publications on the Internet reveals appearances in a plethora of
guises - hyphenated and not hyphenated, Centered and Centred, capitalised and lower case. In this SSG
we have adopted RCM as the generic term to cover all of the above variations.
33. Asset Management – an anatomy, IAM
34. Adapted from International Infrastructure Management Manual V2, 2002
35. The system has no output at all
36. The system is operating, but below the required standard
37. Nonelectronic Parts Reliability Data (NPRD) provides failure rate data for a wide variety of component
types including mechanical, electromechanical, and electronic assemblies. It provides summary and
detailed data sorted by part type, quality level, environment and data source.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
67 © Copyright The Institute of Asset Management 2016. All rights reserved.
Reliability Engineering Version 1.1 October 2016
The Institute of
Asset Management
38. http://www.isograph.com/software/hazop/
39. SAE JA1012 “A Guide to the Reliability-Centered Maintenance (RCM) Standard,” January 2002
40. Source: NetworkRail
41. Asset Management – An Anatomy V3, IAM, July 2014
42. This paper lists further recommended reading as a Bibliography.
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
43. Note diagrams showing the “pillars of TPM” often show 5,6,7,8 or more pillars depending on the source.
44. Breakdowns of each of these into more granular benefits can be found in the Life Cycle Value Realisation
SSG Document
Purchased by Anita Pharmatrisanti - anitapharma02@gmail.com - For Personal Use Only - Not For Distribution
© Copyright The Institute of Asset Management 2016. All rights reserved. 68
With thanks to our sponsors whose support is greatly appreciated
Sponsorship helps cover the costs of design, check and production processes. Sponsors are not in any way linked to editorial control or input to the IAM’s published content.
Sponsorship of an SSG is not evidence, and must not be used to claim or imply, that the sponsoring organisation is regarded by the IAM as competent or expert in these areas
and the use of the above logos does not imply endorsement in any way by the IAM.