Professional Documents
Culture Documents
De Sanctis, Paciarotti, Giovine - 2016 - An Overview On Reliability, Availability, Maintainability and Supportability (RAMS) Engineering
De Sanctis, Paciarotti, Giovine - 2016 - An Overview On Reliability, Availability, Maintainability and Supportability (RAMS) Engineering
Access to this document was granted through an Emerald subscription provided by emerald-
srm:404576 []
For Authors
If you would like to write for this, or any other Emerald publication, then please use our Emerald
for Authors service information about how to choose which publication to write for and submission
guidelines are available for all. Please visit www.emeraldinsight.com/authors for more information.
About Emerald www.emeraldinsight.com
Emerald is a global publisher linking research and practice to the benefit of society. The company
manages a portfolio of more than 290 journals and over 2,350 books and book series volumes, as
well as providing an extensive range of online products and additional customer resources and
services.
Emerald is both COUNTER 4 and TRANSFER compliant. The organization is a partner of the
Committee on Publication Ethics (COPE) and also works with Portico and the LOCKSS initiative for
digital archive preservation.
efficient maintenance method it is necessary in order to offer methods and criteria to select the rights
maintenance strategies keeping in to account the environmental, safety and production constrains.
Design/methodology/approach – The paper provides an overview of reliability centered
maintenance (RCM) and reliability, availability, maintainability methodologies and an integration of
the two methodologies in a particular case study in the oil and gas sector.
Findings – This paper suggests an improvement of the well-established RCM methodology applicable
to industries with high priority level. It is proposed an integration between a reliability analysis and an
availability analysis and an application on the offshore oil and gas industry.
Practical implications – The methodology provides an excellent tool that can be utilized in
industries, where safety, regulations and the availability of the plant play a fundamental role.
Originality/value – The proposed methodology provides a practical method for selecting the best
maintenance strategy considering the equipment redundancy and sparing, the asset’s performance
over long time scales, and the system uptime, downtime and slowdowns.
Keywords Oil and gas, Reliability availability maintainability, Reliability centred maintenance
Paper type Case study
1. Introduction
An old-fashioned and uneconomic point of view considers maintenance simply a cost
and a necessary evil, rather than a profit contributor and provider of business value.
But, especially in the oil and gas sector, the choice of the right maintenance strategies,
i.e. the best mix of corrective, preventive, condition based and proactive maintenance
for the equipment, is fundamental.
It is true that accidents inevitably accompany offshore development, but when
they occur, they cause environmental, human and economic disasters more severe than
in any other sector.
The causes (equipment failure, personnel mistakes and extreme natural impacts), the
scale and the severity of the accidents’ consequences are extremely variable (Patin, 1999).
Offshore-technology.com (2014) profiles the world’s worst offshore oil rig disasters
(Alexander L. Kielland, North Sea, Norway; Seacrest Drillship disaster, South China
Sea, Thailand; Glomar Java Sea Drillship disaster, South China; Bohai 2 oil rig disaster,
China; Enchova Central Platform disaster, Brazil; Mumbai High North disaster, Indian
Ocean; Usumacinta Jack-up disaster, Gulf of Mexico; C.P. Baker Drilling Barge disaster,
International Journal of Quality &
Reliability Management Gulf of Mexico) and identify the Piper Alpha disaster ( July 1988) in the North Sea,
Vol. 33 No. 6, 2016
pp. 852-880
UK, as the worst offshore oil disaster in history, after 167 people lost their lives
© Emerald Group Publishing Limited
0265-671X
(www.offshore-technology.com/features/feature-the-worlds-deadliest-offshore-oil-rig-
DOI 10.1108/IJQRM-02-2015-0026 disasters-4149812/).
Nevertheless, in the recent April 20, 2010 about 60 km southeast of the Louisiana Integration
coast the largest environmental disaster in US history has occurred due to explosion between RCM
and subsequent fire on the Deepwater Horizon semi-submersible Mobile Offshore
Drilling Unit. Deepwater Horizon was an exploratory drilling and it had been troubled
and RAM
by unusually frequent and forceful contact with explosive natural gas deposits.
The resulting fire could not be extinguished and, on April 22, 2010, Deepwater Horizon
sank, leaving the well gushing at the seabed and causing the largest offshore oil spill in 853
US history. The accident resulted in 11 deaths, huge environmental impact and
significant financial loses (Kaufman, 2010).
The research showed that primary factors that allowed disaster to happen were
poor management and wrong decision making (Høyland and Rausand, 2009). Moreover
that disasters show how rapidly reputation and finances could be lost and therefore
that it is crucially important to invest time and money in assurance of safety of
engineering projects, especially in the oil and gas offshore sector.
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
In light of that disasters, this article want to highlight how the importance
of preventive maintenance (PM) increases in the oil industry in case of offshore
installations because failure could bring to environmental disaster, human and huge
capital losses.
Reliability centered maintenance (RCM) is a widely accepted methodology that has
been available in the industry for over 30 years, and has proved to offer an efficient
strategy for PM optimization (Desphande and Modak, 2002; Fernandez et al., 2003).
Indeed the goal of RCM is to identify maintenance types and intervals only in relation
to the actual critical features of the various components, preserving the most
important equipment (as pumps, vessels and valves) functions with the required
reliability at the lowest maintenance cost (SKF Reliability Systems, 2008; Rausand
and Vatn, 2008).
But in the actual context activities levels are increasing, causing sector inflation and
exploration, development and production costs are rising. Complicating these
changes are the rising expectations of regulators, shareholders and communities, all
of which are demanding more from oil and gas companies. Regulations are becoming
increasingly complex and regulators more proactive, requiring a greater level of
attention to ensure compliance (McCreery et al., 2013; Fernandez et al., 2003).
So, in this context oil and gas companies need to focus on operational excellence and
unplanned shutdowns due to unexpected failures, typically longer and more costly
than planned shutdowns because of their complexity and the difficulty in reaching
them, are not permitted. So the availability of the system achieves a fundamental role
(Rajpal et al., 2006; Hussin et al., 2013).
Hence the importance of an analysis that finds out quantitatively system
availability, reliability and equipment maintainability and that is able to identify the
critical subsystems and the equipment which mainly influence system performance.
Indeed the objective of reliability, availability, maintainability (RAM) analyses is to
increase a system’s ability to operate normally, for a long time, without any failures.
Integrating the RAM and RCM analysis could help to reach an high efficiency of the
system and could contribute to optimize maintenance policies in order to reduce critical
issues and consequently increase system operating efficiency and availability. Indeed
in the present paper we integrated the two methodologies in the oil industry in case of
an offshore installations and we paid attention to procedures and legislation followed in
the maritime offshore sector (Carretero et al., 2003; Vatn, 2008; Turan et al., 2011).
Moreover, according to Conn et al. (2010), we supported standardization and provide
IJQRM value to strategic decisions and opportunities for the operational and maintenance
33,6 units to carry out modifications and projects that ensure safe operations and enhance
process capacity and reliability.
The paper is structured in six sections. After this introduction, Section 2 proposes
review of RCM and RAM methodology. Section 3 explains the methodologies adopted
and provides a detailed description of each step. Section 4 presents the case study and
854 Section 5 shows the main results of the study. The paper is concluded in Section 6 with
a brief discussion of the study and future development.
2. Literature review/background
2.1 RCM
The RCM approach is the main tool to improve the cost-effectiveness and control of
maintenance in all types of industries, and hence to improve availability and safety
(Vatan et al., 2011). The basic criteria of the RCM process are three technical standards:
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
IEC 60300-3-11 (ICE, 1999), SAE-JA1011 (SAE, 1999) and SAE-JA1012 (SAE, 2002).
These are the RCM standards typically used. The methodology has its origin in
the aircraft and aerospace industry and it was also early applied by the military and the
nuclear industry. The RCM methodology has extensive application also outside these
industries, as in rail work networks (Carretero et al., 2003; Vatn, 2008), metallurgical
industries (Deshpande and Modak, 2002; Bookless and Sharkey, 1999), coal mines
(Deshpande and Modak, 2002), chemical and processing, medical (hospitals), sawmill,
shipping, transportation and water distribution (Selvik and Aven, 2011). In addition to
the above, the RCM has been established in the offshore oil and gas industry too.
Serratella et al. (2007) discussed about RCM applications offshore, for the machinery
and rotating equipment of ships.
Mokashi et al. (2002) also presented the introduction of RCM on board ships and
showed how it can influence the ship management and the work of seafarers.
The OREDA (2009) handbook was another effort originating from a few oil
operators to enhance the maintenance and operation of offshore structures by
collecting data for the topside and subsea equipment.
2.2 RAM
RAM is a concept with a history very similar to that of RCM: its origins are in the 1970s,
and its intent was to assure availability for the military. But in the actual context,
keeping in view the competitive environment, overall operating and production costs
the significance of system’s RAM analysis has greatly been enhanced (Rajpal et al.,
2006). Lots of papers are available in literature which suggest that RAM analysis helps
the plant personnel to plan and adapt suitable maintenance strategies (Rajpal et al.,
2006; Sharma et al., 2008; Saraswat and Yadava, 2008). Indeed RAM is an engineering
tool which can be used to predict and improve the performance of the system. It allows
to identify areas within the system or process where improvement actions can be
initiated. The information obtained from analysis will help the management in the
performance improvement of the system (Calixto, 2012).
Indeed, the RAM model provides a quantitative basis for judging the acceptability
of the design, and the tools needed to facilitate justification and prioritization activities
for all future reliability and availability decision making. Factors that affect RAM of a
repairable industrial system include machinery, operating conditions, maintenance
conditions and infra-structural facilities (Komal Sharma and Kumar, 2010).
3. Methodology Integration
In this paper an integrated methodology between RAM and RCM is illustrated for an between RCM
offshore installation in the oil industry.
Indeed the RCM analysis focalize its attention on the maintenance strategy selection
and RAM
and definition, while the RAM analysis through the reliability block diagram (RBD)
structure defines the logical interaction of failures within a system.
RAM analysis concentrates more on quantitative indexes than RCM, in 855
order to enable engineers to rank various design options and scenarios, evaluate
the impact of future or proposed changes to current operations, allow modeling
of spares and maintenance resource constraints, predict system uptime, downtime
and slowdowns and provide a strategic view of the asset’s performance over longer
time scales.
For that reason a RAM analysis permit a continuous improvement and adaptation
throughout a power plant’s life. This helps to prevent confusion, obsolescence, and,
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
ultimately, disregard.
In the present paper an integration of the two methodologies is reported.
In particular here below we will describe in detail the integration of RCM and RAM
analysis. A RAM analysis will be developed in accordance with the results of
maintenance analysis and the criticality analysis (CA). The integration of RAM will
enable engineers to rank various design options and scenarios.
Moreover through the calculation of system uptime, downtime and slowdowns we
will have a strategic view of the asset’s performance over longer time scales that will
influence the maintenance plan definition.
In Figure 1 the entire methodology exploited in the case of study is illustrated.
A data collection is the first step of the analysis, where all the necessary
documentation are collected (pipe and instrument diagrams (P&ID), product flow
diagrams (PFDs), unit functional diagrams (UFDs), main equipment list (MEL), valve
list, instrument list, overall description, emergency shut down (ESD) philosophy and
RAM data collection). To follow, a failure mode and effect analysis and a CA are
implemented in order to identify critical items considering their environmental, security
and production impact. Considering the different level of criticality maintenance
policies could be supplied through a maintenance Strategy selection and a
maintainability analysis (MA) will define their lasts.
As is shown in Figure 1 the RAM analysis carried out downstream the CA and the
maintenance analysis. In fact in big dimension plant it would make sense implement
the RAM only on critical section of the plant and for critical item. Moreover items to
Redesign
RAM
Failure Review Analysis
RAM Data Mode and Criticality
Collection Effect Analysis Figure 1.
Analysis Maintenance Maintenance RCM procedure
Maintenance adopted in the
Strategy Plan
Analysis case of study
Selection Definition
IJQRM include in the RBD will be characterized by their own failure rates and the mean time to
33,6 repair (MTTR) coming from the maintenance analysis.
The pieces of equipment that are the major contributors to the production
unavailability of the project could be identified, to provide, if necessary and applicable,
input and recommendations to the engineering design team in order to improve and
enhance the availability of facilities. Moreover the results of the RAM analysis influence
856 the definition of the maintenance plan, indeed considering the efficiency of the system, it
could confirm the maintenance polices chosen or it will oblige to reconsider the
maintenance strategy of one or more items.
The paragraph below will detail each phase of the analysis explaining the procedure
and the outputs of each steps.
a RAM data collection in the petro chemical field the necessary documents are:
• overall description;
• P&ID;
• PFDs;
• UFD;
• MEL;
• instrument list; and
• ESD philosophy.
A RAM data collection collects all reliability, availability and maintainability data for
all main equipment composing the system under analysis. This activity has to be
continuously modified and integrated when new main equipment types are highlighted
and identified. The reference RAM data bank records data refers to main equipment
templates, that are generic descriptions of the actual main equipment belonging to the
installations object of the study. Reliability is a decisive aspect of equipment installed
on offshore installations. Equipment faults alone, or in combination with other factors,
have been the cause of a number of accidents.
In the oil and gas industries in order to use accurate and reliable data, and being not
always available specific field data, the reliability data used in the analysis will be kept
from a dedicated database available under the heading of OREDA (2009), Offshore
Reliability Data Handbook. OREDA has established a comprehensive data bank with
reliability and maintenance data for exploration and production equipment from a wide
variety of geographic areas, installations, equipment types and operating conditions.
It is considered the reliability of important equipment under “real life” operational
conditions (OREDA, 2009 handbook).
The equipment’s reliability data which are not available in OREDA data bank, will
be referred to European Industry Reliability Data Bank 3rd edition 1998 (Procaccia et al.,
1998), and ANSI/IEEE Std 500-1984 P&V (1984) Reliability data for Nuclear power
generation stations.
Reliability data collection output has to be issued in a table format as in the example
of a centrifugal pump illustrated in Table I, containing defined field format in order to
allow a standard data control/merging/integration.
Item λ MTTF ART Reference data
Integration
Item type Failure mode (×106h)a (h)b (h)c page Note between RCM
and RAM
Centrifugal PA External leakage – utility 5.35 186,916 15 OREDA (2009, –
pump p. 146)
External leakage – process
medium 4.93 202,840 25
Fail to start on demand 4.53 220,751 7.8 857
Spurious stop 9.06 110,375 13 Table I.
Vibration 5.83 171,527 27 Reliability data
Notes: Failure rate; mean time to failure; cactive
a b
repair time collection output
The defined fields of the reliability data collection template will be composed by:
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
• Item, it should contain the system where the item is located, the tag number
and the Item description. For privacy reason we summarize that section under
the name of item.
• Item type, where the analyst has to insert the reference labels used in P&IDS and
PFDs for all types of components belonging to the main equipment type category
under investigation.
• Failure mode, where the main failure modes of reference for the main equipment
under investigation are set. In OREDA failure modes are defined for each
equipment class. The severity class, which describes the severity of the failure for
the main function of the equipment unit, is defined in three levels: critical,
degraded and incipient. In order to be more conservative, the failure rate and
active repair time (ART) considered are the mean value of critical ones. OREDA
defines the critical failure mode as a failure which causes immediate and
complete loss of equipment unit’s capability of providing its output(OREDA,
2009 handbook).
• Failure rate, for each single item it is the frequency of failure related to a specific
failure mode. It is expressed in failures per hour.
• Mean time to failure (MTTF), it is the reciprocal of failure rate (explained in the AA).
• ART, it is the portion of downtime during which one or more technicians are
working on a system to effect a repair.
• Reference data page of the used database and note.
The first part of the activity consists in identifying functional relationship between
items, tags and item description, associating to each item the related family listed in the
reliability data collection.
For each failure mode the detection method and some preventive or mitigation
actions are also envisaged.
The approach is based on a “single failure” criteria, i.e. it assesses each failure
considering the others equipment functioning in normal conditions.
The analysis is performed by considering the relevant working phases of the
system. Moreover different operating Phase will be considered for the estimation of
the failure effects will be: start up, production/running phase, ESD, and planned
shut down.
IJQRM 3.2 Failure mode and effects analysis (FMEA)
33,6 FMEA is essentially a systematic brainstorming session aimed at finding out what
could go wrong with a system or process. The essence of the FMEA technique is to list
for each system component the potential effect of each failure on the rest of the system
(Rausand and Vatn, 2008).
The FMEA analysis consists in describing for each failure mode the local effects, the
858 sub-system effects and the system effects in terms of production, environment and
safety and defines for each failure mode the loss of production in terms of percentage of
loss of capacity of the system, not taking into account the duration of the loss.
The purpose of this analysis is to identify potential system weak points (failure modes)
and to determine their effects on the process considering the three aspects of
production, environment and safety.
At the end of the above steps, the FMEA analysis shall be properly documented in a
report summarizing the results.
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
FMEA output has to be issued in a table format containing defined field format in
order to allow a standard data control/merging/integration.
The FMEA report will include the following field:
• contractor system;
• main tag number;
• tag number;
• failure mode;
• failure mode description (textual description of failure mode);
• operating phase;
• effects at facility functional unit level;
• effects at contractor system level; and
• percent loss of production (percentage of reduction of capacity at system level,
at this stage the loss of production does not take into account the duration of
the loss).
FMEA regards frequency, consequences and detection keeping a qualitative
approach based on safe policies. A variation to this approach is called FMECA,
failure mode and effects CA.
3.3 CA
We found more useful in our study, separate the FMEA from the CA. Indeed the
criticality is an index that is defined by failure cause, frequency, consequences and
detection (Calixto, 2012).
The aim of the CA is to evaluate the item’s criticality considering each failure mode
in terms of the worst potential consequences on the upper level of the system.
A severity category for each parameter (asset, safety, environment) is assigned
according to the tables depicted in Table II.
Moreover the severity value is assigned taking in consideration the existing
compensating provisions set to prevent the effect of the failure mode and to mitigate its
effects (e.g. The presence of sparing/ redundant equipment having equal capacity shall
be shown).
Severity
Integration
value Description between RCM
and RAM
Safety
0 No effects
1 No effect on work performance but may involve slight injury or health effects (e.g. first
aid case) and employee returns to work the same day
2 Low health effects or slight injury (lost time injury) – affecting work performance, such 859
as restriction to activities (restricted work case or occupational illness) or a need to take a
few days to fully recover (lost work case), e.g. food poisoning
3 Major injury or health effects (including permanent partial disability and occupational
illness) – affecting work performance in the longer term, such as a prolonged absence
from work. Irreversible health damage without loss of life, e.g. noise induced hearing
loss, chronic back injuries, sensitisation, hand/arm vibration syndrome and repetitive
strain injury
4 Permanent total disability or one to three fatalities – from an accident or occupational
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
illness. Irreversible health damage with serious disability or death, e.g. corrosive burns,
heat stroke, cancer (small exposed population)
5 Multiple fatalities – from an accident or occupational illness (e.g. chemical asphyxiation
or cancer) large exposed population
Production
0 No effects
1 Slight Damage – No disruption to operation (cost less than 100,000 USD)
2 Minor damage – brief disruption (cost less than 1,000,000 USD)
3 Local damage – partial shutdown (can be restarted, costs up to 10,000,000 USD)
4 Major damage – partial operation loss (2 weeks shutdown, cost up to 100,000,000 USD);
major inquiry for the cost of damage
5 Extensive damage – substantial or total loss of operation (costs in excess of 100,000,000
USD); Extensive inquiry for the cost of damage
Environment
0 No effects
1 Slight effect – minor environmental damage. No financial consequences
2 Low effect – sufficiently large contamination or discharge to damage the environment,
but no lasting effect. Single breach of statutory or prescribed limit, or single complaint
3 Localized effect – limited discharges affecting the neighborhood and damaging the
environment. Repeated breaches of statutory or prescribed limit, or many complaints
4 Major effect – severe environmental damage. The company is required to take extensive
measures to restore the damage environment. Extended breaches of statutory or Table II.
prescribed limits, or widespread nuisance Classifications of
5 Massive effect – persistent severe environmental damage or severe nuisance extending severity values in
over a large area. Loss of commercial or recreational use or nature conservancy resulting terms of safety, asset
in major financial consequences for the company. Ongoing breaches well above and environment
The detection method of a failure event describes the existing means and methods by
which the effects of a failure can be detected. Here below a non-exhaustive list of the
detection methods is provided:
• Functional testing: failure discovered by activating the intended function and
comparing the response against a predefined standard. Includes both functional
and capacity tests. Most relevant for failures on stand by/passive systems.
• Continuous condition monitoring: failures revealed during a continuous, automatic
checking for a predefined failure mode or on-line process monitoring. The failure is
IJQRM revealed by some instrumental monitored value being exceeded, normally
33,6 observed in the control room either by automatic detection (alarm or trip) or by
manually observed deviation.
• Test: performed while the system is operational.
• Inspection: failure discovered during periodic internal or external inspection
860 (e.g. visual inspection, non-destructive testing, etc.).
• Casual observation: casual observation during routine or casual operator checks
mainly by senses (noise, smell, smoke, leakage, appearance, local indicators, etc.).
• Observation method unknown or not specified.
• Corrective maintenance: failure observed during expected or unexpected
corrective maintenance (initiated by another failure).
• Production interference: failure discovered by production upset, reduction, etc.
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
Happens Happens
Safety
Assets
Could happen Has occurred Has occurred
Severity
Environ.
Reported for several several
in E&P at least once several times
E&P industry times/y in times/y in
industry in Company in Company
Company one location
approach
Figure 2.
and RAM
between RCM
Risk matrix
861
Integration
IJQRM They are located in one of the three different colored area of the matrix and they could
33,6 be classified as:
• Tolerable risk: no further impact reduction measures are necessary. Monitor
performance and manage for continuous improvements.
• Compulsory impact reduction required: risk is tolerable when appropriate control
862 measures have been identified and implemented.
• Intolerable risk without detailed justification: it does not satisfy initial screening
criteria. Additional risk reduction measures are required. It is justifiable if compliance
with ALARP principle and individual and societal risk criteria are demonstrated.
The overall criticality (CFM) related to each item failure mode will be estimated as the
worst of the three criticality’s indexes set above: CFM ¼ max (ICS; ICA; ICE).
It represents the overall impact of the failure mode considering all types of effects
(safety, asset damage and environmental damage).
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
The criticality assigned to each item depends on the impact of their failure modes so
the criticality shall be assumed to be the worst effect of the pertaining failure modes:
CI ¼ min (CFM).
The overall criticality index can assume three values:
• Criticality 1: intolerable without detailed justification.
• Criticality 2: compulsory impact reduction required.
• Criticality 3: tolerable risk.
Once the CA is completed, the maintenance strategy selection and availability analysis
could be started.
3.4 Maintenance strategy selection
Efficient maintenance of offshore structures is one of the priority development areas of
the offshore industry world-wide (Kam, 1989). The maintenance strategy selection is a
crucial point and it is performed in order to reduce the routine maintenance tasks to
sustainable minimum, without affecting the plant performance, product quality, safety
or environmental integrity. Indeed the traditional view of maintenance is based on
belief that all failures are bad and must be prevented. However, this is practically
unrealistic from two angles. First, from purely technical point of view it is often not
feasible to prevent failure. Second, even if one could prevent or anticipate all failures it
would be cost prohibitive (Desphande and Modak, 2002). The maintenance strategy
Selection is a decision-making procedure where a suitable flow chart allows to
determine which maintenance strategy is applicable, starting from item criticality
index (FMEA and CA output).
3.5 MA
The aim of MA is to estimate the time spent to repair and maintain the Item considering
the normal operating scenario and the maintenance’s strategy selected through MSS
activity. The MA aims to determine the total maintenance time (MT) required by each
Item. The MA’s outputs allow calculating the item’s MTTR in order to complete the
reliability data used as input in the RAM study. The MTTR is a measure of how long,
on average, it will take to bring the equipment back to normal service ability when it
does fail (Carazas and Souza, 2010).
The purpose of maintenance management is to reduce the adverse effects of
breakdown and to maximize the production system availability at a minimum cost
(Bazrafshan and Hajjari, 2012).
MT is identified by two main categories:
(1) corrective maintenance time (CMT); and
(2) preventive maintenance time (PMT).
3.5.1 CMT. Corrective maintenance (CM) is a part of maintenance carried out after
fault recognition and intended to put an item into a state in which it can perform a
required function. CMT, expressed in the following equation, is the time required to
perform a CM action and, taking into consideration experts’ field experience, it is
composed by the following contributes:
CMT ¼ FLDT þ LD þ TOST þ ART þ BIST þ AOT (4)
where:
(1) FLDT, fault location/diagnosis time. Due to the offshore equipment location, the
FLDT is assessed to be 4 hrs considering the time spent to access to the item, to
IJQRM locate where the failure has occurred and to identify the causes. FLDT
33,6 is assumed to be equal to 0 hr for continuous monitoring or if the failure is
detectable by the control system. FLDT is considered negligible in case of
failures occurring on loops not provided of valves (failures are only related
to the electronic elaboration of measure).
(2) LD, logistic delay. Considering the high criticality of gas production, it can be
864 assumed that each failure is considered as critical failure for the system,
therefore the logistic delay due to crew intervention (located in nearest
platforms) comprehensive of the time necessary to retrieve material (from
on-site warehouse), has been considered to be 4 hrs for offshore .
(3) TOST, take out of service time, the TOST time, expressed in the following
equation, is evaluated taking into account the following contributes:
• FT ¼ fluid temperature: 2 hrs for hot ( W 50°C) fluid; 0 hr for cold fluid;
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
(4) ART: active repair time. The items’ ARTs comes from the referenced reliability
database and reported in the reliability data collection report.
(5) BIST: bring into service time (in the following equation). The items’ BIST is
evaluated considering the following contribute:
• ITR ¼ installation, test and restoring: reconnecting, testing and ramping-up
the equipment to operating conditions, 8 hrs for large equipment and 4 hrs
for small ones;
• TI ¼ tracing and insulation: restoring insulation and tracing, 4 hrs; and
• EC ¼ electric connection: 1 hr to reconnect the item’s instrumentation:
BIST ¼ ITR þ TI þ EC (6)
(6) Activities overlapping time (AOT), activities performed at the same time.
3.5.2 PMT. PM is that part of maintenance periodically carried out in order to preserve Integration
the good functioning and prevent the failure occurrence. between RCM
PMT is the time required to perform a PM action and it is composed by the
contributes reported in the following equation:
and RAM
The following fields will be included in the MA report: contractor system, main tag
number, tag number, failure mode code, failure mode description, LD, FLDT (only for
CM), TOST, BIST, ART (only for CM), APMT (only for PM), CMT (only for CM), PMT
(only for PM) and notes.
Rt
ðlÞdt
R ðt Þ ¼ e 0 (8)
The MTTF, which is the mean value of the failure time, when the failure rate is
constant (λ) can be expressed by the the following equation:
Z 1 Z 1
1
M TTF ¼ Rðt Þdt ¼ elt dt ¼ (9)
0 0 l
Particular attention should be pose on how system reliability are represented. Indeed
series or parallel configurations are used for dependent and independent effects.
When the system reliability includes a high number of blocks the system reliability
is represented by the the following equation. In case of a series configuration:
Y
n
R i ðT Þ ¼ Ri ðt Þ ¼ Rðt Þn (10)
i¼1
Potential production
The items to include in the RBD will be characterized by their own failure rates and MTTR.
It is assumed that MTTR values, which represent the maintainability of a system,
are equal to the results preventive or CMT defined in maintainability report.
So, inherent or technical availability (UNI EN ISO 14224) of the system is given by
the following equation:
M TTF
A¼ (13)
M TTF þ M TTR
In a simple way, availability is controlled by the two parameters MTTF and MTTR.
Moreover the software will provide the mean time between failure of every single
item and of the units, through the following equation:
M TBF ¼ M TTF þ M TTR (14)
It is used to identify the most critical section and the most critical item of the system.
The detailed results provided by software, as the availability percentage trend
through the years and the yearly downtime days, are graphically displayed, giving a
lifecycle screening perspective. In the worst case RAM results recommend a redesign,
in order to improve and enhance the availability of facilities. Otherwise it would bring
to the acceptance or review of the selected maintenance strategy.
4. Case study
The proposed approach illustrated in Section 3 is applied to a context of a particular
offshore installation in the oil industry. RCM is implemented in order to ensure that the
maintenance strategies on company assets deliver the process uptime, safety assurance
and environmental integrity required by the business. The aim is reducing the total
production downtime and the actual cost of the maintenance activities achieving the
best-planned maintenance policies for equipment. We suppose that the analysis is
implemented on a project in phase of detailed design.
Indeed RCM has a particular relevance in the case of offshore installations because,
unplanned shutdowns due to unexpected failures are typically longer and more costly
than planned shutdowns. Because of the complexity of an offshore installation and the
fact that maintenance is generally outsourced, and hence requires considerable up-front
planning, unplanned maintenance activities normally result in much more production
loss than a similar planned one. Additionally, because of contracts with oil and gas
clients, unplanned shutdowns can imply financial penalties (Conn et al., 2010).
The plant considered in the analysis produces hydrocarbons from the offshore field;
gas will be dedicated to feed an onshore power plant in order to achieve its full electric
power production. Associated hydrocarbons liquid will be valorized by means of
dedicated facilities and they will be delivered to an existing oil export terminal.
Moreover the onshore plant, the multiphase sea line to onshore plant, pipeline
for hydrocarbon liquids export are excluded from the study, because the aim of the
study is to focus on offshore installations. The electric distribution sections
are modeled as always available as the architectures of these systems are completely
redundant and the probability of failure that can lead to the complete unavailability
of electrical feeding is negligible. Hence our case of study will consider only the
offshore installation.
The platform is designed for unmanned operation in normal operating conditions,
consequently the amount of facilities installed on board is reduced at least. Separation,
gas treatment and compression facilities will be installed onshore.
Three different production scenarios, with the related average production, will be
considered during the next 30 years:
• Scenario A: six gas wells producing: it is referred to the period starting from 2014
to 2024, only the gas wellheads are considered available, corresponding to the full
load of both the generation station and the gas turbine installed onshore.
The average production used in the calculation is 2.1 MSm3/day of gas coming Integration
from offshore plus overdesign factor. between RCM
• Scenario B: six gas wells+two oil wells producing: it is referred to the period and RAM
starting from 2025 to 2038, gas and oil wellheads are considered totally available;
the gas sent to onshore is equal to 2.1 MSm3/day of gas coming from gas
wellhead plus 73,080 Sm^3/day of associated gas coming from the two oil wells,
plus the overdesign factor. 869
• Scenaraio C: three gas wells+two oil wells producing: it is referred to the period
starting from 2038 to 2045, gas and oil wellheads are considered available but the
gas production coming from gas wellhead is reduced to 1.05 MSm3/day.
A spare philosophy will be adopted for the platform and the following equipment will
be installed with 100 percent spare capacity: pumps and machinery, related to gas and
condensate process, safety valves and oil export pumps.
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
5.2 CA
The CA foresees the evaluation of item criticality through a risk matrix approach as
illustrate in the methodology. The compensating provisions installed on the plant as
the automatic activation of spare system reduce or cancel the severity value in terms of
asset production because they guarantee the regular operating of the plant. Hence a
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
33,6
870
Table V.
IJQRM
FMEA result
valve located
separation unit
for a shut down
Shut down Fail to close on 0.38 2,631,579 2 OREDA na No effect No effect No effect 0.00%
valve demand (2009, p. 574)
Shut down Fail to open on 0.76 1,315,789 1.8 OREDA Production No oil flow No oil flow to unit 220 Shut down 100% oil
valve demand (2009, p. 574) crude oil transport delivered oil to
onshore plant
Shut down Fail to open on 0.76 1,315,789 1.8 OREDA No gas flow to unit 360 Reduction of 2.86% gas
valve demand (2009, p. 574) associated gas delivered gas to
compression onshore plant
Shut down Fail to regulate 1.14 877,193 2.7 OREDA na No effect No effect no effect 0.00%
valve (2009, p. 574)
Shut down External 0.38 2,631,579 10 OREDA Production Gas and oil No oil flow to unit 220 Shut down 100% oil
valve leakage (2009, p. 574) leakage crude oil Transport delivered oil to
process onshore plant
medium
Shut down External 0.38 2,631,579 10 OREDA No gas flow to unit 360 Reduction of 2.86% gas
valve leakage (2009, p. 574) associated gas delivered gas to
process compression onshore plant
medium
Notes: aFailure rate; bmean time to failure; cactive repair time
deep and attentive analysis of the plant functioning and its P&IDs are necessary to Integration
implement the CA. between RCM
and RAM
5.3 Maintenance strategy selection and maintenance analysis
MSS is a decision-making policy through whom it will be defined the maintenance
policy for each item part of the offshore platform. For items which results Criticality 1
or 2 from CA, the MSS flow chart addresses user toward a preventive strategy choice, 871
in order to assure risk lowering. For Criticality 3 items, the default strategy is CM if
allowed by laws or constraints.
For the specific case of study it had been developed the decision diagram shown
in Figure 3. Through various “yes or no” questions, it navigates to the optimal
maintenance task for the item assessed.
Using the results obtained by MSS, through the MA methodology it will be
calculated PMT for each item and CMT related to each failure mode. During their
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
33,6
872
IJQRM
Figure 3.
MSS flow chart
Is There An Incipient Are There Vendor’s
Is the Aging
Criticality 1 and 2 Signal For This Failure NO NO Maintenance
Predictable?
Mode? Instructions?
YES YES NO
NO
Predictive or On
Condition
Maintenance
NO
Maintenance
Does the Maintenance Plan YES NO
Plan NO YES Approved
satisfy Laws and/or Constraints?
Upgrading
Cyclical
Maintenance
Corrective
Maintenance
YES
REDESIGN
Is There Any Law or Constraint
Criticality 3 (assurance/warranty) asking for NO
preventive Maintenance?
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
873
Integration
scenario A
IJQRM Stand-by group are modeled as in normal operation, one item is active and the other one
33,6 is in stand by. The same structure has been used for all production safety valves,
pumps, electric heaters and fuel gas filters.
In the simulation model referred to Scenario A only six gas wells have been
considered available. In Scenario B and C the Unit 100 has been structured considering
the respectively operating scenario, six gas wells and two oil wells producing and three
874 gas wells and two oil wells producing.
The results obtained by the RAM in the three production scenarios is summarized
by the Table VI.
Considering the three different scenarios, which includes the plant behavior from
2014 to 2044, in Figure 5 it is shown the ideal productivity and the loss of production
among the years compared with the plant availability. The normal production it is
assumed to be equal to 100 percent.
Moreover it is compared the plant downtime with the plant availability.
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
It had been considered every five years planned shutdown, it is possible to notice
from the Figure 6 how they increase the days shutdown and have a severe impact on
the plant availability.
Moreover, in order to identify the most critical units of the plant, which give the
major contribution to the plant production unavailability, the MTBF of each section for
the three different scenarios is calculated.
From the Figure 7 it is possible to extract the most critical sections of the plant
because of their lowest MTBF value. The oil wellheads seems to be the most critical
units of the plant because of their series structure of the block and the MTTF and
MTTR of the items enclosed in the unit. Moreover a logarithmic scale has been chosen
because of the high variability of the dates related to different section and scenarios.
Indeed Unit 500, which contains sea water system is the least critical unit of the plant
and has and high MTBF value.
Through their analysis it is possible to evaluate the most critical items of unit. In the
Figure 8 is shown the results obtained from the analysis.
The redundant items have not been considered in the most critical items assessment
because their failure do not cause an unavailability of the unit.
Taking into account the unit related to gas production wellhead, the two flow
control valves are considered the most critical items of the considered unit.
As for gas production wellhead, the same diagram has been calculated for the other
sections of the plant, analyzing the bottlenecks object of possible redesign and
modifications. But, considering the acceptable value of the availability of the system no
process redesign and maintenance review are recommended.
λ Operating Plant
Item Failure mode description (106h)a phase MSSb CMTc PMTc Redundancy Maintenances tasks status Frequency
Flow control Fail to close on demand 0.38 Maintenance PMT 23.75 22.75 n Functional verification 1 1Y
valve Fail to open on demand 0.76 Start up PMT 23.55 Functional verification 1 1Y
Fail to regulate 1.14 na PMT 24.45 Functional verification 1 1Y
External leakage process 0.38 Production PMT 31.75 Tightening shaft 1 1Y
medium packing nut
External leakage process 0.38 Production PMT 31.75 Tightening shaft 1 1Y
medium packing nut
c
Notes: aFailure rate; bmaintenance strategy selection; corrective maintenance time; dpreventive maintenance time
definition template
Maintenance plan
and RAM
between RCM
Table VI.
875
Integration
IJQRM 3
Loss in production Ideal productivity Availability
100
33,6 99.5
2.5 99
98.5
2 98
Productivity MSm3
97.5
Availability
876 1.5 97
96.5
1 96
95.5
Figure 5. 0.5 95
Ideal productivity- 94.5
loss production- 0 94
availability
2014
2016
2018
2020
2022
2024
2026
2028
2030
2032
2034
2036
2038
2040
2042
2044
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
Availability
97.5
Days
3 97
96.5
2 96
95.5
1 95
94.5
Figure 6. 0 94
2015
2017
2019
2021
2023
2025
2027
2029
2031
2033
2035
2037
2039
2041
2043
2045
Downtime days
100,000
10,000
1,000
100
10
1
0
0
S
0
0
0
0
S
20
22
23
45
46
47
19
50
55
42
D
N
EA
EA
LI
IT
IT
IT
IT
IT
IT
IT
IT
IT
IT
W
LH
N
N
N
N
O
L
U
U
U
U
EL
EL
FL
W
D
O
AS
IL
PR
O
G
Figure 7.
AS
G
MTBF of different
0
13
0.000015
0.00001
0.000005
0
877
1
03
03
04
21
7
00
00
02
01
02
00
00
00
00
00
V0
V0
V0
V0
LT
LT
PT
PT
PT
VN
VN
LT
PT
PT
BD
SD
SD
SD
00
00
00
00
00
00
00
00
00
00
Figure 8.
00
00
00
00
42
42
42
42
42
42
42
42
42
42
42
42
42
42
36
36
36
36
36
36
36
36
36
36
MSS flow chart
36
36
36
36
30
30
30
30
30
30
30
30
30
30
30
30
30
30
01
01
01
01
01
01
01
01
01
01
01
01
01
01
fixed plant shut down foreseen every two year for the lines pigging as shown in
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
6. Conclusion
In process industries, where production generally operates continuously, the
availability of equipment is very important. In this study an integrated RCM and
RAM approach is defined and it is applied to a context of a particular offshore
installation in the oil industry.
The present work started understanding the relationships between the different
process section and auxiliary section of the plant and establishing the failure mode
belonging to each item and its effect on the system. All that allows us to calculate the
items criticality that has been useful for the best maintenance strategy selection.
At that point the integration of the reliability study with the availability study,
based on the CA and maintenance analysis results, makes the process analysis more
sensible to the efficiency of the system and focussed on quantitative point of view.
A software simulation allows us to minimize the contract losses due to production
failures, to avoid excessive attention to areas that are non-critical, and instead focus on
areas where improvements and modifications will make a larger impact. Indeed
performing the analysis at an early stage, the study will allow for enhancement
suggestions that would have been much more expensive if carried out at a later stage.
Moreover it would be possible evaluating the impact of future or proposed changes to
current operations, for example through the modeling of spares; predicting system
uptime, downtime and slowdowns; and providing a strategic view of the asset’s
performance over longer time scales.
References
ANSI/IEEE Std 500-1984 P&V (1984), IEEE Standard Reliability Data for Pumps and Drivers,
Valve Actuators, and Valves, The Institute of Electrical and Electronics Engineers, Inc.,
New York, NY.
Aoudia, M. and Belmokhtar, O. (2008), “Economic impact of maintenance management
ineffectiveness of an oil and gas company”, Journal of Quality in Maintenance Engineering,
Vol. 14 No. 3, pp. 237-261.
Bazrafshan, M. and Hajjari, T. (2012), “Diagnosing maintenance system problems: theory and a
case study”, Quality and Reliability Engineering International, Vol. 28 No. 6, pp. 594-603.
Bookless, C. and Sharkey, M. (1999), “Streamlined RCM in the nuclear industry”, Maintenance
and Asset Management, Vol. 14 No. 1, pp. 27-30.
Calixto, E. (2012), Gas and Oil Reliability Engineering: Modeling and Analysis, Gulf Professional
Publishing, Waltham, MA, pp. 119-167.
Carazas, F.G. and Souza, G.F.M. (2010), “Risk-based decision making method for maintenance
policy selection of thermal power plant equipment”, Energy, Vol. 35 No. 2, pp. 964-975.
Carretero, J., Pérez, J.M., Garcia-Carballeira, F., Calderon, A., Fernandez, J. and Garcia, J.D. (2003),
“Applying RCM in large scale systems: a case study with rail way networks”, Reliability
Engineering and System Safety, Vol. 3 No. 82, pp. 257-273.
Conn, A.R., Deleris, L.A., Hoskinga, J.R.M. and Thorstensen, T.A.A. (2010), “Simulation model for
improving the maintenance of high cost systems, with application to an offshore oil
installation”, Quality and Reliability Engineering International, Vol. 26 No. 7, pp. 733-748.
Desphande, V.S. and Modak, J.P. (2002), “Application of RCM to a medium scale industry”,
Reliability Engineering and System Safety, Vol. 77 No. 1, pp. 31-43.
Deshpande, V.S. and Modak, L.P. (2002), “Application of RCM for safety considerations in a steel
plant”, Reliability Engineering and System Safety, Vol. 78 No. 3, pp. 325-334.
Fernandez, O., Labib, A.W., Walmsley, R. and Petty, D.J. (2003), “A decision support maintenance
management system”, International Journal of Quality & Reliability Management, Vol. 20
No. 8, pp. 965-979.
Høyland, A. and Rausand, M. (2009), System Reliability Theory, John Wiley & Sons, Hoboken, NJ.
Hussin, H., Hashim, F.M., Ramli, O.H. and Ghazali, S.M.A. (2013), “Maintainability analysis of an
offshore gas compression train system, a case study”, International Journal of Quality &
Reliability Management, Vol. 30 No. 5, pp. 495-510.
IEC (1999), “Dependability management, part 3-11: application guide reliability centred Integration
maintenance”, IEC 60300-3-11/NEK IEC 60300-3-11, Norsk elektroteknisk komite (NEK).
between RCM
Kam, J.C.P. (1989), “The efficient maintenance of offshore structural integrity using reliability and RAM
analysis”, Quality and Reliability Engineering International, Vol. 5 No. 3, pp. 221-228.
Kaufman, L. (2010), “Oil rig sinks in the Gulf of Mexico”, The New York Times, April 22, p. A14.
Komal Sharma, S.P. and Kumar, D. (2010), “RAM analysis of repairable industrial systems
utilizing uncertain data”, Applied Soft Computing, Vol. 10 No. 4, pp. 1208-1221. 879
McCreery, J., Phillips, E. and Cigala, F. (2013), “Operational excellence: the imperative for
oil and gas companies”, available at: www.bain.com/Images/BAIN_BRIEF_
Operational_excellence_The%20imperative_for_oil_and_gas_companies.pdf (accessed
December 15, 2014).
Mokashi, A.J., Wang, J. and Vermar, A.K. (2002), “A study of reliability-centered maintenance in
maritime operations”, Marine Policy, Vol. 26 No. 5, pp. 325-335.
Offshore-technology.com (2014), “The world’s worst offshore oil rig disasters”, available at: www.
Downloaded by KU Leuven Libraries At 00:13 07 June 2019 (PT)
offshore-technology.com/features/feature-the-worlds-deadliest-offshore-oil-rig-disasters-
4149812/ (accessed November 2014).
OREDA (2009), Offshore Reliability Data Handbook, 5th ed., DNV – Det Norske Veritas, Oslo.
Patin, S. (1999), Environmental Impact of the Offshore Oil and Gas Industry, Ecomonitor Pub,
East NorthPort, NY.
Procaccia, H., Arsenis, S.P. and Aufort, P. (1998), EIReDA, 1st ed., Crete University Press, Iraklion
Crete.
Rajpal, P.S., Shishodia, K.S. and Sekhon, G.S. (2006), “An artificial neural network for modeling
reliability, availability and maintainability of a repairable system”, Reliability Engineering
and System Safety, Vol. 91 No. 7, pp. 809-819.
Rausand, M. and Vatn, J. (2008), “Reliability centred maintenance”, in Kobbacy, K.A.H. and Murthy,
D.N.P. (Eds), A Complex System Maintenance Handbook (Springer Series in Reliability
Engineering), Chapter 4, Springer-Verlag Limited, London, pp. 79-108.
SAE (1999), “Evaluation criteria for reliability-centered maintenance (RCM) processes”,
SAE-JA1011, Society of Automotive Engineers.
SAE (2002), “A guide to the reliability-centered maintenance (RCM) standard”, SAE-JA1012,
Society of Automotive Engineers.
Saraswat, S. and Yadava, G.S. (2008), “An overview on reliability, availability, maintainability
and supportability (RAMS) engineering”, International Journal of Quality & Reliability
Management, Vol. 25 No. 3, pp. 330-344.
Selvik, J. and Aven, T. (2011), “A framework for reliability and risk centered maintenance”,
Reliability Engineering and System Safety, Vol. 96 No. 2, pp. 324-331.
Serratella, C.M., Wang, G. and Conachey, R. (2007), “Risk-based strategies for the next generation
of maintenance and inspection programs”, International Symposium on Maritime, Safety,
Security and Environmental Protection (SSE), Athens, September 20-21.
Sharma, R.K., Kumar, D. and Kumar, P. (2008), “Performance modeling in critical
engineering systems using RAM analysis”, Reliability Engineering and System Safety,
Vol. 93 No. 6, pp. 891-897.
SKF Reliability Systems (2008), “Methodology SRCM: compliant with SAE JA 1011 standard SKF
aptitude exchange”.
Turan, O., Lazakis, I., Judah, S. and Incecik, A. (2011), “Investigating the reliability and criticality
of the maintenance characteristics of a diving support vessel”, Quality and Reliability
Engineering International, Vol. 27 No. 7, pp. 931-946.
IJQRM Vatn, J. (2008), Complex System Maintenance Handbook, Springer-Verlag, London, pp. 509-532.
33,6 Vatan, J., Heggset, J. and Hameed, Z. (2011), “Challenges in the reliability and maintainability data
collection for offshore wind turbines”, Renewable Energy, Vol. 36 No. 8, pp. 2154-2165.
doi: 10.1016/j.renene.2011.01.008.
For instructions on how to order reprints of this article, please visit our website:
www.emeraldgrouppublishing.com/licensing/reprints.htm
Or contact us for further details: permissions@emeraldinsight.com
This article has been cited by:
KumaraswamyK.G., K.G. Kumaraswamy. 2018. Reduction of life cycle costs for a contemporary
helicopter through improvement of reliability and maintainability parameters. International Journal
of Quality & Reliability Management 35:2, 545-567. [Abstract] [Full Text] [PDF]
5. TsarouhasPanagiotis, Panagiotis Tsarouhas, BesserisGeorge, George Besseris. 2017. Maintainability
analysis in shaving blades industry: a case study. International Journal of Quality & Reliability
Management 34:4, 581-594. [Abstract] [Full Text] [PDF]