Professional Documents
Culture Documents
Substations and Electrical Installations: Asset Health Indices For Equipment in Existing Substations
Substations and Electrical Installations: Asset Health Indices For Equipment in Existing Substations
Substations and
TECHNICAL BROCHURE
electrical installations
Members
Corresponding Members
R. CORNELL US
M. VERRIER AU
A. KURZ DE
Copyright © 2021
“All rights to this Technical Brochure are retained by CIGRE. It is strictly prohibited to reproduce or provide this publication in any
form or by any means to any third party. Only CIGRE Collective Members companies are allowed to store their copy on their
internal intranet or other company network provided access is restricted to their own employees. No part of this publication may
be reproduced or utilized without permission from CIGRE”.
Disclaimer notice
“CIGRE gives no warranty or assurance about the contents of this publication, nor does it accept any responsibility, as to the
accuracy or exhaustiveness of the information. All implied warranties and conditions are excluded to the maximum extent permitted
by law”.
WG XX.XXpany network provided access is restricted to their own employees. No part of this publication may be
reproduced or utilized without permission from CIGRE”.
Disclaimer notice
ISBN : 978-2-85873-563-1
“CIGRE gives no warranty or assurance about the contents of this publication, nor does it accept any
responsibility, as to the accuracy or exhaustiveness of the information. All implied warranties and
TB 858 - Asset Health Indices for Equipment in Existing Substations
Executive summary
Satisfactory and reliable performance of substation equipment is critical for any utility company. During
their service-life assets transition from being new to ones that are aged in terms of having one or more
developing failure modes occurring. This may be a gradual deterioration or a step change after a
damaging incident. Eventually failure would follow unless an appropriate corrective action is taken. As
such this deterioration presents a risk exposure affecting key business objectives unless it is identified
and managed. It is a situation that requires each asset, or in some cases a functional group of assets,
to have individual care plans to ensure their future ability to perform.
This process would be used to identify when to intervene with maintenance, refurbishment or
replacement. It creates a focus onto estimates of a “failure-free” period for each asset. It all begins
when the asset is newly commissioned, and it has ongoing revisions as the years in service pass. In
many situations these estimates in turn can be based on periodic condition assessments. To achieve
this the process begins with identification of relevant failure modes, to apply corresponding diagnostic
indicators and coordinate the outcomes within an Asset Health Indexing (AHI) methodology.
The specific aim for such work is two-fold. One is to develop processes to identify intervention
priorities applicable at the individual asset level. The second is to identify processes to aggregate
these priorities for different asset types to produce a score for a circuit end, bay or substation.
The development of this brochure draws upon related experience of working group members together
with some work undertaken mainly within CIGRE A2, A3 and B3 study committees.
1. IN-SERVICE ASSET FAILURES
In this context an in-service failure is a failure to perform a network duty and it does not necessarily
mean an event that creates the end of life. The deteriorated condition might be rectifiable with
maintenance, refurbishment or repair. In this way the aim is more to identify failure-free “life periods”
and this is not necessarily the same as re-defining asset life. In some types of asset such deterioration
is normally addressed with timely maintenance. More catastrophic damage is repaired. The end point
for asset life is when these tasks cease to be effective. This could be when the damage is too great,
one or more fundamental functions have irreversible limiting deterioration or when the costs of repair
or refurbishments outweighs the benefits.
Historically HV power equipment has been specified and then designed with ratings that optimised the
expectation of 40- or 25-year lifetime to match that for the civils and mechanical structures in the
substation or power station. Over time it has become apparent that these are lasting much longer – in
many cases. Experience also shows that even within comparable assets the rates of deterioration are
much more varied. SC A2 and A3 reliability studies have attributed the causes for the greater range in
asset performance to the quality of the design, manufacture, commissioning, maintenance and
variations in use, see their references [B1] and [B2].
One issue is then to have the capability to identify the time frames where assets are most likely to fail,
either because they require maintenance and repair, or due to irreversible causes. This range of
failure-free periods and consequential failure mechanisms require a different management practice
from one that follows a simple time-based assignment applied to the whole asset class. One way of
managing the assets is by using condition assessment if it can be linked to both the range and rate of
development of failure modes found to occur. These are critical provisos for the approach adopted in
this document. The outcomes may then be a failure free expectancy based upon asset health indexing
systems that link the condition to a failure time frame. The process is, therefore, one moving away
from broad time base criteria to one that has asset-specific decisions based upon condition. It may
migrate further into decisions based upon risk.
2. THE ROLE OF ASSET HEALTH INDICES
The role of an AHI is, therefore, to divide the asset register into several categories such as the five
shown in Table 1.1. This example is the one developed within WG B3.48 for primary assets and based
upon experience of members and of publications elsewhere. The time scales need to be user defined
and the active failure modes identified. Both format and methodology will depend on the intended
application. In order to aggregate scores across the various asset classes in a substation it is
important, however, to preserve the same definitions for scores for all asset types, at least within a
company. Most assets assessed in this way will be in groups 1 and 2 and require no specific remedial
action at the time of the assessment. It is part of a condition-based regime where activities only occur
3
TB 858 - Asset Health Indices for Equipment in Existing Substations
when diagnostics indicate a need. It is necessary, therefore, with such an approach to continue with
an ongoing programme to capture condition data with an ongoing process to re-define and re-assess
activities and their time scales. Those in groups 3, 4 and 5 would be assessed and given their
individual action plan based upon the failure mode identified, its general rate of progression, its
condition-based rate of progression, and its criticality exposure.
Table 1.1 – Example of AHI definitions for expressing remaining failure free years
Very low likelihood of failure over Continue with inspect More than 10 years likely before additional
many years. This would be in the and test schedule. maintenance and refurbishment is
1 Very good original factory condition or after undertaken. Timing of interventions is asset
extensive refurbishment. specific and indicated by the inspection and
test results.
Low likelihood of failure over a Continue with inspect and 5-10 years likely before additional
long period. General deterioration test schedule. maintenance and refurbishment is
is consistent with its time in undertaken. Timing of interventions is asset
service. specific and indicated by the inspection and
test results. Subcategory bands can be
2 Good introduced based upon failure mode and
rate of change in diagnostics.
The impact of any life-limiting irreversible
deterioration is expected to be beyond this
time frame. If not, introduce extra column
with 5-year replacement bands.
Low risk defect or life-limiting Investigate the issue 2-5 years before interventions. Timing and
deterioration has been detected. and plan any scope are indicated by investigations,
Performance may be adversely intervention. Continue together with changes in inspect and test
affected long term unless remedial with a revised inspect results.
3 Fair action is carried out. and test schedule. Subcategories introduced in yearly bands
Revise life expectancy based upon failure mode and rate of change
planning into likely 5- in diagnostics.
year bands.
Progressive deterioration has been Remedial action to be 3- 24 months before interventions. Planning
detected, with high likelihood of carried out and/or the action and its timing is determined by
failure in the short term. The unit increased condition failure mode analysis and operational
can remain in service, but short- monitoring implemented. practicalities. This is managed using
4 Poor term reliability is likely to be De-rating and risk increased surveillance.
reduced. Subcategories are useful management zones may
to define urgency of repair or be needed.
replacement timeframes.
High likelihood of immediate failure Any exception would 0-3 months determined by risk assessment.
exists and the unit should not require intensive risk
remain in service. management actions. If
5 Critical returned to service
decision points and time
frames need to be
defined.
In category 1 the term “as new condition” is avoided since some newer assets can have a higher
failure rate.
Mostly assets will be allocated into categories 2 and 3 and this will not aid prioritisation in terms of
remedial actions. It is likely that there will be several subsets – 2.1, 2.2, etc. to provide this granularity
and allow prioritisation. It will be important however to make these subsets relate to time scales for
action consistent with category 2, or 3 etc.
3. CREATING THE AHI METHODOLOGY
Creating a condition linked methodology usually starts by working from the asset register – which is a
simple list identifying the company assets. The register identifies the asset type, design information,
location and function. The AHI modification includes an assessment of condition and implied likelihood
of an in-service failure within a timescale for each asset in the register. In this context, the modified list
is then referred to as an Asset Health Index (AHI). By associating a future time scale alongside the
detected deterioration, the outcome can then be used as the best tool to identify timing for
4
TB 858 - Asset Health Indices for Equipment in Existing Substations
interventions to reduce the likelihood of such an in-service failure. Such interventions include asset
replacement, repairs, refurbishments and maintenance.
It is important that the interpretation of the test data needs to be done within the terms of the failure
mode. Test results are not failure modes and always need to be interpreted within their relationship to
failure modes and predicted rates of progression. For example, some tests (such as Dissolved Gas
Analysis (DGA) for transformer oils) can be indicators of several failure modes, but the analyses need
to be interpreted severally and relative to each failure mode.
The AHI is built from assessments of each of the critical failure modes. The result may be
encapsulated into the assessment of just one failure mode (that with the poorest assessment.)
Illustration of the approach on each of the many substation asset types is described in Chapter 4. This
worst score can cascade up levels for individual components, single assets, bays or substation, at
each level carrying forward the worst most urgent score. Alternatively, some granularity can be
achieved, at least at component or asset level either by having sub-categories based upon either the
failure mode and its detected rate of progression, or by aggregating scores for all failure modes.
However, this is not without its problems when results from many modes and where different assets in
a bay or substation are aggregated. This is an aspect discussed at length in Chapter 5 of the brochure
and concludes with the appreciation that there is no single approach to AHI, many will work and need
to be chosen to suit the application.
4. CONCLUSIONS
The AHI approach is applicable where there is a link through condition indicators of symptoms of
failure modes and the timescales for the transition from being sound to being likely to cause an
in-service failure.
Creating an AHI approach is to produce a listing of each asset in terms of its likelihood to fail in
service in a user selected time interval. This likelihood would be used with a criticality analysis to
form a risk assessment register.
Any AHI process should repeat and follow each asset through its life by identifying changing
likelihoods of failure with their associated time periods and by creating an action plan for an
intervention – maintenance, repair, or replacement.
The resulting set of AHIs should be calibrated for time. The AHI must uniformly reflect the same
urgency of intervention. All assets with the same score should have the same timescale for
intervention, irrespective of failure mode or asset type, otherwise there is confusion in applying
AHIs consistently.
A ‘poorer’ AHI should always reflect a more urgent condition. This means that where several
failure modes are being assessed and the scores aggregated the method of aggregation should
not produce any violation of this principle.
The AHI methodology can be used at component, asset, bay and substation levels, incorporating
a wide variety of asset types and for a range of outcomes – maintenance and replacement
planning. With such a range there is no single “correct” method for developing and applying the
AHI process.
5
TB 858 - Asset Health Indices for Equipment in Existing Substations
Contents
Executive summary ............................................................................................................. 3
Tables ................................................................................................................................. 13
1. Introduction.............................................................................................................. 16
Goal of this working group and the technical brochure ...................................................................... 16
The role of health indices within asset life planning ............................................................................ 16
Drivers for the development of an AHI process .................................................................................... 18
Societal impact of in-service failures .................................................................................................. 18
Regulatory impact .............................................................................................................................. 18
AHI within an asset management process ............................................................................................ 19
AHI and the ageing asset base ............................................................................................................... 19
Dealing with Unexpected Failures.......................................................................................................... 21
Experience developing AHI .................................................................................................................... 22
6
TB 858 - Asset Health Indices for Equipment in Existing Substations
7
TB 858 - Asset Health Indices for Equipment in Existing Substations
8
TB 858 - Asset Health Indices for Equipment in Existing Substations
9
TB 858 - Asset Health Indices for Equipment in Existing Substations
10
TB 858 - Asset Health Indices for Equipment in Existing Substations
11
TB 858 - Asset Health Indices for Equipment in Existing Substations
12
TB 858 - Asset Health Indices for Equipment in Existing Substations
Tables
Table 1.1 – Example of AHI definitions for expressing remaining failure free years ............................... 4
Table 2.1 – Example of AHI definitions for expressing remaining failure free years ............................. 24
Table 2.2 – Log and Linear condition scale codes ................................................................................ 30
Table 2.3 – Converting condition indicators (observations or measured values) to condition scale
codes ..................................................................................................................................................... 31
Table 2.4 – Aggregating summed scores with linear and logarithmic scoring ...................................... 33
Table 2.5 – Effect of weighting linear scores [B15] ............................................................................... 33
Table 2.6 – Aggregating scores............................................................................................................. 35
Table 3.1 – Asset data ........................................................................................................................... 40
Table 3.2 – First level assessment – example of a susceptibility review .............................................. 42
Table 3.3 – Detectability ........................................................................................................................ 43
Table 3.4 – Indication of restricted data and limited confidence ........................................................... 45
Table 3.5 – Example showing the relation between score and AHI ...................................................... 46
Table 3.6 – The compiled AHI – example based upon Log base 3 scoring .......................................... 47
Table 4.1.1 – Asset register information example ................................................................................. 48
Table 4.1.2 – Consequences of Failure ................................................................................................ 48
Table 4.1.3 – Diagnostic indicators in use and failure modes ............................................................... 49
Table 4.1.4 – Common asset data ........................................................................................................ 50
Table 4.1.5 – Scoring Historic data ....................................................................................................... 50
Table 4.1.6 – Level assessment – example of review of failure mode susceptibility factors ................ 51
Table 4.2.1 – Common faults and indicators (simplified list) ................................................................. 54
Table 4.2.2 – Scale code assignment ................................................................................................... 56
Table 4.2.3 – Visual Inspection ............................................................................................................. 56
Table 4.2.4 – Survey test results ........................................................................................................... 57
Table 4.2.5 – On-line monitoring ........................................................................................................... 58
Table 4.2.6 – Offline and investigative testing ....................................................................................... 58
Table 4.2.7 – Common faults, indicators and scoring for AHI ............................................................... 59
Table 4.3.1 – Distribution of CB failures per cause ............................................................................... 61
Table 4.3.2 – MaF modes ...................................................................................................................... 61
Table 4.3.3 – Examples of condition indicators related to components and failure modes .................. 64
Table 4.3.4 – Example of condition indicator estimation for circuit breakers ........................................ 65
Table 4.3.5 – Review Level, Grid Integrity and C/P for condition indicators ......................................... 66
Table 4.3.6 – Typical condition indicators and scoring methodologies ................................................. 67
Table 4.3.7 – Comprehension about health indices .............................................................................. 69
Table 4.4.1 – Main tasks of the equipment ........................................................................................... 71
Table 4.4.2 – Review level .................................................................................................................... 71
Table 4.4.3 – DS and ES: Failure mode of drive only by type of drive (Sum MaF + MiF) [B21] ........... 72
Table 4.4.4 – DS and ES: Failure mode excluding drive (Sum MaF + MiF) (Table 3-60; Table 3-59 in
[B21]) ..................................................................................................................................................... 72
13
TB 858 - Asset Health Indices for Equipment in Existing Substations
Table 4.4.5 – Effects and root causes of several Failure Modes .......................................................... 73
Table 4.4.6 – Deciding diagnostic strategy ........................................................................................... 75
Table 4.4.7 – Example assessment and comparison of 3 different disconnectors ............................... 75
Table 4.5.1 – Component, failure mode and indicators ........................................................................ 78
Table 4.5.2 – Visual Inspection [B28] .................................................................................................... 80
Table 4.5.3 – Non-invasive in-service test results ................................................................................. 81
Table 4.5.4 – Offline and investigative testing [B28] ............................................................................. 81
Table 4.5.5 – On-line monitoring ........................................................................................................... 82
Table 4.5.6 – Detectability of diagnostics .............................................................................................. 82
Table 4.5.7 – Example oil results .......................................................................................................... 84
Table 4.5.8 – Example of translation of the C2H6 condition from DGA to a condition indicator index .. 85
Table 4.5.9 – Reduction of dielectric withstand capability ..................................................................... 85
Table 4.5.10 – Example AHI scores ...................................................................................................... 86
Table 4.6.1 – GIS components, sub group of components, subsystems, function of subsystems, and
key parts ................................................................................................................................................ 88
Table 4.6.2 – The condition indicators in subsystems of GIS ............................................................... 91
Table 4.6.3 – Summary of norm for humidity content for 150 kV GIS from a specific manufacturer as
generated from different approaches .................................................................................................... 94
Table 4.6.4 – Example of condition scores and their descriptions ........................................................ 95
Table 4.6.5 – Condition scores of primary conductor subsystem in GIS .............................................. 96
Table 4.6.6 – Example of Condition Score (CC), interpretation, and bay index ................................... 98
Table 4.6.7 – Summary of Condition Scores of Subsystems in CB (G0) from each line of GIS ........... 99
Table 4.6.8 – Summary of Bay Index of GIS example .......................................................................... 99
Table 4.6.9 – Failure susceptibility indicator index of GIS example .................................................... 100
Table 4.6.10 – Summary of Bay Health Index & Failure Susceptibility Indicator index of GIS example
before and (expected after) mitigation action ...................................................................................... 101
Table 4.7.1 – Diagnostic indicators in use and failure modes ............................................................. 107
Table 4.7.2 – Dielectric dissipation factor analysis for capacitor banks .............................................. 109
Table 4.7.3 – Capacitance analysis for capacitance banks ................................................................ 110
Table 4.7.4 – Data and scale codes .................................................................................................... 111
Table 4.7.5 – Scale code assignment ................................................................................................. 112
Table 4.8.1 – Identifying assets and diagnostics................................................................................. 113
Table 4.8.2 – failure mode analysis ..................................................................................................... 113
Table 4.9.1 – Auxiliary Equipment and Roles ..................................................................................... 116
Table 4.9.2 – Review levels ................................................................................................................. 116
Table 4.10.1 – Components ................................................................................................................ 121
Table 4.10.2 – Failure mode detection indicators ............................................................................... 122
Table 4.10.2 – Classification rules for buildings according to their condition [B5] .............................. 124
Table 5.1 – Condition scale code examples ........................................................................................ 126
Table 5.2 – Example with alphabetical codes ..................................................................................... 127
Table 5.3 – Example with numeric codes ............................................................................................ 128
Table 5.4 – Combining AHI for 3 assets, alphanumeric codes ........................................................... 128
14
TB 858 - Asset Health Indices for Equipment in Existing Substations
15
TB 858 - Asset Health Indices for Equipment in Existing Substations
1. Introduction
Goal of this working group and the technical brochure
Working group B3.48 was created to produce a technical brochure describing a process to classify
substation assets in terms of their changing likelihood of having an in-service failure. This would be
achieved by producing guidelines for companies to build credible Asset Health Indices (AHIs). These
AHIs should be the first step towards more focused outcomes such as plans for maintenance, asset
refurbishment, asset replacement and risk management.
16
TB 858 - Asset Health Indices for Equipment in Existing Substations
towards a set of plans for managing each asset, a process consistent with international asset
management practice [B6], [B7], [B8]. Within this context, company asset managers will be most
concerned with going further than only assessing condition and likelihood of failure. They will be
assessing the risk of failure where this is defined as:
𝑅𝑖𝑠𝑘 𝑜𝑓 𝑓𝑎𝑖𝑙𝑢𝑟𝑒 𝑜𝑓 𝑎𝑛 𝑎𝑠𝑠𝑒𝑡 = 𝑙𝑖𝑘𝑒𝑙𝑖ℎ𝑜𝑜𝑑 𝑜𝑓 𝑓𝑎𝑖𝑙𝑢𝑟𝑒 𝑜𝑓 𝑡ℎ𝑒 𝑎𝑠𝑠𝑒𝑡 × 𝑐𝑜𝑛𝑠𝑒𝑞𝑢𝑒𝑛𝑐𝑒𝑠 𝑜𝑓 𝑖𝑡𝑠 𝑓𝑎𝑖𝑙𝑢𝑟𝑒
Here the consequences of failure will be assessed in terms of the asset's role in the network, business
impact, safety, environmental damage etc. This creates a much more bespoke assessment between
companies depending upon their risk tolerance. Conversely the "likelihood" factor should be more
amenable to creating a common methodology across companies. For this reason, the terms of
reference for CIGRE Group B3.48 is being limited to this aspect. The work will be followed by a later
group B3.61 to complete the task of building the methodology for the broader risk-based decisions.
Diagrammatically this two-stage approach is shown in Figure 1.1.
17
TB 858 - Asset Health Indices for Equipment in Existing Substations
Regulatory impact
Much of the industry has changed focus to become performance driven organisations. The most
significant facilitator has been the development of an asset manager model as a single business
function in the utility. Such a function is empowered by the company executives to implement their
asset related strategies. This involves the control of costs to achieve the stated business objectives for
network performance, risk exposures and return on investment. It has led to the need to identify what
assets exist, where they are in the network and what is their role relative to these business objectives.
Risk management is a fundamental role in an asset management company, and it is a legal
requirement in some.
Regulators have been keen to see that utilities have processes in place to manage the competing
demands of cost reduction, network performance and the range of business risks. It was this that led a
range of utility sector stakeholders to create firstly the BSI-PAS 55 document [B6], [B7] and more
recently in February 2014 the first international asset management standard, ISO 55000 [B8]. These
have been used to change organisations which had been founded as service providers into ones that
are asset focused, achieving business returns on invested capital whilst defining and managing risk
exposures. One important feature of this asset management model has been the "line of sight" which
is a direct link between the role for every asset within the system and how it meets the objectives set
by the utility executives. This means that each asset has its own life plan. This includes its
receiving ongoing investment to meet its business goals. Thus, from a completely different standpoint
the requirement has been created to evolve an asset register to include cost evaluated mitigation
plans to address identified asset related risks relating to loss of load. This is what creating asset health
index is attempting to achieve when allocating each asset into a band reflecting likelihood of failure
within a time period.
18
TB 858 - Asset Health Indices for Equipment in Existing Substations
19
TB 858 - Asset Health Indices for Equipment in Existing Substations
environmental damage, system disturbances switching duty and load levels. Failures that have been
seen tend to be those in early life units that have design limitations and so unsuitable for predictive
analysis for applying to the whole asset class. The recent CIGRE survey of transformer failures [B1],
for example, concluded that there was no evidence of a bathtub characteristic for transformer
population as a whole. A separate study of UK transformer and reactor failures concluded that only by
separating out the population into its design groups, each with its own dominant failure mode, could
any sign of a predictive time based failure pattern be identified [B10]. Similarly, German studies of
failures in over 4000 SF6 circuit breakers showed an infant mortality followed by random failure rate
[B11] and [B12]. It is only when specific functions such as failures in hydraulic drives are separated out
can a predictive wear out pattern be seen. It is only visible when the data is not related to service age
but to years since an earlier intervention (repair, rebuild or maintenance). These results are not
surprising for the generation and transmission sectors in particular which have relatively few assets
with the same design, OEM and operational environment. But the converse can also be true – in
distribution or with cable systems there may well be large populations of identical equipment all with a
single dominant failure mode. Predictive failure modelling might then be more relevant [B13].
An important perspective is to be able to introduce key performance indicators subsequent to audits of
the use and role of condition based AHI when assets are removed from service (step 6 of Figure 1.3).
A2 session papers [B10] and [B14] have described the AHI process and how it was used to identify
replacement schedules over a 20 year period. The population included around 800 transformers, all
over 100 MVA at 400 kV and 275 kV and installed since 1952. Forensic examinations were made
during scrapping and the individual results compared with the AHI created when the unit was in
service. Reference [B14] describes how the correlation was generally good. The age of transformers
selected for replacement is shown in Figure 1.4 [B1] and [B14]. The figure also shows actual failures
still occurring in service (lower line). In this work failure was defined as a situation requiring complete
removal and replacement. In many cases these failures were random in time and mainly ones that
followed a system event and so not predicted to fail as a result of assessments of longer-term
deterioration based upon selected indicators. The outcome is a hazard rate for these unexpected but
“actual” in-service failures. The second important point is that where replacement was justified by
condition indicators (upper line) these indicators are more likely to worsen with age. The third point is
that investigation of real failures together with their forensics provide the basis for ongoing continuous
improvement of the methodology.
Hazard
Figure 1.4 – Failure hazard and replacement hazard for TSO population [B1] and [B14]
For power transformers at least, such conclusions confirm the development of condition-based AHI
and away from decision rules based upon simple time/age and duty relationships. The AHI
methodology seeks to manage within a context of recognising the more diverse range of failure modes
and applying asset-specific risk and condition-based tools. It then leads to the individual asset life
plans within an asset health review that link not to the past time in service but forward looking, towards
a time left in service before a particular failure mode becomes terminal. This could be a rolling
estimate that starts when an asset is newly commissioned and on through stages during service. It is
not to say that the number of previous service years (age), duty and exposure are not relevant. But
these are influencing factors and not failure modes themselves. Their relevance is in their possible
influence on timing of the onset and rate of progression of a particular failure mode. The tenet of a
20
TB 858 - Asset Health Indices for Equipment in Existing Substations
condition-based approach is one where the time/duty-based development of a particular failure mode
is identified through related diagnostic indicators which do relate to deterioration of a particular failure
mode, its onset and rate of progression.
21
TB 858 - Asset Health Indices for Equipment in Existing Substations
Figure 1.5 – Hoop buckling on common winding (left) and crush damage on the tertiary (right).
The problem here had been the optimum diagnostic strategy selected had been reliant upon only
levels for each combustible gas after dissolved gas analysis (DGA) in the oil. Yet DGA is a poor
indicator when the transformer failure mode is by mechanical movement. Secondly DGA was being
used as per a common practice, relying upon an adverse laboratory report initiated only when based
on IEEE and CIGRE guidelines for exceeding stated levels for one or more of the gases. In this case
study a close-up short circuit had taken place but the change in combustible gas levels following it was
not sufficient to exceed levels in the condition 1 of the IEEE C57.104 guidelines for normal units.
However, at this time the utility changed its practice with specialist engineers assessing consequences
from any damaging events. They then looked for changes in relative concentration of the key gases as
per IEEE C57.104-2008. This identified that there had been a change in percent hydrogen content
after the short circuit. This is indicative of partial discharge (PD) damage. But confirmatory evidence of
significant damage only came following out-of-service diagnostics. Since the DGA indicators had
appeared to show a unit with a low increased failure risk it took a further 2 years for the unit to be
allocated a circuit outage to allow such investigative testing to take place. Only then did the
assessment change. Winding capacitance and sweep frequency response results both gave very
unambiguous indications of severe winding movement with hoop buckling. Internal inspection from the
top of the tank revealed a broken clamping plate. The tear down confirmed hoop buckling and showed
the hydrogen was coming from PD at crushing damage to a tertiary winding.
22
TB 858 - Asset Health Indices for Equipment in Existing Substations
However, many of those developing AHI methods have realised some of the basic issues, including:
To begin by deciding the purpose. It could be an internal document for prioritising tasks such
as maintenance or replacement. Similarly, it may be to indicate likelihood of in-service failure and
so to plan the means to address this, through replacement or repair. Different purposes will lead
to a set of differently prioritised lists and different AHI as described in TB 761 [B3].
To have a clear understanding about failure modes and asset life. Some have used “current
age relative to a defined asset life” as their starting position and recalibrating it by factoring in
both the presumed effect from the operational environment and results of diagnostic tests.
Others, however, start with a FMEA approach and by seeking to identify the onset of a failure
mode. It is then this that indicates future failure free lifetime. The need is then to identify these
modes, their causes and apply diagnostic indicators which then assess the future time frame to
failure. Individual indicator results are not failure modes – nor is age.
To have a method of aggregating the results in a way that does not dilute and so hide a
bad score. This is a major issue where adding to aggregate a set of linear scores from individual
failure modes, or made worse when using weighted scores, and can lead to incorrect decisions
[B15].
The technical reasoning for an AHI outcome should be clear and reasonable. The
interpretation of data should be through published standards and guides (such as IEEE, IEC and
CIGRE) which relate asset condition to defined failure modes and identify the presence and
severity of those failure modes. The action should be clear, and the evidence provided to justify
the decisions to be made- and not lost within a convoluted and/or multifactored assessment
process. The AHI output should be time calibrated so that resulting actions can be prioritised.
An ongoing improvement process needs to be built in. There is little evidence that assets
removed from service are then forensically examined to re-assess the approach.
23
TB 858 - Asset Health Indices for Equipment in Existing Substations
the maximum periods before maintenance is carried out to avoid an in-service failure. It assumes that
maintenance is being undertaken as indicated by time intervals derived from type approval tests, or as
part of a condition-based assessment and creation of an asset maintenance index. This example in
Table 2.1 was constructed to reflect the timescales appropriate for an intervention which would itself
be used to identify the type of action – maintain, refurbish or replace. In this way it is generic, reflecting
the needs of most asset types and their components. Decisions relating to the timing of end of life will
reflect a time when future maintenance and refurbishment interventions will no longer delay onset of
an end of life failure. Identifying a time to this end of life will depend upon the asset type, the quality of
its design and manufacture, duty cycle and operational environment. To convert the table into an asset
replacement schedule would require a modification to the time scales with many more subcategories
in each of the five categories. However, although this final column will be a variable depending upon
the purpose of the AHI, it is important that time frames are stated in an AHI.
All AHI should reflect a condition, an action and a time scale for the action.
Table 2.1 – Example of AHI definitions for expressing remaining failure free years
Very low likelihood of failure over Continue with inspect More than 10 years likely before additional
many years. This would be in the and test schedule. maintenance and refurbishment is
1 Very good original factory condition or after undertaken. Timing of interventions is asset
extensive refurbishment. specific and indicated by the inspection and
test results.
Low likelihood of failure over a Continue with inspect and 5-10 years likely before additional
long period. General deterioration test schedule. maintenance and refurbishment is
is consistent with its time in undertaken. Timing of interventions is asset
service. specific and indicated by the inspection and
test results. Subcategory bands can be
2 Good introduced based upon failure mode and
rate of change in diagnostics.
The impact of any life-limiting irreversible
deterioration is expected to be beyond this
time frame. If not, introduce extra column
with 5-year replacement bands.
Low risk defect or life-limiting Investigate the issue 2-5 years before interventions. Timing and
deterioration has been detected. and plan any scope are indicated by investigations,
Performance may be adversely intervention. Continue together with changes in inspect and test
affected long term unless remedial with a revised inspect results.
3 Fair action is carried out. and test schedule. Subcategories introduced in yearly bands
Revise life expectancy based upon failure mode and rate of change
planning into likely 5- in diagnostics.
year bands.
Progressive deterioration has been Remedial action to be 3- 24 months before interventions. Planning
detected, with high likelihood of carried out and/or the action and its timing is determined by
failure in the short term. The unit increased condition failure mode analysis and operational
can remain in service, but short- monitoring implemented. practicalities. This is managed using
4 Poor term reliability is likely to be De-rating and risk increased surveillance.
reduced. Subcategories are useful management zones may
to define urgency of repair or be needed.
replacement timeframes.
High likelihood of immediate failure Any exception would 0-3 months determined by risk assessment.
exists and the unit should not require intensive risk
remain in service. management actions. If
5 Critical returned to service
decision points and time
frames need to be
defined.
In category 1 the term “As new condition” is avoided since some newer assets can have a higher
failure rate and consequently not in a “Very good condition”.
It would be normal for the user to define future time-related terms to suit their application. These times
may then be inserted into the table for use within the company. For example, it might be to include into
the timescale column an estimate of remaining time before irreversible deterioration and asset
replacement is due. This may need to align replacement timing with their regulatory review periods, as
24
TB 858 - Asset Health Indices for Equipment in Existing Substations
was the case with the utility reference earlier [B10] and [B14]. Equally the timeframes for
reassessment may differ according to experience with the design and operational environment.
The actual health of an asset is at best an estimate based on selected indicators used at a certain
time. The confidence in the assigned index will improve if more comprehensive diagnostic strategies
are used. The range of possible diagnostic strategies, from basic to advanced are here referred to as
“Review Levels”. See section 2.3 for further explanation.
AHI Applications
AHI can be used for likelihood to fail, as a replacement index and maintenance prioritisation indicator.
A health index is a result of a condition assessment that leads to a value, whether it be a letter, a
code, a number or some other indicator, that has to be consistent in terms of timescales to make
sense. For example, if the AHI is an expression of generic likelihood of failure, as described above, it
may be turned into AHI for replacement and maintenance actions:
A health index for asset replacement may give results as numeric codes, say 1-5. The timescale
for action for code 3’s may be “maintain normally (as per manufacturer instructions) but asset still
has a likely need to be replaced in 5-15 years”. All code 3 assets, of any type, should be in the
plan for replacement in 5-15 years. If the index is given as a percentage then we would also
expect monotonicity – if 100% indicates a “very good condition’’ asset, then an asset at 60%
should always be more urgent than those at 70% in terms of action timescales.
A maintenance index based on monitoring data, say for OLTC’s or bushings, may be based upon
interventions with far shorter timescales than a replacement index for a transformer: hours to
days to weeks rather than decades. This can be confusing if a maintenance scale is 1-5 where 5
means intervene ‘immediately, while 4 is ‘within 24 hours, and 3 is ‘within a week’ etc. In this
case we can identify these as 5.1, 5.2, and 5.3 so the main code is consistent with the
replacement index and likelihood to fail index. The sub-code reflects the urgency within that time
period. But, remembering A2.49 advice, if a monitor indicates an urgent ‘do it now’ condition, do
not wait for a new review of the index to confirm or deny.
Subcomponent elements of a health index, say an OLTC or a winding, may have a code of their
own; this code should also be consistent with the overall replacement/ maintenance index. It
would be confusing to have a breaker mechanism needing intervention in 3-5 years, but the
asset replacement index indicating a need to replace in 2 years.
We must be aware of time passing and when analyses are performed. If a replacement index
says ‘replace in 3-5’ years, then, in 2 years’ time, that should become ‘replace in 1-3 years’ so
we would expect the asset to be in the plan and ahead of those which had entered the ‘replace in
3-5 years’ code during those two years. That said – review and checking of condition is required
to make sure that the code still applies to the asset.
The criticality aspect usually involves an analysis of the consequences of this mode to its loss of
functionality and its likelihood of occurring within the time periods of interest.
25
TB 858 - Asset Health Indices for Equipment in Existing Substations
Reliability is the likelihood that an asset will perform its specified function under specified conditions
for a specific period of time. This definition is in line with that of the IEC 60050-191.
Probability theory is a branch of mathematics where reliability and age-related failures of a
population of assets are expressed using terms and equations - such as hazard function, probability
density function, and survival function. It is, however, only a model and only as good a model as its
inputs and assumptions allow. The relevance of this approach to a whole asset class is debatable
since asset failures that are being experienced relate to a range of differing failure modes, the installed
population shows various internal design limitations, and installations are subject to random external
events to a greater or lesser extent. It would have greater application when asset classes are
separated into groups where some had a single failure mode and a single age-related distribution. But
even then, random system events would limit it. TB 761 [B3] devotes the whole of its Chapter 7 with a
useful discussion of the problems and dependency of the outcome on the significant, and often
unsubstantiated, assumptions that have to be made when assigning probability of failure rates.
Likelihood of failure is a less specific term than the above where applied to a single asset. The
likelihood is assessed in terms of its predicted failure modes, effects, condition indicators and
expected rate of deterioration. It is, therefore, “only a model” relying on how comprehensive the inputs
are and how effective is the expertise available to translate measured values to a time scale for the
rate at which the asset will deteriorate to the point of failure.
Failure free period is a useful estimate for managing possibilities of an in-service failure. It is the
outcome of a result that could be based upon probability theory or an estimate of likelihood of failure.
26
TB 858 - Asset Health Indices for Equipment in Existing Substations
Intervention
These are activities to remedy a malfunction by maintenance, repair, refurbishment or replacement.
27
TB 858 - Asset Health Indices for Equipment in Existing Substations
Timescales are important when creating and using health indices. The response time to changes in
condition can vary between milliseconds up to foreseeable future (up to 15 years). Aspects that need
to be considered include:
Asset condition
Failure modes
Data and its relevance
Condition assessments
Intervention planning
Figure 2.1 – Achieving AHI with 5 identified strategies, each with staged activities
The confidence level depends not only on the range of activity, but also will depend upon the extent
each activity is adequately comprehensive. Important is the age of the data – how often and how
recent are the inspections and out-of-service testing. This can be overcome with permanent on-line
monitoring, but presently only a restricted number of failure modes can be monitored in this way.
28
TB 858 - Asset Health Indices for Equipment in Existing Substations
identifying the significant failure modes. If no more is undertaken its limitation will be that the
diagnostic condition related data will be restricted and aged. It is likely to relate to fewer failure modes.
Some low voltage assets have a value and impact on failure where it may not be worth investing much
more effort than using this Level 1 strategy alone. There may not be any diagnostic data. However, it
is commonly the case that at these lowest voltages there are sufficient numbers in each design family
to use a more statistical approach. This could link design group, separating in terms of its failure
modes and applying Weibull statistics to identify a lifetime in terms of onset of failure for each of these
categories. Service-life may then be related to this lifetime estimate. This is, however, quite a different
assessment from a condition-based health index.
29
TB 858 - Asset Health Indices for Equipment in Existing Substations
fed back and combined with operational data extracted from Historian servers to allow a dynamic
indication of AHI. Over recent years the reliability and longevity of monitoring systems have improved.
Site data management and hardware have improved, with fibre networks and IEC 61850 protocols
enabling greater access between vendor systems. Access to operational data is improving with an
asset management data file being incorporated into the Common Information Model (CIM) by
IEC TC 57.
Keep it simple Use Log Use Log Each are linked to descriptions in Table 2.1
base 3 base 10
Alphanumeric
B 2 3 10 Good condition
The starting point is the description of each of the failure modes and how the condition relating to
extent of deterioration for the mode is reflected from the measured values. This relationship is specific
to the failure mode. The user could utilise the relationships obtained from custom and practice within a
company or rely upon a set of standards. Important is the level of expertise available to undertake this
task. For example relating a dissolved gas value to a dielectric failure mode would involve measuring
specific combustible gases and relating the assessment to the value (ppm of each gas), the rate of
change, and an indication of the type of problem using tools such as the IEEE key gas method or
Duval’s triangles. The failure mode assessment could be improved with additional indicators from
directly measuring the partial discharge activity, by using a UHF probe inserted into the tank for
example. From such an assessment the numerical values may be translated into one of the 5 scale
codes. It is for SC A2 and A3 to identify the link between measured diagnostic values and condition.
SC A2 does do this in the appendix for TB 761 [B3].
More difficult is where the assessment is subjective – how likely is a leak to lead to a failure, for
example. Here failure may relate to a functional failure following a low oil level alarm, but equally relate
to when pollution and its environmental impact become unacceptable. It is then for the user to define
30
TB 858 - Asset Health Indices for Equipment in Existing Substations
the relationship as per Table 2.3. The answer requires past experience within or outside the company.
The key is always to relate it to the likelihood of failure, as given in the text of column 2 of Table 2.1.
For both linear and exponential scoring situations the specialist engineer has a key role ensuring the
sanity of the outcomes. At very least the outcomes must be sufficiently transparent to be audited by an
expert, particularly when significant investment is being indicated from an automated or semi-
automated system for translating observational data into scale codes.
Table 2.3 – Converting condition indicators (observations or measured values) to condition scale codes
OBSERVATIONS – Examples
An important factor when deciding the indication system is how the codes from different indicators are
to be aggregated as a single number or as a summation. This is described later in Section 2.6.
It is important that the scale code selected in such as Table 2.3 is based upon the FMEA analysis to
indicate how the measured values can be associated with specific failure modes and their time scales.
Noteworthy here is that in this simple example (relating to the case study in Chapter 1) most of the
measured test values indicate normal deterioration when they are related to their failure modes. Each
would then be assigned to a condition code 2 (linear) or 3 in a log scale. However, when the
interpretation of gas results changed to use the IEEE Key gas method the higher hydrogen levels after
the fault indicated a dielectric fault had initiated and the assessment changed with that mode changing
its scale code to 3 (linear) or 10 (log). It was not until out of service testing was done, which indicated
a higher risk from a second failure mode (a mechanical failure in the event of a future close up short
circuit) and that this mode required an increase in condition scale code to 4 (linear) or 30 (log).
31
TB 858 - Asset Health Indices for Equipment in Existing Substations
Figure 2.2 – Linking test data to failure modes and to a linear condition scale code
32
TB 858 - Asset Health Indices for Equipment in Existing Substations
aggregate many assets with varying numbers of scale codes in the summation the reader would be
unsure if this value 9 related to adding 9 codes of 1, or just three scale codes of 3, 3, 3 being used.
With a base 10, the sum would again be 9 which is less than the next advanced scale code score
of 10. Put another way, if attempting to combine scores for a bay analysis, the number of scale codes
needs to be less than the number to promote the sum to the next AHI band. This is most relevant
when attempting to aggregate many assets with large numbers of scale codes and varying numbers of
scale codes across the asset types. This is a topic to be pursued later in Chapter 5.
Table 2.4 – Aggregating summed scores with linear and logarithmic scoring
Weighting of scores
One of the ways some have tried to make units with high risk of failure stand out when using linear
scoring has been to use a weighting factor on each of the criteria. An example is shown in Table 2.5
[B15].
Table 2.5 – Effect of weighting linear scores [B15]
Here the very methodology pre-supposes the answer as to what criteria are most likely to cause
failure, and any manipulation of scores loses the conceptual appreciation of what is going wrong.
Some readers might argue why this user gave acetylene value the lowest rating, or why ethylene be
three times more important than the LTC oil, and so on. The assessment will be subjective and not
readily useable as a generic method: the preferred alternative is not to get involved with approaches
involving weighting of data.
Similarly, translating aggregated weighted systems into a likelihood of failure (LoF) is not simple or
direct, since higher (or lower) scores do not represent a higher LoF. This system might be relatively
easy to understand, but the dilution effects of the aggregated weighting rob the system of meaning.
However, the result will not be directly relatable to LoF. Units at the top of the list with the “worst”
scores may not be the units that fail and so invalidate the usefulness of this approach.
33
TB 858 - Asset Health Indices for Equipment in Existing Substations
The problem comes when there are more than 9 values in any category. It is not clear in TB 761 [B3],
however, what experience exists in the use of its recommended methodology. This method is better
than using the highest code alone since it indicates that there are several other aspects that could
cause an early failure. Critically it also allows linear scores to be used for single assets without losing
bad results when aggregating. Again, the weakness comes when there are so many scale codes that
there could be a double-digit score in one or more category, thereby destroying the five-digit combined
score approach. As with the log score approach it is most suited to single assets with a modest
number of scoring codes.
The method suggested in TB 761 overcomes many of these limitations because it does not attempt to
sum – just to record the number in each score code. It does require confidence that the indicators that
you have are sufficient to cover all relevant failure modes and that all indicators are properly calibrated
with respect to each other (same scoring indicator means same probability of failure). Also, you need
to assume then that all indicators are independent. If these requirements are met, then even in case of
different number of indicators the enumeration system should work properly.
There is further discussion of this scoring and aggregation topic in Chapter 5.
The conclusion there is that both the log scoring system and the TB 761 method may be useful
when creating AHIs at a single asset class level but not when trying to aggregate outcomes of
many assets with many failure modes and where the assets differ in number of failure modes
and scale codes.
34
TB 858 - Asset Health Indices for Equipment in Existing Substations
Method Comment
Adding linear scores Badly scoring codes are averaged out. See Table 2.4.
A
Condition indices are in a linear set of condition scores, NOT RECOMMENDED
(1-5 or 1-10) and aggregated by adding all individual
scores to reach an AHI as a sum or averaged to
normalise assets with different inputs.
Using worst score Will identify presence of WORST defect that could
C cause failure. But where several failure modes with the
Condition indices have a linear score, 1-5 or 1-10 as
above but then using only the score of the attribute same score are present this will not be apparent. It will
having the highest (worst) value as the AHI. not reflect the overall condition of the asset or provide
any granularity for prioritisation within the asset class.
In the example in Table 2.4 it means only the single
scores of “3”, “4” and “5” are used and all with lower This can work and is most useful when
scores are ignored. consolidating bay or substation wide scores.
Adding log scores Gives single number and reflects general condition. Will
D also identify presence of weak attribute that could
Each condition index has a logarithmic score, say 1, 3,
10, 30, and 100, and then the scores are summed. cause failure.
This clearly identifies the worst failure mode scores as But it will have problems when aggregating both with
with (C) above and also gives a numerical appreciation of single assets each with many modes with same score,
other higher scores. and where there are many different assets in a Bay or
In both references [B14] and [B15] a sum with a base 3 substation wide AHI – see chapter 5.
logarithmic score was used. It can be used in a prioritised This will be most suitable for scoring single asset
table display for a single asset type. types when creating prioritised actions for that
asset class.
TB 761 approach Loses unique single number but will reflect the range of
E scale scores for an asset. Will also have problems
Use linear scores, say 1-5, and include all in each
category as per A2 TB 761. The score can be used in a when aggregating many assets in a bay or substation
league table display of outcomes. This will allow wide AHI.
prioritisation within the asset population. This also will be more suitable for scoring single
asset types when creating prioritised actions for
the asset class.
35
TB 858 - Asset Health Indices for Equipment in Existing Substations
36
TB 858 - Asset Health Indices for Equipment in Existing Substations
37
TB 858 - Asset Health Indices for Equipment in Existing Substations
Review levels
The review level would be decided for each asset as illustrated in Figure 2.1. At its simplest a review
could be only office based and analysing data as obtained and identified in preceding sections. More
detailed strategies will provide more comprehensive activities and results to assess the current
condition. In particular, they will relate to the capability of the chosen diagnostic strategy to identify all
significant failure modes that are developing. This point was made in Chapter 1 with an example of a
transformer failure mode requiring an out of service test to assess winding movement.
Deciding just how comprehensive the AHI process should be can be addressed through a cost benefit
analysis. The degree of rigour will vary, but inevitably will identify the cost of impact from an in-service
failure and relate it to the cost of undertaking and implementing an analysis.
The FMEA process is part of Step 2 where the aim is to identify relevant failure modes and to link
them to diagnostic indicators. This also has a requirement that all diagnostic data being used has the
same quality relative to the speed of development of each failure mode. For example, the decision
might have been to work to Level 4, but for some assets out of service testing had been delayed. The
assignment would then be judged on a restricted basis of Level 3. This could also happen if the time
from the last measurement had been too long. Restricted data means that some failure modes are not
being assessed and a decision is required as to the consequences. What should happen is that the
assessment is made against all failure modes for which the data exists. It would then be given a
restrictive marker identifying modes missing or very aged. How this is handled is described later in
Step 5.
38
TB 858 - Asset Health Indices for Equipment in Existing Substations
Documentation
This information provides historic documentation held within company databases and other data
storage systems. In addition, the information relating to the performance of the particular design in
both factory evaluations and service experience should provide the basis for identifying failure modes
and their diagnostic indicators.
39
TB 858 - Asset Health Indices for Equipment in Existing Substations
Documentation from the manufacturer and held by the utility Documentation on policies in utility
Copies of manuals Maintenance policies and practices
Original factory test results and standards applying now relevant – to acquire actual
activities identified for each unit
Relevant design standards used for this build (IEC/ IEEE etc.)
Policies and practices relevant to
Specification requirements (BIL, short circuit, etc.) diagnostic testing
Maintenance tasks, intervals, materials identified both when new by the Operational policies and practices,
manufacturer and subsequently by the company factors that affect the mode of
Failure investigation reports on this and similar design of units operation of the apparatus
Service advisories from the manufacturer
Details from manufacturer/repairer relating to any rebuild/refurbishment
done on this specific design group
Standards
The standards used at the time of manufacture, which now may be considered to have been
inadequate, are identified; examples are withstand tests such as the Basic Insulation Level (BIL).
Designs also have changed over the years that improve or reduce performance and expected lifetime.
Identifying the design practice at build is important, therefore, to predict future performance in these
areas.
Factory Information
Original test data for the unit and accessories should be available for comparison with the specification
and also in-service results. Problems found either in manufacture or factory test with these units can
be identified and assessed against service experience. This might include poorer performance in
tests, such as for heat run results that could affect normal thermal rating and any consequence of
overloads. Details of any major rebuild should be identified, and related reports often give a good
indication of the rate of aging generally to be found.
Failure information
This comes from the malfunction reports of the asset:
failure effect (major, minor failure),
failure mode failure rates.
40
TB 858 - Asset Health Indices for Equipment in Existing Substations
In case the failure information is not available, or the quality of the information is not good enough an
alternative can be to use the service experience of in-house technicians and technical engineers
to retrieve some information on the failures.
Number and duration of forced outages caused by the asset
Maintenance man days used and material cost for the repairs
Disruptive consequences (failure effect) of past outages of the unit on the system
Failure modes and investigation reports on this and similar design units
Event records from SCADA, which may indicate frequency of system faults
Sometimes useful information can come from publications – such as CIGRE technical brochures such
as TB 642 (transformers) [B1] and TB 509-513 (substation equipment) [B2], [B21], [B27], [B29]. These
provide general outcomes which are helpful but do not provide the above specific information for
particular designs. Utility trade organisations and some service providers do, however, compile design
specific failure information.
Maintenance policy
The differences between the manufacturers' original maintenance plan, what activity has actually been
done over the years, and opinion as to best practice maintenance policy and practice should be
identified. This may indicate shortcomings that could affect future reliability. A review of maintenance
work undertaken indicates the problems encountered with the unit, their extent, and cost. This can be
used to indicate integrity and likely future performance risks, as well as yielding key performance
indicators for reliability and cost.
41
TB 858 - Asset Health Indices for Equipment in Existing Substations
Here the five criteria are colour scored with the worst colour being carried through to the summary
column as a colour and with the criteria having this worst score indicated as shown.
42
TB 858 - Asset Health Indices for Equipment in Existing Substations
43
TB 858 - Asset Health Indices for Equipment in Existing Substations
44
TB 858 - Asset Health Indices for Equipment in Existing Substations
Data is available but it is low data quality – Data quality is an important point in the asset
health index process. If some data is poor in quality, a data cleaning campaign could be
necessary to augment the data quality before using it within an AHI process. Using data of poor
quality will result in a wrong asset health index.
Data unit’s conversion – For the same data, it is sometimes possible to use different units
(example pressure in MPa vs bar, SF6 dew point at nominal pressure vs at atmospheric
pressure, etc.). In order to correctly interpret the measurement results, al results must be using
the same units within the whole company.
Missing time stamps – In order to have the latest information and decide on the overruling,
each data point must have a valid time stamp.
There needs to be some form of marker indicated in the AHI tabulation that identifies that the AHI
includes a restriction associated with data quality – as per Erreur ! Référence non valide pour un
signet..
Table 3.4 – Indication of restricted data and limited confidence
Colour Meaning
GREEN Figure 2.1 is correctly assigned and all data matching failure modes exists and has been used.
ORANGE Data as identified above is missing or poor quality. However, the effect on the AHI is not
considered drastic.
RED Data as identified above is missing or poor quality. The AHI is not reflecting all failure modes
effectively.
This restriction on AHI was also considered in a more complex way, with scoring of quality levels –
see Table 4.3 of TB 761.
Before buying or developing new applications for data collection and storage, a proof of concept in a
spreadsheet for a limited number of data points is advised. This will help the utility later on in
describing the business requirements for the software required.
45
TB 858 - Asset Health Indices for Equipment in Existing Substations
A transformer for example could be considered as comprising the main unit with tap changer and
bushings. Alternatively, the AHI could be computed for each category.
It is advised to set up the model by grouping different condition indicators with relation to the same
sub-component or the same failure mode into one sub-AHI. Each sub-AHI is determined by analysis of
the underlying condition indicators. These sub-AHIs will later on be combined to one overall Asset
Health Index (AHI). The important advice from this Working Group is that simple linear or weighted
scores from individual failure modes or subcomponents should not be added – see Chapter 2. Adding
would average out the scores and mask the badly scoring element. (With logarithmic scores 1, 3, 10,
30, and 100 masking when adding is far less likely). The important step is to use a method of
aggregation capable of identifying the highest scoring failure modes and its indicators.
Step 7.1: Aggregate condition scale code scores to a sub-health score and
asset health score
Once a score has been given to all condition indicators, a consolidated score by sub-component or by
failure mode can be produced. It is advised in this step to continue working with the condition indicator
score and not with the condition indicator index in order to keep enough detailed information of the
potential problems.
This conversion from measured values to 1-5 scores in the likelihood of failure by a particular failure
mode indicated in Table 2.1 is done by using pre-agreed functions defined by the user in order to
calculate a score by sub-component or failure mode. This is the most difficult part and where the
subject expert has a role, as do international standards such as IEC and IEEE, best practice guides,
etc.
The sub-AHI and AHI can be used to visualize in an easier way the estimated condition of the sub-
component and/or the asset. In order to achieve this, a relation between the final score and the AHI
must be given by the user. This relation depends on the scoring method and function(s) used.
Table 3.5 – Example showing the relation between score and AHI
Here the dominant score is in the OLTC but can be improved with remediation. Nevertheless, the
reportable AHI score for this asset would be 4 should only the highest (worse) score of a linear scoring
system be used. With a base 3 log scale the total would be 10 + 30 + 10 = 50. This indicates that a
score of 30 has been given to one subcomponent, but that there are other areas with a score
indicative of significant deterioration.
46
TB 858 - Asset Health Indices for Equipment in Existing Substations
From step 1 From step 5 Based only on 5 FMs in this example Alternative AHI approaches
Col 1 Col 2 Col 3 Col 4 Col 5 Col 6 Col 7 Col 8 Col 9 Col 10 Col 11 Col 12
etc. Note example with no data for one FM NOTE R in score of example unit 2 to denote
restricted assessment.
47
TB 858 - Asset Health Indices for Equipment in Existing Substations
4. Applied methodology
The following chapter describes the application of the generic methodology described in Chapter 3 to
a range of substation assets. The structure again follows the eight steps of Figure 3.1 in Chapter 3. In
each of the sections 4.1 to 4.10 the asset is considered as a functional unit. Each chapter has been
prepared based on experience and also to illustrate differences in scoring systems and if aggregation
is used. Such a ranking is based on a single asset/functional unit basis and would be the source
information for prioritising maintenance, repair, refurbishment or replacement. How outcomes are
combined into AHI for bays, circuits or substations is considered within Chapter 5.
Cost implications
An early decision is to decide upon the diagnostic strategy as per Figure 2.1, and this involves a
simple cost benefit. The first step is to identify the consequences of an in-service failure and typical
inputs are listed in Table 4.1.2.
The next step is to review the extent to which diagnostic data is already being gathered within the
company, and the cost implication of implementing more comprehensive data. Also included is
provision to indicate the type of failure mode that may be identified.
Table 4.1.2 – Consequences of Failure
48
TB 858 - Asset Health Indices for Equipment in Existing Substations
Diagnostic indicator data available in utility Failure modes being Failure modes not
assessed being assessed
Visual surveys Y/ N
Survey diagnostics Y/ N
Oil analyses
Oil Quality
Dissolved gases
Paper ageing
OLTC oils
Gas analyses
Gas leakage
Infra – red surveys
UHF – PD surveys
UV Corona surveys
Timing
Other
49
TB 858 - Asset Health Indices for Equipment in Existing Substations
Documentation from the manufacturer and held by the utility, often at the substation Availability – Y/N
Copies of manuals
Original factory test results and standards applying
Relevant design standards used for this build (IEC/ IEEE etc)
Specification requirements (BIL, short circuit etc)
Maintenance tasks, intervals, materials identified both when new by the manufacturer and
subsequently by the company
Failure investigation reports on this and similar design units
Service Advisories from the manufacturer
Details from manufacturer/ repairer relating to any rebuild/ refurbishment done on this specific design
group
50
TB 858 - Asset Health Indices for Equipment in Existing Substations
Table 4.1.6 – Level assessment – example of review of failure mode susceptibility factors
51
TB 858 - Asset Health Indices for Equipment in Existing Substations
For most substation transformers and reactors it would be a normal requirement to migrate quickly
away from an asset health index having a simple Level 1 confidence to one of the more
comprehensive Levels, 4 and 5 in Figure 2.1. Which level would depend on cost benefit factors
included in the following list:
Transformer and reactor purchases in transmission and generation utilities have a big impact on
capital expenditure (CAPEX) budgets and it is important to identify likely life expectancies for the
utility stock in order to justify investment planning requirements.
Unlike switching assets, many of the failure modes in large transformers are not repairable on
site and usually lead to asset replacement. However, operating in a colder climate a network unit
in a shared load N-1 configuration can last up to 80 years before these occur [B10].
GSU and interconnector failures present a risk to the continuity of the grid and its income earning
capability. An unexpected early life failure can have a significant impact. For network units the
risks are less when in N-1 parallel operation.
Some outage times can be considerable, depending upon the spares holding and logistics of
transporting the replacement to site.
Some failure modes are catastrophic with safety and environmental implications from fires,
explosions and release of insulating fluids into the environment. This is particularly the case
where bushing failures are often catastrophic and destroy the transformer in the process. On-line
diagnostics is valuable where this is a perceived risk.
Some important failure modes require offline diagnostics – this includes mechanical deformation
of the winding, clamping and core movement. Ageing of the DDF/capacitance measurements. Oil
testing to reveal bushing degradation can be effective, but some utilities have restrictions on oil
sampling of factory sealed equipment [B5]. Gaining an outage to allow oil sampling and/or offline
testing is a problem for some.
Purchase files
The purchase documents should include design information such as the winding configuration,
materials and manufacturing processes. It should also identify the oil preservation system, tap
changer, and cooling system. The duty should be identified – network, GSU, interconnector etc.
Performance of the unit or design group in factory acceptance tests. This would include poorer
performance in tests, such as for heat run results that could affect operating at both normal
thermal ratings and any as a consequence of overloads.
Original specifications define the user’s requirements for ambient temperatures, load, voltage,
power factor, source impedance, lightning levels, short circuit withstand, acceptable losses,
source impedance and load power factor. Some of these factors may now have evolved such
that these requirements are inadequate for the current and future operating environments. The
purchase specification is, therefore, an important document to review in light of current
manufacturing standards, and against the actual operating environment.
52
TB 858 - Asset Health Indices for Equipment in Existing Substations
In light of these specifications discussions and documentation within the utility should indicate
any significant requirement to operate the unit above specified conditions, exposure to unusual
levels of short circuits or switching transients, DC carry through, harmonics, extent of reverse
power flow, etc. together with historic relocations, whether system voltages are at the top of the
voltage range and over-fluxing is a possibility, and whether fault frequency or levels are higher
than specified. The latter could occur if source impedance or earth impedance have changed, or
protection is slower than specified.
Standards used at the time of manufacture which now may be considered to have been
inadequate are identified; examples are withstand tests such as BIL. Designs also have changed
over the years, using Roebel conductor transpositions, interleaving, winding in discs rather than
layers, directed oil flow, belted cores rather than core bolts are all examples of changes that
improve thermal performance and impulse withstand. Identifying the design practice at build is
important, therefore, to predict future performance in these areas.
53
TB 858 - Asset Health Indices for Equipment in Existing Substations
Winding issues – Thermal, Thermal degradation by depolymerisation of Oil sampling for DGA.
Hotspots, poor thermal design the insulation leading to inter turn failures Furan measurements
Overloading Connections Off-line DDF/C
Operating at ambient temperatures higher Winding resistance for connections
than used for rating calculations. Infra-red scan for pumps and fans
Malfunction of pumps, fans and control
systems
Bushing failures due to ageing, design Overheating, PD, loss of insulation quality Visual inspection of Oil level gauge for
issues, moisture ingress. Often explosive with safety and loss of oil
Connection failures environmental impact Broken sheds
Loss of oil May lead to transformer fire On-line scanning UHF PD and IR
External connection overheating On-line monitoring for DDF/C and PD
Out of service DDF/C tests required
Tap changer issues from oil ageing, Overheating and partial discharge Position indicators, temperature
corroded contacts, stuck mechanisms Explosion risk sensing, DGA, UHF-PD and IR scans
Winding resistance
Protection- water ingress and corrosion, Malfunction and failure to perform Visual inspection, testing
service outstanding, aged system
Pumps and fans not working or inefficiently Winding overheating Visual inspection, Infra-Red
Oil containment concrete cracks, leaks, Environmental impact if oil spilt Visual inspection
water blockage
54
TB 858 - Asset Health Indices for Equipment in Existing Substations
inspection. Included within this assessment is the effect of moisture estimated from oil testing. The first
stage is to review the load history over the life of the units. From these representative periods are
selected and the loss of life calculated using equations from relevant standards (e.g. IEEE C57.91)
and from temperature data. Ideally, the hotspot temperatures are available.
55
TB 858 - Asset Health Indices for Equipment in Existing Substations
Scale code Scale code Description Fault free time Remaining life
log linear
1 1 Very good condition >25 years
5-10 years before maintenance on OLTC, cooling Or 40-80y in network
3 2 Good condition system, oil etc. 15-40 years
Visual inspection
Confirmation of the asset register data and the equipment actually in each bay is recorded correctly.
Results from an external visual assessment, Figure 4.2.1 may be used to relate to failure modes and
relationships between observed data and risk of failure as per Erreur ! Source du renvoi
introuvable.
Table 4.2.3 – Visual Inspection
Step 5 Step 6
56
TB 858 - Asset Health Indices for Equipment in Existing Substations
Figure 4.2.1 – Inspection example findings – Tank rusting and a stuck WTI
Step 5 Step 6
57
TB 858 - Asset Health Indices for Equipment in Existing Substations
Installed monitoring
In addition, there are several permanently installed systems now available.
Table 4.2.5 – On-line monitoring
Outage investigations
Unusual results from online survey methods are best investigated further with offline testing, targeted
at specific failure modes. Outage testing is also appropriate for detecting some failure modes.
Table 4.2.6 – Offline and investigative testing
Result of Tests – based on level and rates of change AHI Score 1-5 or log 1-100
Winding Dielectric Dissipation Factor (DDF)/capacitance detecting deteriorated primary
insulation, mechanical deformation
Bushing DDF/capacitance detecting deteriorated primary insulation
Dielectric frequency response giving some indication of moisture content
Turns ratio detecting winding conductor issues
Leakage reactance and excitation current detecting issues in the magnetic circuit
Sweep Frequency Response Analysis (SFRA) detecting winding movement
Winding DC resistance for connection issues in winding and tap changer
Core earth resistance (for core form designs) for single point or loss of earthing
58
TB 858 - Asset Health Indices for Equipment in Existing Substations
The scoring in Table 4.2.7 is used to compile the AHI Table 3.6 with the ten failure modes. Whilst
there may be six bushings each with several failure modes, it is their collective assessment that is
used. If assembled using a log score the summation column 9 can be displayed as a list from most
likely to least likely to fail. Displaying as a spreadsheet also has the added value over a number
spewed out of an algorithm. In this way the engineer can see how the AHI numbers are derived.
59
TB 858 - Asset Health Indices for Equipment in Existing Substations
Circuit breakers
Step 1: Identify assets and decide review level
Figure 4.3.1 – Live tank (left) and dead tank (right) circuit breakers
Figure 4.3.1 shows two of the main types of breaker in use today in air insulated substations [B9]. The
interrupters are either at high voltage and the operating mechanism is at the bottom of the insulated
stack (i.e. “live tank”), or, all is at earth potential (i.e. “dead tank”). A third category is the GIS breaker
which is fully enclosed within the GIS trunking.
Over the years the insulating medium and arc extinction principle has changed from oil and air blast to
SF6. The SF6 breaker has been the preferred choice at transmission voltages for many years, but all
types are still found in many countries. There is now increased interest to develop alternative or mixed
gases and also to use vacuum. A circuit breaker has two states; to be either fully insulating and
withstanding the system voltage across the terminals, or fully conducting and passing system current
through the contacts. The breaker needs to be able to pass from one state to the other in a few
milliseconds and to close circuits without generating excessive overvoltages on the system. Selecting
the number of circuit breakers to install at a substation is important to allow sections to be isolated with
minimal disruption.
The major application is essentially passive, to be there to operate and protect primary assets (and
hence provide system security) should the system be subjected to an abnormal fault current. The
breaker then operates and the fault current in the circuit is interrupted. In contrast, there is second
application where the breaker is operated hundreds of times a year, and here it is to manage system
current and voltage by switching in reactors and capacitor banks. In the former application operation is
initiated by the protection system; the latter is by a control command. The command to open is made
by sending tripping signals to the circuit breaker mechanism trip coils. Each interrupter consists of a
fixed part and a moving part. When the interrupter is opened, the SF6 gas is propelled through the
created gap forcing the arc to be extinguished.
There are a number of different types of mechanism that store the energy required for driving the
moving contact. The most common today is the spring type where the opening and closing strokes of
the circuit breaker are performed by releasing charged springs. Earlier designs used hydraulic or
pneumatic drives or charged capacitors.
Since high voltage circuit breakers provide critical protection of high value primary assets such as
generators and transformers, as well as facilitating correct operation of the system, there must be one
of the highest levels of investment in creating AHI analysis.
60
TB 858 - Asset Health Indices for Equipment in Existing Substations
these critical failure modes can be a sensible option to choose for health index calculation. TB 510
reports on the major failures of 840 circuit breakers and provides a list and distribution of 25 major
(MaF) and minor (MiF) circuit breaker failure categories. (See Table 4.3.1 au-dessous.)
Table 4.3.1 – Distribution of CB failures per cause
Additionally, TB510 provided the Table 4.3.2 identifying the MaF modes:
Table 4.3.2 – MaF modes
61
TB 858 - Asset Health Indices for Equipment in Existing Substations
Both Tables could assist the assets managers in evaluating the performance of the circuit breakers
and identifying the condition indicators which would be essential for the specific failure mode.
Also, TB 167 [B18] provides detailed information about relationship between condition indicators and
components’ functions, which can be associated with failure modes. Bear in mind that though these
references are highly informative, asset managers still have to consider specifics associated with their
assets in order to determine suitable condition indicators.
Table 4.3.1 shows multiple failure causes that could be attributed to FSI’s and its impact on the major
and minor failures of the circuit breakers.
If there is an obvious FSI to influence the performance of the circuit breakers, asset managers should
keep an eye on these matters and take proper action to improve the situation. With the specific
selected model to predict value of the health indices, these factors can be reflected in a quantitative
way in the calculation of health index.
62
TB 858 - Asset Health Indices for Equipment in Existing Substations
63
TB 858 - Asset Health Indices for Equipment in Existing Substations
IEEE C37.10 Guide includes twelve different tables, which describes the diagnostics of various types
of circuit breakers and characteristics for these assets [B19]. Based on this information and various
previously established failure modes, the condition indicators for each failure mode should be
determined (see Table 4.3.3 au-dessous). The table contains the typical failure modes for the
components which contribute to the primary functions and the related condition indicators. These are
mainly based on the structure presented in the TB 167 and the TB 737 [B18] and [B17].
Note that the listed condition indicators are the ones regarded as essential/high rank in the
maintenance/life cycle assessment in the TB 167, or ones described as standard practice in the field
in the TB737 respectively.
The brackets in the Condition indicators column indicate the respective component from the
Components column.
Table 4.3.3 – Examples of condition indicators related to components and failure modes
Vibration (C)
Charged/Discharged indicator (S)
Does not operate on command Main contacts velocity (C)
Mechanical drive Change in mechanical functional characteristics Opening/Closing/Arcing Time (C)
Common (C) Mismatched (unexpected) operation Number of fault operations (C)
Hydraulic operated (H) Loss of pressure in hydraulic mechanism Mechanical operations counter (C)
Pneumatic operated (P) Fluid leakage Contact travel curve (C)
Spring operated (S) Hardened lubricant Coils current (C)
Springs (close/open) not fully charged Number of motor pump starts (H)
Motor current (C)
Recharging (running) time (C)
64
TB 858 - Asset Health Indices for Equipment in Existing Substations
checking more frequently by online monitoring. In this case, cost benefit analysis should be done by
considering the cost of monitoring.
The following Table 4.3.4 shows a list of condition indicators for circuit breakers estimated with
regards to their criticality, i.e. correlation to primary function and detectability as an example. The TB
167 [B18] describes “Rank” for each condition indicator with regard to life assessment, which is
informative to estimate the correlation to the circuit breakers’ primary functions of making, carrying and
breaking currents under normal and specified abnormal circuit conditions. Also, the TB 737 introduces
“Degree of maturity” for each condition indicators, which is useful to estimate the detectability [B17].
The detectability level scores based on the example Table 4.3.4 where score 1 indicates easy to
detect and score 10 indicates it is hard to detect a failure.
Table 4.3.4 – Example of condition indicator estimation for circuit breakers
Effectiveness or
No. Condition Indicator Correlation to primary function detectability of diagnostic
(score 1 – 10)
65
TB 858 - Asset Health Indices for Equipment in Existing Substations
66
TB 858 - Asset Health Indices for Equipment in Existing Substations
In service P Specification,
In service P OEM design and insulation type,
In service P Operating mechanism,
In service P Nameplate information,
1 Office study
In service P Switching application (overhead line, transformer, cable, shunt reactor,
capacitor, bus coupler, others),
In service P Spare assets/parts available,
In service P Maintenance records
Table 4.3.4 shows examples of scoring condition indicators. Some condition indicators are scored
intermittently because they are not to keep track of its trend, but to indicate just “good”, “bad” and so
forth at the moment the indicator was measured. Note that normal values are scored as “0” to avoid
masking abnormal indicators. On the other hand, other condition indicators are scored consecutively,
because their trend is important to catch abnormality. It is essential to consider the trace of these
condition indicators to catch abnormal symptoms at an early stage for appropriate actions. Typical
condition indicators are described in Table 4.3.3 in terms of these classifications.
67
TB 858 - Asset Health Indices for Equipment in Existing Substations
a) Intermittent scoring
b) Consecutive scoring
Figure 4.3.4 – Aggregate condition scale scores into an asset health score
68
TB 858 - Asset Health Indices for Equipment in Existing Substations
Thanks to the consideration in Step 3, the relations between condition indicators and key failure
modes are easy to understand.
Table 4.3.7 shows an example of overall comprehension about health indices according to AHI
definition described in Chapter 2. In this example from Japan a more intensive analysis is used so
allowing the AHI to be scored as a percentage (column 2).
Table 4.3.7 – Comprehension about health indices
1 Very good condition 0-5 Continue with inspect and test schedule.
Mitigation actions would include replacement, refurbishment, inspection, do nothing etc. What
counts is to predict future AHI value assuming possible actions in order to make rational
decisions based on objective criteria.
69
TB 858 - Asset Health Indices for Equipment in Existing Substations
This section focuses on disconnectors and earthing switches. This equipment is used in both Air-
Insulated and Gas-Insulated Switchgear (AIS and GIS). The chapter describes the principles in
the applied methodology for generating an AHI. Focus is given to AIS. For GIS specifics please
refer to Chapter 4.6.
An overview of the main tasks of disconnectors, earthing switches and fast earthing switches is
presented in Table 4.4.1.
70
TB 858 - Asset Health Indices for Equipment in Existing Substations
Hot-spots via infra-red image, PD (UV camera, UHF-PD or ultrasound locator), anti-
3 Diagnostics - in service
condensation heaters
Supply voltages, mechanical chain (e.g. friction, alignment), operating times, power
4 Diagnostics - out of service
consumption of drives, operation times, interlocks
Operation times, contact wear, power consumption of drives, operation times, temperatures
5 Online monitoring - in service
(inner/outer), power consumption of auxiliaries e.g. heaters
Within the distribution network (typically Un = 110 kV) an AHI Level 2 is recommended, since the
network is designed redundantly (n-1) and the centre break type is not used. For the transmission
network the AHI Level 3 may be advisable (if AHI Level 2 does not apply). Due to certain conditions or
applications such as Un >> 380 kV, as well as extremely critical nodes and power circuits for the
connection of transformers and power plants, AHI Level 3 or 4 is applied.
The report contains the conclusion that most of the MaF are associated with the drive and kinematic
chain.
71
TB 858 - Asset Health Indices for Equipment in Existing Substations
Based on this report the data shown in the following table displays the summed values for the minor
and major faults (drive only) regarding the three main types of DS and ES.
Table 4.4.3 – DS and ES: Failure mode of drive only by type of drive (Sum MaF + MiF) [B21]
Failure Mode of Drive only (DS and ES) Classification Electric Motor Pneumatic Manual
Change in functional characteristics of control of auxiliary systems MiF 34% 16% 22%
The evaluation leads to the conclusion that independent of the investigated type of drive (electric
motor, pneumatic, manual) the data main failure mode is "does not operate on command". An
important note is that none of the values presented exceeded 50%.
Table 4.4.4 displays the failure mode excluding drive for different types of DS and ES. For this display
the absolute values are summed up for MiF and MaF and transformed into percentages by dividing
them by the total number of reported failures (see [B21]).
Table 4.4.4 – DS and ES: Failure mode excluding drive (Sum MaF + MiF) (Table 3-60; Table 3-59 in [B21])
Change in mechanical
MiF 28% 31% 52% 15% 29% 26% 33%
functional characteristics
Total of reported failures MaF + MiF 839 381 135 92 14 140 196
The failure modes with the highest relative values are depending on the type. The "Change in
mechanical functional characteristics" is the leading cause for failure regarding the types centre break
DS, knee type DS and ES. For the double break DS as well as vertical break DS and pantograph the
main failure mode is "Change in electrical functional characteristics".
Figure 4.4.2 displays the distribution by failed subassemblies separated in MaF and MiF. In general,
the drive is the main cause for failure.
72
TB 858 - Asset Health Indices for Equipment in Existing Substations
Malfunction in the
Does not operate on command Broken/ rusted parts
mechanical chain
Change in mechanical functional Broken drive rod/ partly broken insulator (by
Mechanical overload
characteristics optical inspection)
Wire rupture / failed Change in functional characteristics of Open connection; auxiliary switch does not
component control of auxiliary Systems open/close accordingly
73
TB 858 - Asset Health Indices for Equipment in Existing Substations
The components of disconnectors and earthing switches can be divided into the following groups:
Current path and contact system
Insulating system
Operating mechanism and mechanical chain and
Control and auxiliaries
The following sections describe the typical deviation, wear and defects separately for each group. With
regard to AIS none of the following defect necessitate that the switch reached the end of its usable life.
Generally, repairs are possible and cheaper if spare parts are available and not more than one
component parts/component unit must be replaced.
74
TB 858 - Asset Health Indices for Equipment in Existing Substations
Relevance
Failure Mode Condition Indicators Diagnostic Strategy
(FMEA)
Increased pump-starts
DS/ES with pneumatic or Does not operate on Level 3 as standard, higher
High leakages (oil, air)
hydraulic operated drives command 4 on request
lack of aux. voltages
Visible damages
Redundancy:
Surface conditions (rust, pollution)
Level 2 as standard
Visible misalignment
Knee type, Centre break Change in Check contact alignment in open and
or Semi-pantograph mechanical functional High
design characteristics close position (visible)
No redundancy:
Check contact alignment in open and
Level 3 as standard,
close position (metrological)
higher on request
Contact resistance
DS 1 DS 2 DS 3
Surface clean,
Surface clean, no Surface dusty, no Some shields broken,
Insulators 1 1 2 3 4 30
damages damages Slight cracks in
cement
Increased contact
resistance
Contacts Visible OK 1 1 Slight burn marks 3 10 4 30
Contacts burnt
Slight misalignment
Aux. supplies OK 1 1 OK 1 1 OK 1 1
75
TB 858 - Asset Health Indices for Equipment in Existing Substations
DS 1 DS 2 DS 3
Anti-
condensation
Anti-condensation heating Ok Anti-condensation
heating Ok Aux. switch in heating Ok
Control place and fixed
Aux. switch in place 1 1 3 30 Aux. switch in place 1 1
circuits
and fixed Loose wires and fixed
No loose wires found No loose wires
Motor increased
noise
Some loose
Corrosion Some loose paint, no Lot of loose paint,
2 3 paint, getting 3 10 4 30
protection rust rusty
rusty
Replace asap
No special action required.
Step 8 Plan maintenance/repair (In this example repair technically and
Next inspection acc. schedule
economically not meaningful)
The AHI should indicate time scales for action to deal with any adverse score. It may involve cost
benefit analysis to decide between repair, refurbish or replace.
76
TB 858 - Asset Health Indices for Equipment in Existing Substations
Instrument Transformers
Step 1: Identify the assets and decide on review levels
Instrument Transformers (IT) – The most common usage of instrument transformers is to access
instruments or metering from high voltage or high current circuits, safely isolating secondary control
circuitry from the high voltages or currents. The primary winding of the transformer is connected to the
high voltage or high current circuit, and the meter or relay is connected to the lower voltage secondary
circuit. Non-conventional ITs are not covered in this TB as there is not enough experience with their
failures yet.
Current Transformer (CT) – An instrument transformer intended to have its primary winding
connected in series with the conductor carrying the current to be measured or controlled.
Voltage Transformer (VT) – An instrument transformer intended to have its primary winding
connected in shunt with a power supply circuit, the voltage of which is to be measured or controlled by
a secondary winding where the signal is proportional to the actual prevailing value on the primary.
Inductive Voltage Transformer (VT) – A voltage transformer that uses a transformer to step
down the voltage
Capacitor Voltage Transformer (CVT) – A voltage transformer that uses a capacitive potential
divider, inductive element, and an auxiliary transformer to step down the voltage.
Combined Current and Voltage Transformer (CCVT) – An instrument transformer that combines a
magnetic voltage transformer and a current transformer in the same device.
Safety
High impact
Most of the failure modes are catastrophic with safety and environmental implications, which can
damage surrounding parts and cause high follow-up costs.
Cost
Low to Medium impact
The CAPEX cost for the preventive replacement of instrument transformers are low.
The OPEX costs for the inspection and maintenance of instrument transformers are low. Online
diagnostics on instrument transformers are not common and quite expensive in terms of OPEX
costs.
The repair costs after a breakdown can be quite high due to the collateral damage caused by an
exploding instrument transformer.
77
TB 858 - Asset Health Indices for Equipment in Existing Substations
Advanced Asset health review Level 4 The reduction of dielectric withstand capability as well
as other failure modes, can be evaluated from the results of outage testing. Periodic outage
testing would be appropriate where these failure modes are considered a concern.
Advanced Asset health review Level 5 The reduction of dielectric withstand capability failure
modes can initiate and progress in a short time inconsistent with routine survey or out of service
testing. Then online diagnostics with permanently installed diagnostic systems is appropriate
where catastrophic failure could lead to safety or environmental consequences.
Main internal insulation Reduction of dielectric withstand capability Deterioration of bellows (O)
Common (C) Internal dielectric failure (explosion) Oil leakage (O)
Oil (O) SF6 leakage (S)
SF6 (S) Moisture content of the asset (O)
Resin (R) Dissipation factor/capacitance of the asset (C)
DGA (O)
Oil DDF (O)
Oil breakdown voltage (O)
Oil moisture content (O)
SF6 quality (S)
Partial discharges (C)
Insulator (porcelain, External dielectric failure (flashover) Dissipation factor/capacitance of the asset (C)
composite, or resin) Loss of mechanical integrity (mechanical Insulator cleanliness (C)
damages of different parts like insulators, Thermal hot spots (C)
etc.)
Partial discharges (C)
Primary terminals Loss of electrical connections integrity in Thermal hot spots (C)
primary Partial discharges (C)
Secondary terminal board Loss of electrical connections integrity in Voltage comparison between units (Secondary
(secondary terminals and secondary false readings) (C)
reconnection taps included) Thermal hot spots (C)
78
TB 858 - Asset Health Indices for Equipment in Existing Substations
Capacitors in CVT Accuracy out of tolerances Voltage comparison between units (Secondary
Windings (short turns) false readings) (C)
Winding ratio (C)
Damping circuits in VT and Damping circuit failure Damping circuit resistance (C)
CVT
Monitoring device (SF6 Monitoring device failure Functioning of the SF6 monitoring device (S)
density meter)
Purchase files
The purchase documents should include design information such as the primary arrangement,
materials, and manufacturing processes. It should also identify the instrument transformer type of
sealing.
Performance of the unit or design group in factory acceptance tests. This would include poorer
performance in tests, such as for heat run results that could affect normal thermal rating and any
consequence of overloads.
Specifications – User requirements defined in the original specifications for ambient
temperatures, load, voltage, power factor, lightning levels, short circuit withstand, and acceptable
losses. Some of these factors may now have evolved such that these requirements are
inadequate for the current and future operating environments. The purchase specification is,
therefore, an important document to review considering current manufacturing standards, and
against the actual operating environment.
In light of these specifications discussions and documentation within the utility should indicate
any significant requirement to operate the unit above specified conditions, exposure to unusual
levels of short circuits or switching transients, DC carry through, harmonics, extent of reverse
power flow, etc. together with historic relocations, whether system voltages are at the top of the
voltage range and over-fluxing is a possibility, and whether fault frequency or levels are higher
than specified. The latter could occur if source impedance or earth impedance have changed, or
protection is slower than specified.
Standards used at the time of manufacture which now may be considered to have been
inadequate are identified; examples are withstand tests such as BIL. Designs also have changed
over the years, Identifying the design practice at build is important, therefore, to predict future
performance in these areas.
The aim is to gather performance data relating to specific assets. This may require only a restricted
activity for low cost units.
79
TB 858 - Asset Health Indices for Equipment in Existing Substations
Service experience
How the design has performed in the general utility experience is important. Often poor in-service
performance can be related back to design limitations described above. Sometimes they may be due
to poor standard or specification not meeting actual conditions. Industry or trade association reports of
failures on similar designs (HV tank, insulator, main internal insulation, primary winding, capacitors in
CVT, sealing, and secondary winding) are a valuable source of information – being able to relate
trends in test data for all units to the failure and rates of aging revealed in the failed unit.
Operating History
The differences between the manufacturers’ original maintenance plan, what activity has been done
over the years, and opinion as to best practice maintenance policy and actual work undertaken should
be identified. This may indicate shortcomings that could affect future reliability.
A review of maintenance work undertaken indicates the problems encountered with the unit, their
extent, and cost. This can be used to indicate integrity and likely future performance risks, as well as
yielding key performance indicators for reliability and cost.
The dielectric withstand capability of the main internal insulation system is often affected by the
ingress of moisture due to loss of hermeticity. Risk assessment requires information on sealing
integrity, lightning events, dissipation factor and the capacitance. This may then be used with
diagnostics such as dielectric response analysis and DGA to assess the condition of the main internal
insulation.
Operating condition history, including load history, and ambient temperatures are important to the life
of insulation. Included within this assessment is the effect of moisture estimated from oil testing. The
first stage is to review the load history over the life of the units. From this, representative periods are
selected, and the loss of life calculated using IEEE C57.91 [B25] equations and from temperature
data.
Operating Costs
Some assets involve more ongoing activity to prevent in service failures.
Some will have this data captured in a CMMS (Computerized maintenance management system). The
amount spent may have relevance to assessing ongoing life.
Historical test data
A review of this data will identify normal significance of deviations for values found in the test group.
Data for individual units will also be reviewed for changes throughout life, and the current rates of
change.
Data from visual inspection
Confirmation of the asset register data and the equipment in each bay is recorded correctly.
Results from an external visual assessment, Table 4.5.2, may be used to relate to failure modes
and relationships between observed data and risk of failure.
Feature
Verify that adequate clearances exist between primary and secondary circuit wiring.
80
TB 858 - Asset Health Indices for Equipment in Existing Substations
Feature
Verify that all required earthing and shorting connections provide contact.
Verify correct primary and secondary fuse sizes for voltage transformers.
Test
Infra-red surveys – a simple survey method using an infra-red camera to detect high
temperatures indicative of overheated joints.
Partial discharge (PD) detection: a simple survey method using a UHF antenna and
scanner to identify partial discharge. Detecting above 100 MHz escapes the unwanted
effects from corona and surface discharge.
Outage investigations
Unusual results from on-line survey methods are best investigated further with off-line testing, targeted
at specific failure modes. Outage testing is also appropriate for detecting some failure modes. Table
4.5.4 shows offline test applicable to instrument transformers.
Table 4.5.4 – Offline and investigative testing [B28]
Oil testing: oil quality for acidity and consequences of contamination or deterioration. It
should include testing for contaminating material such as PCB.
Oil testing for paper ageing: This is a furanic compound test, although some research is
indicating other tests may be additionally useful.
Insulation-resistance
Polarity
81
TB 858 - Asset Health Indices for Equipment in Existing Substations
Verify that current transformer secondary circuits are earthed and have only one earthing
point
Installed monitoring
In addition, there are some permanently installed systems now available that could be applied to
instrument transformers. Table 4.5.5 shows some systems that could be applicable to instrument
transformers.
Table 4.5.5 – On-line monitoring
Relative saturation sensors can give a reliable indication of moisture content in the windings
Criticality Detectability
Indicator Related failure mode(s) (10 = high, Inspection Method (1 = high,
1= low) 10 = low)
Sealing failure
Reduction of dielectric withstand
SF6 leakage 10 Online monitoring (alarm)
capability
Internal dielectric failure (explosion) 1
Sealing failure
Reduction of dielectric withstand
Oil leakage 10 Visual inspection every 3 months
capability
Internal dielectric failure (explosion) 3
Insulator Cleanliness External dielectric failure (flashover) 8 Visual inspection every 3 months 2
82
TB 858 - Asset Health Indices for Equipment in Existing Substations
Criticality Detectability
Indicator Related failure mode(s) (10 = high, Inspection Method (1 = high,
1= low) 10 = low)
Reduction of dielectric withstand
Deterioration of capability
10 Visual inspection every 4 years
bellows Internal dielectric failure (explosion)
Sealing failure 5
Reduction of dielectric withstand
capability Oil analysis every 8 years starting
DGA 10
from an age of 25 years in service
Internal dielectric failure (explosion) 5
Reduction of dielectric withstand
capability Oil analysis every 8 years starting
Oil DDF 10
from an age of 25 years in service
Internal dielectric failure (explosion) 5
Reduction of dielectric withstand
Oil breakdown capability Oil analysis every 8 years starting
10
voltage from an age of 25 years in service
Internal dielectric failure (explosion) 5
83
TB 858 - Asset Health Indices for Equipment in Existing Substations
Criticality Detectability
Indicator Related failure mode(s) (10 = high, Inspection Method (1 = high,
1= low) 10 = low)
Reduction of dielectric withstand
capability
Internal dielectric failure (explosion)
Partial discharges 10 Continuous PD online monitoring
External dielectric failure (flashover)
Loss of electrical connections integrity
in primary 2
Loss of electrical connections integrity
in secondary
Daily Infrared scans by infrared
Thermal hot spots Loss of electrical connections integrity 8
camera's
in primary
External dielectric failure (flashover) 1
Functioning of the
Functional test of the alarms every
SF6 monitoring Monitoring device failure 1
6 years
device 4
% SATURATION
Temperature
WATER
Water
C2H4
C2H2
C2H6
Date
TAN
CO2
CH4
CO
O2
H2
N2
ID
20/07/2016 15680 34,40 5,00 3.754,25 50.000,00 82,68 1.533,02 46,63 13,14 1,00 478,68 0,01 0,08 44,00 37,52 35,00
20/07/2016 15681 25,00 5,00 500,00 50.000,00 97,46 1.950,14 51,02 14,52 1,00 489,62 0,01 0,08 45,00 38,28 35,00
20/07/2016 15682 27,20 5,00 3.617,47 50.000,00 90,29 1.419,30 40,66 11,07 1,00 387,38 0,01 0,07 42,00 35,82 35,00
28/07/2016 26891 35,10 5,00 14.787,80 50.000,00 218,02 1.664,86 9,05 1,02 1,00 5,32 0,01 0,01 28,00 45,31 20,00
28/07/2016 26889 22,00 5,00 999,02 50.000,00 178,73 1.614,78 14,56 3,27 1,00 19,11 0,01 0,02 33,00 56,24 20,00
28/07/2016 26890 26,60 5,00 2.521,11 50.000,00 156,00 2.081,02 14,09 3,16 1,00 18,38 0,01 0,03 33,00 49,61 23,00
02/08/2016 8418 35,30 15,70 20.000,00 48.966,70 304,68 1.264,20 8,48 1,00 1,00 1,00 0,01 0,00 14,00 36,68 14,80
02/08/2016 8420 41,50 25,44 20.000,00 50.000,00 421,23 1.412,29 13,12 1,00 1,00 1,00 0,01 0,00 13,00 34,06 14,80
02/08/2016 8419 40,20 17,66 20.000,00 50.000,00 365,52 1.314,38 12,11 1,00 1,00 1,00 0,01 0,01 16,00 41,92 14,80
The next step is to assess these data points in terms of criticality to failure modes and likelihood of
failure.
84
TB 858 - Asset Health Indices for Equipment in Existing Substations
200
180
160
140
C2H6 1
120
C2H6 2
C2H6
100
80 C2H6 3
60
C2H6 4
40
20 C2H6
0
0 0,5 1 1,5
Score
Figure 4.5.1 – Translating the set of condition indicator scores to a condition indicator index
Example of translation of the C2H6 (ppm) value from DGA to a condition score (0-infinity %) is given in
Table 4.5.8.
Table 4.5.8 – Example of translation of the C2H6 condition from DGA to a condition indicator index
Condition indicator 1 2 3 4 5
C2H6 (Condition indicator score) <=50% 50%-70% 70%-90% 90%-100% > 100%
C2H6 (Condition indicator Index) Green Yellow Orange Red Black
Condition indicator 1 2 3 4 5
DDF
C2H2
C2H6
Sub-HI Reduction of dielectric withstand capability <=50% 50%-70% 70%-90% 90%-100% > 100%
85
TB 858 - Asset Health Indices for Equipment in Existing Substations
dielectric withstand
Sub-HI Reduction of
capability (%)
Equipment ID
36278 57,14 % 0,00 % 103,58 % 103,58 %
23231 19,05 % 0,00 % 97,20 % 97,20 %
24117 29,64 % 96,00 % 1,25 % 96,00 %
25027 93,38 % 0,00 % 5,00 % 93,38 %
50272222 92,23 % 0,00 % 3,03 % 92,23 %
42723 42,86 % 92,00 % 0,00 % 92,00 %
28072 91,96 % 0,00 % 5,75 % 91,96 %
The Table 4.5.10 shows a Sub-Hi Reduction of dielectric withstand capability based on the condition
indicators DDF, C2H2, and C2H6. As explained in the following 3 examples shown above, the dominant
AHI determines the overall AHI score for the asset.
1. Equipment ID 36278: DDF HI is 57.14% which is a good condition and indicates a low
likelihood of failure over a long period. Similarly, C2H2 HI is 0% and also has a very good
condition, indicating very low likelihood of failure over many years. However, the C2H6 HI is
103.58% which is critical condition and indicates a high likelihood of immediate failure existing.
This means that the unit should not remain in service. In this case, therefore, the C2H6 HI
prevails over the others and the Sub-HI reduction of dielectric withstand capability takes its
value of 103.58% as the general condition of the equipment.
2. Equipment ID 24117: DDF HI is 29.64% and C2H6 HI is 1.25% and both are a very good
condition and indicate very low likelihood of failure over many years, C2H2 HI is 96% which is a
poor condition and indicates that progressive deterioration has been detected, with high
likelihood of failure in the short term, therefore, the C2H2 HI prevails over the others and the
Sub-HI reduction of dielectric withstand capability takes its value of 96% as the general
condition of the equipment.
3. Equipment ID 28072: DDF HI is 91.96% which is a poor condition and indicates that
progressive deterioration has been detected, with high likelihood of failure in the short term,
C2H2 HI is 0% and C2H6 HI is 5.75% and both are a very good condition and indicate very low
likelihood of failure over many years, therefore, the DDF HI prevails over the others and the
Sub-HI reduction of dielectric withstand capability takes its value of 91.96% as the general
condition of the equipment.
86
TB 858 - Asset Health Indices for Equipment in Existing Substations
GIS
Step 1: Identify assets and decide review level
GIS installations typically house several main substation components, depending on actual
configuration encompassing busbar sections, switchgear, earthing switches, disconnectors, measuring
devices (VTs, CTs) and cable or line bushings. Furthermore, GIS may be combined with AIS
components (hybrid substations).
GIS plays a critical role in electricity transmission, as it may serve as a node in the network or as a
main substation next to a power plant. Some utilities may operate GIS from several manufacturers and
different years of manufacturing with numerous kinds of designs and technologies. As a consequence,
it might be possible that the number of condition indicators is not the same among the GIS types.
Utilities commonly apply comprihensive Review Level 4 and 5 (see Figure 2.1) when assessing the
health index of HV/EHV GIS. Commonly utilities apply comprehensive Levels, 4 and 5; according to
Figure 2.1; when assessing the health index of HV/EHV GIS.
Note, that overlap may occur in these analyses as sub-components may for instance have shared gas
compartments.
87
TB 858 - Asset Health Indices for Equipment in Existing Substations
Voltage Transformer
CB Driving Mechanism
Arcing Contacts Compartment
Current Transformer
CB Compartment
Termination ES
Termination Compartment
Termination Disconnector
Cable Termination
Busbar Disconnector
Busbar Compartment
Figure 4.6.1 – An example of a feeder bay in GIS. The components are placed inside different enclosures
of GIS
Functions of components located inside the enclosures
Based on their functionality, there are seven groups of components, namely:
1. Fault and load interrupters, i.e. Circuit Breaker (CB)
2. No-load switches including limited-fault interrupter, i.e. Disconnector Switch (DS), Earthing
Switch (ES), High-Speed DS (HSDS)
3. The main path for current distributions in GIS and interconnection among GIS feeders, i.e.
Busbar, Bus Segment (BS)
4. Link the GIS with the incoming and outgoing feeders, i.e. Terminations (TE)
5. Voltage and current sensing devices, i.e. Instrument Transformers (IT), including the Current
Transformer (CT) and the Voltage Transformer (VT)
6. Transient overvoltage limiter, i.e. Surge Arrester (SA)
7. Local Control Cabinet (LCC) housing the auxiliary wiring and control and protection
functionality
Each component consists of subsystems. Table 4.6.1 gives an example of how to make divisions of a
GIS.
Table 4.6.1 – GIS components, sub group of components, subsystems, function of subsystems, and key
parts
Subgroup of
Component Subsystem Function of subsystem Key parts
component
Main and arcing contacts,
Primary Conduct the current at its rating
conductor
Sending a command to driving
Wiring, auxiliary contacts,
Secondary mechanism either from remote control or
relays
from local control cubicle.
88
TB 858 - Asset Health Indices for Equipment in Existing Substations
Subgroup of
Component Subsystem Function of subsystem Key parts
component
Local Control Installed on the GIS or Connects with the rest of the control and Bay control unit, contactors,
Secondary
Cabinet (LCC) free standing protection auxiliary switches
GIS consists of bays, enclosures, components, and parts. Therefore, when performing FMEA on GIS,
its “hierarchical layers” should be taken into consideration. Figure 4.6.2 shows how a GIS can be seen
based on its physical layers (in the horizontal direction), and by its functionality layers (in the vertical
direction). Typically, a GIS can be divided into four layers of functionalities, namely (from top to bottom
hierarchy), substation, bay, enclosure, and component. The lower layer becomes a subsystem of the
higher layer. The component-layer consists of subsystems based on sub-functionalities. Physically, a
component consists of parts.
89
TB 858 - Asset Health Indices for Equipment in Existing Substations
Substation
Substation functionality
Bays
Bay functionality
Enclosures
Enclosure functionality
Components
Parts
Component functionality
Figure 4.6.3 – Example of some failure modes of the dielectric subsystem of GIS from a case study [B16]
90
TB 858 - Asset Health Indices for Equipment in Existing Substations
AND
Figure 4.6.4 – Example of some failure modes of the construction and support subsystem of GIS [B16]
CB and
Static contact resistance μΩ
Switches
CB, Switches,
Contact resistance of primary °C and
and Primary Hot spot in the enclosure
conductor joint pattern
Conductor
Dielectric Subsystem
Gas Pressure
Bar, MPa
(Leakage Rate)
Density of SF6
Gas Density
kg/m3
All Components (Density reduction)
91
TB 858 - Asset Health Indices for Equipment in Existing Substations
Mechanical integrity
Contact
CB Contact travel record position vs
time
Electric
Electric motor readiness Motor current A
Switches
Secondary Subsystem
Corrosion level -
Corrosion on enclosures
All Deposited Pollutants -
92
TB 858 - Asset Health Indices for Equipment in Existing Substations
manufacturers’ concepts, etc. Note that the FSI is not a failure mode, it is also not a condition
indicator, but it makes one aware that the likelihood of a certain failure mode is becoming more
relevant. This may also aid in selecting/favouring certain condition indicators above others between
different sets of equipment, conditions, etc. However, since FSI is only an expectation, it functions as
a “warning flag” for the decision making regarding an asset. It stands as “side notes” of the AHI.
Step 6: Evaluate Current Condition relative to key failure modes and Norms
Generation
The result from inspections (i.e. condition indicators) needs to be interpreted to justify the health status
of the subsystems in GIS components. It is achieved by setting the limit, or the boundary values,
known as the “norm.” The norm uses the measured values of the condition indicators to decide on
a health status which is further translated into a condition score. In case of quantitative condition
indicators (e.g. a gas pressure level) numerical values are to be derived/calculated to reflect good, fair,
poor and/or severe levels/classifications. In case of qualitative condition indicators some guidance
must be given in making proper classifications, for example for a visual inspection on the presence of
rust, presence of leakages, etc.
Several approaches to develop such norms exist, for example:
By using statistical analysis on the condition indicators taken from field inspections on numerous
assets, to determine expected values and what deviations from expected values are deemed
fair, poor or severe deviations. (this is also includes trending analysis, information from failure
investigations, and comparison with sister components)
By using recommendations from literature, like, GIS manuals, international standards,
publications.
By deterministic analysis, for example, from failure modes observed during a forensic
investigation, or by a laboratory test.
By expert’s judgement (can be through discussion with the maintenance expert group or by a
Delphi test [B30]).
By a combination of two or more of the above approaches.
93
TB 858 - Asset Health Indices for Equipment in Existing Substations
Figure 4.6.5 – Boundary values for humidity content in the CB enclosure for GIS from a manufacturer.
The fitted distribution is the Gamma distribution.
From the example, the boundary values for the four-condition status are as follows:
1. Very Good : humidity-content ≤ 135 ppmV
2. Deteriorate : 135 < humidity-content ≤ 277 ppmV
3. Bad : 277 < humidity-content ≤ 336 ppmV
4. Very Bad : > 336 ppmV
Approach 2: Setting the norm based on recommendation from standards and manufacturer’s
recommendation.
The maximum humidity limit from the literature usually can only be interpreted as “Good,” if the
measured value is below the recommended limit, and “Bad” if the measured value is above the limit.
These recommendations are as follow:
1. Maximum humidity content from a specific manufacturer’s guide:
a) CB enclosure : 350 ppmV
b) Non-CB enclosure : 840 ppmV
2. Limit from the IEC60227-1 Ed1 : 804 ppmV
3. Limit from the CIGRE TB 234 and 567 : 200 ppmV
Example:
The norm for the humidity content in GIS from a specific manufacturer as derived from Approach 1
and Approach 2 have been summarized in Table 4.6.3.
Table 4.6.3 – Summary of norm for humidity content for 150 kV GIS from a specific manufacturer as
generated from different approaches
Deteriorate/
Very Good Good Bad Very Bad
No. Approach Moderate
Manufacturer N/A ≤ 350 ≤ 840 N/A > 350 > 840 N/A
94
TB 858 - Asset Health Indices for Equipment in Existing Substations
It is up to the user of the system to ultimately decide on which norm-scheme to use. General
recommendation is to carefully consider (for example in the FMEA analysis) what the best basis is for
the norm schemes applied. In some cases, more generalized recommendation from for instance
international standards may be less applicable to a specific situation, requiring a more localized
analysis. In the absence of such local information, it may be best to utilize norms derived from
standards. It is recommended to also involve the OEM.
Condition scores and their description
Condition scores are used to represent the condition status of subsystems in GIS (based on the
measured/ observed condition indicators). Table 4.6.4 gives an example of condition scores and their
definition as used in the case study. In this example a log base 3 scoring system is used.
Table 4.6.4 – Example of condition scores and their descriptions
Qualitative
Score Description Likelihood of a failure mode to occur
meaning
High
Bad Severe deterioration/aging has been observed.
30 The GIS performance is possibly reduced in short-
Condition Intervention is required in short-term
term.
Another example for primary conductor subsystem is shown in Table 4.6.5. Meanwhile, Figure 4.6.6
gives an example of badly deteriorated main contact of CB in GIS.
95
TB 858 - Asset Health Indices for Equipment in Existing Substations
Condition Score
Component Condition Indicator Unit
1 3 10 30 100
of design limit
Δ Rst-contact ≤ 5%
Static Contact Resistance Rst-contact
Switches
N/A
Hot
No Hot Spot
All Hot Spot on the Enclosure (Pictorial) -
Spot
Found
Figure 4.6.6 – A carbonized female-main contact in one of circuit breaker in GIS in the case study. The
measurement before opening the enclosure had shown the increase of the static contact resistance
above 20% of the value during commissioning.
96
TB 858 - Asset Health Indices for Equipment in Existing Substations
Figure 4.6.7 – Failed fragments of an epoxy disconnector drive tube (left) that exploded out through
bursting disc into the bay and created a significant safety risk. A similar unit is on right. Defects in the
casting considered to be the cause.
97
TB 858 - Asset Health Indices for Equipment in Existing Substations
Finally, Table 4.6.6 gives an example of a method for defining the health index scoring for the GIS
bay. The condition score range in the table comes from steps as explained in the previous paragraph.
Table 4.6.6 – Example of Condition Score (CC), interpretation, and bay index
Example:
An HV GIS with double busbar configuration. The bay configuration is shown in Figure 4.6.8. The GIS
consists of 8 bays: 4 transmission feeders, 3 transformer feeders, and 1 bus coupler. The surge
arresters are located outdoor connected to an overhead line.
Bus Coupler
Figure 4.6.8 – The single line diagram of the GIS example from the case study
The configurations of enclosures in the three types of bays in the GIS example includes:
the line feeder,
the transformer feeder,
the bus-coupler.
98
TB 858 - Asset Health Indices for Equipment in Existing Substations
Figure 4.6.9 – The configuration of enclosures in three types of bays in GIS example
Table 4.6.7 shows the “scores of subsystems in Circuit Breaker compartment (G0) in each bay.” The
score shown in the column follows the definition in Table 4.6.4. In each compartment there are five
subsystems showing the worst condition indicator score. It can be seen that, in the current example,
the worst score (i.e. 100) comes from the dielectric subsystem of Line 1B, due to high moisture
content in it.
Table 4.6.7 – Summary of Condition Scores of Subsystems in CB (G0) from each line of GIS
Condition score
Subsystem
Bus
Line1A Line1B Line2A Line2B Trx01 Trx02 Trx03
Coupler
Primary 30 30 1 1 1 10 1 1
Dielectric 10 100 10 10 10 10 30 1
Driving
10 10 10 10 10 10 10 10
mechanism
Secondary 30 30 30 30 30 30 30 30
Construction
1 1 1 1 1 1 1 1
& Support
There are tables for other compartments from the same GIS. The bay index comes from the
aggregation of condition scores of compartments (by summation of worst subsystem scores).
The condition scores are then translated into the health index of the bay using Table 4.6.6. Table 4.6.8
provides the summary of bay Index of GIS in the example. Table 4.6.7only provide condition scores of
subsystem in CB (G0) enclosure from each of bay lines, bay transformers and bus coupler of GIS.
While Table 4.6.8 provide the “bay index” of which in one bay consists not only G0 but also G1/G10,
G2/G20 and G9 (see Figure 4.6.9)
Table 4.6.8 – Summary of Bay Index of GIS example
Besides the sub health index, the susceptibility indicators (FSI, Failure Susceptibility Indicators) have
been added for each bay. In the example, two susceptibility indicators have been added, i.e. related to
the environmental parameters and the lightning density.
99
TB 858 - Asset Health Indices for Equipment in Existing Substations
HI
BAY FSI – pollutants FSI – lightning density
Bay
100
TB 858 - Asset Health Indices for Equipment in Existing Substations
level “5” to “3” (Moderate). Table 4.6.10 summarizes the revised bay index and failure susceptibility
indicator of GIS example, before and (expected) after mitigation action.
Table 4.6.10 – Summary of Bay Health Index & Failure Susceptibility Indicator index of GIS example
before and (expected after) mitigation action
Line-1B 5 2 5 3
Line-2A 5 2 5 3
Line-2B 5 2 5 3
T1 5 2 5 3
T2 5 2 5 3
T3 5 2 5 3
Bus Coupler 4 2 5 3
101
TB 858 - Asset Health Indices for Equipment in Existing Substations
Capacitor banks
These are sets of individual capacitor units connected in series and parallel combinations to achieve
the required voltage withstand and bank capacitance. Within each capacitor can is a stack of folded
rolls made from a high-grade polymer sandwiched between two aluminium foils, all being impregnated
and encased in an insulating fluid. Critical are the connections between individual rolls and to two
small solid bushings and the seal of the latter to the can. They are protected with fuses (internal or
external) or they can be fuse-less with relay protection. Trays of arrays are usually assembled within
their own compound to protect the workforce. Often the trays are at height. The capacitors could be
tuned with an adjacent inductor so as to create a filter. Their role is also to provide power factor
correction and voltage regulation, as well as removing harmonics.
102
TB 858 - Asset Health Indices for Equipment in Existing Substations
synthetic housing. The first-generation devices consisted of an array of small sealed gas filled
spark gaps. This design was followed with one incorporating spark gaps with silicone carbide
blocks. Spark gaps were eliminated when a column of metal oxide blocks became standard.
Cable sealing ends
These components are be used as interconnectors within a substation or as an incoming cable
connection. The cables typically consist of cores with an insulation made from extruded cross-linked
polyethylene (XLPE) contained within a protective sheath. Older types of cables are oil-filled.
Critical is the removal of the core screen within the termination and the stress at the cable end is
capacitively controlled with a pre-formed stress relieving cone connected to the end of core screen. All
are encased within a dielectric fluid inside a porcelain or synthetic housing. This may then be mounted
on a gantry prior to connecting to the incoming overhead line or connected directly into a cable box on
a transformer.
Insulators
These are used throughout a substation and its incoming feeders. They may not be identified as
individual assets.
They could be solid insulators supporting open conductors. Or they could be cap and pin individual
units supporting an incoming connection at the tower, and then ongoing along the circuit external to
the substation. Normally these are made from porcelain or glass. Porcelain is typically ten times
stronger in compression than in tension, and designs reflect this. Polymeric insulators are now
emerging in recent years as well, but their strength can be limited.
Capacitor banks
These would be treated like any other primary asset. Typical data would include:
Manufacturer and factory location of both the bank and individual capacitors
Rated voltage of the bank
Rated reactive power at nominal voltage
Maximum short-circuit withstand
Short circuit withstand time duration
Neutral connection
Earthing arrangement
Service location – outdoor/indoor
Cooling – air natural forced ventilation or internal
Degree of protection of the enclosure
Maximum sound pressure level
Support structure material
Portable earth lead connection points
The individual capacitor unit has a rating 2-25 kV. It should have its own nameplate data; one
important detail is the materials of construction. Earlier capacitors contained PCB, and this has an
environmental significance if the can fails and there is a leak.
Failure is most likely to be on a can-by-can progression, until some critical stress on remaining units
leads to a complete failure. The time frame is likely to be fairly long and could be tracked with a Level
4 analysis.
103
TB 858 - Asset Health Indices for Equipment in Existing Substations
Bushings
Bushings can be used on transformers, circuit breakers and for through wall exits. They are as well
used on a hybrid switchgear between the GIS and AIS parts. The bushing is a critical part of the asset
to which it is fitted and would have an analysis level determined by the primary asset. The nameplate
should contain ratings and factory test data.
This and other information include:
Bushing manufacturer
Factory and date of manufacture
Voltage and current ratings
Standards applying at manufacture and factory test.
Design and materials used (e.g. OIP or RIS)
Surge arresters
According to IEC 60099-4 [B32], surge arresters are identified by the following nameplate information:
Designation of arrester
Continuous operating voltage Uc
Rated voltage Ur
Rated frequency fr
Nominal discharge current In
Rated short-circuit current Is
Manufacturer's name or trademark, type and identification of the complete arrester
Identification of the assembling position of the unit (for multi-unit arresters only)
Year of manufacture
Serial number
Repetitive charge transfer rating Qrs
Contamination withstand level of the enclosure (IEC TS 60815-1)
104
TB 858 - Asset Health Indices for Equipment in Existing Substations
These systems are very reliable, and failures are usually related to manufacturing defects or incorrect
assembly on site.
Insulators
A Level 3 diagnostic system is all that is likely to be required. In exposed sites pollution levels may be
measured using a sample insulator. This may indicate a need for cleaning or adding booster sheds to
improve flashover performance.
Capacitor banks
The most common failure mode is a dielectric failure of the capacitor units. This may follow from
system transients, external faults (animal and bird impact being important), loose terminations (hot
joint) and manufacturing defects.
Failure of a can increases the current in connected units allowing other cascaded failure causes such
as overheating. Both dielectric and thermal modes are affected by leakage of the fluid. Earlier designs
used paper as the dielectric, and these had a significantly higher failure rate than the currently used
polypropylene. Other problems relate to damaged or polluted can insulators, as well as flashovers
following animal or bird contact.
Surge arresters
With all designs their failures may be due to:
Damage to the housing affecting voltage withstand
Moisture ingress into the housing
Overstress (voltage, current, temperature)
Operations
Poor manufacture, installation or selection of an inappropriate rating.
105
TB 858 - Asset Health Indices for Equipment in Existing Substations
Additionally, with gap arresters a shorting out of some blocks will increase the stress on others and will
lead to progressive failure. With metal oxide material, the degree of ageing depends on the
nature/quality of the granular outer layer. Experience reported by a system operator in India identified
90% of failures due to moisture ingress, i.e. associated with poor manufacture [B33]. The remaining
10% were due to deterioration of the metal oxide blocks. Debris around the base of a failed arrester
can be seen in Figure 4.7.2.
These are problems arising from manufacture and specified acceptance tests, perhaps including
immersion. Many find that their diagnostic strategy relates more to the manufacturer and duty factors.
Survey methods for condition assessment should include:
Visual examination to detect surface damage and/or pollution
Surge counter data, monitoring system damage
Gas pressure, tightness (in case of GIS)
Arrester disconnector, functionality (if present in case of medium voltage equipment)
Measuring compensated third harmonic resistive current and looking for increasing or elevated
levels, followed up with out of service tests measuring capacitance/power factor to identify block
deterioration and insulation resistance changes to detect moisture leakage (see [B33] and [B35]).
The failure scenario described above (low failure rate) applies to surge arresters used in transmission
and distribution networks. It is quite possible that for special applications, e.g. in industrial networks
(electric arc furnaces, capacitors), higher stresses and thus higher failure rates may occur, but these
applications are not part of this consideration.
Bushings
Failure causes may be attributed to mechanical, dielectric and thermal stresses, as well as surface
damage. A recent A2 brochure has described the various aspects covering bushing reliability [B36].
Most importantly failures of bushings are often explosive with the outer insulator ejected several
hundred metres. An electric arc can pass through the asset itself and in the case of a transformer lead
to failure of the latter and poses a fire and environmental impact from lost transformer oil. Assessing
the likelihood of failure is important, therefore. A common failure mode is where the paper, or synthetic
material used in RIS bushings, has deteriorated leading to PD at foil ends. Another is where dielectric
losses and conductivity have increased. This may be due to penetration of moisture, the inks used as
foils in some designs, or corrosive sulphur. Thermal degradation of the dielectric can also increase
losses. Contamination of the oil can lead to failure. This might be caused by moisture ingress,
contaminants from oil-gasket interaction and oil ageing.
Cables
Most failures are associated with sealing end issues rather than in the cable; and there they can be
explosive with debris sent over 100 m. The cable itself is extruded at the factory and tested there on a
drum. The remaining risks occur when rolling out the cable, pulling in, bending and creating the
termination. The latter involves paring back the core screen with a tool and sliding a tightly fitting pre-
formed stress cone to mate with the end of the core screen. Problems can arise if the end of the
screen is damaged, the paring creates stress raisers or the whole system is damaged during fitting.
Other installation issues can arise if there is an error in earthing the core screen. (Circulating currents
can be reduced if the cable is earthed at only one end. If the design calls for just one end and both or
neither are earthed, then discharge and overheating can occur.) Further, over time in service the
insulation medium, e.g. viscous silicone-based fluid or XLPE can itself deteriorate if moisture enters
the housing.
106
TB 858 - Asset Health Indices for Equipment in Existing Substations
Insulators
Failure causes include damage from impacts, generally or from rifle bullets. Failure can follow
flashover in heavily polluted sites. Known as dry-banding surface currents can lead to localized
erosive discharge, shown in Figure 4.7.5. Another mechanism follows cracking at the cement area.
With age the cement will improve its mechanical strength but as it does so it can expand. The latter
transfers stresses to create local tensional forces. Where moisture can enter hollow housings, PD can
develop tracking down this inner surface.
Condition indicators
With each asset in this section performance data will be assembled, indicating past failures and
outages, identified by site and manufacturer.
Table 4.7.1 – Diagnostic indicators in use and failure modes
107
TB 858 - Asset Health Indices for Equipment in Existing Substations
108
TB 858 - Asset Health Indices for Equipment in Existing Substations
109
TB 858 - Asset Health Indices for Equipment in Existing Substations
There is a long tradition of DDF/ capacitance testing on HV bushings, this being done in an outage
and at 10 kV. As a result, there are extensive utility shared databases detailing most major suppliers,
designs and voltage levels. Individual results can be compared against similar designs. Particularly
important are changes relative to the nameplate values. Changes used for alert/action levels depend
upon the general aspects of design, materials used, known design-specific failure modes and voltage
levels. More focussed is a dependency upon past experience with particular designs. Changes from
nameplate DDF vary due to these factors, but changes from 20 to 100% indicate a significant concern
and the need to measure more frequently. At that point, the rate of increase becomes critical.
Capacitance changes may be due to dielectric changes, but more likely are indicative of shorting of
adjacent foil sections. Concerning levels of change depend upon the number of foil sections.
One way of assessing change is to compare values with benchmarks of the same design group within
the company, or with an international database. In the case of the set of three bushings. H1, H2 and
H3 the company had only 100 similar bushings, whereas a better benchmark is possible using a
collaborative industry wide data set of 2700 similar bushings.
Figure 4.7.7 – Benchmarking DDF data with international database resource [B38]
110
TB 858 - Asset Health Indices for Equipment in Existing Substations
Figure 4.7.8 – Bushing tap modified for PD and PF measurements, and typical results
Step 5 Step 6
111
TB 858 - Asset Health Indices for Equipment in Existing Substations
Scale code log Scale code linear Description Expected fault free life
1 1 Very good condition >25 years with
Normal test and inspect schedule
3 2 Good condition 15-25 years with
Normal test and inspect schedule
10 3 Fair condition 5 - 15 years with
More regular test and inspect schedule
30 4 Poor condition <5 years
Advanced test and inspect, plan remediation or
replacement.
100 5 Critical condition 0-6 months
Immediate remediation or replacement.
112
TB 858 - Asset Health Indices for Equipment in Existing Substations
Unable to operate
Mis-operation
Power supply failure
Digital relays Mechanical damage
Overheating
Internal battery failure
Contamination of relay
113
TB 858 - Asset Health Indices for Equipment in Existing Substations
Unable to operate
Mis-operation
Electromechanical relays Mechanical damage
Overheating
Contamination of relay
Unable to operate
Mis-operation
Power supply failure
Control terminals
Mechanical damage
Overheating
Contamination
Unable to function
Power supply failure
Fault recorders Mechanical damage
Overheating
Impaired ability to record
Unable to function
Power supply failure
Mechanical damage
RTU
Overheating
Unable to connect to
network
Unable to function
Power supply failure
Condition monitoring Mechanical damage
system Overheating
Reporting wrong data
Impaired ability to record
Unable to function
Power supply failure
Station computer
Mechanical damage
Overheating
Unable to function
Power supply failure
Mechanical damage
PMU
Overheating
Reporting wrong data
Impaired ability to record
Unable to function
Power supply failure
Mechanical damage
Metering system
Overheating
Reporting wrong data
Impaired ability to record
Failure of insulation
Open wire
Control cables
Critical attenuation (fibre)
Contact high resistance
114
TB 858 - Asset Health Indices for Equipment in Existing Substations
115
TB 858 - Asset Health Indices for Equipment in Existing Substations
Auxiliary systems
This section focuses on the auxiliary systems of the substation and application of applied methodology
for generating an AHI.
From TB300 [B40] 3.12 – The most important components of AC and DC station services are the
station services transformers, automatic and/or manual AC and DC transfer schemes, battery
chargers, batteries, and AC and DC distribution panels.
An overview of the main tasks of the auxiliary substation components is described in Table 4.9.1.
Table 4.9.1 – Auxiliary Equipment and Roles
Station Service Transformer Provides AC power to operate equipment/control systems in the substation
AC/DC Transfer Scheme Switch between primary, backup AC sources and the DC power system in the substation
Keeps DC system powered and charging - source comes from station service or backup AC power
Battery Charger
source
Typical Analysis
Asset Component Priority Comments
Review Level
Station service Priority is very high for this asset, because it can
AC Supply 1 2
transformer damage the main power transformer
Generators 2 3
Distribution backup 2 3
116
TB 858 - Asset Health Indices for Equipment in Existing Substations
As an example, for the DC supply asset, battery component, the FMEA procedure leads to the
following failure modes:
Insufficient electrolyte level
High temperature
Contact corrosion
Over or under voltage
Cell failure
Figure 4.10.1 – Sample image of substation where we can name all assets under consideration
117
TB 858 - Asset Health Indices for Equipment in Existing Substations
Site issues
Typical components of a substation are shown in Figure 4.10.1. To perform a condition assessment
there should be a strategic review covering the following:
Is the site of adequate size for the substation installed on it?
Is there satisfactory access for loaders, cranes, vehicles, and equipment? In particular, access
for transformer floats should be level, provide for unloading on site, not in the street, provides
adequate road turning radius, and provide adequate crane height.
Have there been any changes that affect access?
Are the ownership and tenure of the site satisfactory? Freehold title is the preferred tenure
method. Leases and easements are less satisfactory and if they are used should not be limited in
time; even 99-year leases run out.
Is there satisfactory access for cables, duct lines, and overhead lines?
Is there a buffer zone around the substation?
Are security issues adequately covered?
Are there noise issues?
Are there nuisance to neighbours’ issues?
Are there intrusive lighting issues?
Are the earth conditions stable?
Are there drainage or subsidence issues?
Is the site clean, tidy, and adequately landscaped, grass mowed, etc?
Are there any risks imposed by adjacent properties, such as fuel storage, petrol stations, and
trees?
Is the site in a flood plain and if so, is it sufficiently elevated?
Is the site at risk from bushfires?
Substation fences
General condition of substation fences can be assessed based on their location and their purpose.
Increasingly fences are being made to more secure designs and require more attention and
maintenance (Figure 4.10.2).
Support poles must be plumb and capable of supporting fence loads for all design conditions.
Corrosion of posts should be repaired and inhibited against further rust. Fences shall be checked
regularly for excessive gaps between the bottom of the fence and finish grade. Large gaps have to be
fixed to prevent animals, unauthorized personnel and children gaining access into a station. These
gaps have to be filled and any other holes should be repaired.
Connections to the earth grid and bonding straps should be inspected to prevent touch potential
hazard to the public. Integrity of the station fence earthing grid should also be checked periodically. It
is not unusual to find that thieves have removed entire pieces of fence earthing. To mitigate this
situation, replacement of copper conductors with aluminium or aluminium-clad steel wire is an
alternative.
118
TB 858 - Asset Health Indices for Equipment in Existing Substations
Buildings
There is a wide range of building types used in substations, from the large fully enclosed building in a
city business district housing all equipment, to the control building in an outdoor type substation. Even
in outdoor substations the building will house critical protection and control equipment, without which
the substation cannot operate. The design and condition of the building can therefore be a critical item
in the assessment of the substation (it is often overlooked).
Points to address in assessing a substation building are as follows:
What type of building is it? Reinforced concrete, brick or block work, steel framed, steel clad,
light construction, etc.
What is the design life of the building and its current age?
What type of foundations are installed and what are the earth conditions? Is it built on rock or on
piers to rock or solid earth? If not, it may be subject to movement, subsidence, cracking, or even
collapse.
Is the construction strong enough to withstand stress, e.g. from earthquake, impact, pressure
load from electrical faults and similar?
Is the structural design such that a failure of one part of the building will not cause a cascade
failure of other parts?
Condition of the structure. Is there evidence of subsidence, movement, cracking, damage?
Roof. A key requirement for a substation is that the roof should prevent water entry into any
electrical area as this could cause failure of the electrical equipment.
Type of roof e.g. concrete, steel sheet, fibre cement sheet, aluminium sheet, tiled.
A concrete slab roof is sound provided it is tied into the wall structure. However, if it
depends on a membrane for waterproofing this is poor as the membrane has limited life.
A pre-cast beam roof will usually not be tied into the wall structure and will depend even
more on a membrane for waterproofing.
A fibre cement sheet roof will be easily damaged by large hailstones and any damage will
let water in.
A tiled roof will also be damaged by large hailstones or similar objects.
A sheet steel roof if well-constructed will be resistant to damage and water ingress but at
best will require replacement at about 25 years.
Structure of roof
A timber framed roof will be a fire risk, from internal equipment failure, from external
sources such as a transformer fire or a bushfire, or even from wiring in the roof space. Once
119
TB 858 - Asset Health Indices for Equipment in Existing Substations
fire takes hold in the roof space it will generally burn out completely, collapse, and render
the whole substation unusable. Fire rated ceilings provide only partial protection.
A steel framed roof is a good option for a substation provided there is no fuel source below
it or nearby. Exposed Steel framing does not have a fire rating.
The roof structure should be adequately tied down to resist extreme wind loadings.
Gutters and down pipes.
All roofs should overhang the walls. Internal gutters will invariably lead to water ingress as they
will block up with debris or hail and flood. All gutters should be external to the walls so that
overflow will not let water into the building.
Similarly, all down pipes should be external. Internal down pipes will inevitably block up or leak
and let water into the building. The most suitable arrangement is to have no roof gutters but to
use earth level dish drains below the roof edges. This avoids the inevitable build-up of leaves
and debris in roof gutters.
Identify the assets and decide what type of maturity analysis is appropriate to the particular class of
assets. This is likely to link to business need, particularly system reliability.
In a substation, there will be all possible types of buildings and structures whose design may vary from
one manufacturer to another, would depend on voltages, countries etc. However, it is possible to list
most used materials and describe possible problems, their effects and indicators according to that.
There will be different categories of (main) features which independently contribute to the reliability of
the substation:
Buildings
Structures and foundations (steel and concrete)
Substation platform, drainage, roads, earthing, fences and gates.
A substation expertise would require a technical expertise mostly in these three technical areas:
Quality of concrete structures and foundations
Quality of steel structures and their assembly
Quality of the site and the drainage system
According to generic methodology performing FMEA should be categorized into three risk categories:
High risk
Medium risk
Low risk
120
TB 858 - Asset Health Indices for Equipment in Existing Substations
But as we can list four possible consequence scenarios for failures in buildings and structures assets
list, we can use following priorities with explanations.
Priority 1 – If something happens then substation is unable to function properly. For example,
when roof of the control building is leaking heavily then all the equipment inside may suffer from
water damage and therefore stop functioning. Safety of workers.
Priority 2 – If something happens then asset (primary or secondary) related to that is unable to
function properly. For example, when support of disconnector is failing then it is unable to use
that disconnector.
Priority 3 – If something happens then nothing will happen to substation in short term, but it will
affect reliability in long term. For example, when drainage is not functioning properly.
Priority 4 – If something happens then nothing will happen to substation that may affect
reliability but makes service more inconvenient. For example, when road is broken.
121
TB 858 - Asset Health Indices for Equipment in Existing Substations
122
TB 858 - Asset Health Indices for Equipment in Existing Substations
This table is not final and will never be. It gives just an overview how it might be possible to
determine failure mode, methods to detect the failure mode and condition indicators for assets that are
not directly related to high voltage transmission system. How the exact classification is done is based
on individual FMEA analysis.
123
TB 858 - Asset Health Indices for Equipment in Existing Substations
Example of the scoring was presented at CIGRE meeting in Belgium in 2013 and is referenced in
TB 660 [B5].
Table 4.10.3 – Classification rules for buildings according to their condition [B5]
Normal mainte-nance
Repair or refurbish
Include in planned
Trigger to start a
Explanation
projects
project
Code
124
TB 858 - Asset Health Indices for Equipment in Existing Substations
Note that these three potential outcomes, though not strictly the same, are strongly related to each
other and all may be used similarly in further analysis. As already stated in Chapter 2, company asset
managers will typically need to go further than only assessing condition and likelihood of failure. They
will be assessing the risk of failure, which further requires impact of failure estimations, compared to
specific specified risk appetite (thresholds) for the company, more commonly referred to as Key
Performance Indicators (KPIs) which would be defined as part of the company policies and/or
aspirations.
The risk assessments for all assets may then be ranked, yielding a Risk Index (analogous to the
Health Index methodology). Together with cost and portfolio optimization tools these results can then
be used to determine optimized replacement/maintenance programs.
125
TB 858 - Asset Health Indices for Equipment in Existing Substations
To begin with we must identify the characteristics of a health index which is capable of being
combined with other health indices in a meaningful manner. Then we can look at the pros and cons of
different approaches to combination.
In particular we may need to estimate the probability of failure of an asset or component in order to
perform calculations of overall probability which reflects both the individual assets and their
interconnectedness.
Collating health scores for individual assets allows a unified view of a collection of assets: a bay, a
substation or a circuit. The collection could be physical, such as a bay, or logical, such as oil filled
assets, or those of a particular manufacturer. Examples of such collections may be:
Transformers > 300 kV for a specific manufacturer, manufactured after 1990
Circuit breakers connecting lines or reactive compensation
All 33 kV substations bays in a certain region
To build up a single score/index for a collection of assets it is required that for each individual asset:
There is an individual asset health/index score for each asset in the aggregation.
The score for each asset is both calibrated and monotonic ensuring similar scores have similar
ranked ‘urgency’ and/or a time-to-act ranking alongside it. The individual health scores for all
asset have associated and uniform timescales for action; it is up to the health index system
‘operator’ to determine appropriate timescales for their organization. For example, see Table 5.1.
The individual score is an indication of the probability of a need for intervention to prevent failure
– maintenance, replacement, reduced loading etc.
There is no requirement that the code value (1, 2, 3, etc.) relate directly to the urgency or time to
act.
Log base 3 Log base 10 Linear Alpha Description Urgency Time to act
1 1 1 A Very good Negligible > 10 years
condition
3 10 2 B Good condition Low / 5-10 years 5-10 years
10 100 3 C Fair condition Moderate 2-5 years
30 1,000 4 D Poor condition High 1-2 years
100 10,000 5 E Critical condition Extremely high < 1 year
The methods for aggregation/collation of health scores for individual assets will depend on the type of
health scoring system used for the individual assets, where this may be (refer also to Table 1.1):
A Max scoring system, where the final (individual) asset score is defined by the worst scoring
condition indicator.
A logarithmic scoring system, base 3 or 10 for example, or linear system which are summed to
give an overall score.
Examples – Part 1
Note that for any aggregation it is crucial that health index scores for assets within the aggregation are
calibrated and monotonic with respect to each other in order to create a meaningful aggregated score.
In case of an aggregated score for multiple asset types, the user must be cautious of the health
scoring methodology used for the individual assets.
126
TB 858 - Asset Health Indices for Equipment in Existing Substations
Figure 5.1 – Max and Average of Asset Health Indices at a single station
How do we ascribe meaning to the two values? If the scores were generated by a means which
retains the urgency, then the Max will continue to indicate that urgency while the Average will not, as
we do not know, from the score, just how many assets are involved or how the urgency has been
maintained. The idea that the average score gives a ‘general indication’ of the health of the station is,
to a degree, valid – but it provides little in the way of understanding the volume of work/intervention
required, or the urgency thereof.
The overall health index produced by Max and Average are ‘valid’ from a mathematical point of view,
but the meaning we ascribe to them may be different and difficult to justify for the average.
If transformer T1 has a health index code A, and Breaker B1 has a health index code C, what should
their combined asset health index be?
Their Max is code C, and their ‘average’ is somewhere between A and C. What is the urgency of the
combination, and the timescale for action? These are not well defined but are important for action
planning if using an index for the combined assets.
127
TB 858 - Asset Health Indices for Equipment in Existing Substations
What if we have numeric codes, as in Table 5.3, so asset T1 is a code 2 and asset B1 is a code 3.
Table 5.3 – Example with numeric codes
The Max for T1 and B1 is 3, while the average is now 2.5, but there is no Code ‘2.5’. The Max
identifies the urgency of the two assets, but the Average is a mathematical construction and again has
no physical meaning.
What if there are three assets, with overall asset health score codes as per Table 5.4:
Table 5.4 – Combining AHI for 3 assets, alphanumeric codes
The Max is still at C or 3, but the average is somewhere between B and C, with a value of ~2.3. This
average could give false confidence in the viability of the collection. If we compare this set with the set
in Table 5.5, we can see this new set has a Max of E or 5, which reflects the urgency of work required
on the breaker, but the average is, maybe a C? And a mathematical average value of ~2.3 is the same
for Set 1 and Set 2. So, the average will not help identify the urgency within the data set.
In summary – the use of Max identifies the urgency of the worst asset or assets but does not attempt
to combine several codes into a new single code. The average is a mathematical construction with
little meaning in terms of urgency.
Note: if we have a range of ‘Probability of Failure’ (PoF) associated with each category then we can
calculate a range of PoF for the assets in that category.
128
TB 858 - Asset Health Indices for Equipment in Existing Substations
The higher the enumeration, the more urgent the intervention would be note that there is no need to
average or combine individual scores or PoF’s as the enumeration ‘automatically’ ranks more urgent
cases as higher numbers. The ranking does not indicate the PoF but would retain the PoF rank order.
In this example, set 2 has a higher ranking than set 1.
The enumeration approach is one of the methods for collating component scores discussed in the [B3]
CIGRE TB 761 “Condition Assessment of Power Transformers” (Table 2-2). As an alternative
visualisation of the enumerated scores a stacked (colour) bar chart can be used as depicted for this
example in Figure 5.2.
Option 2 – Enumeration of all available condition indicator scores for all assets
In Option1, only the overall asset score was considered. However, we lose information concerning the
number of indicators with a certain score category. It could be argued that an asset that has three
indicators with a certain score category is more likely to fail than one with one indicator in that same
category.
Consider the same two sets of assets, but now with indication of their three individual condition
indicator scores, see Table 5.7 and Table 5.8.
Table 5.7 – Example of condition indicator scores: Asset 1
Asset Set 1 T1 B1 D1
Condition indicator 1 2 3 2
Condition indicator 2 2 1 2
Condition indicator 3 1 3 2
Asset Set 2 T2 B2 D2
Condition indicator 1 1 3 1
Condition indicator 2 1 5 1
Condition indicator 3 1 1 1
The enumeration of all condition indicator scores would give (Table 5.9):
Table 5.9 – Enumeration of Combined Asset Condition Scores for Assets 1 and 2 [B3]
129
TB 858 - Asset Health Indices for Equipment in Existing Substations
Again, the higher the enumeration, the more urgent the intervention would be note that there is no
need to average or combine individual scores or PoF’s as the enumeration ‘automatically’ ranks more
urgent cases as higher numbers. The ranking does not indicate the PoF but would retain the PoF rank
order. In this example, set 2 has a higher ranking than set 1.
As an alternative visualization of the enumerated scores a stacked (colour) bar chart can be used as
depicted for this example in Figure 5.3.
Option 3 – Normalisation of all asset scores into one overall aggregate score
It can be argued that a single aggregate health score for an aggregate of assets is also a feasible way
to show an overall score. For instance, by summing all condition indicator scores for all assets to
create one number to represent the entire aggregate. However, summation of the scores for multiple
assets would prefer all individual asset scores to have an equal number of assessments/indicators.
This will allow a fair comparison between different asset scores within the aggregate and creation of a
balanced final aggregate score.
An example of this is shown in Table 4.6.8 where this is applied to GIS bays each with five
subcomponents that are each scored according to a log base 3 scoring system. The total bay score is
then determined as the sum of all the subsystem scores.
Note that a clear drawback of the method shown in Table 4.6.8 is that it needs to be calibrated to the
number of assets and indicators. For instance, if the number of assets in the bay increases from 5
to 7, the condition ranges (left column) would need to be re-calibrated. A further complicating factor
would be to determine an aggregate score for a group of assets which have a different number of
condition indicators between them.
For example, consider a simplified substation bay consisting out of a transformer, circuit breaker and a
disconnector. Assume that each asset has a health score using a base 10 log scoring system as
depicted in Table 5.10. Note that there is difference in the number of condition indicators for the
transformer (6), circuit breaker (3) and disconnector (2). The overall score for the bay is given as the
sum of the individual scores for the assets.
Table 5.10 – Example of a simplified bay
130
TB 858 - Asset Health Indices for Equipment in Existing Substations
In this case one can easily see that the overall summated score of the bay (42+21+20 = 83) is
dominated by the transformer, in this example largely due to the larger number of indicators for this
asset compared to the others. This clearly indicates that the total bay score is skewed by a difference
in number of condition indicators between assets taken together.
This could be solved using normalisation of scores, i.e. to compensate for the different number of
indicators. In the example of Table 5.10 this is done by normalising to the situation of maximum two
indicators (as is the case for the disconnector) with a maximum possible score of 20 (2 indicators with
max score 10). The normalised scores as shown in Table 5.10 show equal scores for the transformer
and circuit breaker, and the worst score for the disconnector. Since the transformer and breaker have
the same relative number of ’10’ and ‘1’ scores, this is to be expected. Similarly, both indicators of the
disconnector indicate the worst possible score, explaining why this is the asset with the overall worst
normalised score. Whether this actually proves that the disconnector is the worst asset of the
aggregate is quite questionable, since this is based on the judgement of two indicators only. In case
several other condition indicators are possible to assess but were disregarded or there simply is a lack
of data, the above example could give a false feeling of security/trust in the assessment where in fact
it is incomplete.
At which point do a number of ‘lower’ urgency codes add together to become a code of ‘higher’
urgency? And how would we calculate and manage such an effect? This does not affect simple
enumeration scores or simple Max scores, but does apply to those which use summation, and could
be applied to complex enumeration where multiple lower scores combine to a higher score.
Another problem with normalisation of scores is that a single “bad” score can become less visible in
case the remaining indicators score significantly better. The higher the number of indicators, the more
pressing this issue becomes. See for example Table 5.11, where asset 1 has eight condition
indicators, asset 2 has four. By normalising the scores, a skewed image of the actual situation is
created where it seems that asset 1 is in a significantly better condition which is not true.
Table 5.11 – Example of aggregation of scores
Users of log-based scoring systems must remain aware of compound-effects for scores that will
influence total scores of aggregated assets. An example is given in Table 5.12 where a log base 3
summation index is shown, a maximum score of 100 applies. The highest score for the indicators is 30
points. Note that the sum score is 117.
Table 5.12 – Example of a log-3 based scoring system with category promotion
Asset Score Highest score Value Number of condition indices with promotion
Asset 1 1 1st 100 1
nd
Asset 2 30 2 30 0
Asset 3 10 3rd 10 1
Asset 4 3 4th 3 2
Asset 5 10 5th 1 1
Asset 6 30
Asset 7 3
Asset 8 30
Sum score 117
131
TB 858 - Asset Health Indices for Equipment in Existing Substations
Note that again, differences in numbers of indicators for each asset will cause skewing of results. User
must be aware of this characteristic.
In case individual condition indicators are not considered linked in any way such that promotion is not
a preferred characteristic, the summated score is only assessed for the worst score and number of
indicators with this score. Then, similarly to what was discussed in earlier sections, an enumeration
system is best used to count the number of condition score categories to preserve ranking. In the
example shown before this would yield a ranking number 03221 (see Table 5.13).
Table 5.13 – Example of a Combined Score Without Category Promotion
132
TB 858 - Asset Health Indices for Equipment in Existing Substations
If we consider the single-line diagram of a feeder of Figure 5.4, it is apparent that poor health of the
assets which form a single point of failure pose a larger risk than the assets which are in parallel
(assuming that one can take over the load of the other). Based on the substation configuration and the
individual probabilities of failure of the assets in the bay, a total PoF for failure of the aggregate to
supply power to the customer could be derived and also expressed in terms of a health score.
If we assume, for the sake of simplicity, that the busbars have a failure probability of zero, the overall
bay failure probability using standard mathematics is equal to:
This can serve to compare the performance of different bays. Note that it is important to realise that
bays (or other functional aggregates for that matter) with different configurations may have different
overall failure probabilities due to the configuration itself. In some cases the user may only be
interested to rank functional aggregates in relation to the failure probability, regardless of the
configuration, this may be a deliberate choice.
133
TB 858 - Asset Health Indices for Equipment in Existing Substations
Feedback discussion
In this practical example, transformer assessments were performed using analyses of components,
with scores for each component on a log scale 1, 3, 10, 30, 100. An asset score was then calculated
using either an average of component scores or with the maximum of individual component scores. As
shown below, an Average score dilutes and loses the urgency, while a Max score retains the sense of
urgency. The results, in Table 5.16, show the Average Health Index (AHI - Avg) and the Max Health
Index (AHI – Max) based on the component scores. This is listed against the ranking from the field
engineers who reviewed the assessment data (Found) without use of an index and then ranked the
assets in order of intervention priority (Field).
Table 5.16 – Comparing weighted with Max and field engineer assessment
The two approaches show the effect of the different means of combining scores: with the same
approach being available to apply at a components-to-assets level or assets-to-aggregate level. What
was considered to be the most urgent issue by the field team received a very low ranking using the
AHI-Average; further, the AHI-Max does not give much detail for ranking within the code 30’s. Both
methods have clear limitations.
We can calculate the expected number of failures and adjust figures to reflect historic rates – or
whatever rate we choose.
An example is shown in Figure 5.5, where the PoF for different condition codes for a variety of assets
is adjusted to allow an overall target to be met.
134
TB 858 - Asset Health Indices for Equipment in Existing Substations
Note that a health index encapsulates data and analyses and does not provide ‘new’ information. This
is certainly also true for aggregated scores. Four different methodologies for aggregating individual
asset health scores were discussed:
1. Enumeration of single (overall) asset scores
2. Enumeration of all available condition indicator scores for all assets
3. Normalisation of all asset scores into one overall aggregate score
4. Focussed aggregation aimed at a functional set of assets using probability of failure
information
Both enumeration methods (1 and 2) are easy to implement, retain ranking and preserve scoring
information better compared to aggregations aiming to create a single aggregate score (3). Both these
methods are recommended to use.
Furthermore, as was demonstrated with some examples, there are risks in creating a single overall
score by use of summation methods, especially when dealing with different asset types that have
different numbers of condition indicators. Therefore, this method is only to be used when potential
pitfalls are properly addressed.
The focused aggregation is quite useful from the perspective of functionality of a network or part of a
grid but requires more effort to extract failure probability of equipment, based on condition indicators
and failure and survival data analysis. This is not a trivial exercise but does offer a mathematically
sound outcome that also further enables/feeds into further risk assessment. Although more complex,
this approach is recommendable.
135
TB 858 - Asset Health Indices for Equipment in Existing Substations
136
TB 858 - Asset Health Indices for Equipment in Existing Substations
6. Conclusion
The role of this working group has been to produce a version of the Asset Health Index methodology
applicable to the various assets within a transmission substation. By defining the likelihood of failure of
each asset over various time scales it provides action plans for remediation or replacement. The
outcome leads to a list of assets such as the asset register but modified with additional data relating to
the condition of the asset in terms of possible failure modes. The second role of AHI analyses is to
combine failure likelihoods across the asset classes contained within a circuit end, bay or even a
complete substation. This is an outcome to be used for strategic planning and operational decision
making.
Important features are:
1. The use of AHIs is important to the application of condition-based decision making. As such it
is also the likelihood step within risk-based decision making. An important point is that there
should be a link between related diagnostics and failure mode modes. This comes through the
application of FMEA methods. The second important consideration is that the rate change in
the indicators of deterioration is sufficient for detection, decision making and corrective actions.
2. The AHI outcomes should not be confused with age, the period of prior service. Aspects such
as inferior designs, age, duty cycle and operational environment are not failure modes. They
are relevant in this context as hazard factors that might emphasise the likelihood of particular
failure modes becoming more dominant for a specific asset.
3. The AHI approach is based upon a systematic application of all failure modes, and so has the
attraction over a simple assessment based upon just one or two diagnostics. The resulting set
of AHIs should be calibrated for time. The AHI must uniformly reflect the same urgency of
intervention. Any AHI should identify changing likelihood with time periods by creating an
action plan for an intervention – maintenance, repair, or replacement. All assets with the same
score should have the same timescale for intervention, irrespective of failure mode or asset
type, otherwise there is confusion in applying AHIs consistently. A ‘poorer’ AHI should always
reflect a more urgent condition. This means that where several failure modes are being
assessed and the scores aggregated the method of aggregation should not produce any
violation of this principle.
4. The AHI is created by assimilating assessments from a number of Failure Mode scores.
Reducing scores from different failure modes into a single unified number is attractive,
particularly when the purpose is to create a prioritised action list. One option is to use only the
worst failure mode score to apply to the whole asset or across a bay or substation covering
many assets. This approach has both attractions and limitations. In order to achieve
prioritisation within an asset class it will be necessary to introduce time limited subcategories.
An alternative approach is where an exponential based score set could be added and so reflect
the condition of all failure modes. However, such an aggregation should not dilute a dominant
score of one failure mode, nor preclude any reverse audit. Combining scores across different
asset classes to provide a bay wide score again has operational attractions but also introduces
further complications. There can be no simple adding of scores when so many assets and
failure modes are involved.
5. The methodology should allow an auditable and direct trail between the outcome and
supporting evidence. The output should be clear, auditable and justifiable by those needing to
make decisions based on the output. And that it is not just a number as an output from an
automated analysis.
6. Creating an AHI approach is costly in time and effort. It is essential before starting out to clearly
establish the benefits and potential for cost savings and maintaining core business attributes of
safety, performance and reputation. This would identify the review level of the AHI method to
be adopted for each class of assets.
7. There needs to be an audit process – to tear down and examine forensically any unit scrapped
in order to confirm the AHI process. This should indicate the existence of all active failure
modes and their relationship to the assessment and AHIs made whilst the unit was in service.
With improvements identified it then can become part of the asset life plan within ISO 55000.
137
TB 858 - Asset Health Indices for Equipment in Existing Substations
138
TB 858 - Asset Health Indices for Equipment in Existing Substations
139
TB 858 - Asset Health Indices for Equipment in Existing Substations
140
TB 858 - Asset Health Indices for Equipment in Existing Substations
141
TB 858 - Asset Health Indices for Equipment in Existing Substations
[B27] TB 512 “Final Report of the 2004 - 2007 International Enquiry on Reliability of High Voltage
Equipment, Part 4”, WG A3.06
[B28] "Acceptance Testing Specifications for Electrical Power Distribution Equipment and
Systems”, NETA
[B29] TB 513 (2012) “Final Report of the 2004-2007, International Enquiry on Reliability of High
Voltage Equipment, Part 5: Gas Insulated Switchgear (GIS),” WG A3.06
[B30] “The Delphi Method: techniques and applications,” 1975, H.A. Linstone, M. Turoff
[B31] “Statistical Lifetime Management for Energy Network Components,” PhD Thesis, TU Delft,
the Netherlands, 2012, R. A. Jongen
[B32] IEC 60099-4 “Surge arresters - Part 4: Metal-oxide surge arresters without gaps for a.c.
systems”
[B33] “Condition Monitoring of Surge Arresters”, IndiaDoble symposium Delhi, 2005, R. K.
Tyagi
[B34] “Capacitor bank failure investigation”, Doble Engineering Conference, 2007, L. Pong and J-F
Chrétien
[B35] “Experience with leakage-current testing of 380 kV MOV surge arresters in the field,
utilizing an lcm portable instrument – section 9-3”, Proceedings of the 1994 International
Conference of Doble Clients, P Leemans and G Moulaert
[B36] TB 775 (2019) “Transformer bushing Reliability”
[B37] “Update – field testing capacitor bank with M4000 test instrument”, Doble Engineering
Conference, 2007, L Pong and D Wheat
[B38] “Managing Bushings: From Statistics to Singularities – Where to Focus?” Transformer
Technology, Issue 8, 2020, T. McGrail, https://www.transformer-technology.com/community-
hub/technical-articles.
[B39] “Advanced condition monitoring method for high voltage overhead lines based on visual
inspection”, IEEE PES General Meeting, Portland, 2018, H. Manninen, J. Kilter,
M. Landsberg
[B40] TB 300 (2006) “Guidelines to an optimized approach to the renewal of existing air insulated
substations Working”, WG B3.03
[B41] “Transformer Asset Management: How Well Are We Doing And Where Do We Need To Do
Better?”, International Conference of Doble Clients, Boston, 2015, Ryder S., Jarman P.,
Heywood R.
[B42] “T. (2016). Deriving a Useful Asset Health Index - Getting Started, Getting Value and Making
Use of Them”, Doble Client Conference, 2016, McGrail, T., Heywood, R.
[B43] “Transformer Health Index and Probability of Failure Based on Failure Mode Effects Analysis
(FMEA) of a Reliability Centered Maintenance Program (RCM)” CIGRE Session paper A2-
110, Paris, 2016, P. Lorin et al
[B44] “DNO Common Network Asset Indices Methodology”, UK Regulator, OFGEM, 2017,
www.ofgem.gov.uk/system/files/docs/2017/05/dno_common_network_asset_indices_method
ology_v1.1
[B45] “Hydro One Distribution - ACA Summary Report”, Hydro One, 2005
[B46] “Aggregate Health Indices as Used for Asset Investment Decisions and Universal
Understanding”, EuroDoble, UK, 2007, Kydd T.
[B47] “Transformer Condition Assessment”, International Doble Conference, 2003, Bennett, G.
[B48] “Development of Transformer Health Index (THI) – TATA Power Experience”, 13th IndiaDoble
Power Forum, 2015, Kini M.V. et al
[B49] “Current Situation and Recent Challenges in Asset Management of Aging T&D Substation
Facilities in Japan”, CIGRE Paper B3_302_2016, Paris, 2016, Kobayashi T., et al
142
TB 858 - Asset Health Indices for Equipment in Existing Substations
[B50] “Procedure for Using Condition Based Maintenance to Create the Health Indices of
Transmission Power Lines: A Case Study of the Kenyan Coast”, International Doble
Conference, 2012, Bosire E. & Yarrow A.
[B51] “Modern Insulation Condition Assessment for Instrument Transformers”, IEEE International
Conference on Condition Monitoring and Diagnosis, Bali, 2012, Stephanie Raetzke, Maik
Koch, Martin Anglhuber
[B52] TB 234 (2003) "SF6 Recycling Guide"
[B53] TB 567 (2014) "SF6 Analysis for AIS, GIS and MTS Condition Assessment”
143
TB 858 - Asset Health Indices for Equipment in Existing Substations
144
TB 858 - Asset Health Indices for Equipment in Existing Substations
Note, there is a point at which an organization may consider the probability and consequent risk of
failure in a 12-month period to be too high – at which point it is incumbent upon them to remove the
asset to meet their risk aversion. The level of risk aversion will vary between organizations. What is an
acceptable level of failure for an asset? For an asset group?
So – if we have a set of three independent assets of condition code B, what is the range of probability
of failure, p, for the set?
145
TB 858 - Asset Health Indices for Equipment in Existing Substations
The range must be somewhere between the three assets all being at 1% and all three assets being at
2% giving a range, as long as the assets are independent:
(1 – (1 − 0.01)3 ) < 𝑝 < (1 – (1 − 0.02)3 ) ≥ 2.97% < 𝑝 < 5.88%
The higher value shows that the set of three assets could easily now be a Code D.
The problem is that manipulating codes, rather than the root probability data, means that we have lost
sight of the data which puts it in a particular category to start with. The best approach would be to use
the original data – however it was derived or estimated - such that it allowed us to place the asset in a
particular category.
It may be that the placement of an asset in a category was done by pure estimate – a Delphic2
Approach, for example? – and the limits for the category are the best we have. In which case math of
the limits for the category may always be used to provide a range for a set of assets: a range which
depends on the number of assets.
The situation is compounded if the assets have an influence on each other such that the probability of
failure of a transformer is decreased if used in combination with a surge arrester or increased if used
with a breaker prone to restrike. The interconnectedness of assets may well affect their individual
PoF’s.
146
TB 858 - Asset Health Indices for Equipment in Existing Substations
D.1. UK TSO
This refers to the publication from the 2014 CIGRE Paris session [B14]. It describes a method
developed some 20 years ago by the TSO at the instigation of the UK Regulator who wanted to see
some justification for the transformer capital replacement plan. This was coupled with providing
evidence that the company was managing its supply business with due regard to the management of
risk, cost control and supply performance. This methodology has been used by the utility in its UK and
USA networks and their experience was described in this 2014 publication. It separates the population
of about 800 transformers into categories shown in Table D.1.
It is a 5-year plan to be used with the Regulator for justifying capital replacement planning in the
5-year rate review periods. It is not a prioritisation for planning maintenance, repairs etc, nor is it
directly addressing likelihood to fail.
It identifies which transformers are faulty and likely time scale for the end of life – AHI 1 and 2.
Noteworthy is that age is specifically not factored into the assessment since for this utility the
failure rate does not increase with time. Conversely their failure rates show that in the first 10
years it is 3× higher than the random failure rate over the following 40 years [B10]. However, low
asset lives can be associated with a poor performing design. Consequently, a unit with such a
poorly performing design but even with no evidence of the developing deterioration is identified in
a special category (3 in table) and requires increased surveillance. Transformers that are
deteriorating are assessed by their condition and failure mode into categories 1 and 2.
It allows clear identification of the dominant failure mode, but as developed it related to (only) the
transformer tank, core and windings, where any deterioration would likely lead to a replacement
unit. Component parts such as cooling, oil, OLTC, bushings and control systems could be
refurbished or replaced independently of the main tank and not included in this AHI.
Assessment has been achieved using a base 3 logarithmic scale, using no weighting to the
condition assessments in order to allow a clear identification of the weak link.
Whenever a transformer is being scrapped (for whatever reason), the company engages
specialist design engineers and forensic scientists to study the extent of deterioration, confirming
both the selected failure modes and the AHI ascribed prior to scrapping. This allows an audit of
the process as well as ensuring it evolves with more knowledge. The results of this audit were
described in the 2014 publication [B14].
This system has been used in UK since the mid-1990s and initially used by specialist transformer
engineers based within the utility’s Technology Division. After the closure of this unit in 2002 this
specialist engineering work transferred to an outsourced service provider. At this point the
methodology evolved beyond life-limiting failure modes to include all failure modes where failure could
prevent the transformer from meeting its performance requirement. The outcomes reflect likelihood to
fail and the impact on the score if recommended remediation is then carried out. As such this method
has been used by the service provider in other utilities in USA, Africa and Middle East.
147
TB 858 - Asset Health Indices for Equipment in Existing Substations
148
TB 858 - Asset Health Indices for Equipment in Existing Substations
They use a linear scoring system but recognise that simply adding scores hides damage. They prefer
to note all assessments with the number of aspects that score in each of the bands, as per Figure D.2.
Within any tabulation this score would translate to be 035310 and would be ranked higher than others
with scores 026310 or 04410 for example. This is acceptable because each category is calibrated in
terms of having the same time scales for activity to take place.
149
TB 858 - Asset Health Indices for Equipment in Existing Substations
150
TB 858 - Asset Health Indices for Equipment in Existing Substations
The problem with such a weighted system is the difficulty relating AHI and LoF. Furthermore, as
discussed earlier, a weighted aggregated AHI may actually be masking assets in poor condition and
may not be a good indication of overall health.
Data was assessed according to standards, experience and relevance. On this basis their fleet of
nearly 600 transformers were allocated into five classes ranging from those in need of more
immediate intervention to those operating as expected. There is no asset health index with scoring – it
is simply using diagnostic data to categorise the units into these five bands. They believe their
assessment has proven to be 96% correct. There is, however, no formal relationship identified
between coding and Likelihood of Failure (LoF).
151
TB 858 - Asset Health Indices for Equipment in Existing Substations
The final score is the sum of the weighted values. This is not unusual and is open to dilution effects as
described earlier. To mitigate this, although the normal scores are 1 (poor) to 4 (very good), there is a
fifth “special” category for units assessed to be “very poor” which has a score 0.01. This introduces an
element of extreme logarithmic scoring and so over-rules the scoring and weighting used in their
standard system. The AHI (or THI as authors refer to it) is developed based on a 1-100 system, similar
to others in the industry. Once sub-component weights are identified, major component analysis takes
place.
Figure F.1is a flowchart for THI. It shows that the process starts by identifying the analysis of basic or
Tier 1 factors. Manually assessed factors that are described as Tier 2 are used to adjust the final THI.
152
TB 858 - Asset Health Indices for Equipment in Existing Substations
Their process of developing the AHI is formal, but there are areas where judgment and heuristics
apply. The weighted system focuses on known failure modes and related diagnostics. The results
have estimated timescales that provide a basis for a loose LoF calculation. But the weighting
approach and so many parameters means that the relation is difficult to calibrate and evaluate, and
any sense of urgency is lost.
An AHI for 66 kV transformers has been developed. It is a mix of conditions and consequences, which
makes it more suited to risk analysis. The problems are the intermingling of possible condition factors
such as rising DGA or poor tap changer condition, with criticality factors such as presence of PCBs in
oil entailing a more substantial clean up. The system does not seem to be calibrated and has different
scores possible for different contributing factors. DGA may be up to 10 points, while winding
configuration may be allowed to score up to 47 points.
The calibration of scales is not stated. The system is proposed to allow ranking for replacement, and
the index developed is an indication of where to look. At present, we believe there would be little
chance of developing a relationship between the AHI and LoF.
LoF is discussed only qualitatively and the relation between the AHI and LoF is neither described nor
identified.
The authorErreur ! Source du renvoi introuvable. described a case study on transmission line AHIs
focusing on assets in one district. Four categories are defined, based on assessed imminence of
failure and the AHI developed previously:
CR1 is a condition in which there is no detectable or measurable deterioration and no increased
probability of failure.
CR2 is where there is evidence of deterioration that is considered to be normal ageing and has
no significant effect on the probability of failure.
153
TB 858 - Asset Health Indices for Equipment in Existing Substations
CR3 is a condition where there is significant deterioration that increases the probability of failure
in the short to medium term.
CR4 represents severe degradation and indicates an immediate, significant increase in the
probability of failure.
154
TB 858 - Asset Health Indices for Equipment in Existing Substations
The relationship described is monotonic, indicating that the lower the AHI, the more likely a unit is to
fail. The AHI is used to generate a LoF, and the LoF is used to drive action.
The problem with weighted systems will again arise and some units with a higher AHI may actually
have an increased likelihood of failure. In addition, the paper states that LoF will rise with age; which is
contrary to what has been noted by many other investigations and analysis. However, this may be true
depending on failure modes for lines where the condition can be improved via maintenance, e.g.
painting.
155
TB 858 - Asset Health Indices for Equipment in Existing Substations
156
TB 858 - Asset Health Indices for Equipment in Existing Substations
157
TB 858 - Asset Health Indices for Equipment in Existing Substations
Strategic condition
Service know-how
Spare parts
Experiences
Number of components in the grid
While the first group describes the actual condition of the installations, which can be expressed, for
example, by the hazard rate, the second group describes a strategic condition that leads to an artificial
aging of the installation. In this case, this has no effect on the failure rate of the installation. The
consequence is that this review can perform an investment control to replace individual devices or
groups earlier, for example, to force a change in technology or implement a new overall strategy. In
this case, the assessment would assign "artificially" a poor condition to the asset, so that, for example,
a replacement occurs earlier.
In the example shown in Table G.2, the final condition index c is calculated by these individual
evaluations as relative weighted sum of these values (weighting). The following rule is applied for the
considered equipment: the larger the calculated index c, the worse the condition of the equipment. For
c = 0, the equipment is in excellent condition, for c = 100 in very poor condition.
Table G.2 – Criteria for the condition assessment of equipment in case of a circuit-breaker
158
TB 858 - Asset Health Indices for Equipment in Existing Substations
In general, it is possible that an assessment of equipment consists of many criteria. A single poor
assessment can be perfectly compensated by other assessments, so this poor rating is not noticeable
in the overall result. To solve this problem, a message should occur, if a threshold is exceeded so that
immediate maintenance actions must be carried out.
The AHI algorithm accomplishes new benefits for the operational business. The number of oil
inspections can be reduced dramatically. The investigation of 50 % of transformers revealed the
possibility to predict the failure of 85 % of the critical transformers Therefore it is possible to reduce the
operational costs substantially. In the meantime, the second stage of this pilot project has started
already. Moreover, the described strategy will be continued with an expanded database (local
transformer data). The user interfaces will be further developed to enhance the usability of the
algorithms.
159
TB 858 - Asset Health Indices for Equipment in Existing Substations
The measured values may come from either monitoring systems or measurements.
An OEM-invented weighting system for condition parameter aggregation is in use. It is called the
weighted probability average (WPA which weights in general worse values exponentially higher than
good ones. The exponential function is equivalent to the probability of a failure.
160
TB 858 - Asset Health Indices for Equipment in Existing Substations
161
TB 858 - Asset Health Indices for Equipment in Existing Substations
162
TB 858 - Asset Health Indices for Equipment in Existing Substations
163
TB 858 - Asset Health Indices for Equipment in Existing Substations
164
TB 858 - Asset Health Indices for Equipment in Existing Substations
For each asset the diagnostic portfolio is determined together with values of acceptability. The range
of operating values is then divided into 5 sub-bands, assigned to each individual assessment:
"4" – There is no deviation of the measured parameters to requirements of normative and
technical and/or design (project) documentation that equipment complies with the required
functions in full.
"3" – Measured parameters are within the values defined by the normative and technical
documentation and/or design (project) documentation, but there is a tendency of deterioration in
the values of such parameter.
"2" – Measured parameters are within the values defined by the normative and technical
documentation and/or design (project) documentation, but there is a threat of occurrence of
failures
"1" – Measured parameters are at the level of maximum permissible values, defined by the
normative and technical documentation and/or design (project) documentation that equipment
complies with the required functions not fully
"0" – Measured parameters are outside of maximum permissible values, defined by the
normative and technical documentation and/or design (project) documentation
Each parameter is assigned to a design function (insulation, active part, magnetic, OLTC, bushings,
etc.). A weighting of the importance of the parameter to assessment of the function is assigned with
their sum adding to 1.
The score 0-4 is expressed as a fraction of 4 and then converted to a percent by:
𝑠𝑐𝑜𝑟𝑒
100 × 𝑤𝑒𝑖𝑔ℎ𝑡𝑖𝑛𝑔 ×
4
That percentage is again weighted by importance to yield a health index. This is illustrated in Table
G.4 for a transformer assessment. The total is then categorised in terms of excellent to critical, as
shown. The further step is to evaluate risk in order to identify appropriate next steps.
Table G.4 – Health index calculation.
165
TB 858 - Asset Health Indices for Equipment in Existing Substations
FAILURE RATE per 100 CB-bay years Fail to perform Dielectric breakdown
inside outside CIGRE [2] Utility CIGRE [2] Utility CIGRE [2]
The difference in failure patterns is attributed to the tropical environment. This is high average annual
relative humidity of 80% at high ambient temperatures of 27° C. The annual lightning flash density is
high at 15 per km2. In the wet half year there is an average of 200 mm rainfall. Corrosion is common,
exacerbated by pollution near Jakarta. This leads to leaks and acidic attack on gaskets. The AHI
methodology attempts to rank the system in terms of likely to fail periods. It introduces a cascade
based on AHI assessed firstly for parts, then components enclosures, bays and finally the AHI for the
substation.
They consider GIS as consisting of five subsystems:
1. Primary conductor (including main contacts)
2. Dielectric
3. Driving mechanism
4. Secondary
5. Construction and support
Each is analysed using FMEA methods, identifying diagnostic indicators to be applied. An example of
one of the most important indicators (the dielectric with its scale code indices) is shown in Table G.6.
The approach is in line with this TB, with a base 3 log scores (1,3,10,30 and 100) which are then
added.
Interesting point is the experience of cascading from parts to substations (example of which is shown
in Table G.7) for the consolidation from component to bay levels.
The other innovation is the use of “failure susceptibility indicators”. There are three used, and each is
assessed as low, medium and high in terms of impact in creating a failure:
Environmental
Pollutant level
Lightning density
Design
Short life failure modes
O-ring materials
Absorbent use
They and their assessments are used as guides for those preparing AHI by utilising the side notes.
These are important factors to help creating AHI. They are not failure modes nor are they evidence of
an active failure mode.
166
TB 858 - Asset Health Indices for Equipment in Existing Substations
167
ISBN : 978-2-85873-563-1
TECHNICAL BROCHURES
©2021 - CIGRE
Reference 858 - December 2021