Professional Documents
Culture Documents
Unit 1 Principles and Practices of Maintenance Planning KVN
Unit 1 Principles and Practices of Maintenance Planning KVN
1
• Good maintenance engineering is vital to the success of any manufacturing or processing
operation, regardless of size
• The maintenance engineer is responsible for the efficiency of daily operations and for
discovering and solving any operational problems in the plant
• A company's success may depend on a quality maintenance engineering department that
can be depended upon to discover systematic flaws and recommend solid, practical
solutions.
Definition of Maintenance
Maintenance is the routine and recurring process of keeping a particular machine or asset in its
normal operating conditions, so that it can deliver the expected performance or service without
any loss or damage.
Maintenance is defined as
All actions necessary for retaining an item, or restoring to it, a serviceable condition, include
• servicing, repair, modification, overhaul, inspection and condition verification
• Increase availability of a system
• Keep system’s equipment in working order
Purpose of Maintenance
• Attempt to maximize performance of production equipment efficiently and regularly
• Increase reliability of the operating systems
• Minimize production loss from failures
• Prevent breakdown or failures
2
• To keep the time schedule of delivery to the customers or to the sections for
further processing.
• To meet the availability requirements for critical equipments.
• To keep the maintenance costs as low as possible for non critical equipments.
• To control the cost of maintenance related activities
• To provide effective and trained supervision.
• To meet the quality requirements of the product.
• To increase the profits of production systems.
3
the modern techniques, recent trends in maintenance and to chalk out a strategy to meet
the growing demands of the industry.
4
MAINTENANCE FUNCTIONS AND ACTIVITIES
• Identifying areas for implementation of preventive maintenance program
• Making suitable arrangements for maintenance facilities for carrying out the maintenance
work properly
• Planning and scheduling the total maintenance work
• Ensuring proper and timely supply of spare parts
• Managing proper inventory control of materials spares an tools required for the
• maintenance
• standardization of maintenance work
• Impleting modifications to the existing wherever possible
• Assisting the purchase department in procuring materials Disbursement of services such
as water, electricity, steam, compressed air and other amenities required to carry out the
maintenance
• Identification of obsolete and surplus equipment for replacement and disposal
• Designing the systematic way for disposal of equipment an for maintaining floor space
• Training of maintenance personnel
• Analysis of future demands and forecast the role of maintenance activities
• Implementation safety norms and procedures
• Ensuring safety of personnel and equipment
Decentralized
This is suitable for large sized plants where inter unit communication is difficult to get. In
this type of organization, the maintenance is under the control of chief engineer of
production to ensure better understanding between the production and maintenance
department.
Centralized
This is suitable for small units where unit communication is feasible. In this type of
organization, the maintenance is under control of chief maintenance engineer. The
responsibilities and accountability is with the concerned department heads.
Partially Centralized
This is the modified version of centralized maintenance organization and suitable for the
industry where units are located at far away locations. In this type of organization, the
maintenance personal attached with production unit will carry out the routine
maintenance works. Scheduled maintenance works such as overhauls, planned
maintenance work, procurement of spare parts are under the control of chief maintenance
engineer at the central office.
5
TYPES OF ORGANIZATION:
There are basically two at least two types of organization are followed in most of the industries.
They are
• Line Organization
• Line staff Organization
LINE ORGANISATION
Line organization consists of a general foreman and a number of specialist foremen with their
team under them as shown in figure. The specialist foremen execute maintenance work in their
respective areas while the general foreman supervises the total work under his control and
coordinates the various maintenance tasks carried out in the industry,. This kind of structure is an
old type maintenance organization.
6
MAINTENANCE FUNCTIONAL ORGANIZATION
Maintenance functional organization is the structure based on craft concept. In this organization
structure persons joined the organization as apprentices and elevated to higher positions such as
foreman after sufficient experience in their respective jobs. This organization structure is based
on the craft skills acquired through experience. There is no formal training mechanism available
at each stage of the organization. The recent trends in maintenance management also favors this
kind of functional based organization based on craft skills. Figure below shows the maintenance
functional organization in which few workers are placed under each functional foreman.
7
MAINTENANCE ECONOMICS
Life-cycle cost analysis (LCCA) is a tool to determine the most cost-effective option among different
competing alternatives to purchase, own, operate, maintain and, finally, dispose of an object or process,
when each is equally appropriate to be implemented on technical grounds.
For example, for a highway pavement, in addition to the initial construction cost, LCCA
takes into account all the user costs, (e.g., reduced capacity at work zones), and agency
costs related to future activities, including future periodic maintenance and rehabilitation.
All the costs are usually discounted and total to a present day value known as net present
value (NPV). This example can be generalized on any type of material, product, or
system.
In order to perform a LCCA scoping is critical - what aspects are to be included and what
not? If the scope becomes too large the tool may become impractical to use and of limited
ability to help in decision-making and consideration of alternatives; if the scope is too
small then the results may be skewed by the choice of factors considered such that the
output becomes unreliable or partisan. Usually the LCCA term implies that energy and
8
environmental costs are included, whereas the similar Whole Life Costing generally has a
reduced scope.
The factors to be considered in the purchase of equipment of industries include the cost,
quality, performance and maintenance requirements. Some balance is to be made
between the capital cost and operating cost of the equipment in finding the suitability of
the equipment. The evaluation of any equipment for purchase should be made by keeping
into considerations that total cost incurred by the equipment over a span of time say ten
years. Life cycle costing is the cost analysis for the equipment in an industry that
accounts the total cost of the equipment over a span of time which includes the capital
cost, operating cost and maintenance costs. This analysis is the integration of
engineering, economic and financial strategies in relation to the equipment to be
purchased. The aim of life cycle costing is to ascertain the total cost of equipment over
the span of its entire life period.
9
Maintenance Cost:
Budgets are allocated for all the activities in planning stage itself which includes the
maintenance cost. The cost of maintenance is difficult to measure due to random nature
of failures.
One universal measurement of maintenance performance, and perhaps the measure that
matters most in the end, is the cost of maintenance. Unfortunately maintenance costs are
often used to compare maintenance performance between companies or between plants
within the same company. Equally unfortunately, there is no standard for measuring
maintenance costs. Each company, usually each plant within a company and often each
department within a plant develop their own definition of "maintenance costs." For this
reason, maintenance cost comparisons should always be accompanied by a clear
definition of what is included and excluded for each plant included in the comparison.
The records on maintenance history may be useful in determining the cost. The analysis
of maintenance cost is helpful in taking a decision regarding replacement of a machine or
any of its components.
10
Fig. Reliability Vs Maintenance cost
As the reliability cost decreases maintenance cost increases. And when more money is
spent on increasing the reliability the availability increases and the maintenance costs is
low. However these increase and decrease or disproportionate as visible from the figure
and from the total cost curve (total cost = reliability cost + maintenance cost), optimum
cost can be obtained.
Optimum cost is the tradeoff between the amount spent on equipment to improve
reliability and amount spent on equipment in reduce maintenance cost.
Maintenance Budget:
The need for a maintenance budget arises from the overall budgeting need of corporate
management and involves estimation of the cost of the resources (labour, spares etc.) that
will be needed in the next financial year to meet the expected maintenance workload. The
maintenance life plans and schedule have been laid down to achieve the maintenance
objective (which incorporates the production needs, e.g. operating pattern and
availability) and in turn generates the maintenance workload.
The maintenance budget is used to set aside certain amount of money to meet the
expenditures incurred in achieving the objectives of maintenance. The following are the
types of maintenance budget,
11
FACTORS CONSIDERED FOR EVALUATION OF MAINTENANCE COST
• Cost of maintenance from the recorded data (past experience
• Level and requirements of maintenance
• Cost of spare parts and materials
• Cost of replacement of components and assemblies subjected wear and tear
• Accounting the number of breakdowns with their levels
• Downtimes of the equipment for want of maintenance and repair
• Penalty cost due to the loss of production
• Cost of manpower involved
• Cost of additional manpower requirements for emergency breakdown and maintenance.
12
Advantages of Life Cycle Costing
• Integration of engineering, economics and financial aspects lead to the way of robust
metric for the selection and purchase equipment required for the industry.
• Reduced operating and maintenance cost of equipments due to cost analysis over span of
time.
• It leads to the selection equipment
13
• Most companies complete their preventive maintenance in the cooler months of
the year.
• Compressor and evaporator rebuilds are easier to perform in the winter because
the temperatures are not as hot, requiring less of a refrigeration load. This offsets
labour costs, too.
14
RELIABILITY
Reliability may be defined in several ways: The idea that an item is fit for a purpose with respect
to time. In the most discrete and practical sense: "Items that do not fail in use are reliable" and
"Items that do fail in use are not reliable".
The capacity of a designed, produced or maintained item to perform as required over time. The
capacity of a population of designed, produced or maintained items to perform as required over
time. The resistance to failure of an item over time. The probability of an item to perform a
required function under stated conditions for a specified period of time. In line with the creation
of safety cases for safety, the goal is to provide a robust set of qualitative and quantitative
evidence that an item or system will not contain unacceptable risk.
The basic sorts of steps to take are to: First thoroughly identify as many as possible reliability
hazards (e.g. relevant System Failure Scenarios item Failure modes, the basic Failure
mechanisms and root causes) by specific analysis or tests. Assess the Risk associated with them
by analysis and testing. Propose mitigations by which the risks may be lowered and controlled to
an acceptable level. Select the best mitigations and get agreement on final (accepted) Risk
Levels, possible based on cost-benefit analysis.
RELIABILITY THEORY
Reliability is defined as the probability that a device will perform its intended function during a
specified period of time under stated conditions. Mathematically, this may be expressed as,
Where is the failure probability density function and is the length of the period of time
(which is assumed to start from time zero).
Reliability
15
In Phase I, the failure pattern inherent in a new product because of manufacturing or design
defects. Therefore, the failure rate at the beginning of infant mortality stage is high and then it
decreases with time after early failures are removed by burn-in or other stress screening methods.
It is referred to infant mortality period of equipment.
Phase II shows the useful life period of an equipment where the failure rates are normally
moderate as the equipment gets set to the working environment. This period is usually given the
most consideration during the design stage and it is the most significant period for reliability
prediction and evaluation activities.
In Phase III, the failures are occurring due to wear outs which are caused due to ageing of the
equipment. The equipment life cycle is the essential requirement for the prediction of system
reliability. The other important parameters such as repair time distribution can be used to
estimate availability, maintainability and level of corrective and preventive maintenance.
Availability
Availability of a system may also be increased by the strategy of focusing on increasing
testability, diagnostics and maintainability and not on reliability. Improving maintainability
during the early design phase is generally easier than reliability (and Testability & diagnostics).
Availability is the ratio of the time at which the equipment is available for the designated
operation service to the total time of operation and maintenance of the equipment. It is also
defined as the ration of equipments uptime to the equipment uptime and down time over a
specified period of time.
The uptime of a machine / equipment is the time for which it is in actually available to
complete the desired function. The downtime or outage of a machine is the period of time
during which it is not in an acceptable working condition.
16
AVAILABILITY
A Reliability Program Plan may also be used to evaluate and improve availability of a system
by the strategy on focusing on increasing testability & maintainability and not on reliability.
Improving maintainability is generally easier than reliability. Maintainability estimates (Repair
rates) are also generally more accurate. However, because the uncertainties in the reliability
estimates are in most cases very large, it is likely to dominate the availability (prediction
uncertainty) problem; even in the case maintainability levels are very high.
When reliability is not under control more complicated issues may arise, like manpower
(maintainers / customer service capability) shortage, spare part availability, logistic delays, lack
of repair facilities, extensive retro-fit and complex configuration management costs and others.
The problem of unreliability may be increased also due to the "Domino effect" of maintenance
induced failures after repairs.
Only focusing on maintainability is therefore not enough. If failures are prevented, none of the
others are of any importance and therefore reliability is generally regarded as the most important
part of availability.
Achieved Availability:
It is the probability that a system or equipment shall operate satisfactorily when used
under prescribed conditions in an ideal support environment with periodic preventive and
corrective maintenance at any given time.
Achieved availability = MTBM/ MTBM+M
Operational Availability:
In industrial system a certain amount of delay will always caused by time element such as
supply downtime and administrative downtime.
Operational availability = MTBM/ MTBM+MDT
Where MDT is the mean downtime is the satisfied mean of the downtimes including the
supply downtime and administrative downtime.
17
In general availability of a system is a complex function of reliability, maintainability and
supply effectiveness.
As = f (Rs, Ms, Se)
In such a test, the product is expected to fail in the lab just as it would have failed in the field—
but in much less time.
The main objective of an accelerated test is either of the following:
• To discover failure modes.
• To predict the normal field life from the high stress lab life.
Accelerate cause (something) to happen sooner, Life is the period of duration, usefulness,
or popularity of something, testing means by which the presence, quality, or genuineness
of anything is determined by means of trial.
Goals of ALT Accelerated Life Testing: Most Important goals of Up Front Product Life
Testing (In-house or Beta Sites) & Data analyses are to gain information for Fundamental
Improvements and Proactive reliability improvement before Product Release.
➢ Purpose of ALT: Reliability estimation at user level and dominant failures
mechanism identification.
Types of accelerated tests: a) ESS and burn-in. b) Qualitative tests, c) Quantitative tests
➢ Burn-in:
Burn-in can be regarded as special case of ESS. It is a test performed for the purpose of
screening or eliminating marginal devices. These devices are those with inherent defects
or defects resulting from manufacturing aberrations which cause time and stress
dependent failures. As with ESS, burn-in is performed on the entire population.
➢ Qualitative tests:
An accelerated test that yields failure information or failure modes only is commonly
called a qualitative test or elephant test. Over stressing of products to “quickly” obtain
18
failures is perhaps the oldest form of reliability testing. It increases reliability by
revealing probable failure modes.
➢ Quantitative tests:
Quantitative accelerated life testing, unlike Qualitative testing, is designed to provide
reliability information on the product, component or system. Data needed in quantitative
test is time to failure, such as hours, days, cycles, miles, actuations etc.
Common form of ALT is continuous use acceleration. Estimate the life distribution of the
product in a shorter time.
The objective of this lecture is to bring clarity in understanding the two often confused terms
viz, Availability and Reliability, by explaining in simple perspective for the purpose of
understanding by a common maintenance man.
Let’s try to understand through this picture.
19
Here I want to share a screen-shot ( used in one of my previous documents ) . This has a realtime
data of an Equipment with performance relevant to present discussion. See this picture.
The Equipment03 in Year 1314 has Availability of the order 99.58% but
lowest Reliability (MTBR) of 95.57 Hrs. Compare this with the 2nd line: Equipment02 has
lowest Availability 99.39% but good Reliability (MTBR) around 400 Hrs.
Then, a question might arise, that ‘Why Reliability can not be directly based on the No. of
Failures?’ The answer could be ‘Yes, it is ! But it is calculated as a function of No. of Failures
per a Specified period , where this period might differ from process to process and hence the
acceptability of Reliability Index‘.
This is the time-line of a particular Equipment where U is Operating time (Uptime in Hrs), D is
Repair time (Downtime in Hrs). A total period of 6 weeks has been taken for analysis.
Two cases have been depicted here .
Case1:
No. of Failures = 6 (Denominator for MTTR, MTBR Calculations)
Total Uptime = U1 + U2 +U3 + U4 + U5 + U6 + U7. = Say 900 Hrs.
Total Downtime = D1 + D2 + D3 + D4 + D5 + D6. = Say 108 Hrs.
We know that MTTR (Mean Time to Repair in Hrs) = ( D1 + D2 + D3 + D4 + D5 + D6 ) / 6
= 18 .
Similarly MTBR (Mean Time Between Repairs in Hrs) = ( U1 + U2 +U3 + U4 + U5 + U6 +
U7 ) / 6 = 150 .
Now,
Equipment Availability (%) is: UpTime / Total Time = (900 / 1008) * 100 = 89.2
Another formula for Equipment Availability in practice is [MTBR / (MTTR + MTBR)] * 100
= (150 / 168 )*100 = 89.2
20
Case2:
No. of Failures = 2 (Denominator for MTTR, MTBR Calculations)
Suppose here too we get the same total values like:
Total Uptime = U1 + U2 +U3 = Say 900 Hrs
Total Downtime = D1 + D2 = Say 108 Hrs
MTTR (H) = ( D1 + D2) / 2 = 54 .
MTBR (H) = ( U1 + U2 + U3) / 2 = 450 .
Equipment Availability (%) is: Uptime / Total Time = (900 / 1008) * 100 = 89.2
Through other formula for Equipment Availability : [MTBR / (MTTR + MTBR)] * 100 = (450
/ 504 )*100 = 89.2
We have seen the Availability, Now let’s see the Reliability. What is Reliability?
Reliability can be broadly defined as the probability that an Equipment will perform its
intended functions continuously for a specified duration
MEAN TIME BETWEEN FAILURES
Mean time between failures (MTBF) is the predicted elapsed time between inherent failures of a
system during operation. MTBF can be calculated as the arithmetic mean (average) time between
failures of a system.
The MTBF can be defined in terms of the expected value of the density function ƒ(t)
where ƒ is the density function of time until failure – satisfying the standard requirement
of density functions –
21
The Overview
For each observation, downtime is the instantaneous time it went down, which is after i.e. greater
than) the moment it went up, uptime. The difference (downtime minus uptime) is the amount of
time it was operating between these two events.
MTTR
MTTR is an abbreviation that has several different expansions, with greatly differing meanings.
It is wise to spell out exactly what is meant by the use of this abbreviation, rather than assuming
the reader will know which is being assumed.
The M can stand for any of minimum, mean or maximum, and the R can stand for any of
recovery, repair, respond, or restore.
The most common, mean, is also subject to interpretation, as there are many different ways in
which a mean can be calculated.
Mean time to repair
Mean time to recovery/Mean time to restore
Mean time to respond
Mean time to replace
In an engineering context with no explicit definition, the engineering figure of merit, mean time
to repair would be the most probable intent by virtue of seniority of usage.
It is also similar in meaning to the others above (more in the case of recovery, less in the case of
respond, the latter being more properly styled mean "response time").
If n1 is the number of specimens that failed during the first hour, n2 is the number that
failed during the second hour and nk is the number that failed during the kth hour.
Then the mean time to failure for N specimens is
22
MTTF = (n1 + 2n2 + 3n3 +....knk) / N
It is better to represent the time as interval of the time as such as. Hence the time interval
is Δt instead of one hour.
Then MTTF is calculated as
MTTF = (n1 Δt +2n2 Δt +. ... + knkΔt +..... + lnl Δt) /N
MTTF = (n1 Δt +2n2 Δt +. ... + knkΔt +..... + rnr Δt) /N
MTTF = 1/N Σ knkΔt (k = 1 to r)
Where n1 is the number of specimens that failed during the first interval, n2 is the number
of specimens that failed during the second interval and nk is the number of specimens that
failed during the kth time interval and so on.
23