Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 22

Module - 1

Introduction
to
Cloud Computing
Recap - Last Class

• Cloud Computing Defn.

• Cloud Cube Model

• History of Cloud Computing


This Class
• Availability

• Risk Management

• Business Continuity

• Storage concepts

• RAID
What is Business Continuity (BC)

• Business continuity planning (BCP) is a methodology used to


create and validate a plan for maintaining continuous
business operations before, during, and after disasters and
disruptive events

• Business Continuity is preparing for, responding to, and


recovering from an application outage that adversely affects
business operations
What is Continuous Availability (CA) /
Information Availability (IA)
• Refers to the ability of an infrastructure to function
according to business expectations during its specified
time of operation
• Can be defined in terms of three parameters:
– Reliability
• The components delivering the information should be able to
function without failure, under stated conditions, for a specified
amount of time
– Accessibility
• Information should be accessible at the right place and to the right
user
– Timeliness
• Information must be available whenever required
Availability Measurement – Levels of ‘9s’ Availability
Causes of Information Unavailability

Disaster (<1% of Occurrences)


Natural or man made
Flood, fire, earthquake
Contaminated building

Unplanned Outages (20%)


Failure
Database corruption
Component failure
Human error

Planned Outages (80%)


Competing workloads
Backup, reporting
Data warehouse extracts
Application and data restore
Impact of Downtime
• Average cost of downtime per hour = Average
Productivity Loss Per Hour (APLPH) + Average
Revenue Loss Per Hour (ARLPH)
• Where:
• Average Productivity Loss Per Hour = (total
salaries and benefits of all employees per week) /
(average number of working hours per week)
• Average Revenue Loss Per Hour = (total revenue
of an organization per week) / (average number of
hours per week that an organizations is open for
business)
BC Terminologies
• Disaster Recovery
– Disaster recovery (DR) is part of business continuity, and
deals with the immediate impact of an event.

– Restoring previous copy of data and applying logs to that


copy to bring it to a known point of consistency

• Disaster Re - start
– Process of restarting from disaster using mirrored
consistent copies of data and applications
Measuring Information Availability

MTTR – Time to repair or ‘downtime’

Response Time Recovery Time

Detection Repair Restoration

Time
Incident Diagnosis Recovery Incident

Detection Repair time MTBF – Time between


elapsed failures or ‘uptime’
time

• MTBF: Average time available for a system or component to perform its


normal operations between failures
• MTTR: Average time required to repair a failed component
IA = MTBF / (MTBF + MTTR) or IA = uptime / (uptime + downtime)
Problem : -

• A Car without oil changes might fail after 200


hours of driving. Assuming 8 hours to repair
the engine and change the engine oil.

• Calculate the availability percentage of the


engine.
Measuring Information Availability

MTTR – Time to repair or ‘downtime’

Response Time Recovery Time

Detection Repair Restoration

Time
Incident Diagnosis Recovery Incident

Detection Repair time MTBF – Time between


elapsed failures or ‘uptime’
time

• MTBF: Average time available for a system or component to perform its


normal operations between failures
• MTTR: Average time required to repair a failed component
IA = MTBF / (MTBF + MTTR) or IA = uptime / (uptime + downtime)
Solution : -

• MTBF – Mean Time Between Failures


– 200 Hours

• MTTR – Mean Time to repair


– 8 Hours
Solution : -

• Availability = MTBF / (MTBF + MTTR)

• = 200 / 208

• = 0.9615  96.15%
Problem : -
• A Car without oil changes might fail after 200 hours of
driving. Assuming 8 hours to repair the engine and change
the engine oil. Calculate the availability percentage of the
engine.

• Assume that we change oil every 50 hours


and oil change takes 30 minutes, calculate
the availability
Solution : -

• Availability = MTBF / (MTBF + MTTR)

• = 50 / 50.5

• = 0.9900  99.00%
Problem ( Solve ): -
• An outage was detected at a machine
installed at 36th Data center after 2 weeks
and 4 days of installation. It took 63 hours for
the engineers to bring it up and running.
• Simultaneously, another outage was detected
at 37th Data center after 3 weeks and 5 days
of installation. It took 4 days to restore.
• Calculate the availability percentage of the
machine under both conditions and compare.
Problem : -
• Company’s total production time recorded is
300 hours, out of which 60 hours were
calculated as the total downtime and the
number of breakdowns detected were 3.

• Calculate the MTBF and MTTR.


Solution : -

• Mean Time Between Failures =


(Total up time) / (number of breakdowns)

• Mean Time To Repair =


(Total down time) / (number of breakdowns)
Solution : -

• Total Production time = 300 Hours

• Total Downtime = 60 Hours

• No of Breakdowns = 3

• Total Uptime = PT – DT = 240 Hours


Solution : -

• MTBF = 80 Hours

• MTTR = 20 Hours
Business Continuity and Disaster Recovery
Planning, Implementation, and Revision Cycle

You might also like