Analysis of Software Safety and Reliability Method PDF

See discussions, stats, and author profiles for this publication at: https://www.researchgate.
net/publication/315920456
Analysis of software safety and reliability methods in cyber physical systems
Article in International Journal of Critical Infrastructures · January 2017

DOI: 10.1504/IJCIS.2017.10004182
CITATIONS READS
3 318
2 authors:
Shahrzad Oveisi Reza Ravanmehr

University of Tehran Islamic Azad University Central Tehran Branch
18 PUBLICATIONS 37 CITATIONS 42 PUBLICATIONS 109 CITATIONS
SEE PROFILE SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Arabic/Persian Handwritten Word Recognition View project
My students Thesis View project
All content following this page was uploaded by Shahrzad Oveisi on 06 September 2018.
The user has requested enhancement of the downloaded file.

Int. J. Critical Infrastructures, Vol. 13, No. 1, 2017 1
Analysis of software safety and reliability methods in

cyber physical systems
Shahrzad Oveisi and Reza Ravanmehr*

Department of Computer Engineering,
Islamic Azad University,
Central Tehran Branch,
Tehran, Iran
Email: sha.oveisiarangeh.eng@iauctb.ac.ir
Email: r.ravanmehr@iauctb.ac.ir
*Corresponding author
Abstract: The software-based systems, alone do not cause any risk; but the risk
is posed when the software-based systems are considered in the context of
general systems where potential risks or hazards exist. Cyber-physical systems
are cited as instances of software-based systems. Nowadays, safety and
reliability of cyber-physical systems are considerably important due to the
increasing complexity of these systems. Risk management techniques are
required to reduce the risk to an acceptable level. Generally, safety and
reliability methods are important in a risk management process among them
software fault tree analysis (SFTA) and software failure modes and effects
analysis (SFMEA) methods can be utilised. The main purpose of this article is
to provide a comprehensive survey and evaluation of the currently available
approaches for software safety and reliability methods in cyber-physical
systems in order to reflect the state of the art of this active area.
Keywords: cyber-physical systems; CPSs; software safety and reliability;

software failure modes and effects analysis; SFMEA; software fault tree
analysis; SFTA.
Reference to this paper should be made as follows: Oveisi, S. and

Ravanmehr, R. (2017) ‘Analysis of software safety and reliability methods
in cyber physical systems’, Int. J. Critical Infrastructures, Vol. 13, No. 1,
pp.1–15.
Biographical notes: Shahrzad Oveisi earned her BSc in Computer Engineering

from Azad University, Tehran, Iran, in 2011. She is currently an MS student in
Computer Engineering from Azad University, Tehran, Iran. Her current
research interests include the areas of software reliability and safety analysis,
artificial intelligence and security of distributed systems.
Reza Ravanmehr graduated in Computer Engineering from Shahid Beheshti

University in Iran. After that he gained his Master’s and PhD, both in
Computer Engineering from Islamic Azad University, Science and Research
campus. His main research interests are distributed/parallel systems, large scale
data management systems and context awareness/pervasive computing. He is a
faculty member of Computer Engineering Department in Islamic Azad
University, Central Tehran Branch from 2001.
Copyright © 2017 Inderscience Enterprises Ltd.

2 S. Oveisi and R. Ravanmehr
1 Introduction
Successful implementation of a software safety program ensures detection and reduction

in probable software risks. Successful implementation of a software safety program
includes selection and utilising of effective analytical tasks to meet special needs suitable
for development of projects and software safety program. According to studies conducted
by National Institute of Standards and Technology the cost of software failure in the US
economy is about 59.5 billion dollars annually. It is also estimated that more than a third
of the costs (22.2 billion $) can be significantly reduced by infrastructure improvements
such as using safety analysis and testing, which facilities detection, prediction and
elimination of defects. Thus, using a tool that predicts and reduces errors in the software
can be useful in systems where software plays a major role. Technological advances in
recent decades have led to the development of some prototypes of the next generation
systems (Murali, 2013).
One unique problem of software systems lies in the complex relationship between
defects and effects. A small error can either cause damage to the software system or
result in invisible but very complex and subtle effects as well as long-term side effects.
As a result, software has uneven quality in terms of impact of potential failures (Snooke
and Price, 2011). Hence, a tool that can predict and reduce errors in the software is
essential due to the significant effect of software failures in the systems where software
controls the hardware such as cyber-physical systems (CPS). CPSs is a set of physical
computation and processes as well as embedded computers and networks that monitor
and control the physical processes usually associated with the feedback loops where
physical processes affect the computation and vice versa (Alur, 2015). These systems
have high reliability, safety, effectiveness and in usually must be executing in real-time
mode (Wu and Kaiser, 2008). Economic and social potential of such systems is much
more than what it seems to be. Large investments have been made for development of
this technology in the world (Lee, 2008).
System safety analysis techniques can be used as part of an overall safety plan for
CPSs to ensure that probable risks to the systems are identified and reduced in a suitable
time. During implementation of system safety program, CPSs designers need to consider
software protection against damage. Three preliminary hazard analysis (PHA), software
fault tree analysis (SFTA) and software failure modes and effects analysis (SFMEA)
methods are used to establish software safety in the process of developing software
systems during implementation of software safety program. Generally, SFMEA and
SFTA are performed in two different ways. These two methods analyse the software at
detailed system levels. At system level, evaluation is performed by the effect of failure
modes for each working subroutine. SFMEA identifies the effect of potential errors in
individual variables used in the software at detailed levels. Qualitative and quantitative
analyses are carried out after drawing SFTA. Accordingly, this analysis can be used to
enhance reliability and safety of the software development process used in these systems.
In this paper, the main focus is on SFTA and SFMEA for software safety program,
because PHA technique usually deploys in the system level.
The rest of the paper is as follow: we present the problem statement in Section 2.
After that safety and reliability methods are reviewed in Section 3. Then, in Section 4
short overview of CPSs is provided. In Section 5 software safety and reliability methods
are studied and evaluated. Finally, software safety techniques in CPSs are described and
analysed.
Analysis of software safety and reliability methods in cyber physical systems 3
2 Statement of the problem
As observed in Figure 1, the question begins where a fault is caused in subsystem. Then,
this fault turns to an error. Then, the error is issued within the system until it reaches
subsystem Y where it leads to an improper service in subsystem Y. If subsystem Y fails,
issuance of subsystem Y failure can lead to failure of the entire system. Failure in the
subsystems can also be identified and resolved; the effect of system represented as failure
in other subsystem or operation of the system in particular case may lead to system
failure. System failure may lead to a safe or dangerous failure through secure or
dangerous events. A combination of hazardous events caused by the environment, the
entire system and the system may hazard the whole system. Dangerous events can be
known as initiating events.
Figure 1 From faults to mishap
Source: Clemson (1984)

Given the above, it can be stated that if these errors and failures can be predicted
and controlled by one or several methods, the hazard and mishap can be avoided in
systems.
3 Safety and reliability methods
Software safety and reliability methods are explained in this section. These models are
improved using a bottom-up or top-down analysis. In bottom-up approach, the analyst
repeatedly asks what may happen in case of failure in order to construct a model. In fact,
an analyst views the system with a bottom-up perspective. In other words, the analyst
initiates his action by observing the lowest level of system details and its performance. In
top to bottom analysis, the analyst asks what led to system failure in order to construct a
model. Analyst observes the system with a top-down perspective. In other words, the
analyst begins his work by observing the highest level of system failures and move
downward in the system in order to track failure routes.
Figure 2 Safety and reliability methods in risk management process
Generally, safety and reliability methods play a major role in risk management process.
An acceptable level of risk management is required to reduce the risk (Jirsa and Zacek,
2010). Figure 2 shows the position of safety and reliability methods in risk management
activities and in Table 1; these methods are reviewed based on evaluation criteria.
3.1 Failure modes and effects analysis
Failure modes and effects analysis (FMEA) is a bottom-up analysis, which aims to
identify, classify and evaluate the hazards and risks associated with them (Sozer et al.,
2007). Process analysis begins by identifying the system range and limits. FMEA
functions are used to develop a flowchart of a process and design system maps. In the
next step, potential failure modes are identified gradually. There are several worksheets
to document FMEA. In the next step, causes of inherent failure effects are decided. In the
last step, necessary measures are identified to reduce the risks due to failure (Menkhaus
and Andrich, 2005).
Table 1 Summarising the scope of different techniques
Method Analysis type Application area Type of system Time to use

PHA Bottom-up Associated with the work of All types of sys. Early in the sys.
MIL-STD-882 (mil. sys). EX. ACE missile Development
Applied to any area. More sys. process. As early
safety critical than related. as possible
FMEA Bottom-up Stem from US mil. Used in Hardware or Early in the
aerospace, rocket and software sys. development
automobile in. Can be applied Generic when it program. Used
in different areas. Safety related comes to type of over several phases
and critical. sys. Ex. of landing on different detail
gear of an aircraft level. Should be
or radio sys. started early.
FTA Top-down From US mil. Ex. missiles, Not any particular Any lifecycle
transport, spacecraft, nuclear type of sys. phase. Early in the
power plants and medical design process as
equipment. For safety critical possible.
and related. No particular area.
3.2 Fault tree analysis

FTA is a powerful tool for analysis of complex systems. FTA presents a table of adverse
events and shows the relations between the events using predefined symbols. Fault tree
analysis (FTA) focuses on how an event has occurred and reviews the hierarchy of
related causes. For this purpose, FTA is often known as a top-down approach (Helmer
et al., 2001).
3.3 Preliminary hazard analysis

PHA is a bottom-up analysis required at the beginning of each cycle of product
development. The design teams are reminded that several risks associated with the
product. PHA presents a list of major and noteworthy risks to the system (Raspotnig and
Opdahl, 2013). PHA also evaluates the remaining risk after countermeasures.
Countermeasures are used to remove or mitigate the risks. This list covers qualitative risk
assessment but not quantitative assessment. PHA also encompasses a list of
countermeasures along with a qualitative description of the anticipated effectiveness. In
fact, PHA is a preliminary investigation on system security system faced with risks
(NASA Technical Standard, 2004).
4 Cyber physical system
Computing and communication capabilities are embedded in all types of objects and
structures in the physical environment. Applications with enormous societal impact and
economic benefit are created by harnessing these capabilities across both space and time.
Such systems that bridge the cyber-world of computing and communications with the
physical world are referred to as CPSs. CPS are physical and engineered systems whose
operations are monitored, coordinated, controlled and integrated by a computing and
communication core (Jianhua et al., 2011). This intimate coupling between the cyber and
physical will be manifested from the nano-world to large-scale wide-area systems of
systems. The internet transformed how humans interact and communicate with one
another, revolutionised how and where information is accessed, and even changed how
people buy and sell products. Similarly, CPS will transform how humans interact with
and control the physical world around us.
Examples of CPS include medical devices and systems, aerospace systems,
transportation vehicles and intelligent highways, defence systems, robotic systems,
process control, factory automation, building and environmental control and smart
spaces. CPS interact with the physical world, and must operate dependably, safely,
securely, and efficiently and in real-time. CPS can be considered to be a confluence of
embedded systems, real-time systems, distributed sensor systems and controls. The
promise of CPS is pushed by several recent trends (Miclea and Sanislav, 2011): the
proliferation of low-cost and increased-capability sensors of increasingly smaller form
factor; the availability of low cost, low-power, high-capacity, small form-factor
computing devices; the wireless communication revolution; abundant internet bandwidth;
continuing improvements in energy capacity, alternative energy sources and energy
harvesting.
The need for CPS technologies is also being pulled by CPS vendors in sectors like
aerospace, building and environmental control, critical infrastructure, process control,
factory automation and healthcare, who are increasingly finding that the technology base
to build large-scale safety-critical CPS correctly, affordably, flexibly and on schedule is
seriously lacking (Wu et al., 2011).
CPS brings together the discrete and powerful logic of computing to monitor and
control the continuous dynamics of physical and engineered systems. The precision of
computing must interface with the uncertainty and the noise in the physical environment.
The lack of perfect synchrony across time and space must be dealt with. The failures of
detailed in both the cyber and physical domains must be tolerated or contained. Security
and privacy requirements must be enforced. System dynamics across multiple time-scales
must be addressed. Scale and increasing complexity must be tamed.
These needs call for the creation of innovative scientific foundations and engineering
principles. Trial-and-error approaches to build computing-centric engineered systems
must be replaced by rigorous methods, certified systems, and powerful tools. Analyses
and mathematics must replace inefficient and testing-intensive techniques. Unexpected
accidents and failures must fade, and robust system design must become an established
domain. The confluence of the underlying CPS technologies enables new opportunities
and poses new research challenges.
As can be seen in Figure 3, CPS will be composed of interconnected clusters of
processing elements and large-scale wired and wireless networks that connect a variety of
smart sensors and actuators. The coupling between the cyber and physical contexts will
be driven by new demands and applications. Innovative solutions will address
unprecedented security and privacy needs. New spatial temporal constraints will be
satisfied. Novel interactions among communications, computing and control will be
understood. CPS will also interface with many non-technical users. Integration and
influence across administrative boundaries will be possible. The innovation and
development of CPS will require computer scientists and network professionals to work
with experts in various engineering disciplines including control engineering, signal
processing, civil engineering, mechanical engineering and biology. This, in turn, will
revolutionise how universities educate engineers and scientists. The size, composition
and competencies of industry teams that design, develop and deploy CPS will also
change dramatically. The global competitiveness of national economies that become
technology leaders in CPS will improve significantly (Rajkumar, 2010).
Figure 3 A CPS architecture model
Source: Sanislav and Miclea (2012)
5 Software safety and reliability methods
It is hard to discover the defects with increasing complexity of the CPS. Software is the
backbone of CPS. Complexity of the software with millions of code lines makes a
dangerous position for the effect of failure in this software. Evaluation and verification of
such software as CPS need to ensure stability and analysis of failure modes. Potential
deficiencies in prerequisites, design or application of the software can cause adverse
events in the next level of software integration. As mentioned earlier, one main challenge
in CPSs lies in safety and reliability. In this section, two main approaches have been
studied and evaluated for safety and reliability.
5.1 Software failure modes and effects analysis
Implementation of FMEA in a mechanical or electrical system is much easier than

implementation FMEA in the software. Causes of errors are often known and the
sequence has been studied. The situation is different in software environments. Software
error conditions are often unknown. Software modules cannot be stopped with error.
They only show inappropriate behaviours. SFMEA administrators should find a proper
starting point for their analysis. A list of topics related to error specifies causes, detectors
and sequence of errors. SFMEA scenarios widely show a wrong behaviour in the
program but not a grammatical error in the source code. Error modes in hardware and
software have many distinguishing characteristics (Lutz and Nikora, 2012). In Table 2,
software and hardware of FMEA are reviewed and compared.
Table 2 Hardware and software FMEA
Hardware FMEA Software FMEA

• Can only be investigated in theoretical • May be implemented at general level or at
mode. detailed level.
• Is implemented in those systems with • Is implemented for those systems bereft of
errors in various situations. error.
• Ignores an error mode relevant to practical • Ignores errors caused by long life and
errors caused by software potential errors. pressure.
• Performs this sequence at system level. • Cites degree of criticality and the actions
that should be prevented or damaging
• Cites criticality of error and what is needed
effects on the next sequence.
to avoid the errors and problems caused by
the sequence.
Implementing FMEA before designing step reduces the cost of reimplementation.

SFMEA begins from exit part of either the system or subsystem. List of details of
procedures is provided. In the next stage, list of error, causes and potential effects is
provided.
Design control covers studies on designing software, covering the topics again,
inspection, analysis of software complexity, standardised codes. The number of events
can hardly be estimated. Thus, the number of events is adjusted within a range from five
to ten. However, guessing these values is dependent on teamwork experience. The team
artistically guesses a potential error in the design phase (not only the cost to avoid this
error is reduced, but also absence of the error can be examined in the test). Error mode in
software might affect the same module or higher modules or all applications. As a result,
FMEA testers, developers, managers and analysts should thoroughly examine the source
code to find error in the program logic and loops, parameters, and links to other
applications and syntactic structure (Ozarin and Siracusa, 2003; Carlson, 2012). In
Table 3, the type of effort is investigated that is needed for each part of SFMEA.
Table 3 The type of effort for each part of SFMEA
Task Personnel involved with this task

• Planning. • All.
• Collecting actual software failure data to • Facilitator.
identify likely failure modes.
• Facilitator does initial work. Software
• Constructing left side of SFMEA table. engineers review for completeness.
• Effects on system, likelihood, severity. • All-facilitator keeps discussion moving.
• Mitigate risks/make corrective action. • Software management.
5.2 Software fault tree analysis

The main objective of SFTA lies in identification of potential defects in prerequisites,
design or application of software, which causes adverse events at the next level of
software integration. Superior or top events cover root of trees, which should be selected
among defects at system level caused by software (Helmer et al., 2001).
Analysis at system level confirms the system software products are responsible for a
potential defect in the system (top event). SFTA can be used for identification of software
details embedded in the software product whose behaviour leads to occurrence of top
event. If the top event was a critical flaw, software details involved in the top event can
be classified as critical software.
The events composing a tree are analysed as lower level events, which can link
together through logical gates defined in methodology. When analysis and integration of
events are ended, the lowest granularity analysis is obtained. This package depends on
the scope and purpose of SFTA (Needham and Jones, 2006).
6 SFMEA analysis
In this section, SFMEA is studied at both system and detailed levels. In Table 4, these
two methods are evaluated based on evaluation criteria (Goddard, 2000).
Table 4 SFMEA in different levels
Assessment
SFMEA at system level SFMEA at detailed level
method/criteria
Application • Software protection to prevent • Software protection to prevent
system hazardous behaviours. system hazardous behaviours.
• Is used in identifying structural • Check the software to
weaknesses in software design. recognise effects of error in
product unique variables
• Examines effectiveness of
employed.
software architecture
• Aims to identify the major
detailed of software and
functions.
Time of application In software architecture design When the code is fully available,
phase at software detailed design phase.
Runtime Is much quicker than SFMEA at Is very time-consuming and is
detailed level used in special cases.
Output • Shows effects of failure modes Protection at high level of design
on software output to identify is accomplished or not.
any analysed hazardous
outputs.
• Critical level of each detailed is
determined.
Failure modes • Usual software: problems of Failure modes of various variables
output, input, quality, user are examined: char, bool, int,
intermediate, etc. float, double, etc.
• Embedded software: problems
of control, relationship and
transfer, computed problems,
display problems, etc.
6.1 SFMEA at system level

SFMEA at system level is assessed by the effect of failure modes associated with each
working subroutine, which aims to identify the major software details and functions and
review the effectiveness of software architecture (Czerny et al., 2005).
6.2 SFMEA at detailed level

SFMEA at detailed level identifies the effect of potential errors in individual variables
used in software, which is done when the code is fully available. This step is very time
consuming and applicable. For this purpose, this action is done based on SFMEA results
at system level. In other words, the analysis may be done in sections with the highest risk
(Dong et al., 2009). Failure modes are analysed based on variable type. In addition to
failure modes in inherent variable, potential failure modes of software process may be
considered. This includes a review of operators such as addition, subtraction, comparison,
etc. in order to determine negative effects.
In addition, output variables are used for developing a project between the modules
when a FMEA is implemented in any software module. The effect of this module on a
module input is followed with relevant input variable error mode in the next module.
Following this mode or effect of variable errors are repeated until high level of the
program is obtained.
7 SFTA analysis
After construction of SFTA, FTA can be done in two quantitative analysis and qualitative
methods. In the Table 5, each case study is stated as follows:
Table 5 Cases examined in qualitative and quantitative analyses
Qualitative analysis Quantitative analysis

• Cut set: a group of basic events. If they occur, the • Probability of final event occurrence.
final event occurs.
• Probability of intermediate event
• Minimum cut set (MCS): a cut set with minimum occurrence.
number of initiating events.
• Probability of occurrence of
• Path set: a group of events that if not occurred minimum cut and prioritising cut sets
entirely, the final event would not happen and determining dominant cut set.
• Minimum path set (MPS): a path set with the • The importance of basic and
lowest number of initiating events intermediate events and MCSs.
Important items are specified after analysis: • The rate of reduction in probability
• A failure set with many detailed indicates low of occurrence of the event can be
vulnerability. avoided.
• A failure set with low detailed indicates high

vulnerability.
• Many failure sets indicate high vulnerability.
• A failure category with only one detailed is called
unique failure of set, which indicates potential
presence of a one-point failure.
In Table 6, the advantages and disadvantages of SFTA and SFMES have been
investigated (European Cooperation for Space Standardization, 2012).
Table 6 Advantage and disadvantage SFTA and SFMEA method
Method Advantage Disadvantage

FMEA • It is systematic. • It does not consider multiple failures.
• It can reveal potential system failures • It can be time consuming and tedious;
caused by software, not detected by its difficulty increases with the
system-level analyses. complexity of the software being
analysed.
• It can be used to identify the critical
software details and thereby give • To be correctly and usefully
guidance on where to focus the performed, it requires specific skills,
development, verification and spanning from RAMS to software
product assurance effort. engineering to thorough
understanding of the system
• It can be used to drive and justify operation.
design and verification decisions.
FTA • It is systematic. • It cannot reveal potential system
failures caused by software, not
• It can be used to identify the critical detected by system-level analyses
software details and thereby give (therefore, it should be applied in
guidance on where combination with other techniques).
• To focus the development, • It is difficult to introduce timing and
verification and product assurance causal dependency in the analysis
effort. (dynamic behaviour).
• It considers multiple failures. • It can be time consuming and tedious;
• It can be used to drive and justify its difficulty increases with the
design and verification decisions. complexity of the software being
analysed.
• An entire fault tree does not need to
be completed for the analysis to be • To be correctly and usefully
useful. performed, it requires specific skills,
spanning from RAMS to software
• Tools exist to support the engineering to thorough
development of a SFTA. understanding of the system
operation.
8 Software safety techniques in CPSs
This section explains effective application of a set of software functions and safety
methods, which provides software safety program conditions for many applications
(for example: for those systems where software controls the hardware such as CPSs in
which the effect of software failure is very serious).
These inputs, outputs, and tasks are software safety program requirements for CPSs
(Czerny et al., 2005) and are consistent with part 3 of the IEC 61508 standard that
addresses software safety. In Figure 4, software life cycle is represented.
Figure 4 Software life cycle
Table 7 Relations between software development phases and software safety tasks
Software development phase Typical software safety tasks

Conceptual design • Preliminary hazard analysis.
• SW safety planning.
SW requirements analysis SW safety requirements analysis:
• Software hazard analysis.
• Hazard testing.
• Safety requirements review.
SW architecture design SW safety architecture design analysis: software system
FMEA/FTA.
SW detailed design and • SW safety detailed analysis: software detailed FMEA/FTA.
coding
• SW safety code analysis: defensive programming.
Integration, test, and SW safety testing, SW safety test analysis, SW safety case:
verification
• Software safety validation.
• Software safety case.
Table 7 shows the relationship between software development process and software
safety procedure. Accordingly, a general overview of the system is designed during
design operational phase. Project leaders should decide whether a software safety plan is
needed for product implementation or not.
These decisions are typically based on either previous product knowledge or PHA. If
PHA identifies any hazard that may cause software failure, software safety program is
well developed.
In the next phase, i.e., the analysis phase, the software requirements include software
safety program objectives such as identification of software safety requirements to
eliminate, reduce or control potential hazards related to software potential failures.
Software safety requirements cover a set of government rules, applicable international
standards, customer or internal corporate need. A software safety requirement
identification matrix may trace the requirements throughout the development process.
Applied cases and procedures meet the software safety objectives as follows:
1 software hazard analysis
2 hazard testing
3 safety requirements review.
Software hazard analysis detects software scenarios that may lead to identification of
potential hazards identified during the PHA. As mentioned before, a common method
used to accomplish this task is called SFTA. It should be noted that there is no software
architecture and detailed design at this stage. Therefore, software modes are identified in
FTA. Then, safety requirements are defined in investigating software safety requirements
for each possible software failure. At the final stage, a real test is needed to examine the
system under probable risk in testing the hazards. The results of this test show deviation
in error response times and an acceptable system level.
FTA and FMEA are performed at the system level in software architecture design
phase, which aims to identify the major software details and functions. Critical level of
each detailed is determined. Detailed FTA is examined and the codes were written safety
in the next phase, i.e., the detailed software design phase, based on results of the previous
FMEA phase. Given that analysis at detailed level is very time consuming, detailed FTA
and FMEA are investigated in such cases with high risk or intensity based on analysis
results. Then necessary functions in coding are separated from unnecessary function in
secure programming to reduce the probability of unnecessary error leading to probable
risks. Necessary or unnecessary functions are determined in two FTA/FMEA studies at
system and detailed levels. Finally, software validation and verification are determined
using unit tests, integration, etc. to ensure that 2005.
9 Conclusions
The successful implementation of a safe software program involves the selection,

utilisation and the effectiveness of the analysis methods that are suitable for the special
requirements of the project development.
This paper explained the effective use of a set of operations and software safety
procedures that control software safety planning for many applications, including system
software, hardware controls and in general CPSs. For this purpose, different safety and
reliability methods such as SFTA and SFMEA have been studied and analysed
considering various quality factors in project life cycle development.
For future research, we utilise the results of this survey for evaluating and analysis of
software safety in a specific CPS. For this purpose, we focus on software safety analysis
based on SFTA approach for an optical telescope.
References
Alur, R. (2015) Principles of Cyber-Physical Systems, MIT Press, Cambridge, MA.
Carlson, C.S. (2012) Effective FMEAs: Achieving Safe, Reliable, and Economical Products and
Processes using Failure Mode and Effects Analysis, John Wiley & Sons, Inc., Hoboken,
New Jersey.
Clemson, B. (1984) Cybernetics: A New Management Tool, Abacus Press, Tunbridge Wells, Kent,
UK.
Czerny, B.J. et al. (2005) ‘Effective application of software safety techniques for automotive
embedded control systems’, Transaction Journal of Passenger Cars: Electronic and Electrical
Systems, Vol. 114, No. 7, pp.20–33, Detroit, Michigan.
Dong, W. et al. (2009) ‘Automating software FMEA via formal analysis of dependence relations’,
IEEE 2008, 32nd Annual IEEE International Computer Software and Applications, Turku.
European Cooperation for Space Standardization (2012) Software Dependability and Safety, ECSS-
Q-HB-80-03A, Noordwijk, The Netherlands.
Goddard, P.L. (2000) ‘Software FMEA techniques’, IEEE 2000, Proceedings Annual Reliability
and Maintainability Symposium, Los Angeles, CA.
Helmer, G. et al. (2001) ‘A software fault tree approach to requirements analysis of an intrusion
detection system’, Journal of Requirements Engineering, Vol. 7, No. 4, pp.207–220.
Jianhua, S. et al. (2011) ‘A survey of cyber-physical systems’, IEEE 2011, International
Conference Wireless Communications and Signal Processing, Nanjing.
Jirsa, J. and Zacek, J. (2010) ’UML-oriented risk analysis in manufacturing systems’, Acta
Polytechnica, Vol. 50, No. 6, pp.41–48.
Lee, E.A (2008) ‘Cyber physical systems: design challenges, center for hybrid and embedded
software systems’, IEEE 2008, 11th IEEE Symposium on Object Oriented Real-Time
Distributed Computing, Orlando, FL.
Lutz, R. and Nikora, A. (2012) Failure Assessment, in Nasa Technical Reports 2008: 1st
International Forum on Integrated System Health Engineering and Management in Aerospace,
Pasadena, CA, USA.
Menkhaus, G. and Andrich, B. (2005) ‘Metric suite for directing the failure mode analysis of
embedded software systems’, Paper Presented at the Proceedings of the Seventh International
Conference on Enterprise Information Systems, Miami, USA, pp.266–273.
Miclea, L. and Sanislav, T. (2011) ‘About dependability in cyber-physical systems’, IEEE 2011:
9th East-West Design & Test Symposium, Sevastopol.
Murali, D.V. (2013) Verification of Cyber Physical Systems, Unpublished Master of Science
Thesis, Virginia Polytechnic Institute and State University, Blacksburg, Virginia.
NASA Technical Standard (2004) Nasa Software Safety Guidebook, NASA-GB-8719.13.
Needham, D. and Jones, S. (2006) ‘A software fault tree metric’, IEEE 2006: 22nd International
Conference on Software Maintenance, Philadelphia, PA.
Ozarin, N. and Siracusa, M. (2003) ‘A process for failure modes and effects analysis of computer
software’, IEEE 2009: Paper Proceedings Annual Reliability and Maintainability Symposium,
USA.
Rajkumar, R. (2010) ‘Cyber-physical systems: the next computing revolution’, ACM 2010: Design
Automation Conference, California, USA.
Raspotnig, C.H. and Opdahl, A. (2013) ‘Comparing risk identification techniques for safety and
security requirement’, The Journal of Systems & Software, Vol. 86, No. 4, pp.1124–1151.
Sanislav, T. and Miclea, L. (2012) ‘Cyber-physical systems – concept, challenges and research
areas’, Journal of Control Engineering and Applied Informatics, Vol. 14, No. 2, pp.28–33.
Snooke, N. and Price, C. (2011) ‘Model-driven automated software FMEA’, IEEE 2011:
Reliability and Maintainability Symposium (RAMS), 2011 Proceedings, Annual, Lake Buena
Vista, FL.
Sozer, H., Tekinerdogan, B. and Aksit,M (2007) ‘Extending failure modes and effects analysis
approach for reliability analysis at the software architecture design level’, Journal of
Architecting Dependable Systems, pp.409–433, Berlin.
Wu, F.J., Kao, Y.F. and Tseng, Y.C. (2011) ‘Review from wireless sensor networks towards cyber
physical systems’, Journal of Pervasive and Mobile Computing, Vol. 7, No. 4, pp.397–413.
Wu, L. and Kaiser, G. (2013) ‘FARE: a framework for benchmarking reliability of cyber-physical
systems’, IEEE 2013: Systems, Applications and Technology Conference, Long Island.
View publication stats

Analysis of Software Safety and Reliability Method PDF

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Analysis of Software Safety and Reliability Method PDF

Uploaded by

Copyright:

Available Formats

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Analysis of software safety and reliability methods in cyber physical systems

Article in International Journal of Critical Infrastructures · January 2017

Shahrzad Oveisi Reza Ravanmehr

SEE PROFILE SEE PROFILE

Arabic/Persian Handwritten Word Recognition View project

My students Thesis View project

The user has requested enhancement of the downloaded file.

Analysis of software safety and reliability methods in

Shahrzad Oveisi and Reza Ravanmehr*

Keywords: cyber-physical systems; CPSs; software safety and reliability;

Reference to this paper should be made as follows: Oveisi, S. and

Biographical notes: Shahrzad Oveisi earned her BSc in Computer Engineering

Reza Ravanmehr graduated in Computer Engineering from Shahid Beheshti

Copyright © 2017 Inderscience Enterprises Ltd.

Successful implementation of a software safety program ensures detection and reduction

2 Statement of the problem

Figure 1 From faults to mishap

Source: Clemson (1984)

3 Safety and reliability methods

Figure 2 Safety and reliability methods in risk management process

3.1 Failure modes and effects analysis

Table 1 Summarising the scope of different techniques

Method Analysis type Application area Type of system Time to use

3.2 Fault tree analysis

3.3 Preliminary hazard analysis

4 Cyber physical system

Figure 3 A CPS architecture model

Source: Sanislav and Miclea (2012)

5 Software safety and reliability methods

5.1 Software failure modes and effects analysis

Implementation of FMEA in a mechanical or electrical system is much easier than

Table 2 Hardware and software FMEA

Hardware FMEA Software FMEA

Implementing FMEA before designing step reduces the cost of reimplementation.

Task Personnel involved with this task

5.2 Software fault tree analysis

6.1 SFMEA at system level

6.2 SFMEA at detailed level

Qualitative analysis Quantitative analysis

• A failure set with low detailed indicates high

Method Advantage Disadvantage

8 Software safety techniques in CPSs

Figure 4 Software life cycle

Software development phase Typical software safety tasks

The successful implementation of a safe software program involves the selection,

View publication stats

You might also like