Professional Documents
Culture Documents
Trends in Human Spaceflight Failure Tolerance High Reliability
Trends in Human Spaceflight Failure Tolerance High Reliability
1
historic files, including the Apollo Failure Modes and
Effects Analyses (FMEA) and Probabilistic Risk
Analyses (PRA). Failure reports are also largely
scattered and illegible, making for long hours of
research and unfulfilled leads.
Compounding the difficulty of maintaining historic
documents in hard copy, online records present a unique
set of challenges. Heritage Space Shuttle pre-flight
reports from the 1980’s and 1990’s are currently stored
in outdated and unsupported file formats, leaving the
research team with partial records and large gaps in
early and mid-program data.
Apollo, Space Shuttle and Orion propulsion subsystems Figure 2. Shuttle Orbital Manoeuvring System (OMS)
were generically designed with a similar set of
guidelines to enable active pressurization, pressure
relief, propellant distribution, and attitude and
translation control [1][2][3]. However, as bipropellant
system technology has remained fairly static since the
1960’s, qualitative and quantitative failure tolerance
requirements have changed much more frequently.
Apollo initially mandated a probability of safe crew
return design standard (0.999) and employed
redundancy wherever practical, with specific exceptions
for the structure, heat shield, and certain portions of the
main propulsion systems. The Space Shuttle directed a
fail-safe design for all nominal and abort conditions,
managing exceptions through Critical Items Lists (CIL) Figure 3. MPCV Orion European Service Module
[4]. And early Orion requirements defaulted to two- (ESM) Propulsion Subsystem
failure tolerance, but later evolved to specify no less
than single-failure tolerance with provisions for zero- To compare unique propulsion design attributes across
failure tolerant exceptions pending MSERP and multiple systems, a diagnostic tool was developed to
technical authority concurrence [5]. generically map component types, prevalent failure
modes and criticality designations from vehicle to
vehicle. As noted previously, Apollo Crew and Service
Module (CSM) and Space Shuttle Orbital Manoeuvring
System/Reaction Control System (OMS/RCS) hardware
FMEAs were not available within advertised folders at
the National Archives. In the absence of this data,
S&MA representatives defined criticalities based on
2
design files, an Apollo Single Point Failure (SPF) For this particular study, crew survival methods,
analysis [6] and a comparative FMEA analysis on the including cross-feed and lunar module recovery, were
Shuttle Knowledge Console [7]. Relative criticalities not considered in the raw criticality scores; however,
were then compared across programs as follows: these systems could influence the survivability of select
• (+1) Additional redundant instrumentation, failures including external leakage and blockage.
increased failure tolerance
• (-1) Additional hardware or failure modes, 3. FAILURE HISTORY CORRELATION
decreased failure tolerance 3.1 Analysis Methodology
Aggregate results for each type of hardware are
summarized in Table (1), and causal factors are As described above, the quantity and quality of readily
addressed in Table (2). available failure history data was somewhat limited. A
few additional, important caveats are highlighted below.
Component Orion vs. Apollo Shuttle vs. Apollo
Burst Disc/Relief Valves -1 0 Data acquisition for Apollo proved to be the most
Check Valves 0 -3 challenging piece of this investigation. Flight data was
Engines -2 6 generally available in post-mission reports; however,
Fill/Drain Valves 0 0 ground data was limited to sparse carbon copy
Filters -1 -1 Discrepancy Reports (DR) that were collected and
scanned at various National Archives locations. While
Lines 1 0
the Apollo ground failure database may not have been
Pressurization Valves -2 -4 complete, the raw number of retrieved DRs was within
Propellant Valves 0 -3 family with respect to recovered space shuttle and Orion
Pyro Valves -3 0 files.
Regulators 1 1
Sensors 7 8
Shuttle flight reports and pre-flight reports, while
extensive, were also not converted to an appropriate
Tanks 0 0
Microsoft Office format within sufficient time to
support this study. Due to the large volume of Orbital
Table 1. Multi-Vehicle Qualitative Failure Modes and Manoeuvring System (OMS) and Reaction Control
Effects Analysis Comparison System (RCS) data, this study has been constrained to
OMS data for forty-five (45) missions between Space
Transportation System STS-31 and STS-114. All OMS
Component FMEA Distinctions and RCS flight failures have been captured in Mission
Burst Disc/Relief Valves Pressurization system architecture Evaluation Reports (MER) and are included in this
report.
Check Valves Common feed system, internal and/or
external check valves
Finally, Orion propulsion anomalies were limited to
Engines Feed system complexity, pneumatic
redundancy
data contained within the cross-program Problem
Reporting and Corrective Action (PRACA) database.
Fill/Drain Valves Pressurization and propellant fill/drain These reports were also somewhat incomplete
valve failure tolerance
considering open component and vehicle qualification
Filters Single-point blockage failures testing, and unproven flight performance.
Lines Elimination of flexible line bellows
For the purpose of this study, criticality assignments on
Pressurization Valves Vapor migration protection, improved
instrumentation, cross-feed complexity all discrepancy reports were universally adapted to
reflect Orion FMEA ground rules and assumptions, as
Propellant Valves Tank isolation complexity, non-isolatable
bellows
defined in the Orion MPCV Program Hardware Failure
Modes and Effects Analysis/Critical Items List
Pyro Valves Additional valves (FMEA/CIL) Requirements Document [8]. Interfacing
Regulators Improved instrumentation ground system, electrical system and thermal system
Sensors Improved instrumentation failures were excluded from each data set, and are
considered forward work.
Tanks All propellant leakage assumed to be a
worst-case Criticality 1
3
3.2 Anomalies over Time 3.2.2 Flight Anomalies over Time
2.5
The following trend charts reflect Apollo, Space Shuttle
and Orion anomaly counts for a given failure criticality
over time. 2
Number of Anomalies
1.5
1
1R2
16 1R3
1R4
1 1R6
14
1SR
1S
12
0.5
10
3
Number of Anomalies
1
1R2 0
8 Apollo 9 Apollo 10 Apollo 11 Apollo 12 Apollo 13 Apollo 15 Apollo 16
1R3
Mission
1R4
1R6
6
1SR
1S
4
Figure 7. Apollo Propulsion Number of Flight
Anomalies by Mission
2
0
1967 1968 1969 1970 1971 1972 1973
14
Mission
12
10
8
1
1R2
Anomalies by Year 6
1R3
1R4
1R6
1SR
1S
4
12
10
0
1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011
Year
8
3
Number of Anomalies
1
1R2
6
1R3
1R4
1R6
1SR
Figure 8. Space Shuttle Propulsion Number of Flight
4 1S
Anomalies by Year
2
0
1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005
Year
2 1S
high density of leakage failures highlighted the
importance of adequate leak detection.
1
4
3.3 Anomalies by Component Type 3.3.2 Flight Anomalies by Component Type
Regulator
16%
Lines
6% Propellant Valve
32%
Sensor
34%
Tank
8%
Burst Check Valve
Disc/Relief 8%
Valve
Engine
3% 50%
Sensor
18%
Engine
17%
Filter
0%
Propellant Valve
1%
Regulator
12%
Fill/Drain Valve Pressure Valve
Regulator 9% 2%
17% Filter
1%
Propellant Valve Burst Disc/Relief Valve Check Valve Engine Fill/Drain Valve
6%
Line Filter Line Pressure Valve Propellant Valve
Pressure Valve 7%
Regulator Sensor Tank
6%
5
a predisposition toward leakage and failed-off
conditions.
Number of Anomalies
1S
35 4 1SR
1R6
30 3
1R4
1R3
25 2
Number of Anomalies
1S 1R2
20 1SR 1 1
1R6 3
15 0
1R4
Burst Check Valve Engines Fill/Drain Filter Lines Pressure Propellant Regulator Sensor Tank
1R3 Disc/Relief Valves Valve Valve
10
1R2 Valve
5 1 Component
3
0
Burst Check Valve Engine Fill/Drain Filter Lines Pressure Propellant Regulator Sensor Tank
Disc/Relief Valves Valve Valve
Valve
Component
Figure 17. Apollo Number of Flight Anomalies by
Component based on Criticality
Figure 14. Apollo Number of Ground Anomalies by 90
70
Number of Anomalies
60 1S
50 1SR
35 1R6
40
1R4
30 30 1R3
20 1R2
25
Number of Anomalies
1S 1
10
20 1SR 3
0
1R6
Burst Check Engine Fill/Drain Filter Line Pressure Propellant Regulator Sensor Tank
15
1R4 Disc/Relief Valve Valve Valve Valve
Valve
1R3
10 Component
1R2
5 1
3
0
Burst
Disc/Relief
Check
Valve
Engine Fill/Drain
Valve
Filter Line Pressure Propellant Regulator
Valve Valve
Sensor Tank Figure 18. Space Shuttle Number of Flight Anomalies
Valve
Component
by Component based on Criticality
12 1S
Figure 16. Orion Propulsion Number of Ground Most plots indicated the highest-density failures were
Anomalies by Component based on Criticality observed on Criticality 1, 1R2, 1R3 and 1SR
components. The prevalence of these issues and the
large number of “close calls” reinforced the benefit of
maintaining current agency failure tolerance
requirements, at a minimum.
4. CONCLUSIONS
6
human-rated missions. Additional research is required
to define all corrective actions; however, interim 8. ABBREVIATIONS AND ACRONYMS
recommendations include additional controls for high-
density Criticality 1 anomalies, including additional 0FT Zero-Failure Tolerant
process inspections, cleanliness verifications, and CIL Critical Items List
targeted lessons from prior programs. DR Discrepancy Report
DFMR Design for Minimum Risk
As a secondary observation, a standalone review of ESA European Space Agency
human spaceflight failure history allowed the team to FMEA Failure Modes and Effects Analysis
recognize common failures across multiple programs. ISS International Space Station (ISS)
Several examples are highlighted below. In the future, MER Mission Evaluation Reports
additional coordination is needed to establish an MPCV Multi-Program Crew Vehicle
effective framework for capturing valuable mitigations MSERP MPCV Safety and Mission Assurance
from prior programs, to ensure failures are not regularly Safety and Engineering Review Panel
repeated. (MSERP)
NASA National Aeronautics and Space
Common Failures across Multiple Programs Administration
OMS Orbital Manoeuvring System
• Failed-closed pressurization valves PII Personal Identification Information
• Latch valve degaussing PRA Probabilistic Risk Analysis
• Loose sensor wires PRACA Problem Reporting and Corrective
• Bellows leakage at both welds and convolutes Action
• Fuel and Oxidizer Reaction Products (FORP) RCS Reaction Control System
• Valve and tank gauging failures SPF Single Point Failure
• Internal leakage of helium regulators and valves SBU Secure But Unclassified
STS Space Transportation System
• External leakage of main engine pneumatic pack
• Burst disc rupture
• Excessive pull-in voltage 9. REFERENCES
• Internal leakage of ball valve shaft seals
• Valve leakage due to pilot seat contamination 1. Formatted by Wood, B., NASA (1969). Apollo
Operations Handbook, Block II Spacecraft,
From a data integration standpoint, the team would like Volume 1, Spacecraft Description, SM2A-03-
to reiterate the need to finish reviewing and tabulating Block II-(1), SID 66-1508, 15 October 1969,
space shuttle RCS ground anomalies (all), space shuttle pp 175-162.
OMS ground anomalies (missing years), qualification
anomalies (all) and additional Apollo reports. These 2. Curated by Dimukes, Kim, NASA (2002).
missing data sets will continue to be developed and Shuttle Reference Manual (1988).
included in future versions of this assessment. https://spaceflight.nasa.gov/shuttle/reference/sh
utref/orbiter/.
Finally, the S&MA research team would like to offer a
few general process observations to improve failure 3. ESA (2018). Orion Propulsion.
history research, going forward. NASA documentation https://www.esa.int/Our_Activities/Human_an
at the National Archives was extremely difficult to d_Robotic_Exploration/Orion/Propulsion.
retrieve in a short amount of time, and required
personnel and financial resources. To minimize these 4. NASA Engineering and Safety Center (NESC)
demands, multiple agencies stands to benefit from (2007). Design, Development, Test and
retrieving, digitizing and managing these reports in a Evaluation (DDT&E) Considerations for Safe
controlled searchable platform. A shared database and Reliable Human Rated Spacecraft
would ensure files are subjected to a consistent set of Systems, RP-06-108, pp 33.
screens for sensitive but unclassified (SBU) and
Personal Identification Information (PII), while helping 5. Krevor, T., Epstein, G., Benfield, P. &
design engineers make critical decisions. As deep-space Deckert, G. (2009). Leveraging Reliability in
missions drive higher technical and programmatic risks, Support of Risk Balancing and Weight
a cross-program knowledge-sharing system would pay Reduction on Orion, AIAA 2009-6725, pp 3.
countless dividends in employee productivity,
minimized ground and flight delays, and improved
system reliability.
7
6. NASA (1968). CSM 101 Criticality I and II
Single Point Failure Summaries, SD68-547.