Professional Documents
Culture Documents
AS S H R R S: Imulation Tudy of AIL Esistant OOF Trategies
AS S H R R S: Imulation Tudy of AIL Esistant OOF Trategies
Robert L. McPherson
Simulation for Management Decision Making, CSCI E-135
Harvard University Extension
Abstract
An initial evaluation of the published statistics concerning the benefits of hail-resistant roofs might lead insurance
executives, state insurance regulators, and building code advocates to pursue incentives and laws that would strongly
promote investing extra money to install hail-resistant roofs when conventional roofs are destroyed as a result of hail-
damage. However, an evaluation of the long-term benefits with the use of simulation modeling technology reveals
the answer may not be so clear cut. Hail storm events that are sufficiently strong to cause significant damage (over
$1 million per event, which is a low threshold by insurance company standards) are relatively infrequent, and the
variance in the severity of these events is such that it could take an extremely long time to realize a clear payoff from
such efforts.
This paper is to fulfill the requirements for a final project for Simulation Modeling for Decision Making, ISMT
E-135, Harvard University Graduate Extension School.
1
Robert McPherson
ISMT E-135, Spring 2010
CONTENTS Final Project
Contents
1 Problem Formulation 4
4 Data Collection 5
4.1 Examining the Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
4.2 Data Analysis of Hail Storm Property Damage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
4.3 Data Analysis of Hail Storm Inter-Arrival Times . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
4.4 Data on Standard and Hail Resistant Roofs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
4.5 Simulating the Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
8 Conclusion 37
2 of 39
Robert McPherson
ISMT E-135, Spring 2010
LIST OF FIGURES Final Project
List of Figures
1 Basic design of proposed model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2 Histogram of monthly hail property damage losses over $1 million, January 1993 - March 2010 . . . 8
3 Histogram of monthly inter-arrival times in days for hail property damage losses over $1 million,
January 1993 - March 2010 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
4 Scatter-plot matrix of all hail loss data and variables . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
5 Decomposition of seasonality and trend of hail property damage data . . . . . . . . . . . . . . . . . . 11
6 Decomposition of seasonality and trend for inter-arrival between hail events, in days . . . . . . . . . 12
7 GEV exponential distribution fitted to property damage data . . . . . . . . . . . . . . . . . . . . . . 15
8 Generalized Pareto fitted with exponential distribution, on property damage data with a threshold of
200 million dollars . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
9 Scatter plot of remaining nine extreme property damage values above the 200 million dollar threshold 17
10 Exponential distribution is best fitting from Arena Input Analyzer for property damage over 200 mil-
lion dollar threshold . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
11 Plot of autocorrelation function indicating little evidence of non-stationarity . . . . . . . . . . . . . . 20
12 Lognormal distribution fitted to untransformed inter-arrival data . . . . . . . . . . . . . . . . . . . . 21
13 Configuration of the “Create Hail Storm Event" object . . . . . . . . . . . . . . . . . . . . . . . . . 22
14 Configuration of the lognormal distribution for “Storm Severity" object . . . . . . . . . . . . . . . . 23
15 Configuration of the Hail Size Magnitude object . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
16 Configuration of gamma distribution random generator for hail stone size . . . . . . . . . . . . . . . 24
17 Configuration of property damage object (i.e., “Storm Severity") . . . . . . . . . . . . . . . . . . . . 24
18 Configuration of log-Weibull random generator behind “Storm Severity" object . . . . . . . . . . . . 24
19 Model structure for conventional asphalt roofing replacements . . . . . . . . . . . . . . . . . . . . . 26
20 Model structure for hail resistant roofing replacements . . . . . . . . . . . . . . . . . . . . . . . . . 26
21 A single year of simulated hail losses shows extreme variability inherent in losses from actual hail events 28
22 Histograms of inter-arrival times in days (left), and loss severity in dollars (right) . . . . . . . . . . . 31
23 Average losses for each of the 365 days in a year over a single year (i.e., one replication) . . . . . . . 32
24 Average losses for each of the 365 days in a year over 200 years (i.e., 200 replications) . . . . . . . . 33
25 Basic setup for comparing hail-resistant roof replacement strategies in Process Analyzer software . . . 33
26 Box plot of 50 repititions indicating no single scenario is significantly better . . . . . . . . . . . . . . 34
27 Box plot of 100 replications indicating no single scenario is significantly better, even after 100 years . 35
28 Box plot of 250 replications shows all three hail-resistant scenarios are finally significantly better than
non-hail resistant . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
29 50-year comparison of exponential inter-arrival with mean of 16.2 days for sensitivity testing . . . . . 37
30 100-year comparison of exponential inter-arrival with mean of 16.2 days for sensitivity testing . . . . 38
3 of 39
Robert McPherson
ISMT E-135, Spring 2010
Final Project
1 Problem Formulation
Installation of hail-resistant roofs could save insurance companies significant claims costs, and reduce premiums for
consumers. However, hail storm events are extremely variable in terms of arrivals, as well as the amount of damage
incurred. Additionally, hail resistant roofing is more expensive than regular roofing, and this added cost is also variable,
depending on price differences among contractors, and suppliers. Because of these factors, determining whether
implementing various strategies to install hail-resistant roofs is not as simple as calculating average loss amounts for
various roof types, and examining average installation costs. It could take a great many years to realize a significant
return on such efforts.
A simulation model could help answer whether it be worthwhile for insurance companies to incent policyholders
to replace hail damaged roofs with upgraded hail resistant shingles. It could also provide information for legislators to
consider whether it is worthwhile to enact building codes requiring roof replacements with hail-resistant shingles.
As we shall see, studies indicate that hail-resistant roofs can reduce damageability of roofs by about half, and the
added cost to install these roofs is only a fraction of this amount. On the surface, it would appear that spending the
extra money to install these roofs is obviously a good thing for the insurance companies. We shall test this assumption
in the analysis to follow. Specifically, the analysis will explore whether the reduction in losses for hail-resistant roofing
is truly significant enough to outweigh the added cost, and whether the rarity of hail-storm events, as well as the high
degree of variability in losses, causes any potential payoffs to fail to be fully realized in a reasonable amount of time.
Also, there should be a means for accounting for the extreme amount of uncertainty inherent in natural catastrophe
events, such as large hail storms. Additionally, uncertainty should be accounted for in the amount of loss reduction
that is actually experienced from hail-resistant roofs, as well as in the added cost to install them.
4 of 39
Robert McPherson
ISMT E-135, Spring 2010
Final Project
An initial very simple model was built to validate that simulating the process in this manner could produce plausible
results. Several scenarios were developed with varying proportions of conventional roof replacements to hail-resistant
roofing. The Process Analyzer software module that is package with the Arena simulation software by Rockwell was
utilized to compare the results.
The initial proposed models are not shown here, since they differ from the final models, and could cause confusion
with the final result. The models implemented in this analysis were largely shaped by the data that were available, and
they evolved as discoveries about the data unfolded. For instance, the initial models indicated that hail storm events
only included the state of Colorado, and for a very limited time frame. Through later data analysis, it was discovered
there are far too few data points with which to find a good fitting distribution. Thus, the resulting model encompassed
hail events and losses for the entire U.S. over a seven year period (the maximum available). Also, the initial models
added a layer of complexity that greatly broadened the scope by considering claim paying operations as a submodel.
While this may be desirable for future models, it goes beyond the main focus of this analysis. However, despite the
changes in theh final models, the basic model logic, as shown in figure 1 remains the same.
4 Data Collection
Weather data for this study were collected from the National Oceanic and Atmospheric Administration’s free, on-line
data resource, called the National Climatic Data Resource Center (http://www.ncdc.noaa.gov/oa/ncdc.html). As of the
time of this study, hail data was available for the entire U.S. on a monthly basis, from January 1993, through January
2010. (Although this is being written in April, 2010, data after January, 2010, is not yet posted on the website.) All
of the available monthly hail data from January, 1993, through January, 2010, was collected and stored in a database.
Since the data can only be downloaded in smaller increments, one year had to be downloaded at a time, and assembled
in the database.
Initially, the intent for this study was to utilize Colorado data only. However, it became apparent upon analysis
of the Colorado data, that there were not sufficient observations to get a good distribution fit using the Arena Input
5 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.1 Examining the Data Final Project
Analyzer. Therefore, all the following analysis is on consolidated data for all of the states in the continental U.S. Also,
a threshold of one million dollars was selected as the point above which data would be collected. This limited the
amount of the data so that it would be feasible to conform to the website’s limitation of returning no more than 1,000
records per query. The website required running a separate query for each year. All queries returned less than 1,000
records when limiting the dollar amount of property damage to one million dollars.
Utilizing thresholds in this manner for analyzing extreme weather data is common [14]. Also, insurance companies
are particularly interested in analyzing “tail events.” Variance for high frequency, low severity events is predictable
enough that they are adequately handled by conventional pricing models. However, the low frequency, high severity
events can cause losses not contemplated in conventional pricing models, and premium charges may not be adequate
to cover them. A typical threshold for catastrophic events is $250 million dollars, such as utilized in AIR’s Classic/2
catastrophe modeling software. This threshold is too high for this analysis, however, since we are interested in the
difference that hail resistant roofs have on general hail losses. On the other hand, a threshold below $1 million would
like be too low, as many of these claims would turn out to be less than the homeowner policy deductibles. Thus, due
to both practicalities of the website limitations, and the nature of the question at hand, a $1 million threshold was
considered to be adequate for this analysis.
There were three variables that were collected from the NOAA website. For each hail storm event, data were
provided as to the state and county where the event occurred; the month, day, and year of occurrence; the magnitude
of the typical hail stone size; as well as the amount of property damage in U.S. dollars.
After the data were imported into a newly created database, all data attributes were aggregated to the national level
for each day that an event occurred. The hail stone sizes were aggregated to represent the maximum size on a given
day. The property damage represents the sum of all losses for a given day. The inter-arrival times were calculated and
put into a new data column by subtracting the difference in days between events. Structured Query Language (SQL)
code is exhibited below, to further demonstrate how the data were assembled. Lastly, the data were grouped by month
to produce a monthly time series. Since this step was done with a pivot table, there is no SQL code to exhibit.
The hail data were aggregated and grouped by event date, as shown in this SQL snippet:
SELECT
"Date",
MAX( "Mag" ) AS "Magnitude",
SUM( "PrD" ) AS "PropertyDamage",
MAX( "InterArrival" ) AS "InterArrival"
FROM "AllStatesHail_1993_2009" GROUP BY "Date"
Note that since the data were supplied by county and state, there were many duplicate dates. Hence, this is the
reason for using the maximum (i.e., “MAX”) function when grouping the inter-arrival times, as shown in the SQL
snippet. To produce the inter-arrival time in terms of days, the data were initially sorted by date, and each preceding
date was subtracted from the subsequent date. This produced an inter-arrival time of zero for each row of duplicate
dates, while the first row in the block of duplicates contained the only amount greater than zero. Therefore, using the
maximum function was an effective way to group by date, and only show inter-arrival values that are larger than zero.
Table 1 shows a small sample of collected hail data from NOAA, prior to grouping and summing by date.
A sample of data after it has been grouped by date at a national level is shown in table 2.
6 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.1 Examining the Data Final Project
Sample of Raw Data from NOAA Website, with Calculated Inter-Arrival in Days
Location or County Date Type Mag PrD InterArrival
6 Madras 02/18/09 Hail 4.25 4 0
7 Hampton 02/18/09 Hail 1.75 1.4 0
8 Loganville 02/18/09 Hail 3 2.5 0
9 Chamblee 02/18/09 Hail 1.75 2.5 0
10 Lilburn 02/18/09 Hail 2 2.5 0
11 Tyrone 02/18/09 Hail 3 8 0
12 Bill Arp 02/18/09 Hail 1.75 1 0
13 Jonesboro 02/18/09 Hail 3 6 0
44 Jonestown 03/25/09 Hail 3 160 35
14 Locust Grove 03/28/09 Hail 1.75 1 3
45 South Lake 03/30/09 Hail 2.75 35 2
46 Grapevine 03/30/09 Hail 1.75 5 0
47 Keller Alta Vista Ar 03/30/09 Hail 2.75 1 0
Table 1: Small sample of hail data from the National Oceanic and Atmospheric Administration
Table 2: Small sample of hail data that has been grouped by date, and aggregated nationally
7 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.1 Examining the Data Final Project
dependencies among the variables. We can see from the histograms in figures 2 and 3, that these data are highly
skewed. It will, in fact, prove quite challenging to fit distributions to these data, as we will later see.
Histogram of PropertyDamage
350
300
250
200
Frequency
150
100
50
0
PropertyDamage
Figure 2: Histogram of monthly hail property damage losses over $1 million, January 1993 - March 2010
Examining independence The scatter-plot in figure 4 indicates that there is little to no visible dependence between
the inter-arrival times, and the property loss data. However, there appears to be some dependence between the mag-
nitude of the hail (i.e., hail size) and the property losses, as evidenced by the upward sloping dotted regression line.
This indicates a positive correlation, as one might expect. Losses tend to be higher for larger hail sizes. Hail stone
magnitude data were initially evaluated and fitted with distributions along with inter-arrival and loss data. However,
due to the correlation between loss severity and hail stone size, as well as the fact that not enough data was available to
convert hail stone size into equivalent losses, it was decided to omit this variable from the models as being unnecessary
for the goals of this analysis.
In addition to the dotted linear regression line, the solid line in the plot is produced by a non-linear optimized sum
of squares (LOESS) function, and reveals any non-linear relationships. The plots along the diagonal of the matrix
show the probability density for each variable.
Although visualizing the data relationships is useful, the data are so highly skewed, that appearances may be
deceiving. A nonparametric, Spearman rank order correlation matrix was run in R to further test for dependencies in
the data in a manner that is not dependent upon the data having any particular distribution assumption. The results
indicate that the strongest of these correlations is between the magnitude of the hail stone sizes, and the property
damage severity. (Note that the variables in the correlation analysis were detrendend and deseasonalized, utilizing a
method called “seasonal, trend, and LOESS decomposition’, or “STL.”)
Although the correlation between property damage and magnitude is not high, with a correlation coefficient of
r = 0.21, the p-value for this Spearman rank order correlation is 4.579e-05. This is well below the 0.05 level, which
causes us to reject the null hypothesis that this correlation coefficient is equal to zero at the 95% confidence level.
8 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.1 Examining the Data Final Project
Histogram of InterArrival
300
250
200
Frequency
150
100
50
0
InterArrival
Figure 3: Histogram of monthly inter-arrival times in days for hail property damage losses over $1 million, January
1993 - March 2010
Table 3: Spearman rank order correlation matrix on deseasonalized, detrended, hail storm data
Examining stationarity It is important to examine the data for the possible presence of seasonality and trend. We
may or may not want to reflect seasonality and trend in a model, depending on the objectives of the model. However,
autocorrelation produced by seasonality and trend and greatly impact the results of a model that is based on time series
data. For instance, if there is a queue in the model, a particularly active season may produce a longer queue than quiet
seasons. Also, non-stationary (i.e., auto-correlative) data can make it quite difficult to fit distributions as well, as we
shall see in the upcoming section regarding distribution fitting. The purpose of the initial examination is simply to
“eyeball” the data to understand it at a high level. More in-depth statistical analysis concerning stationarity will be
performed later in the distribution fitting process, so as to determine whether to utilize the raw data, or data that has
9 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.1 Examining the Data Final Project
● ●
6
● ●
Magnitude ● ● ● ●
5
●
●●●●●● ● ●●
●●●●●●● ●
●
●● ●
●●●● ● ●●●●● ● ●
●
●● ●
●● ● ●●●●● ●
4
● ●
●
●●●● ●●● ●● ●
● ●
●
●●●●
●●●● ● ●●
●●●●●● ●
3
●
●
●●
●●
●●
●●
●●●
● ● ● ● ●
●●
●●
●
●●
●●●●
●●
●●●
●●●● ●● ●● ●●
●
●
●●●
●● ● ●●
●●●●● ● ●
● ●
●
●●●● ●●●●● ●
●●●●● ●
●●● ●● ●● ●
2
●
●
●●●●● ● ● ●●
●●●
●●●
●●
●●●
●●●●●●● ●● ● ● ● ● ●● ● ●
●● ●● ●●
●
● ●● ●●
●
●●●● ● ●●
●●● ●
●●
●● ● ●
1
● ●● ●
●
● ● ●●
| | || | | | | | | | | | | | | | | | | | ● ● ●
1400
● ●
PropertyDamage
1000
600
● ●
● ● ●●
● ●
200
● ● ●●
● ● ● ●
●
● ● ● ●●●●●
●
● ● ● ● ●
●
●●● ●●● ●●
●
● ●
● ●●● ●● ● ●
●
●
●● ●● ●●●
● ●
● ● ●●
●●
● ● ●
●●● ● ●
●
●
●
●
●●●●●●
●●
● ●●●
● ● ● ●
● ●● ● ●
● ●●
●
●●●
● ●●● ● ●
●●●
●●●●●●● ●●
●●● ●
● ● ||||||||||
|||||||||||||| || | || | | ●
●
●●●
●
●●
●●
●●
●●
●
●●
●●
●●
●
●●
●●
●●
● ●
●●
●
●●●
●●● ●●
●●● ●●●● ●●●● ●●● ● ●● ● ● ●
0
250
● ●
● ● InterArrival
200
● ●
● ●
● ●
150
● ●
●● ● ●●
● ● ●●
100
● ●● ●●
● ● ●●
● ●
● ●
● ●
● ●
●
● ● ●
●
50
● ●
● ● ● ●
●
● ● ● ●● ● ●● ●
●●● ●●● ●
●
● ● ● ● ●
● ●● ●
●● ●
●●● ● ●
●● ● ● ●
●●
●
●●●
●●
●● ●
● ● ● ●● ● ● ●
●●
●
●●
●●●●● ●
● ● ●●
●
●●●
●●
● ●●
● ● ●● ●
●●
● ●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
● ●
● ●●●
● ●
●
● ● ●
● ●
● ●●● ● ●
● ● ● ●
● ●●●●● ●●●
●●●● ● ● ●
●
●●
●
●●
●●
●●●
●
●
●
●●
●
●●
●●
●
●●● ● ● |||||
||||||||||||||||||||||||||||||||||||
|||| ||||||| || | | | | | |||| | || | | | | | |
0
1 2 3 4 5 6 0 50 100 150 200 250
10 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.2 Data Analysis of Hail Storm Property Damage Final Project
1200
800
data
400
0
150
seasonal
100
50
0
150
100
trend
50
0
1000
remainder
600
200
−200
1980 1985 1990 1995 2000 2005 2010
time
Distribution: Weibull
Expression: 0.999 + WEIB(9.05, 0.379)
Square Error: 0.000287
11 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.2 Data Analysis of Hail Storm Property Damage Final Project
200
data
50 100
0
20
seasonal
10
0
−10
30
20
trend
5 10
0
200
remainder
100
50
0
−50
1980 1985 1990 1995 2000 2005 2010
time
Figure 6: Decomposition of seasonality and trend for inter-arrival between hail events, in days
Kolmogorov-Smirnov Test
Test Statistic = 0.129
Corresponding p-value < 0.01
Data Summary
12 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.2 Data Analysis of Hail Storm Property Damage Final Project
Histogram Summary
Various transformations we performed on the data to see whether it would improve the fit. Utilizing the Seasonal,
Trend, and LOESS (STL) decomposition function in the R Environment for Statistical Computing, the inherent au-
tocorrelation due to seasonality was removed, and the data were detrended as well. The illustration in figure 5 was
produced by the STL function, and it also shows the remainder after adjusting out the seasonality and trend. This
remainder represents a more stationary time series that is more suitable for analysis. However, a trade off in utilizing
this approach is that we would have to add a seasonality component back into the model if we wish to preserve the
time dependent aspects inherent in real-life.
Further analysis was done to determine whether the one million dollar threshold was sufficient for distribution
fitting purposes. Often, a better fit can be found, and the results more useful, if the data are broken into segments
within thresholds, based on shifts in the distribution at certain points in the data. For example, it would not make
sense to bias our results with commonly occurring events, when we are really mainly concerned with those that cause
significant damage.
An R software package called, “extRemes” [15] was utilized to examine the effects of only including data above
various thresholds, and also to try fitting various catastrophe related distributions that are not included in the Arena
Input Analyzer software. Of particular interest, was the Gumbel distribution, and the generalized Pareto distribution,
as these are often useful in fitting extreme, infrequent weather data, such as hurricane events.
Fitting Distribution to Deseasonalized and Detrended Property Damage Data An alternative to the threshold
method is to utilize the STL decomposition process that was shown in figure 5 to remove the seasonality and trend
from the data. A distribution could be fitted to the remaining, deseasonalized and detrended data, and this distribution
could be utilized to generate simulated events. Removing the auto correlative features of seasonality and trend from
the time series data should make it easier to find a distribution that fits with some statistical significance (i.e., p-value
greater than .05 at a 95% confidence level), while avoiding the need to utilize a threshold that would omit a large
portion of the data. When producing the simulated values from the fitted distribution, the seasonality could be added
back to the process by utilizing calculated seasonality factors.
The “extRemes” package in R was utilized to fit a generalized extreme value (GEV) distribution to the deseason-
alized, detrended (i.e., remainder) property damage data. The output from this analysis is shown below, and indicates
that the Gumbell distribution is statistically significant. With a p-value of 0.686, we would not reject the null hypoth-
esis that the Gumbell distribution fits this distribution.
************
GEV fit
-----------------------------------
Response variable: V1
13 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.2 Data Analysis of Hail Storm Property Damage Final Project
Likelihood ratio test (5% level) for xi=0 does not reject Gumbel hypothesis.
likelihood ratio statistic is 0.1637489 < 3.841459 1 df chi-square critical value.
Diagnostics, along with a histogram fitted with the Gumbell distribution are shown in figure 7. The first of the four
graphs in this illustration (top left graph) is a probability plot. If the Gumbell distribution was a perfect fit for the data,
the points in this graph would form up directly on the diagonal line. However, it can be seen by the slight ‘S’ curve,
that the fit is not exact.
The graph at the top right of this illustration shows a quantile plot. The quantile plot indicates that, while the
Gumbell distribution fits pretty well for the lower property damage values, it does fit the more extreme values as well,
beginning at a property damage loss amount of between $300 million and $400 million.
The graph at the lower left of figure 7 is called a “Return Level Plot”. It is is important to extreme value analysis,
since it indicate the probability level at which the distribution is reliable. In this case, it indicates that the Gumbell
distribution fits well, up to the point just below the 100 in the x-axis, which represents a one in one-hundred year event.
This tells us that the distribution does not fit as well in the extreme tail where there is at least a nearly 1/100 chance of
occurrence.
A density plot for the Gumbell distribution is shown in the bottom right of figure 7, and illustrates with a histogram
of the ‘z’ statistic, how the distribution fits well for the bulk of the data, except for the extreme values in the right hand
side of the histogram.
All of these diagnostics confirm that the Gumbell distribution fits the bulk of the property damage data well.
However, the extreme values at the tail are problematic, and might be better fitted as a separate group through a
method known as threshold analysis. This method establishes a threshold, above which the data are dealt with by
fitting a different distribution than the data below the threshold amount.
Based on the diagnostics in figure 7, it would appear that it would be logical to establish a threshold somewhere
between $300 million and $400 million in property damage losses. However, this would result in a sample size of
only about five data points. A possible option to consider may be to reduce the the threshold to below $300, which
increases the sample size slightly. It turns out that establishing a threshold such that only property damage of $200
million for any event, results in a best fit of an exponential distribution for the extreme tail values. The Gumbel was
not a good fit, based on a p-value of nearly zero. The output from the “extRemes” package in R is shown below, and
includes the parameters of the distribution, as well as a p-value that is well above the 0.05 level we would expect to
see in order to avoid rejecting the null hypothesis at the 95% confidence level, that the exponential distribution is a
good fit for these data.
14 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.2 Data Analysis of Hail Storm Property Damage Final Project
1.0
●
●●
● ●
●●
●
●
●
●
●
1000
●
●
0.8
●
●●
●
●
●●
●●
●
●
●●
●
●
●
●●
●
●
●●
●
●●
●
●
●●
Empirical
●
0.6
●
●
●●
●
●●
●
●●
●
●●
●
●●
●
●
●●
●
●●
●
●
Model
●
●
●●
●
●●
●
●
●
●
●●
●
●●
●
●●
●
●
600
●
●●
●
●●
●
●●
●
●
●●
●
●●
●
●●
●●
●
●●
●
●●
● ● ●
●●
●
●●
● ●
0.4
●
●●
●
●●
●
●
●
●●
●
●●
●
● ●
●
●
●●
●
●
● ●●
●●
●
●
●
●●
● ●●●●●
●
0.2
●
●●
●
●●
● ●
●
●●
●●
●
●●●
●●
200
●
●● ●●
●
●●
●
●●
●
●
●●
●
●
●
●● ●
●
●●
●
●●
●
●
●●
●
●
●●
●
●●
●
●●
●
●●
●●
● ●
●
●●
●
●●
●
●
●●
●
●
●●
●
●
●●
●
●●
●
●
●●
●
●●
●
●
●●
●
●
●
●
● ●●
●
●●
●
●
●●
●
●
●●
●
●●
●
● ●●
●
●●
●
●
●●
●
●●
●
●
●
●● ●
●●
●
●
●● ●
●
●
●
0.0
●●
●
●
●
●●
● ●●
0
0.0 0.2 0.4 0.6 0.8 1.0 100 200 300 400
Empirical Model
●
0.006
1000
Return Level
0.004
f(z)
600
●●
●
0.002
●
●●
●●●
●●●●
●●
●
●●
●
●●
●●
●
200
●
●
●●
●
●●
●●
●
●●
●
●
●
●
●●
●
●
●●
●
●●
●
●
●●
●
●
●●
●
●●
●
●●
●
●
●●
●
●
●
●
●
●●
●
●
●●
●
●
●●
●
●
●●
●
●
●●
●
●●
●
●
●●
●
●
●●
●
●
●●
●
●
●●
●
●●
●
●
●
0.000
●●
●●
●
●●
●
●
●● ●●
●
●●●
●
●●
●●
●●
●
●●
●
●
●●
●●
●●
●●
●
●
●●
●
●●
●●
●●●
●
●●
●
●
●●
●●●
●
●
●●
●●
●
●
●●
●
●
●●
●
●●
●●
●●
●
●●● ● ●● ●
0
Return Period z
Likelihood ratio test (5% level) for xi=0 does not reject Exponential hypothesis.
likelihood ratio statistic is 1.521590 < 3.841459 1 df chi-square critical value.
Convergence successful!
15 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.2 Data Analysis of Hail Storm Property Damage Final Project
● ●
0.8
●
● ●
1000
0.6
●
Empirical
Model
0.4
600
●
●
● ●
0.2
●
●
●
200
●
0.0
● ●●●
Empirical Model
0.008
4e+05
Return level
f(x)
0.004
2e+05
0e+00
0.000
●●●●●● ● ●
●
Figure 8: Generalized Pareto fitted with exponential distribution, on property damage data with a threshold of 200
million dollars
16 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.2 Data Analysis of Hail Storm Property Damage Final Project
There are only nine resulting data points that are this extreme, however. While this is not an uncommon scenario
with extreme weather data, it would obviously be preferable if it was possible to obtain more data points. However,
such historical data on extreme events is often not available, and is not available in this case either. Figure 9 shows a
scatter plot of the nine points that are over the $200 million threshold.
1200
1000
PropertyDamage
800
600
400
200
Date
Figure 9: Scatter plot of remaining nine extreme property damage values above the 200 million dollar threshold
When fitting distributions to extreme weather events it is sometimes necessary to estimate the distribution param-
eters, by either fitting to a small number of points, such as we will do in this case, or by utilizing Bayesian methods of
parameter estimation, such as with Markov Chain Monte Carlo methods (MCMC) [14].
Figure 10 shows the best fitting distribution for the hail property damage data over the extreme value threshold of
$200 million, utilizing the Arena Input Analyzer software. The output from this software for these data is shown below.
It indicates that the exponential distribution is the best fitting among the limited collection of distributions available in
this software package. While the p-value is lower than that of the generalized Pareto exponential distribution (GPD),
it is high enough at the 95% confidence level to avoid rejecting the null hypothesis, base on Kolmogorov-Smirnov
test. Utilizing the exponential distribution in Arena with the parameters shown in the Input Analyzer output would be
an easier alternative than creating an external sample file utilizing the GPD, although the GPD produces a better fit
overall.
Output of best fit analysis from Arena Input Analyzer of property damage:
=================================================================
Fit All Summary
17 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.2 Data Analysis of Hail Storm Property Damage Final Project
Function Sq Error
-----------------------
Erlang 0.0297
Exponential 0.0297
Gamma 0.063
Weibull 0.0848
Lognormal 0.121
Triangular 0.143
Normal 0.151
Uniform 0.232
Beta 0.403
Distribution Summary
Distribution: Exponential
Expression: 201 + EXPO(0)
Square Error: 0.029666
Kolmogorov-Smirnov Test
Test Statistic = 0.304
Corresponding p-value > 0.15
Data Summary
Histogram Summary
18 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.3 Data Analysis of Hail Storm Inter-Arrival Times Final Project
Figure 10: Exponential distribution is best fitting from Arena Input Analyzer for property damage over 200 million
dollar threshold
Box-Ljung test
data: InterArrivRaw$RawInterArriv
X-squared = 1.0205, df = 1, p-value = 0.3124
Lognormal distribution is best fit for untransformed inter-arrival data Running the Arena Input Analyzer on the
untransformed inter-arrival data results in the selection of a lognormal distribution as the best fit. The fit is exhibited
in figure 12..
The p-value for this distribution is 0.134, based on the chi-square test. While this is not as strong a fit as for the
other hail data variables, we would not reject the null hypothesis at the 95% confidence level. The p-value is above
0.05.
The equation output from Arena, however, did not produce a usable result. This output showed the equation as
-0.001 + LOGN(0,0), and of course we cannot simulate lognormal distribution values with all the parameters
19 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.4 Data on Standard and Hail Resistant Roofs Final Project
1.0
0.8
0.6
ACF
0.4
0.2
0.0
0 5 10 15 20 25
Lag
equaling zero. The “fitdistr” function in R software was utilized to produce the correct parameters for the log-normal
distribution on the inter-arrival data. The R output below reveals a usable pair of parameters, as well as the respective
standard error value under each.
20 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.4 Data on Standard and Hail Resistant Roofs Final Project
and labor [11]. To account for this, a triangular distribution was implemented in the Arena simulation model, with a
low estimate of 10%, modal estimate of 15%, and a high estimate of 20%. No modal statistics were available, so 15%
was chosen as the estimate for the mode, simply because it is midway between the low and high range limits.
With each hail event in the simulation, the total proportion of roofs that remain with conventional roofing must be
estimated. With each simulated hail event, the conventional roofs are being replaced with hail resistant roofs, causing
the proportion of conventional roofs in the stock of roofs for single family houses in the U.S. to diminish. Table 4
shows how the total value of roofs in the U.S. was estimated. Bibliography references to data sources are provided in
the table, and include the U.S. Census Bureau [9], and the Cost Helper website [1].
Other sources were also consulted for the roof cost, and a range of estimates can be found, depending on variables
such as roof size, and quality of materials and labor. Rather than add another stochastic variable to the model in this
case, it was decided to choose a midrange point estimate of $5,000, since this study is not concerned so much with
variability in roof costs in general, but is more concerned with variability in the cost difference between conventional
and hail resistant roofs. As previously mentioned, a triangular distribution was used to model this variability. The
$5,000 point estimate, however, can simply be thought of as a constant in the equation, and should not have a significant
impact on the comparison between hail resistant and non-resistant roofing costs.
Data were not found to determine how many roofs presently have hail resistant roofing. However, some sources
mentioned that most roofs for single family houses are asphalt shingle. For purposes of the modeling, it will be
presumed that a low percentage of roofs are currently hail resistant, begining at 10%. A ratio of hail resistant, to
conventional roofs will be calculated after each event. This gradually increasing ratio will be used to determine the
amount of losses for each subsequent event, by blending the higher loss costs for conventional roofs, with the lower
loss costs for hail resistant roofs.
Table 4: Calculation steps for estimating total roof value in the U.S.
21 of 39
Robert McPherson
ISMT E-135, Spring 2010
4.5 Simulating the Data Final Project
Loss costs are based on the hail property damage data from NOAA. These data are adjusted by the typical loss
reduction from hail resistant roofing, based on research. The research indicates that hail resistant roofs are 50% less
likely to be damaged in a hail storm than conventional roofs.
The setup for the random generation of hail stone size is shown in figure 15. The configuration of the random
generator for the gamma distribution behind the “Hail Stone Size” object is exhibited in 16.
Lastly, the setup of the object to generate property damage in dollars, called “Storm Severity”, is shown in figure
17. The setup of the log-Weibull distribution (i.e., Gumbel distribution) that is used to generate the property damage
amount for each storm is illustrated in figure 18.
1. The first model, shown in figure 19 is the simplest, and assumes all roof replacements for hail damage are done
with conventional, asphalt shingles, as is customary in the insurance industry today.
22 of 39
Robert McPherson
ISMT E-135, Spring 2010
5.1 Model Implementation Final Project
Figure 14: Configuration of the lognormal distribution for “Storm Severity" object
2. The second model, illustrated in figure 20 adds to the conventional roofing model, by assuming some or all of
the roof replacements for hail damage are done with hail resistant roofing.
23 of 39
Robert McPherson
ISMT E-135, Spring 2010
5.1 Model Implementation Final Project
Figure 16: Configuration of gamma distribution random generator for hail stone size
Figure 18: Configuration of log-Weibull random generator behind “Storm Severity" object
Both models are designed to compare estimates as to how many years it could take to realize a statistically sign-
ficant benefit, if insurance companies replace hail damaged roofs with hail resistant shingles, instead of the usual
conventional asphalt roofing shingles. The main factor in this evaluation is the very high amount of variability in hail
arrivals and resulting damange levels, given the nature of the extreme value distributions that are inherent in real-life
hail storm events. Thus, much of the effort in this research was given to researching and fitting distributions to these
24 of 39
Robert McPherson
ISMT E-135, Spring 2010
5.2 Simplifying Assumptions in the Models Final Project
extreme data. Model implementation mainly involves the incorporation of distribution generators that mimic real-life
hail storm events. Thus, there are no resources or servers required in either of the models. Most of the modules shown
in figures 19, and 20, serve the purpose of updating variables, simulating cost estimates, and tracking the results from
various hail-resistant roof replacement strategies.
In the case of the hail-resistant model version, the modules labeled I, J, and K, as well as N through S, were added
to estimate the proportion of existing roofs that have been converted from conventional to hail-resistant roofing. It is
presumed in this model that a certain portion of conventional roofs are replaced with hail-resistant roofs, as a result of
replacements after each hail storm event. Two variables, one containing the percentage of hail-resistant roofs, and one
containing the percentage of conventional roofs, are updated after each simulated hail storm event.
• No seasonal adjustments were made, as the purpose of the models is to evaluate results across many years,
rather than to evaluate results within any given year. Since each replication represents an entire year, seasonal
adjustments are not very useuful.
• In these models, it is assumed that conventional roofs only get replaced with hail-resistant roofs when a roof is
damaged by hail. There is no provision made for the possibility that some home owners may install roofs on their
own, prior to any hail damage. The object of these models is to examine the potential impact of replacing hail
damaged roofs due to hail damage. Such action would like occur through laws requiring insurance companies to
pay for replacing roofs with hail-resistant roofing, and by changing building codes to require such installations
so that homeowners will spend their claims dollars on hail-resistant roofing as well. (Insurance companies pay
claims, but do not actually make roof replacements.) Homeowners generally determine the contractors who will
do the work, as well as the materials that will be used. Thus, a chance in building codes would be necessary to
assure that roofs are replaced with the upgraded resistant roofing.
• It is assumed that all of the loss data provided by NOAA for hail storm events pertains to residential roof damage.
In fact, having personally worked as a claims manager for a major insurance carrier, I can attest that by far the
greatest losses we encountered in hail storm events was due to residential roof damage. Most residential roofs
are asphalt shingles, and these can be destroyed merely by knocking off some of the top layer of pebbles on the
shingles.
25 of 39
Robert McPherson
ISMT E-135, Spring 2010
5.2 Simplifying Assumptions in the Models Final Project
Conventional roof replacement model The conventional roof replacement model in figure 19 is explained step-by-
step in table 5, according to module letter name.
Hail resistant roof replacement model Figure 20 illustrates the module layout and logic for the hail-resistant re-
placement version of the model. The illustration also shows the histograms from a single run, as well as a textbox
indicating the proportion of roof replacements that should be done with hail resistant roofing. In this example, the
0.50 in the textbox indicates that 50% of all losses attributable to conventional roofing should be replaced with hail-
resistant roofing. The model multiplies this percentage times the total amount of conventional roofs remaining from
the previous hail-storm event.
The model also incorporates a ratio of the number of hail-resistant roofs to the number of all single family roofs
in the U.S. housing stock. The numerator, consisting of the number of hail resistant roofs, is increased after each hail
storm event, which increases the ratio. The ratio can be applied to the total number of roofs to determine the number
of hail-resistant roofs. The number of conventional roofs is simply determined after each event, by subtracting the
newly adjusted number of hail-resistant roofs from the number of conventional roofs after each round.
Table 6 provides the details behind each of the nodes in the hail-resistant roof replacement model that was illus-
trated in figure 20. (The table is continued on another page from models O through S.) A few of the modules were
26 of 39
Robert McPherson
ISMT E-135, Spring 2010
5.2 Simplifying Assumptions in the Models Final Project
B Separate for Inter Duplicates hail event entity so that data NA Provided for visual separation only;
Arrival Tracking can be collected on inter-arrivals no cost assigned to duplicates
H Storm Severity Generates a dollar loss value based on Loss Severity LN(WEIB(53.98324,.00528)+173)
a lognormal Weibull distribution, for
each hail storm event
I Losses for Non Hail Simply stores the Loss Severity Non Hail Loss Severity
Resistant Roofing variable under the descriptive name: Resistant Losses
Non Hail Resistant Losses
J Total Accum Accumulates losses across yearly Total Cummulative Non Hail Resistant Losses / (Total
Conventional Losses replications; does not reset after Conventional Losses Value of Roof Population)
each year
K Total Hail Losses Stores value under a name that is Total Loss Cost for Non Hail Resistant Losses
comparable to the hail-resistant Both Roof Types
version of the model
L Hail Event Ends section that generates and tracks NA NA
Resolution losses for conventional, non-hail
resistant roofs
27 of 39
Robert McPherson
ISMT E-135, Spring 2010
5.3 Testing, Verification, and Validation Final Project
mainly used for testing and validation purposes, and most of these are noted within the table.
Figure 21: A single year of simulated hail losses shows extreme variability inherent in losses from actual hail events
Table 8 includes basic statistics from the Arena output after running the simulated distributons for 500 replications.
As previously noted these amounts are less than the actual amounts in the data. Although the fit of the distributions was
seen to be statistically significant in terms of high p-values, as previously demonstrated, the quantile plots that were
reviewed earlier illustrated that the tails were not adequately captured in the fitted distributions. Although the outliers
in the tail are few, they can be enough to shift the mean. Truncating the distributions in this manner can result in the
understated averages we see in table 8. This should not be a serious problem for the purposes of these experiments,
however. We will just need to be aware that the inter-arrivals are likely to happen a bit more frequently than in real-life,
and we should keep in mind that the actual losses can, in rare circumstances, be higher than what is modeled. This
is not unlike what has been observed in some commercial catastrophe model results, when comparing them to real
events. Mother nature sometimes has a way of dealing out more than our laboratories and stock distributions can
reproduce.
In fact, sensitivity analysis was conducted on the inter-arrival distribution to test the impact on the outcome. The
result, as will be discussed later, does not change the findings for this analysis. Table 8 also includes a final row that
compares the mean and maximum of an exponential distribution with a mean of 16.2 days (the same as the actual
inter-arrival mean). The shape of the distribution will not match the actual, of course, but the resulting average is
closer to the actual, as is the maximum inter-arrival, although it is still understated. The shape of the exponential
version that was used for sensitivity testing is more gradually sloping than the real distribution. Again, more will be
discussed on this result later.
28 of 39
Robert McPherson
ISMT E-135, Spring 2010
5.3 Testing, Verification, and Validation Final Project
Table 6: Description of modules within the hail-resistant roof replacement model – continued on next page
29 of 39
Robert McPherson
ISMT E-135, Spring 2010
5.3 Testing, Verification, and Validation Final Project
Module Explanations for Hail-Resistant Roof Replacement Model – continued from previous page
ID Module Name Module Description Variable Expressions and Details
O Increased Installation Calculates extra installa- Hail Resistant Increased Installa- TRIA(0.1, 0.15, 0.2) *
Cost for Hail Resis- tion cost of hail-resistant tion Cost Hail Resistant Losses
tant roofs based in triangular
distribution
P Hail Losses Plus Calculates total cost of in- Hail Resistant Losses Plus Added Hail Resistant In-
Higher Installation stalling hail-resistant roof Installation Cost creased Installation
Costs by adding extra installa- Cost + Hail Resistant
tion cost to loss cost Losses
Q Total Accum Hail Re- Accumulates losses across Hail Resistant Losses Plus Installa- (Hail Resistant Losses
sistant Losses Includ- yearly replications; does tion Divided By Value of Hail Re- + Hail Resistant
ing Installation not reset after each year sistant Roofs Increased Installation
Cost) / Total Value of
Hail Resistant Roofs
R Increase Total Num- Increase the number of Total Value of Hail Resistant Roofs Total Value of Hail
ber of Hail Resistant hail resistant roofs in the Resistant Roofs +
Roofs population by the number (Non Hail Resistant
of hail resistant roofs re- Losses * InputPro-
placed in event portionToRepairWith-
HailResistant)
S Percentage of Total For model check: nor- Percentage of Hail Resistant Roofs Total Value of Hail
Hail Resistant Value malize conventional losses in Total Resistant Roofs / To-
to Total Roof Value per dollar of conventional tal Value of Roof Pop-
roofs in population ulation
Table 7: Continuation of description of modules within the hail-resistant roof replacement model
30 of 39
Robert McPherson
ISMT E-135, Spring 2010
Final Project
Nevertheless, the histograms shown in figure 22 show a strong resemblance to the distributions that were shown in
the previous histograms of the actual data.
Figure 22: Histograms of inter-arrival times in days (left), and loss severity in dollars (right)
Testing and validation of the hail-resistant roof replacement incrementer, and total losses If the formula to
increment the number of hail-resistant roofs in the U.S. is working properly, we should expect to see the lines pertainint
to hail resistant roof losses and installation costs move upward as the number of replications increases. That is indeed
what is seen in figures 23 and 24. These graphs compare the total losses from conventional and hail-resistant roofs after
one year, and 200 years. The models were set to replace 100% of hail damaged conventional roofs with hail-resistant
roofs to speed up the replacement process. Thus, the replacements can only occur as fast as the frequency and severity
of hail storms will allow.
Note that severity is not only determined by hail stone size in a real event, but also by geograophical coverage.
Since these models as national in scope, the severity distribution encompasses this consideration. Since the frequency
generator, and the severity generators are based on actual data, the speed at which the roof replacements occur should
mimic real life. However, as pointed out previously, the speed will likely be a bit faster in the models, since there
are tail events that cannot be adequately accounted for in these distributions without resorting to modeling the tails
separately.
The graph in figure 23 indicates the average daily conventional roof losses (top line), the average daily hail-resistant
roof losses, and the average daily hail-resitant losses plus slightly increase installation costs, over a single replication
(i.e., one year). The model was started with only 10% of the roofs having hail-resistant roofs. (While no available data
were found to indicate total hail-resistant roofs in the U.S., 10% was utilized as a starting point, as the number of roofs
with hail-resistant roofs is considered to be quite low. While it is probably lower than 10%, changing this number does
not impact the findings from the experiments that were done.) In any case, figure 23 indicates that the losses related
to hail-resistant roofs remains low relative to conventional roof losses, after only one year. This is, of course, what we
should expect if the incrementing formula is working properly.
Figure 24 shows the same graph as in figure 23, except that it covers 200 replications. We can see from this
graph, that after 200 years, the losses from hail resistant roofs account for nearly all of the losses. The line for the
conventional losses is near zero, and is difficult to see in the graph. After a time as long as 200 years, this seems a
plausible result.
31 of 39
Robert McPherson
ISMT E-135, Spring 2010
Final Project
Figure 23: Average losses for each of the 365 days in a year over a single year (i.e., one replication)
that the cost to install hail-resistant roofs is only a little bit more than to install conventional roofs: 10% to 20% more.
However, given the wide variance in extreme events such as severe hail storms, things that seem certain may not be so.
Or, it may take a great many years to see the benefits from such loss mitigating efforts. This experiment is intended to
determine whether the apparent advantage between various options for installing hail-resistant roofs is real, and if so,
to determine a minimum number of years it might it take to fully realize these benefits.
There are several scenarios that will be compared.
1. Do not replace any hail-damaged roofs with hail-resistant roofing. This is extreme scenario is more of a baseline
for comparison purposes. It could help answer the question as to whether it is worthwhile to even consider
embarking on any hail-resistant roofing strategies.
2. Replace all hail-damaged roofs with hail-resistant roofing. This scenario represents the opposite extreme case to
the first scenario, and is likewise mainly included for comparison purposes. It would likely take some extreme
insurance and building code legislation to accomplish anything even close to this scenario.
3. Replace 50% of hail-damaged roofs with hail-resistant roofing. This may also be a bit optimistic, but might be
plausible. For example, it might be possible to accomplish this kind of rate eventually, if enough insurance com-
panies can find a way to get the public’s attention through attractive incentives, combined with some legislative
changes in hail prone areas, such as improved building codes.
4. Replace 10% of hail-damaged roofs with hail-resistant roofing. This is replacement rate is perhaps a bit more
realistic.
5. Share costs for installing hail-resistant roofing with homeowners, resulting in a 10% take-up rate (i.e., 10% of
hail-damaged roofs replaced with hail-resistant roofing. This is perhaps the most realistic scenario, as some
insurance companies have begun to offer incentives, such as reduced deductibles, and/or reduced premiums
for residents with hail-resistant roofs. The take-up rate is not known, but does not appear to be getting much
attention in the industry. Hence, the take-up rate would appear to be rather low, so that 10% may well be an
overly optimistic estimate. This scenario is also intended to test whether the extra installation cost is much of an
offsetting factor as to the efficacy of installing hail-resistant roofing from a loss standpoint.
32 of 39
Robert McPherson
ISMT E-135, Spring 2010
Final Project
Figure 24: Average losses for each of the 365 days in a year over 200 years (i.e., 200 replications)
Figure 25 illustrates the basic initial setup for comparing the scenarios.
Figure 25: Basic setup for comparing hail-resistant roof replacement strategies in Process Analyzer software
33 of 39
Robert McPherson
ISMT E-135, Spring 2010
7.1 Comparisons over 50 Years Final Project
Figure 26: Box plot of 50 repititions indicating no single scenario is significantly better
The numbers behind the box plot for the 50 replications are shown in table 9. This table indicates the minimum,
maximum, and confidence interval ranges for the mean of each scenario across all the replications (50 years, in
this case). We can see confirmation of the overlapping confidence intervals, despite the seemingly large differences
between the mean values. This is indication of the obscuring affect that extreme events can have on seemingly obvious
courses of action. It turns out that embarking on a program to replace hail-damaged conventional roofs with hail-
resistant ones may not be such an obviously correct course of action after all, or at least, not within a 50-year timeframe.
Fifty years is obviously a long time to wait to see difinitive business results.
It was considered that there may be a potential need for a warm-up period. However, this did not make a difference
in the results, even after allowing for a 100-year (3,650 days) warm-up period. There really are not any significant
interactions within the model, such as between resources or entities, as there are no resources, and there is only
one entity. Thus, it is unlikely that a warm-up would make much difference in most cases with these rather simple
model structures. Although, increasing the number of replications does reduce the confidence intervals, warm-up
replications are not counted toward the value of the sample size, n, when the Arena software package calculates a
confidence interval.
34 of 39
Robert McPherson
ISMT E-135, Spring 2010
7.2 Comparisons over 100 Years Final Project
Figure 27: Box plot of 100 replications indicating no single scenario is significantly better, even after 100 years
In table 10, we can see that the 95% confidence interval low values are not quite as low as the result that was shown
for 50 replications in table 9. Converely, the high values are not as high. Thus, we see confimation of the confidence
interval shrinking with increaseing numbers of years, as expected. However, even with the shrinking confidence
intervals, none of the scenario outcomes is statistically significant. Again, all of the boxes are shaded red in the box
and whiskers plot that was shown in figure 27. None of them are shaded blue, which would have indicated a result that
would have allowed us to reject the null hypothesis that the scenario mean outcomes are statistically equal, given the
amount of variance in the results. Again, we see the effect of extreme events can have on such comparisons.
Total Loss Cost for Both Roof Types by Scenario 100 Replications
(Standard Error Tolerance = 1.12084)
Min Max 95% CI Low 95% CI High Mean
All Conentional Roof Replacements 293.74 5.15 27.69 56.52 42.1
All Hail Resistant Roof Replacements 268.7 2.8 26.73 50.07 38.4
Fifty Percent Hail Resistant Replacements 212.12 3.42 23.78 44.66 34.22
Ten Percent Hail Resistant Replacements 231.39 4.46 27.41 50.96 39.18
Cost Sharing with Ten Percent Hail Resistant Replacements 230.09 4.41 27.18 50.54 38.86
Table 10: Data behind the box plot for 100 replications
35 of 39
Robert McPherson
ISMT E-135, Spring 2010
7.3 Comparisons over 250 Years Final Project
Figure 28: Box plot of 250 replications shows all three hail-resistant scenarios are finally significantly better than
non-hail resistant
Table 11 shows just how narrow the confidence intervals have become after 250 replications. We can see how
differences in sencario outputs can begin to appear significant at this point.
Total Loss Cost for Both Roof Types by Scenario 250 Replications
(Standard Error Tolerance = 1.43306)
Min Max 95% CI Low 95% CI High Mean
All Conentional Roof Replacements 5.15 308.5 35.52 53.13 44.33
All Hail Resistant Roof Replacements 2.48 268.7 24.6 36.76 30.68
Fifty Percent Hail Resistant Replacements 2.56 212.12 23.91 36.09 30
Ten Percent Hail Resistant Replacements 3.87 257.49 29.64 44.58 37.11
Cost Sharing with Ten Percent Hail Resistant Replacements 3.8 251.55 29.25 44 36.62
Table 11: Data behind the box plot for 250 replications
36 of 39
Robert McPherson
ISMT E-135, Spring 2010
7.4 Sensitivity Testing Final Project
Figure 29: 50-year comparison of exponential inter-arrival with mean of 16.2 days for sensitivity testing
However, figure 30 show a similar result in 100 years with the new distribution, as we saw in 250 years with the
previous, fitted distribution. Thus, it appears that the exponential distribution tends to make the confidence intervals
more narrow at a faster rate. However, for business purposes, 100 years is still a very long time. This sensitivity
test appears to confirm our initial findings that hail-resistant roof replacement strategies may not be as appealing as it
might first appear.
8 Conclusion
An initial evaluation of the published statistics concerning the benefits of hail-resistant roofs might lead insurance
executives, state insurance regulators, and building code advocates to pursue incentives and laws that would strongly
promote investing extra money to install hail-resistant roofs when conventional roofs are destroyed as a result of hail-
damage. However, an evaluation of the long-term benefits with the use of simulation modeling technology reveals
the answer may not be so clear cut. Hail storm events that are sufficiently strong to cause significant damage (over
$1 million per event, which is a low threshold by insurance company standards) are relatively infrequent, and the
variance in the severity of these events is such that it could take an extremely long time to realize a clear payoff from
such efforts.
37 of 39
Robert McPherson
ISMT E-135, Spring 2010
REFERENCES Final Project
Figure 30: 100-year comparison of exponential inter-arrival with mean of 16.2 days for sensitivity testing
However, further research should be conducted, as this study was fairly limited in scope. Another possible av-
enues of research could include costs savings for wind damage combined with hail damage, as hail-resistant roofs are
purported to hold up better than conventional roofs in high wind as well. Other possibilities include doing a more
extensive study that would seek to overcome some of the simplifying assumptions that were listed for this study. One
such extension on this study could involve fitting additional distributions to account for the extreme tails. It may be
possible to do a piecewise combination of distributions at different thresholds.
In addition to modeling the impact of hail-resistant roof replacement strategies, much attention was given to the
extreme value distributions behind hail storm events in general. This study also revealed the high challenge of trying
to mimic mother nature, as she does not like to conform to the rather domestic varieties of distributions that are
commonly available in most software packages. The natural world can be much more variable and extreme than we
may tend to imagine.
References
[1] Cost of a new roof - get prices and estimates - CostHelper.com. http://www.costhelper.com/cost/home-
garden/roof.html, December 2008.
[2] Energy information administration - residential energy consumption survey. http://www.eia.doe.gov/emeu/recs/,
2009.
[3] NCDC: * national climatic data center (NCDC) *. http://www.ncdc.noaa.gov/oa/ncdc.html, August 2009.
[4] NCDC: query output. http://www4.ncdc.noaa.gov/cgi-win/wwcgi.dll?wwevent~storms, December 2009.
[5] Arena simulation software by rockwell automation: Home. http://www.arenasimulation.com/, 2010.
38 of 39
Robert McPherson
ISMT E-135, Spring 2010
REFERENCES Final Project
[11] Terry Blinton. Let it hail, let it hail, let it hail. http://www.insurancejournal.com/magazines/southcentral/2003/04/07/features/28144.
April 2003.
[12] NAHB Research Center. High wind- and Impact-Resistant asphalt roofing shingles.
http://www.toolbase.org/Technology-Inventory/Roofs/wind-resistant-asphalt-shingles, 2001.
[13] Stanley A. Chagnon and Tamara G. Creech. AMS online journals - sources of data on freezing rain and resulting
damages. Journal of Applied Meteorology, page 5, 2003.
[14] Stuart Coles. An Introduction to Statistical Modeling of Extreme Values. Springer Series in Statistics. Springer-
Verlag, London, 2001.
[15] Eric Gilleland, Rick Katz, and Greg Young. extRemes: Extreme value toolkit., 2010. R package version 1.62.
[16] Patricia Grossi and Howard Kunreuther. Catastrophe Modeling: a new approach to managing risk. Huebner
International Series on Risk Insurance and Economic Security. Springer Science+Business Media, Inc., New
York, NY, 2005.
[17] Ryan Jewell and Julian Brimelow. EVALUATION OF an alberta hail growth model using severe hail proximity
soundings in the united states. 2004.
[18] David W. Kelton, Randall P. Sadowski, and Nancy B. Swets. Simulation with Arena. McGraw Hill, New York,
NY, international edition, 2010.
[19] Inc. Pearson Education. U.S. home size infoplease.com. http://www.infoplease.com/askeds/us-home-size.html,
2007.
[20] Inc. Professional Investigative Engineers. Insurance-Canada.ca other claims information: Hail resistant roofs:
Fact or fiction? I-ENG-A report. http://www.insurance-canada.ca/claims/other/IENGAHail407.php, 2010.
[]
39 of 39