Professional Documents
Culture Documents
Evaluation of Charpy-V Transition Curve Fitting Based On Distribution-Free Statistical Assessment
Evaluation of Charpy-V Transition Curve Fitting Based On Distribution-Free Statistical Assessment
Evaluation of Charpy-V Transition Curve Fitting Based On Distribution-Free Statistical Assessment
A R T I C L E I N F O A B S T R A C T
Keywords: A non-parametric, distribution-free, statistical assessment of 5 large Charpy transition curve data sets is used for
Charpy impact test the optimum fitting equations for smaller data sets. The assessment makes use of a combination of rank prob
Transition curve ability and binomial probability analysis of the data. The original non-parametric assessment method is
Fitting
improved by combining upper and lower bound binomial estimates, thus removing a bias that exists in the
Non-parametric assessment
Statistics
original method. The non-parametric assessment is not suitable as a standard method because it requires too
Rank probability many data points to give a reliable result. It is, however, ideal as a research tool to examine transition curve
Binomial probability shape and scatter. Based on the assessment several recommendations for transition curve fitting can be made.
* Corresponding author.
E-mail addresses: kim.wallin@kwsolutions.fi (K. Wallin), wolfram.baer@bam.de (W. Baer).
https://doi.org/10.1016/j.ijpvp.2023.105079
Received 11 September 2023; Received in revised form 9 October 2023; Accepted 16 October 2023
Available online 18 October 2023
0308-0161/© 2023 Elsevier Ltd. All rights reserved.
K. Wallin and W. Baer International Journal of Pressure Vessels and Piping 206 (2023) 105079
presented in Fig. 1 [14]. The scatter may at a first glance appear large, then the temperature only specifies where on the curve the specimen
but this is attributed to the large number of tests (200). Normally CVN lies. In this case the physical argument changes to one where parallel
transition curves contain around 30 specimens but may be as few as only transition curves differ by some temperature amount. In this case a
10–12 specimens, which is common for example, in nuclear surveillance temperature-based fit is more appropriate. This latter assumption is
programs [13]. When the Tanh equation is fitted to such a small data set, supported by the fact that data scatter is more consistent with respect to
the scatter appears less than what it really is. temperature than energy as shown in Fig. 2. The scatter with respect to
There are several different techniques by which one can fit the Tanh energy was determined at each test temperature with 20 specimens per
equation to CVN impact data. They range from quite sophisticated to temperature. The scatter with respect to temperature was obtained by
simple least square methods. Most often, the Tanh equation is fitted by a ordering the 200 data points by energy and dividing the data in 10
least squares fit with relation to absorbed energy KV, lateral expansion groups with 20 specimens each. The scatter and mean temperature were
LE or shear fracture appearance SFA. This is the method described in then determined for each group.
different standard procedures like ASTM E2215 [13] and NIST [11], but The scatter in terms of KV (LE, SFA) is not constant, but varies over
the method is not always optimal for CVN fitting. Another possible the transition region, being largest in the centre of the transition region
parameter, on which the least square analysis can be based, is the and smallest in the lower and upper shelves. The scatter in terms of
temperature. Normally it is assumed that temperature is defined without temperature, however, is nearly constant, as seen from Fig. 2, thus fa
scatter and all the variability occurs in terms of energy. This is the voring parallel transition curves. The least square fitting methods as
common argument for using the energy-based fitting. However, if we sume a constant scatter over the whole fitting range. This means that too
assume that each specimen is connected to a specific transition curve, much weight is put on the data in the middle of the transition, when
fitting in terms of energy. One can try to estimate the scatter along the
Fig. 1. Typical CVN transition curve for nuclear grade forging based on 200 Fig. 2. Standard deviation of data presented in Fig. 1, showing the difference in
tests [14]. The temperature and energy based fits are explained below. scatter behaviour in terms of energy and temperature [14,15].
2
K. Wallin and W. Baer International Journal of Pressure Vessels and Piping 206 (2023) 105079
transition curve and use this to weigh the individual energy values, but The role of the dummy parameter is to ensure that the Tanh equation
this is practically impossible for a small data set. reaches upper shelf in the correct temperature region. This is especially
Fitting in terms of temperature removes this problem. It automati important for steep transition curves. For large data sets, the dummy
cally ensures the correct weighing of each data point regardless of test parameter has only limited effect, but for small data sets the use of the
temperature. The only problem is that very close to LS and US, the dummy parameter is important.
scatter in terms of temperature increases. Therefore, these regions The steepness parameter C is obtained from Eq. (4) [11] and the
should not be included in the fit, except for estimating LS and US. The normalization temperature Tt50 from Eq. (5) [2].
inverted Tanh equation has the form shown in Eq. (2). ( ) ( )
∑n
US− LS
∑ n ∑n
US− LS
( ) n⋅ Ti ⋅ln KV − LS
− 1 − T i ⋅ ln KV − LS
− 1
C KV − LS i i
T = Tt50 + ⋅ln (2) C = 2⋅ (i=1 ( ))2
i=1 i=1
( )2 (4)
2 US − KV ∑n
US− CV min ∑ n
US− LS
ln KV i − CV min
− 1 − n⋅ ln KVi − LS
− 1
Contrary to the energy-based fit, which requires a numerical fitting i=1 i=1
3
K. Wallin and W. Baer International Journal of Pressure Vessels and Piping 206 (2023) 105079
estimates are obtained by taking the average of the upper and lower
bound estimates. Note that the probabilities in the upper and lower
bound estimates are reversed with respect to probability.
Fig. 7 shows as an example the estimates of the ±1σ fractiles. The
upper and lower bound Binomial estimates show a clear bias. The points
marked “combined” are the averages of the Binomial estimates and the
estimates based on the Rank probability. As seen, they form together an
estimate with very little scatter. The combined estimates were fitted by a
so-called Burr equation [1,6,7,10] which is quite flexible. For this large
data set and material, the Burr equation fits the non-parametric esti
mates satisfactorily.
The Burr equation used here has the form given by Eq. (9).
(US − LS)
KV = [ ( )]m + LS (9)
1 + exp − T− kT0
If fixed, US and LS can represent the upper shelf and lower shelf
absorbed energies, but for a better overall fit to the combined estimates,
Fig. 4. Principle of rank and binomial analysis. they were used as fitting parameters. Thus, they do not represent true
upper or lower shelf energies in this case. Unfortunately, the parameters
the Binomial probability is constant. The size of the bias is then T0, k and m lack any physical meaning. From this respect the Tanh
dependent on the size of the intervals between test temperatures. By equation is physically sounder.
combining both values above a certain energy up to a selected temper Fig. 8 shows a comparison of the non-parametric estimates for me
ature and values below a certain energy down to a selected temperature dian and ±1σ fractiles and estimates based on the Tanh equation using
and taking their average, the bias is avoided. both temperature and energy as fitting parameter. The median estimates
One of the most common simple analytical median Rank probability do not differ significantly, but the energy-based median fit seems to
estimates has the form given in Eq. (7) [15,16]. overestimate the lower transition region with respect to the non-
parametric analysis. The temperature-based Tanh equation ±1σ frac
PR0.5 ≈
i − 0.3
(7) tile estimates show good agreement with the non-parametric estimates.
n + 0.4 It should be noted that the data set is sufficiently large to provide a high
For the Binomial median probability estimate a similar equation has accuracy for the non-parametric analysis. It was established that, in
been developed, Eq (8) [15]. order to have a good reliability in the estimates, the non-parametric
analysis requires the average number of specimens both for the Rank
i + 0.684
PB0.5 ≈ (8) and Binomial probability to be n and Σn ≥ 8 [17]. This is clearly fulfilled
n + 1.368
with this large data set. The n and Σn varied in all cases between 20 …
in Eq. (8), i corresponds either to the upper bound or lower bound 50 making the result quite reliable.
number of data and n is the corresponding total number for upper or Below the analysis is applied on four additional comparatively large
lower bound analysis. CVN data sets originally published in Ref. [12].
The resulting Rank probability diagram is shown in Fig. 5. The figure
includes the 5 %, -1σ, median, +1σ, and 95 % lines. The corresponding 2. Non-parametric analysis of different strength steels
Binomial probability diagrams are shown in Fig. 6. These figures also
include the 5 %, -1σ, median, +1σ, and 95 % lines. The combined Four different structural steels with comparatively large CVN data
sets were selected for the analysis. The data is taken from Ref. [12]. The
steels corresponded to the EN 10025 [18] standard designations:
S235JR, S355 N, S460 N, and S960QL. The yield strengths varied from
282 MPa for the S235JR steel to 1015 MPa for the S960QL steel. Charpy
tests were carried out according to ISO 148-1 [19] and ISO 14556 [20]
using an instrumented ISO type striker with a 2 mm radius of the striking
edge. The capacity of the machine was 300 J. The impact test results for
all four steels are shown in Fig. 9. Besides performing a non-parametric
analysis of the data, also more conventional data analysis (Tanh fit) was
performed. In Ref. [12], the upper shelf was estimated as the average
energy of all specimens at temperatures where all specimens showed
100 % ductile fracture. Here, all specimens showing 100 % ductile
fracture, regardless of temperature were used to estimate the upper shelf
energy US. Also, in Ref. [12], the lower shelf was taken as the average of
all specimens showing 0 % ductile fracture. Here a more conservative
value, based on a visual estimate of the apparent asymptotic value
corresponding to temperature insensitivity was used for the lower shelf
energy LS.
The Rank probability analyses for the four steels are shown in Fig. 10.
There should be preferably a minimum of 8 test results per temperature
to obtain a reliable analysis. Unfortunately, only the S235JR steel to a
significant degree fulfils this requirement. The other steels have less tests
per temperature and the Rank probability estimate is not quite accurate.
Fig. 5. Rank probability analysis of the data in Fig. 4.
4
K. Wallin and W. Baer International Journal of Pressure Vessels and Piping 206 (2023) 105079
Fig. 7. Distribution-free ±1σ estimates for the data in Fig. 4, showing bias in one-sided Binomial probabilities and the beneficial effect of combining upper and lower
bound values. The combined estimates are the averages of the Binomial estimates combined with the Rank estimates.
5
K. Wallin and W. Baer International Journal of Pressure Vessels and Piping 206 (2023) 105079
6
K. Wallin and W. Baer International Journal of Pressure Vessels and Piping 206 (2023) 105079
7
K. Wallin and W. Baer International Journal of Pressure Vessels and Piping 206 (2023) 105079
8
K. Wallin and W. Baer International Journal of Pressure Vessels and Piping 206 (2023) 105079
Fig. 15. Non-parametric median and ±1σ estimates for the four structural steels. The non-parametric fit is described with the Burr equation fit to the median and
±1σ estimates.
9
K. Wallin and W. Baer International Journal of Pressure Vessels and Piping 206 (2023) 105079
Fig. 17. Comparison of energy- and temperature-based Burr equation fits to the actual test results and the non-parametric median Burr fit for the four steels.
10
K. Wallin and W. Baer International Journal of Pressure Vessels and Piping 206 (2023) 105079
Fig. 20. Two random subsets from the 15Ch2MFA steel data set fitted with Eq. (12).
The fitting parameters a0 … a5 do not have physical meaning. Thus, However, for small data sets with less than 30 test results, the benefit
they cannot be connected to LS or US. As an example of the flexibility of of using a more flexible equation is questionable, and only a rough es
Eq. (12), two subsets from the 15Ch2MFA steel data set were randomly timate of the transition curve can be obtained. For such data sets the
selected by taking one random value at each temperature, thus obtaining simple Tanh equation may be a suitable choice.
two subsets with 10 data points each. The data was fitted with Eq. (12). As to whether the equation should be fit with respect to energy or
As seen in Fig. 20, the equation is extremely flexible and provides a very temperature, a previous study [15] compared 28 J and 41 J tempera
good fit to the data. The problem is that the data sets are too small to tures fitted with both methods for small data sets with 10 data points
give a reliable description of the transition curve. Even though the data each. The data was taken by randomly sampling data from the
subsets represent the same material, the resulting fits would not indicate 15Ch2MFA steel data set, similarly to the data shown in Fig. 20. The
this. This is what is meant by “overfitting” in the case of small data sets. distributions of 28 J and 41 J temperatures estimated by the two
Based on an earlier assessment of the 15Ch2MFA steel [15], an methods are shown in Fig. 21. For both energy levels, the estimates
approximate equation for the standard deviation of different tempera based on temperature produced normally distributed values. The esti
ture estimates corresponding to a given energy x has the form of Eq. mates based on energy showed tails that did not follow a normal dis
(13). tribution. Due to this, the energy-based fits showed a larger scatter than
σT the temperature-based fits. The temperature-based fits standard devia
σTt xJ ≈ ( )0.3 (13) tion is well in line with Eq. (12). For the larger data sets, the
neff − 4
energy-based fits tend to produce lower temperatures than the
The parameter neff relates to the number of data points below US and temperature-based fits. The best estimate non-parametric values are
above LS. For the four steels, neff varies in the range 38–57. Considering Tt28J = − 60 ◦ C and Tt41J = − 53 ◦ C. The respective median values for
that σTt for all four steels is close to 10 ◦ C, the estimate for σTtxJ is temperature-based fits are − 59 ◦ C and − 51 ◦ C, whereas the corre
approximately 3 ◦ C. For the 15Ch2MFA steel where neff is 125, the es sponding values for the energy-based fits are − 62 ◦ C and − 52 ◦ C. Even
timate for σTtxJ is approximately 2 ◦ C. This is fully in line with the trend though the differences are small, the trend is the same. The
seen in Fig. 18. temperature-based fits are slightly conservative, whereas the
Based on the above results, it is recommended to use the Burr energy-based fits are slightly non-conservative.
equation fitted with respect to temperature as given by Eq. (9) for larger In order to examine more closely the effect of overfitting on 28 J and
data sets with approximately 30 test results or more. 41 J temperature estimates, Eq. (12) was used to analyze the same data
Fig. 21. Distribution of temperatures (Tt28J and Tt41J) estimated from random sub-sets with 10 data points from the large 15Ch2MFA data set with 200 data,
analyzed with energy- and temperature-based Tanh equations [15].
11
K. Wallin and W. Baer International Journal of Pressure Vessels and Piping 206 (2023) 105079
Fig. 22. Distribution of temperatures (Tt28J and Tt41J) estimated from random sub-sets with 10 data points from the large 15Ch2MFA data set with 200 data,
analyzed with energy-based Böhme [8] and temperature-based Tanh equations [15]. Also included are the corresponding distributions for the inverted Burr equation.
sets as the ones leading to Fig. 21. Fig. 22 shows the comparison between 4) Small data sets should not be fitted with equations containing too
Eq. (12) estimates and the temperature-based Tanh estimates. The effect many parameters, since it will lead to overfitting and more scatter in
of overfitting is seen as an increased uncertainty in the estimate. The specific temperature estimates, like Tt27J and Tt40J.
result is even worse than for the energy-based Tanh estimates in Fig. 21.
The same data sets were also analyzed with the inverted Burr equation Author statement
and the resulting distributions are also included in Fig. 22. The inverted
Burr equation provides almost the same distributions as the The corresponding author is responsible for ensuring that the de
temperature-based Tanh estimates. This indicates that the inverted Burr scriptions are accurate.
equation can also be used for relatively small data sets without danger of
overfitting.
A practical additioinal advantage of the temperature-based fit Declaration of competing interest
compared to the energy-based fit is that the obtained formula easily
allows calculating transition temperatures at desired KV values. The The authors declare that they have no known competing financial
more complicated search of target values using solver routines - as interests or personal relationships that could have appeared to influence
needed with energy-based fit formulae is unnecessary. the work reported in this paper.
A non-parametric, distribution-free, statistical assessment of 5 large The authors do not have permission to share data.
CVN transition curve data sets has been applied to look for optimum
fitting equations for smaller data sets. The assessment method makes use Acknowledgements
of a combination of rank probability and binomial probability analysis
of the data. The non-parametric assessment requires comparatively large The outstanding experimental work of the Fracture Mechanics and
data sets with at least 100 data points to obtain reliable descriptions of Structural Integrity Laboratory at BAM Federal Institute for Materials
both curve shape and scatter. With data sets consisting of 50–100 data Research and Testing, Berlin, Germany, which performed the compre
points, the non-parametric assessment can be used to estimate the me hensive test program to establish the material data base for the struc
dian behaviour of the data, but the estimate of the scatter contains un tural steels discussed in this paper and in Ref. [12] is gratefully
certainty. The original non-parametric assessment method was acknowledged.
improved by combining upper and lower bound binomial estimates,
thus removing a bias that existed in the original method. The non- References
parametric assessment is not suitable as a standard method because it
requires too many data points to provide a reliable result. It is, however, [1] I.W. Burr, Cumulative frequency functions, Ann. Math. Stat. 13 (2) (1942)
215–232.
ideal as a research tool to examine transition curve shape and scatter.
[2] G. Hofer, C.C. Hung, Ü. Günes, A mathematical function for the description of
Based on the assessment the following recommendations can be made: Charpy impact tests, Z. für Werkst. 8 (1977) 109–111.
[3] W. Oldfield, Fitting curves to toughness data, Journal of Testing and Evaluation,
1) It is recommended that the CVN transition curve fitting is perfomed JTEVA 7 (6) (1979) 326–333.
[4] F.W. Stallman, Evaluation and uncertainty estimates of Charpy—impact data, in:
based on temperature since the scatter in terms of temperature is Proceedings of the Fourth ASTM-EURATOM Symposium on Reactor Dosimetry,
more constant than for energy (or LE or SFA). National Bureau of Standards, Gaithersburg, Maryland, 1982. March 22-26.
2) For data sets with more than 30 results, fitting may be based on the [5] P. Urwank, Unambiguous curve fitting and error estimation for Charpy impact test
data of reactor pressure vessel steels, suitable for small number of samples, J. Nucl.
Burr equation fitted in terms of temperature. Mater. 161 (1989) 24–29.
3) For data sets with less than 30 results, fitting can simply be based on [6] J.L. Helm, The interpretation of Charpy impact test data using hyper-logistic fitting
the standard Tanh equation fitted in terms of temperature, even functions, in: 17:th Symposium on Effects of Radiation on Materials, ASTM special
technical publication, 1996, 1270.
though the Burr equation fitted in terms of temperature can also be [7] P.L. Windle, M. Crowder, R. Moskovic, A statistical model for the analysis and
used with sufficient confidence, if there are more than 10 results. prediction of the effect of neutron irradiation on Charpy impact energy curves,
Nucl. Eng. Des. 165 (1–2) (1996) 43–56.
[8] W. Böhme, W. Schmitt, Comparison of results of instrumented Charpy- and min-
Charpy tests with different RPV-steels, in: Small Specimens Test Techniques, ASTM
12
K. Wallin and W. Baer International Journal of Pressure Vessels and Piping 206 (2023) 105079
STP 1329, Corwin WR, Rosinski ST, Van Walle E., American Society for testing and quality system tool applicable for steel producers and steel users of heavy plates,
materials, 1998. profiles and weldments. Technical steel research, European commission, research
[9] L.W. Cao, S.J. Wu, P.E.J. Flewitt, Comparison of ductile-to-brittle transition curve fund for coal and steel, EUR 22454 (2007).
fitting approaches, Int. J. Pres. Ves. Pip. 93–94 (2012) 12–16. [15] K. Wallin, Fracture Toughness of Engineering Materials – Estimation and
[10] J. Kohout, Various Regression Functions Fitting Transition Curves, Internal Report Application, EMAS Publishing, Warrington, 2011.
for the Ministry of Defense of the Czech Republic, 2012. [16] K. Wallin, Distribution free statistical assessment of scatter and size effects in the
[11] E. Lucon, J. Splett, A. Koepke, D. Newton, NIST software package for obtaining Euro fracture toughness data set, Eng. Fract. Mech. 103 (1) (2013) 69–78.
Charpy transition curves, NIST Technical Note 2158 (2021), https://doi.org/ [17] K. Wallin, Simple distribution free statistical assessment of structural integrity
10.6028/NIST.TN.2158. material property data, Eng. Fract. Mech. 78 (9) (2011) 2070–2081.
[12] W. Baer, P. Wossidlo, B. Abbasi, On the question of how to analyze and apply [18] prEN 10025-1, Hot Rolled Products of Structural Steels – Part 1, General Technical
Charpy pendulum impact test results correctly - shortcomings in corresponding Delivery Conditions, 2011.
procedures and standards, JTE 50 (2022), https://doi.org/10.1520/JTE20210781. [19] ISO 148-1, Metallic Materials – Charpy Pendulum Impact Test – Part, vol. 1, Test
[13] ASTM E2215 Standard Practice for Evaluation of Surveillance Capsules from Light- Method, 2016.
Water Moderated Nuclear Power Reactor Vessels. [20] ISO 14556, Metallic Materials – Charpy Pendulum Impact Test – Instrumented Test
[14] P. Langenberg, J. Buchholz, A. Völling, W. Bleck, P. Balladon, K. Wallin, Method, 2015.
P. Nevasmaa, A. Laukkanen, QUALYTOUGH - development of a fracture toughness [21] NORSOK, STANDARD N-004: Design of Steel Structures, 2013.
13