Professional Documents
Culture Documents
EpshteinN A Assesmentofmethodslinearityinpracticef
EpshteinN A Assesmentofmethodslinearityinpracticef
net/publication/334971858
CITATIONS READS
6 4,579
1 author:
Naum Epshtein
STADA CIS
52 PUBLICATIONS 242 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Naum Epshtein on 05 August 2019.
INTRODUCTION
“Linearity” is one of the required characteristics when validating analytical
methods [1-8]. According to ICH, “The linearity of an analytical procedure is its ability
(within a given range) to obtain test results which are directly proportional to the con-
centration (amount) of analyte in the sample” [1]. The goal of linearity validation of
the method, according to the USP “is to have a model, whether linear or nonlinear, that
describes closely the concentration-response relationship” (by a response, we mean the
measured signal or the result of its processing); “If linearity is not attainable, a nonlin-
ear model may be used” [2]. Therefore, the confirmation of the linear model of the
concentration-response relationship in methods validation is desirable but not neces-
sary. We emphasize that the essence of the “Linearity” test is to check the possibility
of using the linear model before confirming the accuracy and precision of the method
as the simplest and preferred practice. At the same time, the results of the “Linearity”
test may indicate the need for a non-linear model if the nonlinear model de-scribes
more closely (more accurately) the concentration-response relationship and confirms
the accuracy and precision of the method than a linear one.
2
The linearity of methods is most often confirmed in accordance with ICH [1]:
visually by graph of dependence of response Y (the measurement signal, for example,
the peak area in HPLC) on concentration C of analyte and the acceptable value of cor-
relation coefficient R of linear regression dependence of Y on C; also one has to give
the regression equation or statistical characteristics of regression. In practice, other cri-
teria are also used to assess the linearity or non-linearity of the dependence of Y on C,
based on mathematical statistics [6-24]. There are certain reasons for this, as will be
seen from the text of this article. No wonder, therefore, that up to now remain relevant
questions which connected with criteria of linearity of analytical methods, both in prac-
tical validation of methods and in the expert appraisal of validation documentation.
The aim of the present work is to consider the main graphical and calculation
criteria for confirming the linearity of analytical methods and their limitations, and also
to give recommendations.
MATERIALS AND METHODS
The statistical calculations were made using MS Excel, including using the sta-
tistical package included in its composition. The experimental data given as examples
were obtained by HPLC on a Waters e2695 liquid chromatograph with a Waters 2998
Photodiode Array Detector.
RESULTS AND DISCUSSION
First of all, let us discuss why the linearity of the method is important, although
it is known that a nonlinear model of dependence of Y on C can be used for the cali-
bration graph [6, 11-22]. This can be explained by the following reasons. First, the
possibility of using the simplest method - absolute calibration (one concentration of a
standard solution) with negligible systematic error [6]; secondly, in the routine analysis
in case of line calibration chart it is enough to use 2-3 points while at the nonlinear
calibration schedule not less than 5 points will be required, and these are big expenses
of time especially in case of rather slow chromatographic techniques of the analysis.
The required number of experimental points and solutions used for linear-
ity validation. In the ICH validation manual, it is recommended to use at least 5 ex-
perimental points (concentrations of the substance being determined) in linearity vali-
dation of method [1]; this is consistent with the recommendation from mathematical
statistics [25]. But sometimes five points may not be enough for a reliable conclusion
about the linearity or nonlinearity of the method [6]. ICH allows for the validation of
method linearity to use solutions of required concentration by diluting the reference
standard solution. But this ignores the possibility of the influence of excipients since it
is indirectly verified by the “Accuracy” test. At the same time, according to ICH, to
confirm the linearity of the method, instead of diluting of reference standard solution
you can use the results obtained on mixtures of the analyte with placebo when testing
accuracy of the method. This usually reduces the amount of work and significantly
increases the reliability of the conclusions about the linearity or nonlinearity of the
method, since the effect of placebo is taken into account and the number of experi-
mental points increases (at least 9, distributed as evenly as possible in the analytical
3
range of the method). In practice, we use such an approach for methods of determining
the content of main substances.
The range (analytical range) of the method for which it is necessary to vali-
date linearity depends on the type of method [1, 6]:
• For assay of a drug substance or a finished product: “not less than 80% to
120% of the nominal content” (i.e., the concentration of the tested solutions should
cover a range corresponding to at least 80,0% - 120,0% of the nominal content of the
analyte);
• For methods for determining the uniformity of dosage units: “not less than
70% to 130% of the nominal content, if a wider interval is not required for testing (for
example, for aerosols)”;
• For dissolution testing procedures: “± 20% (absolute) with respect to the up-
per/lower limits of the range specified in the specification”;
• For methods for determining impurities: “from the threshold of ignoring im-
purities (registration threshold) [26] or the limit of quantitation (LOQ) to no less than
120% of the nominal limit.” In the US Pharmacopoeia, another requirement is “from
50% to not less than 120% of the acceptance criterion” [2].
Let us consider the main criteria for assessing the linearity of the dependence of
Y on C, used in the validation of analytical methods and their limitations.
Correlation coefficient R
The Pearson’s correlation coefficient R and a graph of the measured response Y
on the concentration C of the analyte are most often used in the analytical methods
validation for confirmation of their linearity [1-6]. At the same time as the criterion of
the acceptability for the correlation coefficient R, it is possible to be guided by R≥
0,999 for methods of determination of the main substances [5, 27] and R ≥ 0,98 for
methods of determination of impurities [4]. Also other criteria of the acceptability, for
example, for methods of determination of the content of the main substances R> 0.998
for API and R> 0.990 for medicines [28], R≥ 0.995 [29] meet. R> is 0.998 for API and
R> 0,990 for medicines [28], R≥ 0,995 [29]. The lowest allowable (critical) values of
R, taking into account the possibility of rounding off of numbers, specified in the GF
XIV: " In most instances one uses the linear dependence relevant to condition R≥ 0.99,
and only in the analysis of trace quantities consider linear dependences for which R≥
0.9" [3]; in our experience such low acceptable values of R should be viewed as insuf-
ficiently reliable, accepted only at confirmation of accuracy and precision of the
method.
The above admissible values of R for determining the content of main substances
are significantly more than the critical values of the Pearson’s correlation coefficient
𝑅𝑅𝑐𝑐𝑐𝑐𝑖𝑖𝑖𝑖 used for theoretical estimates of existence of linear relationship between Y and
C. For example, for n = 5 points of experimental data for one-sided test with a proba-
bility P = 95,0% 𝑅𝑅𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐 = 0,805, with P = 99,0% 𝑅𝑅𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐 = 0,934, and with an increase in
the number of points, the value of 𝑅𝑅𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐 only decreases (http://www.mag-
4
Fig. 1. The results of determining the dependence of the peak area (S) of the impurity on its
concentration (C) - above, and the corresponding residual plot - below, which formally indi-
cates a non-linear trend, however, this trend can be neglected from a practical point of view
(see text).
Plot of the dependence of Y/C vs. C
For assessing the linearity of the method instead of residuals plot, one can use
plot of Y/C on C called the “sensitivity plot” [6]. In accordance with the formula
Y=bC+a it can be expected that on the plot of Y/C on C in the case of linearity of the
method, the points will be located near the trend line ≈ b (characterizes the sensitivity
of method) and almost in parallel to the concentration axis. However, it should be borne
in mind that a necessary condition for estimating the linearity of the method according
to the Y/C on C versus C plot is the proximity of the free regression term a to zero,
since Y/C=b+a/C≈ b only for a ≈ 0. Therefore, before making plot of Y/C on C must
be verified that the free term of the regression equation is statistically insignificant. To
do this, you can use the student's t-test. The value of a is statistically insignificant (a≈0)
if [30]:
|𝑎𝑎|
𝑡𝑡𝑎𝑎 = < 𝑡𝑡 (P, f=n-2) (1)
𝑆𝑆𝑆𝑆𝑎𝑎
where 𝑡𝑡𝑎𝑎 is the estimated value of student's criterion, |𝑎𝑎| is the absolute value of free
member of linear dependence of Y on C; 𝑆𝑆𝑆𝑆𝑎𝑎 is the standard deviation of values 𝑎𝑎;
6
𝑡𝑡(P, f=n-2) is the tabulated value of student's t-test at a probability of P= 95%; f is the
number of degrees of freedom; n is the number of experimental points on regression
line.
Criteria for analysis of variance (ANOVA):
F-criteria and p-value
To verify the linearity of the method, sometimes used analysis of variance
(ANOVA), together with visual confirmation of the linearity of the dependence of Y
on C. The main are two versions of ANOVA.
• LOF-test (“Lack of fit test”): a test with a statistical calculation of the re-
sults of parallel measurements. Its potential advantage is the independence of the ac-
ceptance criterion from the accepted model of the relationship between Y and C. In this
variant several repeated measurements are carried out for each concentration, for ex-
ample, 3 injections in the case of the HPLC procedure [10]. At the same time, it is
recommended to use at least 8 concentrations in the intended range of the method [6].
The LOF test based on consideration of certain residuals and their squares, on
which the variability of total measurement error depends [6, 9, 24]:
𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅 = 𝑦𝑦𝑖𝑖𝑖𝑖 − 𝑦𝑦�𝑖𝑖 = 𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃 𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒 + 𝐿𝐿𝐿𝐿𝐿𝐿 𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒, (2)
𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃 𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒 = 𝑦𝑦𝑖𝑖,𝑗𝑗 − 𝑦𝑦�𝑖𝑖 ; 𝐿𝐿𝐿𝐿𝐿𝐿 𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒 = 𝑦𝑦�𝑖𝑖 − 𝑦𝑦�𝑖𝑖 (3)
where 𝑦𝑦𝑖𝑖,𝑗𝑗 is the result of 𝑗𝑗-th measurement at the point 𝑖𝑖 (i.e., at a concentration of 𝐶𝐶𝑖𝑖 );
𝑦𝑦� is the average value of measurement and 𝑦𝑦�𝑖𝑖 is the value of 𝑦𝑦 calculated on regression
equation at the point 𝑖𝑖. From (3) it can be seen that 𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅 is the deviation of meas-
ured value 𝑦𝑦𝑖𝑖𝑖𝑖 from predicted value 𝑦𝑦�𝑖𝑖 by regression equation at the point 𝑖𝑖. Residual
is the sum of Pure error (PE), which is a random error, does not depend on the regres-
sion model adopted and is the difference between the result of 𝑗𝑗-th measurement at the
point 𝑖𝑖 (𝑦𝑦𝑖𝑖𝑖𝑖 ) and the average value of the parallel measurements at point 𝑖𝑖 (𝑦𝑦�𝑖𝑖 ), and
LOF (“lack of fit error” [6]), which is the difference between the average value of the
parallel measurements at point 𝑖𝑖 (𝑦𝑦�𝑖𝑖 ) and the value of 𝑦𝑦 calculated on regression equa-
tion at the point 𝑖𝑖.
One can represent the variability of the total measurement error 𝑆𝑆𝑆𝑆𝑇𝑇 as a sum of
squares for the pure error (𝑆𝑆𝑆𝑆𝑃𝑃𝑃𝑃 ), 𝐿𝐿𝐿𝐿𝐿𝐿 (𝑆𝑆𝑆𝑆𝐿𝐿𝐿𝐿𝐿𝐿 ) and the factor associated with the re-
gression model (𝑆𝑆𝑆𝑆𝑅𝑅𝑅𝑅𝑅𝑅 ) [9]:
𝑘𝑘 𝑚𝑚𝑖𝑖
2
𝑆𝑆𝑆𝑆𝑇𝑇 = � ��𝑦𝑦𝑖𝑖,𝑗𝑗 − 𝑦𝑦�� =
𝑖𝑖=1 𝑗𝑗=1
𝑘𝑘 𝑚𝑚𝑖𝑖 𝑘𝑘 𝑘𝑘
𝑆𝑆𝑆𝑆𝑅𝑅𝑅𝑅𝑅𝑅 𝑆𝑆𝑆𝑆𝑅𝑅𝑅𝑅𝑅𝑅
𝑆𝑆𝑆𝑆𝑃𝑃𝑃𝑃 𝑆𝑆𝑆𝑆𝐿𝐿𝐿𝐿𝐿𝐿
7
parallel determinations, one can obtain 𝐹𝐹𝑒𝑒𝑒𝑒𝑒𝑒 ≤ 𝐹𝐹 (𝑃𝑃, 𝑓𝑓1 , 𝑓𝑓2 )), and at the same time, the
method may have the necessary precision [24, p. 103].
Fig. 2. Experimental (A) and corrected (by increasing the relative standard deviation of RSD
parallel measurements) - (B), the results of determining the dependence of the peak area (S)
of the API on concentration. In case (A), due to the very high reproducibility of parallel
measurements (RSD from 0.04% to 0.22%), the LOF test did not confirm linearity, although
the points on the residual plot ( ) indicated the linearity of the technique. With an increase
in RSD of parallel measurements to 0.54% -1.03% - case (B), there was only a slight change
in the parameters of the regression dependence of Y on C and the residual plot ( ) compared
to (A), however, this turned out to be enough so that the LOF test confirms the linearity of
the method (see text for explanation).
calculated value of F-criterion (𝑭𝑭𝒐𝒐 ; formula (6)) obtained using the function
=LINEST(range Y; range X; TRUE; TRUE) was greater than the critical value of the
Fisher criterion 𝐹𝐹𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐 (calculated using the function =F.INV.RT(𝛼𝛼,1, k-2) and the calcu-
lated value of the criterion p (calculated using the function =F.DIST.RT(𝐹𝐹𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐 ;1; k-2)
would be less than a given level of significance 𝛼𝛼 (usually 𝛼𝛼 = 0.05 or 𝛼𝛼 = 0.01, which
corresponds to a probability of 95% or 99%, respectively).
𝑆𝑆𝑆𝑆𝑆𝑆⁄1
𝐹𝐹𝑜𝑜 = , where (6)
𝑆𝑆𝑆𝑆𝑆𝑆 ⁄(𝑘𝑘−2)
𝑘𝑘 𝑘𝑘
If the requirements (9) and (12) are met the method is considered as linear. Note
that the correlation index 𝑅𝑅𝑐𝑐 does not depend on the model 𝑌𝑌 = 𝑓𝑓(𝐶𝐶) and therefore has
a significant advantage over the Pearson’s correlation coefficient 𝑅𝑅 in estimating the
linearity of the methods. This approach is considered most detail from a theoretical and
practical point of view in the book [8]. This book also contains the most complete table
with the critical values of the criteria considered above (right parts of formulas (12)
and (9) with 𝑏𝑏 ≈ 1). This facilitates the validation of methods for assay, uniformity of
dosage units and dissolution testing. Even though this approach is based on theoretical
assumptions and uses the criteria of "practical insignificance", it can be concluded,
based on an Internet search, that it is used only in validation methods in Ukraine and
partially in Russia and some other Post-Soviet states. Most researchers use ICH rec-
ommendations [1, 2].
Additional recommendations:
• If during the validation of the method its non-linearity is established, one
11
should try to reduce the range or change the range of the concentrations under consid-
eration or use weighted linear regression if necessary.
• To verify that a particular experimental data point 𝑦𝑦𝑗𝑗 is outlier relative to
the regression model, you can use the t-test [11]. Calculate the experimental value of
test:
|𝑦𝑦𝑖𝑖 −𝑦𝑦�𝑖𝑖 |𝑚𝑚𝑚𝑚𝑚𝑚
𝑡𝑡𝑒𝑒𝑒𝑒𝑒𝑒 = 2
(13)
��
1 �𝑦𝑦𝑗𝑗 −𝑦𝑦
𝑆𝑆𝑆𝑆0 ∙�1+ +
𝑛𝑛 (𝑛𝑛−1)∙𝑆𝑆𝑆𝑆2
𝑦𝑦
where |𝑦𝑦𝑖𝑖 − 𝑦𝑦�𝑖𝑖 |𝑚𝑚𝑚𝑚𝑚𝑚 is the maximum absolute value of the difference between the ex-
perimental 𝑦𝑦𝑖𝑖 and the calculated 𝑦𝑦�𝑖𝑖 (by regression) response value among all data points
(from 𝑖𝑖=1 to 𝑛𝑛); 𝑛𝑛 is the number of experimental points; 𝑦𝑦𝑗𝑗 is the experimental value
of the response at the potential outlier point 𝑗𝑗; 𝑦𝑦�= ∑ 𝑦𝑦𝑖𝑖 ⁄𝑛𝑛; 𝑆𝑆𝑆𝑆0 is the residual standard
deviation and 𝑆𝑆𝑆𝑆𝑦𝑦 is the standard deviation of the experimental response values 𝑦𝑦,
calculated by the formulas:
∑(𝑦𝑦𝑖𝑖 −𝑦𝑦�𝑖𝑖 )2 ∑(𝑦𝑦𝑖𝑖 −𝑦𝑦�)2
𝑆𝑆𝑆𝑆0 = � , 𝑆𝑆𝑆𝑆𝑦𝑦 = � (14)
𝑛𝑛−2 𝑛𝑛−1
Compare 𝑡𝑡𝑒𝑒𝑒𝑒𝑒𝑒 with the tabular value of Student's t(95%, f=n- 2). If 𝑡𝑡𝑒𝑒𝑒𝑒𝑒𝑒 is greater
than the tabular value of 𝑡𝑡(95%, f=n- 2), then with a 95% probability this point does
not correspond to the accepted regression model and is an outlier that requires 𝑦𝑦𝑗𝑗 to be
re-determined. An alternative to this test is the F-test given in [31].
CONCLUSION
The main criteria for assessing linearity at validation of analytical methods and
their limitations were considered. It is shown that these criteria separately cannot pro-
vide a reliable assessment of method linearity. Therefore, in practice, when validating
methods, it is necessary to prove (confirm) their linearity using two or three criteria (it
is more reliable) one of which has to be a plot of measured response (Y) on concentra-
tion (C) of the substance to be determined. On the other hand, as shown above when
using graphical and calculation criteria based on mathematical statistics one should not
formally approach the assessment of non-linearity of the method since they don’t take
into account the possibility of practical insignificance of small deviations from the lin-
ear dependence of Y on C.
REFERENCES
1. ICH Harmonised Tripartite Guideline. Validation of Analytical Procedures: Text
and Methodology Q2(R1). ICH, 2005; 13.
2. United States Pharmacopoeia. <1225> Validation of compendial procedures.
USP40–NF35. 2017; 1640-1646.
3. State Pharmacopoeia of the Russian Federation XIV edition, 2018, V.1; 276-288 (In
Russ.).
12