Professional Documents
Culture Documents
Stochastic Ranking and Dominance in DEA
Stochastic Ranking and Dominance in DEA
PII: S0925-5273(19)30127-6
DOI: https://doi.org/10.1016/j.ijpe.2019.04.004
Reference: PROECO 7345
Please cite this article as: Davtalab-Olyaie, M., Asgharian, M., Nia, V.P., Stochastic ranking and
dominance in DEA, International Journal of Production Economics (2019), doi: https://doi.org/10.1016/
j.ijpe.2019.04.004.
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to
our customers we are providing this early version of the manuscript. The manuscript will undergo
copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please
note that during the production process errors may be discovered which could affect the content, and all
legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT
PT
Mostafa Davtalab-Olyaie ∗, Masoud Asgharian†, and Vahid Partovi Nia‡
RI
SC
Abstract
Data Envelopment Analysis (DEA) requires deterministic input/output data for effi-
U
ciency evaluation of a set of Decision Making Units (DMUs). When there are more
AN
than one set of input/output data for each DMU, however, such requirement is infeasi-
M
ble. Stochastic DEA (SDEA), where input/output data are assumed to be stochastic,
naturally calls for ranking methods that can account for stochastic fluctuations of the in-
TE
put/output, and hence the efficiency score. None of the proposed methods in the current
literature incorporates all the information in the efficiency score distributions for ranking
EP
DMUs. To fill this gap, we introduce two ranking methods, a partial and a linear, for
C
performance evaluation in SDEA using the reliability function of the efficiency scores.
AC
Our proposed partial ranking is based on the notion of stochastic ordering, while our
∗
Correspondence: Mostafa Davtalab-Olyaie, Department of Applied Mathematics, Faculty of Math-
m.davtalab-olyaie@kashanu.ac.ir).
†
Dept of Math & Stat, McGill Univ, Burnside Hall, Room 1224, 805 Sherbrooke Street West, Montreal,
1
ACCEPTED MANUSCRIPT
linear ordering is a weighted average of the reliability function of the efficiency scores.
Special cases of our proposed linear ranking method include mean and median order-
ing in SDEA. Our proposed partial ordering provides a notion for stochastic dominance
PT
using which one can define a natural notion of admissibility as a minimal performance
RI
and illustrate the methods using the Grundfeld data, analyzed using both parametric
SC
Keywords: Data Envelopment Analysis; Stochastic Frontier Analysis; Reliability Func-
U
tion; Stochastic Ranking; Admissibility.
AN
M
D
TE
C EP
AC
2
ACCEPTED MANUSCRIPT
PT
RI
SC
Abstract
Data Envelopment Analysis (DEA) requires deterministic input/output data for effi-
U
ciency evaluation of a set of Decision Making Units (DMUs). When there are more
AN
than one set of input/output data for each DMU, however, such requirement is infeasi-
M
ble. Stochastic DEA (SDEA), where input/output data are assumed to be stochastic,
naturally calls for ranking methods that can account for stochastic fluctuations of the in-
TE
put/output, and hence the efficiency score. None of the proposed methods in the current
literature incorporates all the information in the efficiency score distributions for ranking
EP
DMUs. To fill this gap, we introduce two ranking methods, a partial and a linear, for
C
performance evaluation in SDEA using the reliability function of the efficiency scores.
AC
Our proposed partial ranking is based on the notion of stochastic ordering, while our
linear ordering is a weighted average of the reliability function of the efficiency scores.
Special cases of our proposed linear ranking method include mean and median order-
ing in SDEA. Our proposed partial ordering provides a notion for stochastic dominance
using which one can define a natural notion of admissibility as a minimal performance
1
ACCEPTED MANUSCRIPT
and illustrate the methods using the Grundfeld data, analyzed using both parametric
PT
tion; Stochastic Ranking; Admissibility.
RI
1 Introduction
SC
Efficiency evaluation of decision making units (DMUs) is often a question of prime
interest in many areas of application ranging from banking, business and economy to
U
health care. Efficiency analysis concerns the performance of each unit in transforming
AN
their inputs into quantities of outputs. The relative comparison in efficiency analysis
M
is examined against the efficient production frontier. In fact, the efficiency is measured
based on the deviation of the position of a specific DMU from the efficient frontier. The
D
are two approaches often used in estimating the production frontier, parametric and
TE
postulates a known functional form for the production frontier beforehand. The non-
EP
parametric known as data envelopment analysis (DEA) evaluates the relative efficiency
C
In many real applications the inputs and outputs of DMUs are subject to techno-
logical uncertainties. This can occur when the observed data are collected over several
time periods. Stochastic models where the inputs and/or outputs are considered to be
2
ACCEPTED MANUSCRIPT
ations when analyzing such data. When inputs and/or outputs are random then so is
the DEA efficiency scores of DMUs. Thus, the DEA efficiency score of each DMU has a
distribution function.
PT
As discussed in the literature review, there are several approaches in DEA to rank
DMUs when data are known with certainty, such as the cross-efficiency evaluation, su-
RI
per efficiency methods, among others. When the inputs and/or outputs are random,
some probabilistic, chance constraint models, and statistical methods, such as summery
SC
statistics of the efficiency score distribution and bootstrapping method, have been pro-
U
posed to account for uncertainties. However, none of these approaches incorporates all
AN
the information of the efficiency score distributions for ranking and ordering DMUs. To
fill this gap, we use the notions and tools from reliability and statistical decision theory
M
and introduce ranking methods in stochastic DEA.
D
Our starting point is the estimation of the DEA efficiency distributions of DMUs.
TE
Having estimated the efficiency distributions, we propose two ranking methods using
the estimated efficiency distributions. Our first ranking method is a partial ranking
EP
method using the notion of stochastic ordering that encompasses all the information of
of the reliability function, the complementary distribution function that is. This partial
AC
see Definition 2. Using the notion of admissibility one can categorize DMUs into two
distribution of the DEA efficiency. As observed by Simar and Wilson (2000, 2007) and
3
ACCEPTED MANUSCRIPT
Kao and Liu (2009), the distribution of the DEA efficiency estimator has a mixture
structure with a point mass at 1, i.e., pδ1 + (1 − p)g where δ1 is the Dirac delta function
at point 1, g is a continuous density on (0, 1) and 0 < p < 1. We formalize this result
PT
in Theorem 2, showing that the DEA efficiency distribution does not have a continuous
distribution even if both the random input and output variables are continuous. Using
RI
the point mass decomposition of the DEA efficiency distribution (Theorem 2), we then
provide a linear ranking method and some conditions to check stochastic ranking and a
SC
sufficient condition for admissibility.
U
We also propose a linear ordering method based on a weighted average of the reli-
AN
ability function of efficiency scores. Special cases of this ranking method include mean
and median ranking. This ranking method may also be viewed as an interactive ranking
M
where one incorporates prior knowledge about possible performance of different DMUs
D
or how much inefficiency can be tolerated. Such information may be available to a man-
TE
ager and can be formulated in terms of a weight function which can take the production
For implementation of the proposed ranking methods we need to estimate the ef-
ficiency score distribution of each DMU. The estimation of the efficiency score distri-
C
butions of DMUs comprises two steps; first using a performance evaluation method to
AC
generate a sample from efficiency score distribution of each DMU, and second an esti-
mation method to estimate the efficiency score distribution for each DMU. To this end,
one can use DEA, SFA, etc, for step 1, and empirical cumulative distribution function
efficiency evaluation approaches to obtain a sample from the efficiency score distribu-
4
ACCEPTED MANUSCRIPT
tion of each DMU. We then use both empirical and model-based estimation methods to
The rest of this article is organized as follows. Section 2 presents the related liter-
PT
ature. The notion of stochastic ordering and admissibility are presented in Section 3.
The proposed linear ranking method is also presented in this section. Implementation
RI
of the proposed ranking methods is discussed in Section 4. A simple sufficient condi-
tion for admissibility in terms of the point-mass magnitude is presented in this section.
SC
We illustrate our methods using Grundfeld data in Section 5 where we use the Hasse
U
diagram and graphical tools to visualize the result of our stochastic ranking. The last
AN
section contains the conclusion and some closing remarks. Proofs of the theorems are in
Appendix I.
M
D
2 Literature Review
TE
we will review some of the models proposed in the literature to handle the presence
of stochastic elements. As mentioned above, there are two main approaches for es-
EP
timating the production frontier to evaluate the efficiency of DMUs, parametric and
C
in a stochastic environment falls into the realm of the parametric analysis. Aigner et al.
(1977) and Meeusen and van den Broeck (1977) proposed some models by measuring
one-sided error caused by general statistical noise known as stochastic frontier approach.
Battese and Coelli (1995) also proposed a stochastic frontier production function model
for panel data on firms, in which the non-negative technical inefficiency effects are as-
5
ACCEPTED MANUSCRIPT
sumed to be a function of firm-specific variables and time. The reader can refer to
Kumbhakar and Lovell (2000) for a review on theoretical and practical aspects of effi-
PT
The nonparametric approach known as the Data Envelopment Analysis (DEA), in-
troduced by Charnes et al. (1978, 1979) and extended by Banker et al. (1984), offers
RI
a method widely used for estimating the efficiency of a set of multi-input multi-output
DMUs. Following the criticisms of DEA raised by Schmidt (1985) and echoed further
SC
by Greene (1993), for the lack of solid statistical foundation, two different perspectives
U
based on the source of the randomness, inefficiency versus inefficiency and noise, were
AN
introduced in the literature to fill the gap. The former assumes that there is no noise in
the data generating process (DGP) and all possible realizations belong to the production
M
possibility set (PPS). Therefore the distance from the frontier just indicates the ineffi-
D
ciency term. The latter assumes that there is noise in the DGP, and hence the distance
TE
Banker (1993) established the first building block of a solid statistical foundation for
EP
DEA by showing that the DEA estimator is essentially the maximum likelihood estimator
of the production function under certain conditions. Gijbels et al. (1999) provided the
C
asymptotic distribution of DEA estimator in the case of the single input and output.
AC
Kneip et al. (1998) generalized this result to the multiple inputs and outputs case.
Simar and Wilson (1998) and Simar and Wilson (2000) suggested bootstrap techniques
for evaluating the sampling variability of the efficiency estimator. Kneip et al. (2008)
provided a full theory on the asymptotic properties of DEA estimator and a double-
smooth bootstrap technique. Kneip et al. (2011) presented a simplified and consistent
6
ACCEPTED MANUSCRIPT
version of the double-smooth bootstrap method developed by Kneip et al. (2008). Simar
and Wilson (2007) proposed a two-stage method for estimation and statistical inference
on the efficiency score using single and double bootstrap techniques; and Barros and
PT
Peypoch (2009) applied this method to evaluate European airlines. Hall and Simar
(2002) showed a fully nonparametric model with noise and inefficiency is not identifiable
RI
and proposed a method that allows for introduction of noise into the model. Simar (2007)
extended these ideas to multivariate setting. Having taken a different perspective, Gong
SC
and Sun (1995) proposed some approaches to measure the relative efficiency of DMUs via
U
estimating the performance of one random DMU with respect to a set of deterministic
AN
DMUs. A thorough review of the subject can be found in Simar and Wilson (2015).
Another common approach to handle the uncertainty is via chance constrained mod-
M
els where the random Production Possibility Set (PPS) is replaced by an average PPS
D
where the average is in the sense of Vorob’ev (1984). There has been a surge of articles
TE
on chance constrained models over the past two decades, see for example Land et al.
(1993), Olesen and Petersen (1995), Cooper et al. (1998), and Bruni et al. (2009), among
EP
others. A recent review of this subject can be found in Cooper et al. (2011). The effi-
ciency measured with respect to the average PPS is a fixed value. As discussed by Kao
C
and Liu (2009) the inherent random fluctuation of the efficiency score, caused by the
AC
random nature of the input and output variables, cannot be captured using the chance
constrained models.
Kao and Liu (2009) discussed how to obtain the DEA efficiency distributions of
each DMU via a simulation technique, and used the mean of these distributions to rank
the DMUs. Lamb and Tee (2012) derived confidence intervals for the DEA efficiency
7
ACCEPTED MANUSCRIPT
ranking methods are all based on a summary of the DEA efficiency distributions.
As reviewed in Adler et al. (2002), Angulo-Meza and Lins (2002) and Aldamak and
PT
Zolfaghari (2017), there are different categories of ranking methods in deterministic DEA.
For instance, the cross-efficiency evaluation (see Sexton et al. (1986), Liang et al. (2008),
RI
Wang and Chin (2010) and Davtalab-Olyaie (2018)), supper efficiency (see Andersen
and Petersen (1993), Mehrabian et al. (1999), and Chen et al. (2013)), and statistical
SC
technique (see Friedman and Sinuany-Stern (1997), Sinuany-Stern and Friedman (1998),
U
and Sinuany-Stern and Friedman (2016)), among others.
AN
3 Ranking using stochastic ordering
M
We first recall some basic concepts in efficiency analysis of DMUs. Consider a set of n
D
m s
DMUs, each using m inputs, x ∈ R+ , to produce s outputs, y ∈ R+ . The Production
TE
The frontier of Ψ, denoted by ∂Ψ, is called the production function. The set Ψ can be
AC
m s
X(y) = {x ∈ R+ | (x, y) ∈ Ψ} Y (x) = {y ∈ R+ | (x, y) ∈ Ψ}. (1)
8
ACCEPTED MANUSCRIPT
∂X(y) = {x | x ∈ X(y), θx ∈
/ X(y) ∀0 < θ < 1} (2)
∂Y (x) = {y | y ∈ Y (x), φy ∈
/ Y (x) ∀φ > 1}, (3)
PT
RI
using which Farrell input and output efficiency measures, θj and φj can be defined for
SC
3.1 Stochastic ordering
U
AN
When inputs or outputs of DMUs are random variables, the θj = θ(xj , yj ) will also be
a random variable. To distinguish between random variables and their observed values,
M
we use capital letters for random variables, while retaining small letters for the observed
values. Suppose Θj = Θ(xj , yj ) is the efficiency of DMUj . Let FΘj (·) and SΘj (·) be
D
respectively the cumulative distribution function (cdf) and the reliability function (the
TE
complementary cdf) of Θj . One may use different measures of central tendency, such
EP
as mean, median or quantiles of FΘj (·), for j = 1, . . . , n to rank DMUs. These ranking
methods may be called, mean, median and quantile ranking. While the ranking methods
C
are all based on a summary of FΘj (·), borrowing ideas from reliability and decision theory,
AC
one can consider the so-called stochastic ordering using the whole distribution of Θj , i.e.,
efficient than, DMUj 0 on ∆ ⊆ [0, 1], denoted by Θj ∆ Θj 0 , if Sθj (θ) ≥ Sθj 0 (θ), for all θ ∈ ∆,
and the inequality is strict at some point in ∆. In particular, if ∆ = [0, 1], we write Θj Θj 0 ,
9
ACCEPTED MANUSCRIPT
It is reasonable to prefer DMUj over DMUj 0 if Θj > Θj 0 is more likely to happen than
its reverse. The following calculation shows that stochastic dominance is a sufficient
PT
condition for this intuitive preference, i.e., P (Θj > Θj 0 ) > 1/2 if Θj Θj 0 , Θj is
RI
Z ∞ Z ∞ Z ∞
SC
{SΘj (θ) − SΘj0 (θ)}dFΘj (θ) = SΘj (θ)dFΘj (θ) − SΘj0 (θ)dFΘj (θ)
−∞ −∞ −∞
U
which implies AN
M
Z ∞
P (Θj > Θj 0 ) = 1/2 + {SΘj (θ) − SΘj0 (θ)}dFΘj (θ). (4)
−∞
D
Now continuity of the reliability functions and stochastic dominance (Θj Θj 0 ) imply
TE
Figure 1 illustrates the notion of stochastic ordering. It depicts the probability density
functions (pdf), f (θ), and the reliability functions, S(θ), of the efficiency of two DMUs.
C
We notice that while the pdfs are overlapping (left panel), the reliability function of
AC
DMU1 , the solid curve, is always below the reliability function of DMU2 , the dashed
DMU2 . In other words, for any given efficiency level ξ, the efficiency of DMU2 has a
greater chance to be above ξ than the efficiency of DMU1 . That is, the performance
10
ACCEPTED MANUSCRIPT
2.5
1.0
2.0
0.8
1.5
0.6
)
)
S(
f(
1.0
0.4
PT
0.5
0.2
0.0
0.0
RI
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
SC
Figure 1: Stochastic dominance - density (left) and reliability (right) functions of efficiency scores.
U
being above an efficiency threshold.
AN
Let the mean of the random variable Θj be denoted by E(Θj ) and its β-quantile by
Ẽβ (Θj ), where 0 < β < 1. Each of these quantities can be used for a linear (total)
M
ordering of DMUs. For instance, mean ranking can be performed based on E(Θj ) and
β-quantile ranking based on Ẽβ (Θj ). As a special case using Ẽβ=0.5 (Θj ), one can order
D
DMUs based on the median of their efficiency distributions. It follows from Definition 1
TE
that ranking using stochastic ordering implies mean, median and quantile ranking. The
EP
Remark 1. A simple partial reverse connection between ranking based on quantiles and stochas-
C
tic ordering immediately follows. If for all 0 < β < 1, Ẽβ (Θj ) > Ẽβ (Θ0j ), then Θj Θj 0 . This
AC
and distinguish inadmissible DMUs from the admissible ones in Ψ, we need the following
{SΘ | Θ ∈ Γ}.
11
ACCEPTED MANUSCRIPT
respect to G ⊆ F, if there is no S∗ ∈ G such that S∗ (θ) ≥ S(θ) for all θ ∈ [0, 1], and the
PT
3.2 General linear ranking method
RI
Stochastic ordering provides a partial ordering of the DMUs based on their efficiency
SC
distributions. A linear (complete) ordering can be defined using a weighted average
of the reliability distribution of the efficiency score. This linear ranking method is a
U
generalization of the mean ranking and can be appealing when, for example, possible
AN
preferences and tolerance of a manager is available. Such preference and tolerance can
modelled using information given by a manager for DMUj . Let Ij = Iπj (SΘj ) =
TE
R
θ
SΘj (θ)πj (dθ). Then Ij can be interpreted as an interactive efficiency score accord-
we only consider the common weight function for all DMUs, i.e. we assume πj = π, for
C
j = 1, 2, . . . , n.
AC
Definition 3. We say DMUj is interactively more efficient than DMUj 0 , if Iπ (SΘj ) ≥ Iπ (SΘj 0 ).
Although a linear (complete) ordering of DMUs can also be achieved using measures
of central tendencies such as mean and median of the efficiency score distribution, ranking
levels of inefficiencies. A manager can specify different weights over different regions in
12
ACCEPTED MANUSCRIPT
[0, 1] through the probability measure π. The following theorem establishes a close tie
between admissibility and ranking using Ij . The proof of the theorem is similar to that
PT
Theorem 1. SΘj ∈ G is admissible with respect to G, if there exists a π such that Iπ (SΘj ) >
RI
SC
4 Implementing ranking methods
U
To implement the ideas developed in the previous section, one can measure efficiency
AN
using DEA based on information on a finite number, say n, of DMUs. Motivated by
the setting of our data example, we explain the implementation of the proposed ranking
M
methods using panel data. In what follows, the (m + s)-vector zjt denotes the vector of
t
j=1 j=1 j=1
AC
where L(0 ≤ L ≤ 1) and U (U ≥ 0) are lower and upper bounds for the sum of λj .
et al., 1978); while setting L = U = 1, variable returns to scale assumption, gives ΨBCC
(Banker et al., 1984). The frontier of ΨDEA , ∂ΨDEA , provides an estimate of ∂Ψ, the
production function. Should we take ΨCCR , for instance, we can evaluate the relative
13
ACCEPTED MANUSCRIPT
PT
j=1
Xn
λj yrj ≥ yro , r = 1, . . . , s
RI
j=1
λj ≥ 0, j = 1, . . . , n.
SC
If θo∗ = 1, then DMUo is CCR-efficient.
U
n
Let θjt be the efficiency score of DMUj at time t using available information on n
AN
DMUs. The Empirical Cumulative Distribution Function (ECDF) of Θnj , i.e. the cdf of
n
θjt , t = 1, 2, . . . , T , is a natural estimate for FΘnj , the cdf of the efficiency score of DMUj
M
using information on n DMUs. We denote this estimate by FbT,Θnj . Under relatively mild
D
n
The fundamental underlying assumption for the consistency is that θjt , t = 1, 2, . . . , T
The above approach is a nonparametric approach. One can also take a parametric
approach by specifying a parametric form for the data generating process. Kao and Liu
C
(2009) have taken such an approach. We try both the nonparametric and the parametric
AC
The following result shows that when there are finitely many DMUs, there is at least
one DMU whose efficiency score has a mixture structure with a point mass at 1. This
is true irrespective of the input and output variables being discrete or continuous. This
point has also been implicitly mentioned by Simar and Wilson (2007) and Kao and Liu
14
ACCEPTED MANUSCRIPT
Theorem 2. Let Θnj be the efficiency score of DMUj , for j = 1, . . . , n. Then there is at least
PT
Let FΘnj be the cumulative distribution function of Θnj , which is an estimate of FΘj .
RI
SC
SΘnj (θ) = pj + (1 − pj )SΘ<nj (θ), (7)
U
where pnj = P (Θnj = 1), and SΘnj (θ) = 1 − FΘnj (θ). Similarly we define SΘ<nj (θ) =
AN
1 − FΘ<nj (θ), where FΘ<nj is the cdf of the inefficiency component of the DEA efficiency
score distribution. In the other words, FΘ<nj is the cdf of Θnj when Θnj < 1.
M
The point mass decomposition structure of DEA efficiency distribution can be used
D
to introduce a further simple ranking method using the point mass at 1. DMUs can be
TE
ranked according to the point mass of their DEA efficiencies at 1, the greater the point
mass of a DMU at 1, the higher the ranking of the DMU. We call this ranking method
EP
p-ranking. The p-ranking method is perhaps the simplest method of ranking among
can be cumbersome. Using the mass point decomposition of the efficiency distribution,
AC
equation (7), the following result whose proof is given in Appendix I can be established.
√
Theorem 3. If pno > 3 − 6, then DMUo is admissible.
In real application the value of pn0 is estimated from the available data. Caution
should be taken when the estimated value is used to make conclusion about admissibility
of a DMU. This is, particularly, so when the sample size, T , is small. As a minimum
15
ACCEPTED MANUSCRIPT
requirement we suggest that the lower bound of the confidence interval for pn0 be clearly
√
above 3- 6 ≈ 0.55.
PT
5 Application of the method in the parametric and
nonparametric setting
RI
SC
We illustrate the methodologies developed in the previous sections using the Grundfeld
data of Greene (2011)1 . The data is on 10 firms where each firm uses two inputs (x1 =
U
market value of the firm at the end of the previous year (Fit ), and x2 = value of the stock
AN
of plant and equipment at the end of the previous year (Cit)) to produce one output
(y = gross investment (Iit)). These information were collected on yearly basis for each
M
firm over a period of 20 years (1935–1954). We first apply our ranking methods on the
D
results obtained from DEA estimator. We then use SFA model to estimate efficiency of
TE
To analyze Grundfeld data, we first calculate the efficiency of each firm using CCR model
for 20 consecutive years. Figure 2 below shows the box-plot and the trend of efficiency
ply the proposed ordering methods to rank DMUs using FbT,Θnj . Figure 3 depicts the
1
The data is available online through
http://people.stern.nyu.edu/wgreene/Text/tables/Grunfeld.txt.
16
ACCEPTED MANUSCRIPT
PT
RI
1.0
DMU1
SC
DMU2
DMU3
0.8
DMU4
Relative Efficiency
DMU5
DMU6
0.6
DMU7
U
DMU8
DMU9
DMU10
0.4
AN
0.2
M
0.0
5 10 15 20 25
TIME
D
Figure 2: Grundfeld data - Box-plot and efficiency pattern of the firms between 1935-1954.
TE
cannot easily rank firms based on their 20-year performance using a linear ordering
C
method. For example, the median ordering according to observed data cannot distinguish
AC
between DMU2 and DMU7 , and p-ordering can not rank DMU3 , DMU4 , DMU6 , DMU9
and DMU10 . Besides, linear ordering methods do not use all the information in the
efficiency score distributions of DMUs, and hence cannot capture the whole picture.
For example, DMU1 has a higher ranking in mean ordering in comparison with DMU5 ,
but it cannot stochastically dominate DMU5 (see Column 5 of Table 1). Moreover, the
17
ACCEPTED MANUSCRIPT
DMU 1 DMU 2
1.0
1.0
0.8
0.8
Reliability function
Reliability function
0.6
0.6
0.4
0.4
0.2
0.2
0.0
0.0
PT
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
Efficiency levels Efficiency levels
DMU 3 DMU 4
RI
1.0
1.0
0.8
0.8
Reliability function
Reliability function
0.6
0.6
0.4
0.4
SC
0.2
0.2
0.0
0.0
0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.0 0.2 0.4 0.6 0.8
Efficiency levels Efficiency levels
U
DMU 5 DMU 6
1.0
1.0
AN
0.8
0.8
Reliability function
Reliability function
0.6
0.6
0.4
0.4
0.2
0.2
M
0.0
0.0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8
Efficiency levels Efficiency levels
D
DMU 7 DMU 8
1.0
1.0
TE
0.8
0.8
Reliability function
Reliability function
0.6
0.6
0.4
0.4
0.2
0.2
EP
0.0
0.0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
Efficiency levels Efficiency levels
DMU 9 DMU 10
C
1.0
1.0
0.8
0.8
Reliability function
Reliability function
AC 0.6
0.6
0.4
0.4
0.2
0.2
0.0
0.0
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6
Efficiency levels Efficiency levels
Figure 3: Grundfeld data - empirical estimates of the reliability functions of the CCR efficiency of 10 firms
over 20 years with 95% CI.
18
ACCEPTED MANUSCRIPT
PT
7 1.00 (1) 0.926 (2) 0.70 (2) {3, 4, 6, 8, 9, 10} 0.815 (2) 0.763 (2)
8 0.436 (7) 0.494 (7) 0.15 (5) {3, 10} 0.159(5) 0.147 (5)
9 0.454 (6) 0.486 (8) 0.00 (6) {3, 10} 0.024 (8) 0.00 (8)
10 0.309 (8) 0.327 (9) 0.00 (6) {} 0.009 (9) 0.00 (8)
RI
Table 1: Grundfeld data - partial and linear ranking using different empirical summaries of efficiencies obtained
from DEA estimator.
SC
rankings of DMUs is sensitive to which linear ordering methods have been used in our
U
analysis, as seen in Table 1, there are different rankings of DMUs according to different
AN
linear ranking methods. For instance, DMU8 is ranked 5th using π1 in the weighted
average of its reliability distribution of the efficiency score, Column 6, while it is ranked
M
8th using the same weight in the weighted average method (mean ordering), Column
3. In contrast, using stochastic ordering we observe that DMU2 dominates all the other
D
DMUs, and hence has the highest ordering using all the ranking methods.This is so since
TE
stochastic ordering use all the information of the estimated reliability functions. This
EP
means that DMU2 is the only admissible DMU. The inadmissibility of DMU7 calls for a
careful attention since the estimated value for pn7 is 0.7. Using Theorem 3, pnj ≥ 0.55 is
C
a sufficient condition for admissibility. We emphasize that 0.7 is only an estimate of pn7
AC
based on 20 observations, and not the actual value of pn7 . A closer inspection of Figure 3
actually shows that the lower bound of the confidence interval for pn7 is 0.525 which is
19
ACCEPTED MANUSCRIPT
In a model-based approach we typically assume that the form of the data generating
process is know up to finitely many unknown parameters. The specific form chosen for
PT
the data analysis may be motivated by prior information about the inputs/outputs data
or using methods like maximum entropy. Should one take this latter approach, one may
RI
assume a uniform distribution for each input/output of each DMU over their observed
SC
range. To allow some flexibility, one may use beta distribution and use some calibration
by estimating the unknown parameters for each input/output of each DMU using the
U
available data. We refer the reader to Law and Kelton (2000) and Kao and Liu (2009)
AN
for further discussion on the choice of a parametric model.
parameter values. The distribution parameters of every input/output of every DMU are
TE
estimated using the input/output information collected for every DMU over the period
1935-1954.
EP
Given that there are two inputs and one output for each DMU and there is a to-
tal of ten DMU, we have 30 beta distributions. The α1 and α2 parameters of these
C
beta distributions are estimated using the observed data over 20 years. Note that the
AC
standard beta distribution has a domain of [0, 1]. We use a linear transformation for
each input/output of each DMU to produce a beta distribution whose support is the
range covered by the observed minimum and maximum of that input/output. For a
generalized beta distribution defined on the interval [a, b], one can estimate α1 and α2
for each of the 30 distributions using α1 = λ(µ − a)/b − a, α2 = λ(b − µ)/b − a, where
20
ACCEPTED MANUSCRIPT
λ = ((µ − a)(b − µ)/σ 2 ) − 1 where µ and σ 2 are the mean and variance estimated for
Having estimated the parameters of the beta distribution for each input/output, we
PT
generate B = 2000 data sets by generating B observations from the 30 beta distributions
as described above. We then generate 2000 efficiency score for each DMU by applying
RI
CCR model for each data set.
The generated efficiency samples can then be used to estimate pnj and the reliability
SC
function SΘ<nj (.) for DMUj , for j = 1, 2, . . . , 10. The reliability function of the efficiency
U
score of each DMU along with their 95% confidence bounds are depicted in Figure 4.
AN
The codes are implemented in R package (R Development Core Team, 2005) using the
package Benchmarking (Bogetoft and Otto, 2010). We have summarized our findings
M
using this model-based approach in Table 2. Columns 2–4 in Table 2 provide the results
D
1 0.855 (3) 0.729 (4) 0.444 (3) {3, 4, 6, 8, 9, 10} 0.544 (3) 0.430 (3)
2 1.000 (1) 0.878 (1) 0.603 (1) {1, 3, 4, 6, 8, 9, 10} 0.758 (1) 0.603 (1)
3 0.220 (9) 0.270 (10) 0.010 (10) {} 0.029(10) 0.017 (10)
4 0.662 (5) 0.661 (5) 0.326 (5) {3, 6, 8, 9, 10} 0.429 (5) 0.331 (5)
{3, 4, 6, 8, 9, 10}
C
5 0.804 (4) 0.764 (3) 0.341 (4) 0.510 (4) 0.379 (4)
6 0.559 (6) 0.598 (6) 0.276 (6) {3, 8, 10} 0.362 (6) 0.278 (6)
7 0.972 (2) 0.824 (2) 0.482 (2) {1, 3, 4, 6, 8, 9, 10} 0.638 (2) 0.496 (2)
AC
8 0.349 (8) 0.447 (7) 0.145 (7) {3, 10} 0.198 (7) 0.148 (7)
9 0.386 (7) 0.430 (8) 0.022 (9) {3} 0.076 (8) 0.040 (8)
10 0.217 (10) 0.283 (9) 0.028 (8) {} 0.053 (9) 0.035 (9)
Table 2: Grundfeld data - partial and linear ranking using beta distribution.
Clearly those DMUs whose efficiency scores have a greater chance to be near 1 are
more preferred. Hence, the DMUs whose efficiency score distribution have heavier right
21
ACCEPTED MANUSCRIPT
DMU 1 DMU 2
1.0
1.0
0.8
0.8
Reliability function
Reliability function
0.6
0.6
0.4
0.4
0.2
0.2
0.0
0.0
PT
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
Efficiency levels Efficiency levels
DMU 3 DMU 4
RI
1.0
1.0
0.8
0.8
Reliability function
Reliability function
0.6
0.6
0.4
0.4
SC
0.2
0.2
0.0
0.0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
Efficiency levels Efficiency levels
U
DMU 5 DMU 6
1.0
1.0
AN
0.8
0.8
Reliability function
Reliability function
0.6
0.6
0.4
0.4
0.2
0.2
M
0.0
0.0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
Efficiency levels Efficiency levels
D
DMU 7 DMU 8
1.0
1.0
TE
0.8
0.8
Reliability function
Reliability function
0.6
0.6
0.4
0.4
0.2
0.2
EP
0.0
0.0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
Efficiency levels Efficiency levels
DMU 9 DMU 10
C
1.0
1.0
0.8
0.8
Reliability function
Reliability function
AC 0.6
0.6
0.4
0.4
0.2
0.2
0.0
0.0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
Efficiency levels Efficiency levels
Figure 4: Grundfeld data - model-based estimates of the reliability functions of the CCR efficiencies distribu-
tions with 95% CI.
22
ACCEPTED MANUSCRIPT
tail are more likely to be efficient, and therefore they are more preferred. As seen
in Figure 4, the reliability functions of the efficiencies of DMU2 and DMU7 have the
top two heaviest right tails among the ten DMUs while DMU3 and DMU10 have the
PT
lightest right tails. Thus DMU2 and DMU7 are clearly have the best performance while
DMU3 and DMU10 have the worst. The linear ordering methods reported in Table 2
RI
indicates that DMU2 has the best while DMU3 has the worst performance among the
ten DMUs. It should be noted that the results of stochastic ordering, column 5 of
SC
Table 2, is consistent with Figure 4. For example, as seen in Figure 4 the reliability
U
functions of the efficiencies of DMU2 and DMU7 are superior to that of other DMUs
AN
except DMU5 . The results of median and mean ordering are almost the same, but there
are some differences. For example, DMU5 has a higher ranking than DMU1 using mean
M
ordering, while using median ordering DMU1 is ranked higher than DMU5 . As seen
D
in Table 2, each DMU has a superior performance compared to all of its stochastically
TE
dominated DMUs in all ranking methods. For example DMU5 stochastically dominates
all ranking methods compared with all DMUs in that set. As discussed in the previous
section, linear ordering methods do not use all the information in the efficiency score
C
distributions of DMUs, and hence cannot capture the whole picture. For example, there
AC
are some DMUs which are unordered by DMU5 , {DM U1 , DM U2 , DM U7 }, such that
these DMUs may have a better rank using some ranking methods than DMU5 . As seen
in Table 2, DMU2 and DMU7 have a better ranks in all ranking method than DMU5 ,
while DMU5 is better than DMU1 just in mean ordering. This indicates that although
23
ACCEPTED MANUSCRIPT
median and p-ordering methods. As seen in Table 2, DMU1 and DMU5 are unordered,
but p-ordering assigns a better rank for DMU1 than DMU5 . It means that DMU1 is
PT
5.1.3 Interactive ordering
RI
Given that stochastic ordering provides a partial ordering of the DMUs, several DMUs
naturally remain unordered. The preference of a manager can be used to achieve a linear
SC
(complete) ordering of DMUs by averaging the reliability functions of the efficiency scores
U
with respect to the weights (preferences and tolerance) of the manager. For instance,
AN
suppose that a manager accepts DMUs whose efficiencies are above 0.8 (high range
efficiency) majority of time (60%), and he/she can tolerate low range of efficiency (below
M
0.5) 10% of the time; and the rest of time the manager can tolerate a mid range efficiency
The task is then to devise a distribution that fulfills these preferences, i.e. putting
TE
60% of its mass between [0.8, 1], 30% between [0.5, 0.8] and the rest between [0, 0.5]. It
EP
is clear that there are infinitely many distributions that can fulfill such constraints. A
24
ACCEPTED MANUSCRIPT
where U[a,b] is the Uniform distribution over [a, b]. In other words,
1
5
if θ ∈ [0, 0.5)
π1 (θ) = 1 if θ ∈ [0.5, 0.8) (8)
PT
3 if θ ∈ [0.8, 1].
RI
The probability density function (pdf) π1 is non-informative on each subregion. One
SC
may want to introduce a distribution that is increasing over each sub region, i.e. giving
more priority on the right end point of each subregion, while fulfilling the constraints.
U
To this end, we determine ai , bi for i = 1, 2, 3 such that the following distribution fulfills
the conditions.
AN
a1 θ + b1 if θ ∈ [0, 0.5)
M
π2 (θ) = a2 θ + b2 if θ ∈ [0.5, 0.8) (9)
a3 θ + b3 if θ ∈ [0.8, 1].
D
TE
Using π2 (θ) we can calculate the plug-in estimate of the interactive efficiency index
AC
Iπ (SΘnj ) by Iπ (S
dΘn
j
), for j = 1, 2, . . . , n where
Z 1
Iπ\
(SΘnj ) = Iπ (S
dΘn
j
)= S
d Θn
j
(θ)π(θ)dθ.
0
According to the two last columns of Table 2, interactive ordering using both (8) and
(9) suggest that DMU2 and DMU3 have the best and worst performance, respectively.
25
ACCEPTED MANUSCRIPT
It should be noted that the ranking obtained using (8) and (9) are the same, i.e. some
robustness to changing the weight function from π1 to π2 . As seen in Table 2, DMU9 has
a lower ranking using p-ordering than DMU10 while its interactive ranking using both
PT
weight functions, π1 and π2 , is higher than that of DMU10 .
RI
5.1.4 Comparison between the two analyses
Here we present some comparison between the two approaches taken for the analysis of
SC
the data.
U
• The median ordering using the empirical distribution cannot distinguish between DMU2
AN
and DMU7 , while the efficiency of DMU2 has a greater median in the model-based analysis.
M
• Since po = 0 for DMU3 , DMU4 , DMU6 , DMU9 and DMU10 using the empirical approach,
the first approach cannot provide a p-ranking for these DMUs. The second approach
D
provides a p-rank for all DMUs. It is worth noting that DMU8 has a lower p-rank in com-
TE
parison with DMU4 and DMU6 in the second approach. This is expected from Figure 2.
In fact, even though DMU4 and DMU6 are not efficient over 20 years, but their trend of
EP
• There are some differences between interactive orderings according to these two ap-
AC
proaches. In both interactive ordering using π1 and π2 , the rank of DMU8 is increased to
5, and the ranks of DMU4 and DMU6 are fallen by 1 in the second approach. Moreover,
the empirical approach cannot produce distinctive ranking of DMU3 , DMU9 and DMU10 .
• The following Hasse diagrams (Rutherford (1965)), based on the results reported in Table
26
ACCEPTED MANUSCRIPT
PT
RI
SC
Figure 5: Grundfeld data-Hasse diagrams for the empirical (Left) and model-based (right) approaches.
U
AN
The Hasse diagrams indicate some differences. As seen in Figure 5, DMU4 is stochasti-
tically dominate DMU6 and DMU8 using the empirical approach. DMU6 stochastically
D
dominates DMU9 using the first approach while using the second approach it stochastically
TE
dominates DMU1 . DMU10 is stochastically ordered by DMU9 using the first approach,
while they are unordered using the second approach. The main difference between order-
C
ing using these two approaches is that DMU5 and DMU7 are stochastically ordered by
AC
DMU2 using the empirical approach, while they are unordered using the second approach.
In addition to DMU2 , DMU5 and DMU7 are also admissible units using the model-based
approach.
27
ACCEPTED MANUSCRIPT
Here, we use the stochastic frontier function approach, proposed by Battese and Coelli
(1992), to evaluate the efficiency of each firm in the Grundfeld data. As mentioned in the
PT
beginning of this section, each firm use two inputs, Fit and Cit, to produce one output,
RI
SC
ln(litjt ) = β0 + β1 ln(F it) + β2 ln(Cit) + Vjt − Ujt (10)
U
where the subscripts j and t refer to jth firm and tth observation, respectively. The Vjt s
AN
are statistical noise and assumed to be independent and identically distributed N (0, σV2 ).
jth firm are obtained, and the Ui s are non-negative random variables corresponding to
D
the inefficiency terms which are assumed to be independent and identically distributed
TE
according to half normal distribution. This model is such that the non-negative firm
EP
appropriate when firms tend to improve their level of technical efficiency over time. We
AC
note that in model (10) the time dimension of the inefficiency term will be allowed to
change over time, and so this model is called time variant. In contrast, if in model (10)
To apply our proposed ranking methods using the SFA model and theory developed
by Battese and Coelli (1992), we make our panel data unbalanced by randomly choosing
28
ACCEPTED MANUSCRIPT
different number of years for different DMUs2 . The following table shows the years
considered randomly for different DMUs, where ”\” stands for set minus.
DMU The number of considered year The years randomly considered for each DMU
1 10 {1937,1938,1940,1942,1944,1945,1947,1948,1950,1953}
2 13 {1935–1954}\{1937,1939,1941,1942,1944,1948,1951}
PT
3 17 {1935–1954}\{1943,1951,1952}
4 16 {1935–1954}\{1938,1946,1949,1951}
5 14 {1935–1954}\{1943,1944,1949,1950,1952,1954}
6 20 {1935–1954}
RI
7 15 {1935–1954}\{1936,1947,1949,1951,1952}
8 11 {1935–1954}\{1939,1940,1943,1944,1945,1946,1947,1948,1951}
9 12 {1935–1954}\{1940,1943,1945,1946,1948,1951,1952,1953}
SC
10 18 {1935–1954}\{1936,1942}
U
When we take the above time-variant SFA model on the unbalanced Grundfeld data,
AN
we will have a sample from efficiency score distribution of each DMU. Figure 6 below
distribution of each firm, and apply the proposed ordering methods to rank DMUs.
TE
Figure 7 depicts the estimated reliability functions of different firms. The codes are im-
plemented in R package (R Development Core Team, 2005) using the package frontier.
EP
Table 4 provides the partial and linear ranking using different empirical summaries of
C
efficiencies obtained from SFA estimator. The efficiency scores of DMUs obtained from
AC
time-invariant SFA approach are reported in Column 2. Columns 3–5 provide median,
mean and stochastic ordering of DMUs using time-variant SFA approach, respectively.
As seen in Columns 3 and 4, we have the same rankings of DMUs using Time-
29
ACCEPTED MANUSCRIPT
1.0
0.8
Efficiency (time-variant SFA method)
0.6
0.4
PT
0.2
1 2 3 4 5 6 7 8 9 10
DMU
RI
1.0
DMU1
DMU2
DMU3
DMU4
DMU5
SC
0.8
DMU6
DMU7
Efficiency (time-variant SFA method)
DMU8
DMU9
DMU10
0.6
0.4
U
0.2
AN
0.0
5 10 15 20 25
TIME
M
Figure 6: Unbalanced Grundfeld data - Box-plot and efficiency pattern of the firms with SFA estimator.
DMU Time-invariant efficiencies Time-variant efficiencies
Median Mean Stochastically dominated units
D
Table 4: Unbalanced Grundfeld data - partial and linear ranking using different empirical summaries of
efficiencies obtained from SFA estimator.
AC
of DMUs in comparison with the mean and median orderings in time-variant technique.
For example, DMU2 has the best ranking using time-invariant approach, while the best
30
ACCEPTED MANUSCRIPT
DMU 1 DMU 2
1.0
1.0
0.8
0.8
Efficiency (time-variant SFA method)
0.6
0.4
0.4
0.2
0.2
PT
0.0
0.0
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.2 0.4 0.6 0.8
DMU 3 DMU 4
1.0
1.0
RI
0.8
0.8
Efficiency (time-variant SFA method)
0.6
0.4
0.4
SC
0.2
0.2
0.0
0.0
0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
U
DMU 5 DMU 6
1.0
1.0
AN
0.8
0.8
Efficiency (time-variant SFA method)
0.6
0.4
0.4
0.2
0.2
M
0.0
0.0
0.0 0.2 0.4 0.6 0.8 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
DMU 7 DMU 8
D
1.0
1.0
0.8
0.8
Efficiency (time-variant SFA method)
0.6
0.4
0.4
0.2
0.2
EP
0.0
0.0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.1 0.2 0.3 0.4 0.5
DMU 9 DMU 10
1.0
1.0
C
0.8
0.8
Efficiency (time-variant SFA method)
0.6
AC
0.4
0.4
0.2
0.2
0.0
0.0
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35
Figure 7: Unbalanced Grundfeld data - empirical estimates of the reliability functions of the efficiency distri-
butions using time-variant SFA approach with 95% CI.
31
ACCEPTED MANUSCRIPT
Here we present some comparisons between the results of our proposed ranking meth-
ods using the empirical estimated SFA and DEA reliability functions of efficiency score
PT
distributions of DMUs.
• DMU7 is ranked 1st by mean and median orderings using empirical SFA efficiencies, while
RI
DMU2 is ranked 1st by these ordering methods using DEA. We only have two DMUs,
SC
{DM U3 , DM U10 } ({DM U5 , DM U7 }), with the same ranks in mean ordering (median
U
AN
• The following Hasse diagrams, based on the results reported in Tables 1 and 4, summarize
the stochastic dominance using the two approaches. Clearly DMU2 and DMU7 show
M
D
TE
C EP
AC
Figure 8: Grundfeld data-Hasse diagram for the empirical DEA (left) and empirical SFA (right) analyses.
better performances compared to the other DMUs using different analyses. Yet there are
some differences. As seen in Figure 8, DMU2 stochastically dominates all other DMUs
using the empirical estimation of the DEA efficiencies, while it is dominated by DMU7
32
ACCEPTED MANUSCRIPT
The set of DMUs dominated using DEA approach is a subset of the set of DMUs domi-
nated using SFA approach for all, but DMU7 . For example, DMU6 is stochastically more
PT
{DM U1 , DM U3 , DM U4 , DM U8 , DM U9 , DM U10 } using the SFA analysis while it cannot
stochastically dominate DMU1 , DMU4 and DMU8 using the DEA analysis.
RI
DMU3 and DMU10 are unordered using DEA analysis, while DMU10 dominates DMU3 in
SC
the SFA analysis. Therefore, DMU3 has the worst performance among all DMUs in the
SFA analysis.
U
6 Conclusions
AN
M
The proposed methods in the current literature on stochastic DEA do not use all the in-
formation contained in the efficiency score distributions for ranking and ordering DMUs.
D
In this manuscript, we have addressed this gap by introducing two ranking methods,
TE
partial and general linear ordering, borrowing ideas from reliability and statistical de-
EP
cision theory. The partial ordering paved the way further to the introduction of the
also proposed a linear ordering method based on a weighted average of the reliability
AC
function of efficiency scores. Special cases of this ranking method include mean and
median ranking. This ranking method may also be viewed as an interactive ranking
where one incorporates prior knowledge about possible performance of different DMUs,
The implementation of the proposed ranking methods comprises two steps; first using
33
ACCEPTED MANUSCRIPT
efficiency score distribution for each DMU. To this end, one can use DEA, SFA, etc,
PT
for step 2. For DEA approach in step 1 we used CCR model in this manuscript. One
can however used BCC model (Banker et al. (1984)). All theorems presented in this
RI
manuscript hold true if we replace CCR model by BCC model in step 1. We have
illustrated our proposed ranking methods using both parametric and nonparametric
SC
efficiency evaluation approaches, and empirical and model-based estimation methods.
U
The analysis of Grundfeld data using different methods indicates some similarities
AN
and dissimilarities. The dissimilarities posed a question as to which analyses should
be given more credibility. To this end, we have depicted the scatter plot of data in the
M
following figure. The scatter plot indicates a linear relationship between the logarithm of
D
TE
8
EP 6
4
Inv
Fit
9
2
8
C
7
0
4
AC
-2
-2 0 2 4 6 8
Cit
the output and the logarithm of the inputs, hence providing strong evidence in support
of SFA as the proper method of efficiency evaluation for Grundfeld data. It should be
noted that the low dimensionality of inputs and outputs of Grundfeld data facilitated
34
ACCEPTED MANUSCRIPT
depiction of the data which gave us decisive evidence for choosing SFA as the more
appropriate approach for analyzing Grundfeld data. When the dimension of the outputs
and/or the inputs is above three, the data visualization will not be as easy. The difficulty
PT
exacerbates as the dimension of the outputs and/or inputs grow. In such situations, it
is more reasonable to use a robust approach for efficiency evaluation, such as DEA, than
RI
a parametric approach, such as SFA. This is so since model validation is hard if it is not
impossible.
SC
Another type of uncertainty, not discussed in this manuscript, which has been consid-
U
ered in the literature, is the so-called market uncertainty where the prices of the inputs
AN
or the outputs of DMUs are subject to uncertainties (see for example Sengupta (1999,
Acknowledgement: The authors would like to express their sincere gratitude to the
TE
Editors, the AE handling this article and two anonymous referees whose comments and
suggestions considerably improved this manuscript. This research was partly supported
EP
by the Natural Sciences and Engineering Research Council (NSERC) of Canada [NSERC-
RGPIN-2018-05618].
C
AC
References
Adler, N., Friedman, L. and Sinuany-Stern, Z. (2002) Review of ranking methods in the data
Aigner, D., Lovell, C. A. K. and Schmidt, P. (1977) Formulation and estimation of stochastic
35
ACCEPTED MANUSCRIPT
Aldamak, A. and Zolfaghari, S. (2017) Review of efficiency ranking methods in data envelop-
PT
Andersen, P. and Petersen, N. C. (1993) A procedure for ranking efficient units in data envel-
RI
opment analysis. Management science 39(10), 1261–1264.
SC
Angulo-Meza, L. and Lins, M. P. E. (2002) Review of methods for increasing discrimination in
U
Asgharian, M. and Noorbaloochi, S. (1998) Note on a fundamental relationship between ad-
AN
missible and Bayesian decision rules. Statistics 31, 21–34.
M
Banker, R. (1993) Maximum likelihood, consistency and data envelopment analysis: a statistical
Banker, R., Charnes, A. and Cooper, W. (1984) Some models for estimating technical and scale
Battese, G. E. and Coelli, T. J. (1992) Frontier production functions, technical efficiency and
panel data: with application to paddy farmers in India. Journal of productivity analysis
3(1-2), 153–169.
Battese, G. E. and Coelli, T. J. (1995) A model for technical inefficiency effects in a stochastic
frontier production function for panel data. Empirical economics 20(2), 325–332.
36
ACCEPTED MANUSCRIPT
Bogetoft, P. and Otto, L. (2010) Benchmarking with DEA, SFA, and R. New York: Springer.
Bruni, M., Conforti, D., Beraldi, P. and Tundis, E. (2009) Probabilistically constrained models
for efficiency and dominance in DEA. International Journal of Production Economics 117(1),
PT
219–228.
RI
Charnes, A., Cooper, W. and Rhodes, E. (1978) Measuring the efficiency of decision making
SC
Charnes, A., Cooper, W. W. and Rhodes, E. (1979) Short communication: Measuring the
U
efficiency of decision-making units. European Journal of Operational Research 3(4), 339.
AN
Chen, Y., Du, J. and Huo, J. (2013) Super-efficiency based on a modified directional distance
Cooper, W. W., Huang, Z., Lelas, V., Li, S. X. and Olesen, O. B. (1998) Chance constrained
D
run is much better than two doubles” criterion. To appear Journal of the Operational Research
Friedman, L. and Sinuany-Stern, Z. (1997) Scaling units via the canonical correlation analysis
37
ACCEPTED MANUSCRIPT
Gijbels, I., Mammen, E., Park, B. and Simar, L. (1999) On estimation of monotone and concave
Gong, L. and Sun, B. (1995) Efficiency measurement of production operations under uncer-
PT
tainty. International Journal of Production Economics 39(1-2), 55–66.
RI
Greene (1993) The econometric approach to efficiency analysis, in H. O. Fried, C. A. K. Lovell,
and S. S. Schmidt, eds., The Measurement of Productive Efficiency and Productivity growth.
SC
Oxford: Oxford University Press.
U
Greene, W. H. (2011) Econometric Analysis. 7th edition. New York: Prentice Hall.
AN
Hall, P. and Simar, L. (2002) Estimating a changepoint, boundary or frontier in the presence
Kao, C. and Liu, S.-T. (2009) Stochastic data envelopment analysis in measuring the efficiency
D
Kneip, A., Park, B. and Simar, L. (1998) A note on the convergence of nonparametric DEA
EP
Kneip, A., Simar, L. and Wilson, P. W. (2008) Asymptotics and consistent bootstraps for DEA
AC
Kneip, A., Simar, L. and Wilson, P. W. (2011) A computationally efficient, consistent bootstrap
for inference with non-parametric dea estimators. Computational Economics 38, 483–515.
38
ACCEPTED MANUSCRIPT
Lamb, J. D. and Tee, K. H. (2012) Resampling DEA estimates of investment fund performance.
Land, K., Lovell, C. and Thore, S. (1993) Chance-constrained data envelopment analysis. Man-
PT
agerial and Decision Economics 14(6), 541–554.
RI
Liang, L., Wu, J., Cook, W. and Zhu, J. (2008) Alternative secondary goals in DEA cross
SC
Meeusen, W. and van den Broeck, J. (1977) Efficiency estimation from Cobb-Douglas produc-
U
tion functions with composite error. International Economic Review 18, 435–444.
AN
Mehrabian, S., Alirezaee, M. R. and Jahanshahloo, G. R. (1999) A complete efficiency ranking
R Development Core Team (2005) R: A Language and Environment for Statistical Computing.
Rutherford, D. (1965) Introduction to Lattice Theory. 7th edition. UK: Oliver and Boyd, Ed-
inburgh.
Sengupta, J. K. (1999) A dynamic efficiency model using data envelopment analysis. Interna-
39
ACCEPTED MANUSCRIPT
Sengupta, J. K. (2005) Nonparametric efficiency analysis under uncertainty using data envel-
Sexton, T. R., Silkman, R. H. and Hogan, A. J. (1986) Data envelopment analysis: Critique
PT
and extensions. New Directions for Program Evaluation 32, 73–105.
RI
Simar, L. (2007) How to improve the performances of DEA/FDH estimators in the presence of
SC
Simar, L. and Wilson, P. (1998) Sensitivity of efficiency scores: How to bootstrap in nonpara-
U
metric frontier models. Management Science 44, 49–61.
AN
Simar, L. and Wilson, P. (2000) A general methodology for bootstrapping in nonparametric
Simar, L. and Wilson, P. (2007) Estimation and inference in two-stage, semi-parametric models
D
Simar, L. and Wilson, P. W. (2015) Statistical approaches for non-parametric frontier models:
EP
Sinuany-Stern, Z. and Friedman, L. (1998) DEA and the discriminant analysis of ratios for
AC
Sinuany-Stern, Z. and Friedman, L. (2016) Statistical analysis in the DEA context. 2016 Second
40
ACCEPTED MANUSCRIPT
Wang, Y. M. and Chin, K. S. (2010) Some alternative models for DEA cross-efficiency evalua-
PT
Appendix I
RI
Proof of theorems
SC
Proof of Theorem 2: We first note that Θn is a random variable defined on the probability
U
measure on =. We further note that for any ω ∈ Ω we have a ΨCCR . Let Ai = {ω ∈
AN
Ω : Θni (ω) = 1} for i = 1, . . . , n. Since in any ΨCCR there is at least one efficient DMU,
n
M
S
we have Ω = Ai . Now suppose that there is no mass point at 1 for any DMU, i.e.,
i=1
n
S n
P n
S
P( Ai ) ≤ P (Ai ) = 0. On the other hand, P ( Ai ) = P (Ω) = 1. This is a
i=1 i=1 i=1
TE
contradiction.
EP
Lemma 1. If DMUo is inadmissible, then there exists λ̃ = λ̃1 , ..., λ̃n ≥ 0 such that P (Θnλ̃ ≥
C
2
Θno ) ≥ 2po − po2+1 , where Θnλ̃ indicates the efficiency of the virtual stochastic DMU ( nj=1 λ̃j Xj , nj=1 λ̃j Yj ).
P P
AC
Proof. Since DMUo is inadmissible, then there exists λ̃ = λ̃1 , ..., λ̃n ≥ 0 such that
where SΘn (·) is the survival function of the efficiency of the virtual stochastic DMU
λ̃
41
ACCEPTED MANUSCRIPT
Pn Pn
that uses the input j=1 λ̃j Xj to produce the output j=1 λ̃j Yj . For any λ ≥ 0 define
PT
P (Ωλ̃ ) = P (Θnλ̃ ≥ Θno )
RI
= P (Θnλ̃ ≥ Θno | Θno = 1)P (Θno = 1) + P (Θnλ̃ ≥ Θno | Θno < 1)P (Θno < 1)
U SC
where
=
0
TE
S S
We note that DMUo is not on the frontier for any ω ∈ Ωλ . Thus given ω ∈ Ωλ , the
λ λ
EP
efficiency of DMUo cannot affect the efficiency of other DMUs. Thus Θnλ̃ is independent
of Θno given ω ∈
S
λ Ωλ . We therefore have
C
R1 R1
P (Θnλ̃ ≥ Θno | Θno < 1) = P (Θnλ̃ ≥ θ)dFΘ<no (θ) = SΘn (θ)dFΘ<no (θ).
AC
0 0 λ̃
Using (11),
R1
P (Θnλ̃ ≥ Θno | Θno < 1) ≥ 0
SΘno (θ)dFΘ<no (θ).
42
ACCEPTED MANUSCRIPT
R1 R1 (1−po ) 1+po
0
SΘno (θ)dFΘ<no (θ) = po + (1 − po ) 0
SΘ<no (θ)dFΘ<no (θ) = po + 2
= 2
. Then
(1 − p2o )
P (Ωλ̃ ) ≥ P (Θnλ̃ ≥ Θno | Θno = 1)P (Θno = 1) +
2
(1 − p2o )
PT
≥ P (Θnλ̃ = 1, Θno = 1) +
2
(1 − p2o )
= pλ̃ + po − P (Θnλ̃ = 1 or Θno = 1) +
RI
2
2
(1 − po )
≥ 2po − P (Θnλ̃ = 1 or Θno = 1) +
2
SC
2
p +1
≥ 2po − o .
2
U
Proof of Theorem 3: Suppose DMUo is inadmissible, then using Lemma 1, there
AN 2
exists λ̃ = λ̃1 , ..., λ̃n such that P (Ωλ̃ ) = P (Θnλ̃ ≥ Θno ) ≥ 2po − po2+1 .
M
On the other hand, {ω ∈ Ω | Θno (ω) = 1} = Ω −
S
λ Ωλ . Thus
D
[
po = P (Θno = 1) = 1 − P ( Ωλ )
TE
≤ 1 − P (Ωλ̃ )
p2o + 1 p2 + 3
≤ 1 − 2po − = −2po + o .
EP
2 2
p2o +3
C
43