Professional Documents
Culture Documents
Aggregation Error Bounds
Aggregation Error Bounds
MODELS
R. L. FRANCIS
Department of Industrial and Systems Engineering, University of Florida, Gainesville, Florida 32611; francis@ise.ufl.edu
T. J. LOWE
College of Business Administration, University of Iowa, Iowa City, Iowa 52242, timothy-lowe@uiowa.edu
ARIE TAMIR
Department of Statistics & Operations Research, School of Mathematical Sciences, Tel-Aviv University, Tel-Aviv 69978, Israel, atamir@math.tau.ac.il
(Received February 1997; revisions received January 1998, November 1998; accepted December 1998)
Many location models involve distances and demand points in their objective function. In urban contexts, there can be millions of
demand points. This leads to demand point aggregation, which produces error. We identify a general model structure that includes
most such location models, and present a means of obtaining error bounds for all models with this structure. The error bounds
suggest how to do the demand point aggregation so as to keep the error small.
model. Indeed, this difference suggests that asymptotic gate demand points can be approximated within reason-
analysis, as done by Zemel (1985) and by Rayco and Fran- able accuracy in polynomial time. For example, if qmin
cis (1996), may be a fruitful approach to solving the sec- denotes the minimum number of aggregate demand points
ondary models. needed and the space is the Euclidean plane, Hochbaum
For more detail on using error bounds to suggest aggre- and Maas (1985) provide efficient algorithms that will find
gation methods, using simpler distances, and adjusting 兩Q兩, q⬘ centers (aggregate points) ensuring the desired bound,
see Francis et al. (1996), Rayco et al. (1997), and Francis and q⬘ is arbitrarily close to qmin. Specifically, for any fixed
et al. (1999). In the first two listed papers the authors find positive ⑀ their scheme computes in time which is polyno-
that the error bound for the n-median problem was usually mial in m ⫽ 兩P兩 and ⑀, a set of q⬘ aggregate demand points,
not tight but worked well as a surrogate error measure, such that the error bound is at most B, and q⬘ 聿 [(1 ⫹
and that experimentally, the error bound for the (un- ⑀)2]qmin. They give similar results for d-dimensional Eu-
weighted) n-center problem was usually found to be tight. clidean and rectilinear spaces. For general metric spaces
In the latter paper, the authors observed that, based on or networks the best known approximation of qmin is not as
experimental results, error is decreasing in 兩Q兩/兩X兩. good. The best values of q⬘ that can be computed efficiently
It is also possible, as discussed by Andersson et al. are of the order [log兩P兩]qmin. Such bounds are guaranteed if
(1998), to view the use of approximating planar distances we use, for example, a greedy scheme to solve the above
in the secondary problems as being “strategic” planning. covering problem on the points in P (see Chvatal 1979).
One can then obtain a family of smaller “tactical” sub- The above approach to bound and control the number
problems to solve, where the distances can be the original of aggregate demand points can also be adapted to apply
distances, but using the aggregate demand points. They to the n-median problem. Again, suppose that we want to
found that this approach worked quite well for n-median find the minimum number of aggregate demand points
aggregation, whereas results were mixed for n-center ag- qmin needed to ensure an upper bound of B on the error
gregation. The approach may be viewed as a heuristic de- term 2eb for the n-median problem. Unlike the above re-
composition method. sults for the n-center problem, we are unaware of any
Consider now how the error bound expressions can be polynomial time algorithm that can approximate qmin by a
useful in controlling or reducing the error by adjusting q, constant factor and at the same time ensure the bound B
the number of distinct aggregate demand points in P⬘, i.e., on the error term 2eb. However, the results of Lin and
the number of points in Q. As an illustration, consider first Vitter (1992) can be used to achieve the following result.
the (unweighted) n-center model. Suppose we wish to ag- For any fixed positive ⑀ we can compute, in time that is
gregate the original vector of demand points P and ensure polynomial in m ⫽ 兩P兩 and ⑀, a set of q⬘ aggregate demand
that the error term, 2eb (from (3)), is at most B. The points, such that the error bound is at most 2(1 ⫹ ⑀) B and
relevant question is how big should the set Q be, i.e., what q⬘ 聿 (1 ⫹ 1/⑀)qmin. For example, if ⑀ ⫽ 1, an error bound
is the minimum number of aggregate demand points of 4B can be guaranteed by computing in polynomial time
needed to ensure that 2eb 聿 B. Of course, this problem by q⬘ aggregate demand points, where q⬘ 聿 2qmin.
itself is NP-hard for general metric spaces or networks. It To conclude, we have demonstrated that for several im-
is equivalent to the problem of finding the minimum num- portant location models, the error bounds we will derive in
ber of centers (i.e., aggregate demand points) needed to §3 can be used quite efficiently to assess and control the
cover each one of the original demand points within a “cost of approximation” by selecting the set of aggregate
distance of B/ 2. However, the minimum number of aggre- demand points of the appropriate size.
300 / FRANCIS, LOWE, AND TAMIR
Figure 1. Construction process for SAND location models.
r
each point u of R⫹ as the maximum (or sup) of the values hi(u) is called the jth largest component sum of u.
of { gi(u)} at that point is a SAND function.
r
S-5. If g(u) is a SAND function over R⫹ , then for any THEOREM 1. For each j ⫽ 1, . . . , r, hj(u) is an isotone
fixed u ⫽ (u1, . . . , ur), the function g(u1, . . . , ur) is a function defined on Rr, and gj(u) is a SAND function de-
SAND function of the single nonnegative variable . fined on Rr.
302 / FRANCIS, LOWE, AND TAMIR
r
PROOF. To prove the isotonicity of the function hj, con- tion g(u1, . . . , ur) defined over R⫹ , the nonnegative or-
r
sider a pair of vectors, u, v, satisfying ui 肁 vi, for i ⫽ thant of R . Specifically,
1, . . . , r. Without loss of generality, suppose that v1
f共X:P兲 ⫽ g共T 1共X:P兲, . . . , T r共X:P兲兲 ⫽ g共T共X:P兲兲.
肁 . . . 肁 v r.
The function hj(u) is the jth largest element of the set At the beginning of §3 we discussed two special cases of
{u1, . . . , uj, . . . , ur}. Therefore, it is greater than or equal this model: the classical n-median and n-center models.
to the jth largest element of the subset {u1, . . . , uj}, which Every model in the table, except the last, is a special case
is equal to min{u1, . . . , uj}. Using the isotonicity of the of this model.
minimum operator we obtain We now continue by formally defining the distance-like
assumptions on the function T(X:P).
h j 共v兲 ⫽ v j ⫽ min兵v 1 , . . . , v j 其 ⭐ min兵u 1 , . . . , u j 其
Associated with T(X:P) we assume that there is a second
⭐ h j 共u兲. mapping. Given any pair of vectors P ⫽ ( p1, . . . , pm) and
We conclude that hj is isotone. P⬘ ⫽ ( p⬘1, . . . , p⬘m), this second mapping is defined by
The function gj is the sum of j isotone functions, and S(P:P⬘) ⫽ (S1(P:P⬘), . . . , Sr(P:P⬘)), from Ym ⫻ Ym into
r
therefore it is isotone. Because gj is positively homoge- R⫹ , where, for j ⫽ 1, . . . , r, Sj(P:P⬘) is a nonnegative real
neous, it follows from Rockafellar (1970, Theorem 4.7), number. The two mappings satisfy the following assump-
mentioned above, that gj is subadditive if and only if it is tions.
convex. Thus, it remains to show that gj is convex. DISTANCE-LIKE ASSUMPTION. For each triplet of vectors of
Let u, v be a pair of vectors in Rr and let t be any real points, X ⫽ ( x1, . . . , xn), P ⫽ ( p1, . . . , pm), and P⬘ ⫽
number satisfying 0 聿 t 聿 1. We have gj(tu ⫹ (1 ⫺ t)v) ⫽ ( p⬘1, . . . , p⬘m), in the metric space Y,
¥ {tuk ⫹ (1 ⫺ t)vk兩k 僆 I} for some I 傺 {1, . . . , r}, 兩I兩 ⫽ j.
Thus S共P:P⬘兲 ⫽ S共P⬘:P兲,
g j共tu ⫹ 共1 ⫺ t兲v兲 ⫽ t 冘 兵u 兩k 僆 I其
k
T共X:P兲 ⭐ T共X:P⬘兲 ⫹ S共P⬘:P兲.
⫹ 共1 ⫺ t兲 冘 兵v 兩k 僆 I其 ⭐ tg 共u兲 ⫹ 共1 ⫺ t兲 g 共v兲
k
j j The above assumption generalizes the triangle inequal-
ity property of the distance (metric) function. For example,
We now conclude gj is a SAND function. □ consider again the n-median model, where Ti(X:P) ⫽ D(X,
pi). For i ⫽ 1, . . . , m, define Si(P:P⬘) ⫽ d( pi, p⬘i). It is easy
THEOREM 2. Let a1 肁 . . .肁 ar be a sequence of nonnega- to verify that the distance-like property is implied by the
r
tive reals. The function g(u) ⫽ ¥j⫽1 ajhj(u) is a SAND symmetry of distance, the triangle inequality, and Lemma
r
function defined on R . 1.
Our last supposition of the general location model in-
PROOF. Using summation by parts, the function g(u) can
volves the objective function f(X:P) ⫽ g(T1(X:P), . . . ,
be represented as
Tr(X:P)). The real function g(u1, . . . , ur), defined on R⫹ r
,
冘 共a ⫺ a
r⫺1
is assumed to be SAND.
g共u兲 ⫽ a r g r共u兲 ⫹ j j⫹1 兲 g
j
共u兲.
j⫽1
3.4. Derivation of the Error-Bound Expression
Using this representation, the result follows directly from We now use the general model presented above to derive
property S-1 and the previous theorem, because gj(u) is a explicit expressions for eb, which are independent of X but
SAND function, and aj ⫺ aj⫹1 肁 0 for j ⫽ 1, . . . , r ⫺ 1. □ depend on the original set of demand points P and the set
of aggregate demand points P⬘ to be selected.
3.3. The General SAND Location Model
In a typical facility location model there is a vector of THEOREM 3. Consider the location model f(X:P) ⫽ g(T(X:
demand points P ⫽ ( p1, . . . , pm), embedded in some met- P)), where T(X:P) satisfies the distance-like assumption,
ric space Y (see Goldberg 1976). The objective is to select and g is a SAND function. Then,
a vector X ⫽ ( x1, . . . , xn) of facilities (servers) in Y to
兩f共X:P兲 ⫺ f共X:P⬘兲兩 ⭐ g共S共P:P⬘兲兲 for all X.
optimize some function of the distances {d( xj, pi)兩j ⫽
1, . . . , n; i ⫽ 1, . . . , m} between the m demand points PROOF. The distance-like property implies
and the n servers and the distances {d( xj, xt)兩j, t ⫽ 1, . . . ,
T共X:P兲 ⭐ T共X:P⬘兲 ⫹ S共P⬘:P兲.
n} among the n servers themselves.
To formulate a model, let T(X:P) denote a “distance- Next, from the isotonicity of the function g, we have
like” vector (to be defined shortly) of nonnegative reals,
g共T共X:P兲兲 ⭐ g共T共X:P⬘兲 ⫹ S共P⬘:P兲兲.
involving the above distances. Formally, T(X:P) has r real
components, say T1(X:P), . . . , T r(X:P), then T j(X:P), j ⫽ Applying the subadditivity of g, we obtain
1, . . . , r, is viewed as a mapping from the space Yn ⫻ Ym
g共T共X:P兲兲 ⭐ g共T共X:P⬘兲兲 ⫹ g共S共P⬘:P兲兲.
into the real line R. The objective function of the location
model f(X:P) is modeled by some real-valued costing func- Due to symmetry, we also have
FRANCIS, LOWE, AND TAMIR / 303
g共T共X:P⬘兲兲 ⭐ g共T共X:P兲兲 ⫹ g共S共P:P⬘兲兲. min兵d 2 共 p i , v t 兲, min 兵d 1 共 p i , x j 兲 ⫹ d 1 共 x j , v t 兲其其,
j⫽1,. . .,n
Finally, combining the above inequalities and using the ⭐ min兵d 2 共 p⬘i , v t 兲, min 兵d 1 共 p⬘i , x j 兲 ⫹ d 1 共 x j , v t 兲其其
symmetry assumption S(P:P⬘) ⫽ S(P⬘:P), it follows that j⫽1,. . .,n
To demonstrate the above result we apply it to the sup- T i,t共共X, V兲:P兲 ⭐ T i,t共共X, V兲:P⬘兲 ⫹ S i,t共P:P⬘兲.
porting median model discussed in Mirchandani and Thus, we conclude that an error bound for the supporting
Odoni (1979). The input consists of a set of demand points median model is given by
P ⫽ ( p1, . . . , pm), a set of existing facilities (e.g., airports)
冘冘w
m r
V ⫽ (v1, . . . , vr), and nonnegative weights {wi,t兩i ⫽ 1, . . . , i,t
i,t S 共P:P⬘兲
m; t ⫽ 1, . . . , r}. The objective is to select a set of new i⫽1 t⫽1
facilities to optimize the objective,
冘冘w
m r
⫽ max兵d 1 共 p i , p⬘i 兲, d 2 共 p i , p⬘i 兲其.
冘冘
m r i,t
i⫽1 t⫽1
w i,t min兵d 2 共 p i , v t 兲, min 兵d 1 共 p i , x j 兲
i⫽1 t⫽1 j⫽1,. . .,n
Now consider the n-median model with concave SAND
⫹ d 1 共 x j , v t 兲其其. functions of nearest distances. From Property S-6, §3.1
(d1 and d2 are two different metrics defined on Y, repre- (see Rosenbaum 1950, Theorem 1.4.3) we know that a
senting travel by a supporting median and direct travel, concave, isotone function g(u) of a single real variable u,
respectively.) satisfying g(0) 肁 0, is a SAND function. As an example
We now define the functions T((X, V):P), S(P:P⬘), and consider the following extension of the classical n-median
g, corresponding to the supporting median model. For model, where we build in economies of scale in distance
each pair of indices i, t; i ⫽ 1, . . . , m; t ⫽ 1, . . . , r, set into the model.
Suppose that each customer pi, i ⫽ 1, . . . , m, is associ-
T i,t共共X, V兲:P兲 ⫽ min兵d 2 共 p i , v t 兲, min 兵d 1 共 p i , x j 兲 ated with a transportation cost function ci of the distance
j⫽1,. . .,n
traveled to the respective server, D(X, pi). Specifically, ci is
⫹ d 1 共 x j , v t 兲其其, a concave, isotone function of a single variable, satisfying
i,t
S 共P:P⬘兲 ⫽ max兵d 1 共 p i , p⬘i 兲, d 2 共 p i , p⬘i 兲其. ci(0) 肁 0. The location model is defined by the objective
冘 c 共D共X, p 兲兲.
Finally, set m
f共X:P兲 ⫽ i i
冘冘w
m r i⫽1
g共u兲 ⫽ i,t u i,t .
i⫽1 t⫽1 To obtain an error bound for this extended model, we
apply Theorem 3. Define the vectors T(X:P) ⫽ (T⬘(X:
To apply Theorem 3 we demonstrate that all the assump-
P), . . . , Tm(X:P)) and (S⬘(P:P⬘), . . . , Sm(P:P⬘)) as follows:
tions are satisfied. It is clear that g is a SAND function,
and S(P:P⬘) is symmetric and nonnegative. We only need T i共X:P兲 ⫽ c i 共D共X, p i 兲兲, S i共P:P⬘兲 ⫽ c i 共d共 p i , p⬘i 兲兲,
to satisfy the distance-like assumption,
for all i ⫽ 1, . . . , m. Using the concavity of the function ci,
T i,t共共X, V兲:P兲 ⭐ T i,t共共X, V兲:P⬘兲 ⫹ S i,t共P:P⬘兲. we note that ci is a SAND function and therefore,
Indeed, for any fixed i, t, and j ⫽ 1, . . . , n, c i 共D共X, p i 兲兲 ⭐ c i 共D共X, p⬘i 兲兲 ⫹ c i 共d共 p⬘i , p i 兲兲.
d 1 共 p i , x j 兲 ⫹ d 1 共 x j , v t 兲 ⭐ d 1 共 p⬘i , x j 兲 ⫹ d 1 共 x j , v t 兲 Thus,
⫹ d 1 共 p i , p⬘i 兲. T i共X:P兲 ⭐ T i共X:P⬘兲 ⫹ S i共P⬘:P兲.
Also m
Setting g(u1, . . . , um) ⫽ ¥i⫽1 ui and applying Theorem 3,
d 2 共 p i , v t 兲 ⭐ d 2 共 p⬘i , v t 兲 ⫹ d 2 共 p i , p⬘i 兲. we conclude that an error bound for this model is given by
Let
冘 c 共d共 p , p⬘ 兲兲.
m
i i i
A i ⫽ max兵d 1 共 p i , p⬘i 兲, d 2 共 p i , p⬘i 兲其. i⫽1
Then from the above inequalities, Finally we point out that the results of Theorem 3 ex-
d 1 共 p i , x j 兲 ⫹ d 1 共 x j , v t 兲 ⭐ d 1 共 p⬘i , x j 兲 ⫹ d 1 共 x j , v t 兲 ⫹ A i, tend easily to models that include fixed or setup costs for
the new facilities, as well as other terms that are indepen-
d 2 共 p i , v t 兲 ⭐ d 2 共 p⬘i , v t 兲 ⫹ A i,
dent of the set of customers P. To formulate this exten-
for a fixed pair i, t and all j ⫽ 1, . . . , n. Taking the sion, let H(X) be a mapping from Yn to the space Rl.
minimum operator over both sides of the above n ⫹ 1 Specifically, if H(X) has l real components, H1(X), . . . ,
inequalities implies that H1(X), then Hj(X), j ⫽ 1, . . . , l, is viewed as a mapping
304 / FRANCIS, LOWE, AND TAMIR
from Yn into the real line R. For example, H(X) can be a g1(T(X:P)) is the median objective, f2(X:P) ⫽ g2(T(X:P))
vector of setup costs of some or all of the facilities in X or is the center objective, and g0(v1, v2) ⫽ v1 ⫹ v2.
the vector of distances between some pairs of points in X. We note that the above results for the hyper-SAND
We have the following generalization of Theorem 3. (For model apply to any composition of any number of SAND
brevity we omit the proof because it is very similar to the models from the table in §2.
proof of Theorem 3.)
3.6. Obnoxious Location Models
THEOREM 4. Consider the location model f(X:P) ⫽ g(T(X: The crucial assumption in the general model requiring that
P), H(X)), where T(X:P) satisfies the distance-like assump- the objective is a SAND function does not hold for prob-
tion, and g is a SAND function. Then, lems dealing with the location of obnoxious facilities,
兩f共X:P兲 ⫺ f共X:P⬘兲兩 ⭐ g共S共P:P⬘兲, 0兲. where the goal is to maximize some isotone function of the
distances between the customers and the new facilities.
As an example of Theorem 4, f(X:P) might be the Therefore, the above approach for deriving error bounds is
n-median model including a collection of fixed costs, as- not applicable to obnoxious models.
sembled into a vector H(X), and added to the sum of We present a weaker model that is useful to obtain error
weighted closest distances. The error bound for this model bounds for some obnoxious facility location problems. We
would be the same as for the model without the fixed costs. replace the subadditivity property by the following assump-
tion: Let e 僆 Rr denote the vector, all of whose compo-
3.5. Hyper-SAND Models
nents are equal to 1. A function g(u) is e-subadditive on
The above general model can be further generalized to R⫹r r
, if for any u 僆 R⫹ , and 僆 R⫹1
,
obtain error bounds for complex location models, defined
by compositions of simpler location models. For example, g共u ⫹ e兲 ⭐ g共u兲 ⫹ g共 e兲.
consider the n-cent-dian model, where the objective is de- To motivate this definition, note that the function
fined as a sum (or convex combination) of the objective min{uj兩j ⫽ 1, . . . , r} is e-subadditive but not subadditive.
functions of the n-median and the n-center models. In this (Indeed, it is superadditive.)
case, it can be shown directly that the sum of the error Consider now a location model f(X:P) ⫽ g(T(X:P)),
bounds for the n-median and the n-center models is an where T(X:P) satisfies the distance-like assumption, and g
error bound for the n-cent-dian model. We will show that is isotone and e-subadditive. From the distance-like as-
this result can be extended to any composition of models sumption we have
as long as the composition function itself can be expressed
in terms of a SAND function. T j共X:P兲 ⭐ T j共X:P⬘兲 ⫹ S j共P:P⬘兲,
Consider a collection of s location models, { f1(X: for all j ⫽ 1, . . . , r. Let (P:P⬘) ⫽ max{Sj(P:P⬘)兩j ⫽
P), . . . , fs(X:P)}. Suppose that for each j ⫽ 1, . . . , s, 1, . . . , r}. Then,
fj(X:P) ⫽ gj(T(X:P)), where gj is a SAND function. Let g0
be a SAND function, and define a composite model by T j共X:P兲 ⭐ T j共X:P⬘兲 ⫹ 共P:P⬘兲,