Expert Systems With Applications

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 9

Expert Systems with Applications 38 (2011) 6556–6564

Contents lists available at ScienceDirect

Expert Systems with Applications


journal homepage: www.elsevier.com/locate/eswa

Combined heat and power economic dispatch by mesh adaptive direct


search algorithm
Seyyed Soheil Sadat Hosseini a,⇑, Ali Jafarnejad b, Amir Hossein Behrooz c, Amir Hossein Gandomi d,e
a
College of Electrical Engineering, Tafresh University, Tafresh, Iran
b
Department of Construction Management & Engineering, Islamic Azad University, Science and Research Branch, Tehran, Iran
c
Faculty of Management and Accounting, Allameh Tabatabai University, Tehran, Iran
d
College of Civil Engineering, Tafresh University, Tafresh, Iran
e
The Highest Prestige Scientific and Professional National Foundation, National Elites Foundation, Tehran, Iran

a r t i c l e i n f o a b s t r a c t

Keywords: The optimal utilization of multiple combined heat and power (CHP) systems is a complex problem.
Economic dispatch Therefore, efficient methods are required to solve it. In this paper, a recent optimization technique,
Combined heat and power namely mesh adaptive direct search (MADS) is implemented to solve the combined heat and power eco-
Mesh adaptive direct search algorithm nomic dispatch (CHPED) problem with bounded feasible operating region. Three test cases taken from the
Optimization
literature are used to evaluate the exploring ability of MADS. Latin hypercube sampling (LHS), particle
swarm optimization (PSO) and design and analysis of computer experiments (DACE) surrogate algo-
rithms are used as powerful SEARCH strategies in the MADS algorithm to improve its effectiveness.
The numerical results demonstrate that the utilized MADS–LHS, MADS–PSO, MADS–DACE algorithms
have acceptable performance when applied to the CHPED problems. The results obtained using the
MADS–DACE algorithm are considerably better than or as well as the best known solutions reported pre-
viously in the literature. In addition to the superior performance, MADS–DACE provides significant sav-
ings of computational effort.
Ó 2010 Elsevier Ltd. All rights reserved.

1. Introduction power demands. They can be built in urban areas and utilized as
distributed electrical energy sources.
The conversion of primary fossil fuels to electricity is a rela- Several researches worked in the field of the CHPED problem.
tively inefficient process. Even the most modern combined cycle Non-linear optimization algorithms, such as dual and quadratic
plants can only obtain efficiencies of between 50% and 60% (Vasebi, programming (Rooijers & van Amerongen, 1994), and gradient des-
Fesanghary, & Bathaee, 2007). Most of the energy wasted in this cent approaches, such as Lagrangian relaxation (Guo, Henwood, &
conversion process is released to the environment as waste heat. van Ooijen, 1996), have been applied to it. However, these algo-
Cogeneration or CHP generation is a mature and established tech- rithms cannot deal with discontinuous and/or non-monotonic
nology which has energy efficiency and environmental advantages input–output models for generator fuel characteristics. Alterna-
over other forms of energy supply. Economic dispatch (ED) must be tives to the conventional mathematical approach: evolutionary
applied to obtain the optimal use of CHP units. The main objective computation techniques such as genetic algorithm (GA) (Song &
of economic dispatch problem in a conventional power plant is to Xuan, 1998; Su & Chiang, 2004), evolutionary programming (EP)
find the optimal solution for the power production such that the (Wong & Algie, 2002), multi-objective particle swarm optimization
total demand matches the generation with minimum fuel cost. (MPSO) (Wang & Singh, 2008), a hybrid of genetic algorithm with
The mutual dependencies of heat and power generation propose tabu search (GT) (Sudhakaran & Slochanal, 2003), harmony search
a complication in the integration of cogeneration units into the (HS) (Vasebi et al., 2007), fuzzy decision making (FDM) (Chang &
power system economic dispatch. The best CHP schemes can ob- Fu, 1998), improved ant colony search algorithm (ACSA) (Song,
tain fuel conversion efficiencies of the order of 90% (Vasebi et al., Chou, & Stonham, 1999) and self adaptive real-coded genetic algo-
2007). Cogeneration systems have extensively been used by the rithm (SARGA) (Subbaraj, Rengaraj, & Salivahanan, 2009) have suc-
industry, recently. Some industrial processes have large heat cessfully been applied to the CHPED problem. In the optimization
requirements, either as process steam or piped hot fluid, and large area, many interesting results come from the utilization of pattern
search (PS) methods (Torczon, 1997). At this stage, mesh adaptive
⇑ Corresponding author. Tel.: +98 912 419 9906. direct search (MADS) (Audet & Dennis, 2006) is one of the most
E-mail addresses: s.sadathosseini@gmail.com (S.S. Sadat Hosseini), a.h.gandomi@
powerful optimization algorithms that has recently emerged. The
gmail.com (A.H. Gandomi). MADS method lends itself to optimization problems with discrete

0957-4174/$ - see front matter Ó 2010 Elsevier Ltd. All rights reserved.
doi:10.1016/j.eswa.2010.11.083
S.S. Sadat Hosseini et al. / Expert Systems with Applications 38 (2011) 6556–6564 6557

and continuous variables, such as the generator loads of the CHP The CHPED problem of a system is to determine the unit heat
problem. MADS does not depend on derivatives of the objective and power production so that system production cost is minimized
function of the problem to be solved, non-monotonic functions while the heat-power demands and other constraints are met. It
and accommodating discontinuous. Despite significant advantages can be mathematically stated as follows:
of MADS over other optimization approaches, there have been
Np
X Nb
X Nh
X
some little scientific efforts directed at applying it to academic
Min C i ðPi Þ þ C j ðPj ; Hj Þ þ C k ðHk Þ ð1Þ
and practical problems (Audet, Bechard, & Chaouki, 2008; Nicosia
i¼1 j¼1 k¼1
& Stracquadanio, 2007).
The main objective of this study is to introduce the MADS algo-
Subject to the equilibrium constraints of heat and electricity pro-
rithm to solve the CHPED problem. Three four-unit systems previ-
duction, and the capacity limits of each unit
ously presented in the literature have been used as case studies.
The Latin hypercube sampling (LHS), particle swarm optimization Np
X Nb
X
(PSO) and design and analysis of computer experiments (DACE) Pi þ Pj ¼ Pd ð2Þ
surrogate algorithms are used as search strategies in MADS to solve i¼1 j¼1
each of the CHPED problems. The results obtained by the MADS– Nh
X
Nc X
LHS, MADS–PSO, MADS–DACE methods are further compared with Hj þ Hk ¼ Hd ð3Þ
those generated with other (evolutionary and mathematical pro- j¼1 k¼1
gramming) techniques reported in the literature. This paper is or-
ganized as follows: Section 2 describes the characteristics of a Pmin
i 6 P i 6 Pmax
i ; i ¼ 1; . . . ; Np ð4Þ
cogeneration unit and formulation of the CHPED problem, Section Pmin ðHj Þ 6 Pj 6 Pmax ðHj Þ; j ¼ 1; . . . ; Nb ð5Þ
j j
3 deals with mesh adaptive direct search, Section 4 describes
implementation of MADS to the CHPED problem, and Section 5 dis- Hmin
j ðPj Þ 6 Hj 6 Hmax
j ðPj Þ; j ¼ 1; . . . ; Nb ð6Þ
cusses the MADS performance on this specific problem.
Hmin
k 6 Hk 6 Hmax
k ; k ¼ 1; . . . ; Nh ð7Þ

2. Formulation of the CHPED problem where f is the total heat and power production cost, C is the unit
production cost; P the unit power generation; H the unit heat
Combined heat and power generation is a mature and estab- production; Hd and Pd the system heat and power demands, respec-
lished technology. It has higher energy efficiency and less green tively; i, j and k the indices of conventional power units, cogenera-
house gas emission compared with the other forms of energy sup- tion units and heat-only units; Np, Nb and Nh the numbers of the
ply (Vasebi et al., 2007). The essential difference between com- kinds of units mentioned above; Pmin and Pmax the unit power
bined heat and power units and conventional condensing plant is capacity limits; Hmin and Hmax the unit heat capacity limits.
in the type of the power obtained and the overall efficiency of each The mutual dependencies of heat and power generations from
plant. In conventional condensing plants, the energy from the fuel Eqs. (5) and (6) introduce a complication in the integration of
is utilized to produce electrical power only, while in CHP systems, cogeneration units. Hence, the optimization problem of the CHPED
the energy from the fuel is utilized to produce both electrical and is non-linear and highly constrained in nature.
thermal power thus increasing its efficiency. The heat production
depends on power generation and inversely. This introduces com-
plexity due to the non-separable heat in the CHP units and nature 3. Mesh adaptive direct search algorithm
of electrical power.
The heat-power feasible operation region (FOR) of a combined The PS optimization algorithm is a class of direct search meth-
cycle cogeneration unit is shown in Fig. 1. The boundary curve ods. This algorithm is suitable to solve different optimization prob-
MNOPQR enclosed the feasible operation region. The upper and lems that lie outside the scope of the standard optimization
lower bounds of heat and power units are restricted by their methods. In general, PS has the advantage of being very simple
own generation limits. Along the boundary curve BC, the heat in concept, easy to implement and computationally efficient. A
capacity increases as the power generation declines. useful review of direct search methods for unconstrained optimi-
zation is introduced in Conn, Scheinberg, and Vicente, (2009), in
which the authors give a modern perspective on the classical fam-
ily of derivative-free algorithms, focusing on the development of
direct search methods.
The MADS algorithm for non-linear optimization extends the
generalized pattern search (GPS) (Kolda, Lewis, & Torczon, 2004;
Lewis & Torczon, 2000) algorithms. A main advantage of MADS
over the GPS method for both unconstrained and linearly con-
strained optimization is that local exploration of the space of vari-
ables is not restricted to a finite number of directions, called poll
directions. This is the primary drawback of the GPS algorithms,
and the essential motivation in defining MADS was to overcome
this restriction (Audet & Dennis, 2006). The MADS algorithm is a
frame-based method. The MADS frames are specifically aimed at
ensuring an asymptotically dense set of polling directions.

Notation. R, Z, and N, respectively, denote the sets of real


numbers, integers, and non-negative integers.
For a matrix D, the notation d e D indicates that d is a column of
Fig. 1. Feasible operation region for a cogeneration unit. D. The iteration numbers are denoted by the index i.
6558 S.S. Sadat Hosseini et al. / Expert Systems with Applications 38 (2011) 6556–6564

3.1. Features of the MADS algorithm

MADS is an iterative algorithm. The MADS class of algorithms


starts with an initial point with finite function value. This algo-
rithm does not need any derivative information for f. This is impor-
tant when rf is unavailable, either because it does not exist, or it
cannot be accurately evaluated due to noise in f or other reasons.
Each iteration is divided into two steps. The first one is called the
SEARCH step. In this step, any finite set of mesh points can be eval-
uated. This allows great flexibility to choose strategies. The SEARCH
is expressed to be empty when no trial points are considered. The
SEARCH step adds nothing to the convergence theory, except to
provide counterexamples as in Audet (2004). It is notable that
well-chosen search strategies can greatly improve algorithm perfor-
mance (see Abramson, Audet, & Dennis, 2004; Audet & Orban, 2006;
Booker et al., 1999; McKay, Conover, & Beckman, 1979).
The SEARCH step is essential in concentrated on. Lewis and
Torczon (1996) identified practice because it is so flexible, but it
is a difficulty for the theory for the same reason (Abramson, Audet,
& Dennis, 2006). SEARCH can return any point on the underlying
mesh. Of course, it is trying to identify an unfiltered point. Since
the POLL step is the basis of the convergence analysis, it is the part
of the algorithm where most research has been that POLL should
consider points on the mesh neighboring the incumbent solution
in a set of directions whose non-negative linear combinations span
Fig. 2. Basic MADS algorithm.
the space. This may seem simple, but it is an essential observation.
Coope and Price (2002) extended this concept to the idea of frames,
which can be thought of as doing away with the requirement that the present study are briefly described in the following
the POLL points be mesh neighbors. Audet and Dennis (2006) pro- subsections.
posed MADS as a way to implement frames so that the directions
normalized used in infinitely many POLL steps generate a dense 3.2.2.1. Latin hypercube sampling. An easy and a popular experi-
set on the unit sphere at a MADS limit point. This allows strong mental design technique to provide a good global search of the fea-
convergence (Abramson & Audet, 2006; Audet & Dennis, 2006) sible region is Latin hypercube sampling (LHS). The statistical
and excellent computational results for the MADS algorithms method of LHS was made to produce a distribution of plausible col-
(Audet & Orban, 2006; Marsden, 2004). lections of parameter values from a multi-dimensional distribu-
tion. For the first time, the technique was described by McKay
3.2. Description of MADS algorithm et al. (1979). In the area of statistical sampling, a square grid
including sample positions is a Latin square if (and only if) there
The steps in the MADS algorithm are summarized in Fig. 2. is only one sample in each row and each column. A Latin hyper-
These steps are described in the next five subsections. cube is the generalization of this concept to an arbitrary number
of dimensions, whereby each sample is the only one in each axis-
3.2.1. Initialization aligned hyperplane including it.
In the MADS algorithm, a starting point x0 must be specified When a function of n variables is sampled, the range of each
with finite function value, and an initial mesh size parameter variable is divided into M equally probable intervals. Therefore,
Dm0 2 Rþ . The superscripts m and p stand for mesh and poll, M sample points are placed to satisfy the Latin hypercube require-
respectively. ments; it should be noted that this forces the number of divisions,
The algorithm then defines: M, to be equal for each variable. Also note that this sampling
scheme does not need more samples for more dimensions (vari-
p
 Dm
0 6 D0 : the poll parameter size. ables); this independence is one of the most important advantages
 D ¼ fek ; k ¼ 1; 2; . . . ; ng: the positive basis needed to generate of this sampling scheme. One of the other advantages is that ran-
polling directions, where ek is the kth coordinate direction, and dom samples can be taken one at a time, remembering which sam-
n is the number of variables. ples were taken so far.
 M0 ¼ fx 2 S0 g [ fx0 þ Dm 2n
0 Dz : z 2 N g; the mesh, where S0 is the The maximum number of combinations for a Latin hypercube of
set of points where the objective function f had been evaluated M divisions and n variables (i.e., dimensions) can be calculated
Q
by the initialization. Then, MADS proceeds to the quest for an with the following formula: nk¼0 ðM  kÞn1 .
improved mesh point. For instance, a Latin hypercube of M = 4 divisions with n = 2
variables (i.e., a square) has 24 possible combinations. A Latin
3.2.2. SEARCH step hypercube of M = 4 divisions with n = 3 variables (i.e., a cube) has
The SEARCH step can be empty. This means that the algorithm 576 possible combinations. This is convenient for the MADS algo-
can only be implemented as a sequence of POLL steps. This is an rithms, since the feasible region is already equally divided by the
acceptable choice when a local minimizer in the same basin as mesh, and thus the design points must simply be distributed
the initial guess is adequate. Another reasonable strategy is to try properly.
a step in the same direction as a previously successful POLL step. On the region [0; 1] £ [0; 1], Fig. 3 illustrates an example of 4
It should be noted that although this seems reasonable, some random points selected on a two-dimensional Latin hypercube
researchers have found this approach of limited value at best design. From the 16 sectors, 4 random points are selected so that
(Abramson et al., 2006). Three of the SEARCH strategies used in no row or column is repeated.
S.S. Sadat Hosseini et al. / Expert Systems with Applications 38 (2011) 6556–6564 6559

and yet it will serve as a valuable tool for steering the optimizer to-
wards a function minimum.
Surrogates are a specifically good match for combining with
MADS. One reason for this is the inherent flexibility of MADS, be-
cause it can be separated into a SEARCH step, which offers the user
flexibility, and a POLL step which provides the basis for proof of
convergence. Due to this, a surrogate can be incorporated to search
the design space in a matter well-suited to the problem, and the
choice of surrogate is flexible.
The first step in the optimization process is to choose a set of
initial data. The main reason of using LHS method is to find a well
distributed set of initial data in the parameter space, thus ensuring
that each input variable has all portions of its range represented in
the chosen data set. Once the initial data set, x1, ... , xm, is chosen,
the cost function is evaluated at these points, and an initial surro-
gate model is constructed (Marsden, Wang, Dennis, & Moin, 2004).
A Kriging surrogate model is utilized to interpolate the data, and to
predict the value of the function at a particular location in the
parameter space (Marsden, 2004). Kriging is a statistical method
based on the usage of spatial correlation functions. The Kriging
Fig. 3. Latin hypercube design (2-D, 4-points).
algorithm is implemented in DACE software package. It is easily
extended to multiple dimensions, making it attractive for optimi-
zation problems with several variables. A detailed derivation of
3.2.2.2. Particle swarm optimization. Particle swarm optimization the Kriging approximation can be found in Marsden et al. (2004),
(PSO) is a population-based, self-adaptive, stochastic optimization following Lophaven et al. (2002). If the surrogate optimization fails
method Kennedy and Eberhart (1995). The fundamental idea of the to find an improved mesh point, then the surrogate is updated and
PSO is the mathematical modeling and simulation of the food made more accurate with the new point. The use of surrogates usu-
searching activities of a swarm of birds (particles). In the multi- ally yields significant improvement in the objective function value
dimensional space where the optimal solution is sought, each early on in the iteration process (Abramson, 2007).
particle in the swarm is moved toward the optimal point by the Boeing utilizes DACE surrogates (Santner, Williams, & Notz,
addition of a velocity with its position. The velocity of a particle 2003) in their Design Explorer filter implementation (Audet,
is influenced by three elements, namely, inertial, cognitive, and so- Booker, Dennis, Frank, & Moore, 2000). They generate data sites
cial. The inertial element simulates the inertial behavior of the bird by an orthogonal array, and thereafter fit a DACE model to the data.
to fly in the previous direction. The cognitive element models the The SEARCH includes a global Newton SQP method applied to the
memory of the bird about its previous best position, and the social surrogate problem to try to generate several good local optimizers
element models the memory of the bird about the best position for that problem. Afterwards, they use the expensive ‘‘true’’ prob-
among the particles (interaction inside the swarm). The particles lem functions at those points to decide whether the SEARCH has
move toward the multi-dimensional search space until they find been successful. Whenever new values of the true problem func-
the food (optimal solution). tions have been computed, they are utilized to recalibrate the sur-
rogates. This surrogate management framework results in very
successful methods. Details are given in Audet et al. (2000). Alison
3.2.2.3. Surrogate-based methods. In problems with expensive func- Marsden has solved trailing edge shape design problems utilizing
tion evaluations, it is usually beneficial to substitute a model, or both types of surrogates in an insightful way. She generates trial
computationally cheaper approximation to stand in for the true points using the MATLAB DACE surrogate package (Lophaven,
function. This substitution can give an insight into the behavior Nielsen, & Sondergaard, 2002) and thereafter uses a less expensive
of the function without performing excessive costly function eval- turbulence model to check whether a trial point is in X. If it is, then
uations. The term surrogate is used rather than approximation be- she runs the more precise simulation. Her SEARCH includes apply-
cause there is no tendency for implying that anything is required ing an evolutionary algorithm to the DACE surrogates (Marsden,
with respect to how well the surrogates approximate the problem 2004). Another interesting application of surrogates is in Marsden
functions (Booker et al., 1999). In engineering applications, surro- et al. (2004), where a framework to identify good algorithmic
gates can often be differentiated into two basic types: numerical parameter values is provided. To demonstrate this framework,
solutions of simpler physical models, and interpolation based mod- MADS was applied to an objective function that measured the
els of the solutions that are constructed using knowledge of the CPU time required by a trust-region algorithm (Gould, Orban, &
true function values. The former utilizes a simplified model of Toint, 2003) to solve a set of difficult problems.
the physics that hopes to capture some aspect of the function
behavior. The latter utilizes no knowledge of the physics, and is 3.2.3. The POLL step
based only on knowledge of the function values at a set of known The POLL step is more rigidly defined than the SEARCH step.
data points. Whenever the SEARCH step fails to generate an improved mesh
A disadvantage may be that some simplified physics models are point, then the POLL step is invoked before terminating the itera-
costly to evaluate compared to interpolation-based models. In tion. The POLL step includes a local exploration of the space of opti-
addition, a simplified physics model that is not well understood mization variables near the current incumbent solution xi (called
could result in a misleading surrogate that slows down the optimi- the frame center). The set of trial points considered during the
zation. In the current study, an interpolation-based surrogate is POLL step is called a frame. If the POLL step fails to produce an im-
used. It is notable that surrogates are not required to accurately proved mesh point, Pi is expressed as a minimal frame with mini-
represent the function. A surrogate may efficiently capture the glo- mal frame center xi. If both the SEARCH and POLL step is successful
bal behavior of the function, but be inaccurate for local prediction, in finding an improved mesh point, the improved mesh point
6560 S.S. Sadat Hosseini et al. / Expert Systems with Applications 38 (2011) 6556–6564

becomes the new current iterate xi+1 and the mesh is either re- partial derivatives are known, MADS can exploit this information
tained or coarsened. If none of the steps is successful, then the min- to decrease the number of function evaluations in each POLL step
imal frame center is retained as the current iterate (i.e., xi+1 = xi) (Abramson et al., 2004) without sacrificing theoretical convergence
and the algorithm continues to the parameters update step. properties. In related work, Custodio and Vicente (2007) compute a
For MADS, the poll size parameter Dpi 2 Rþ for iteration i is pre- simplex gradient from a subset of formerly evaluated points having
sented. This new parameter determines the magnitude of the dis- certain geometrical properties, and they have studied its use as a
tance from the trial points generated by the POLL step to the potential direction of descent in an effort to speed convergence.
current incumbent solution xi. Because MADS is opportunistic, in that it moves immediately to a
The MADS frame is constructed by the usage of a current new improved mesh point as soon as it is found, the order in which
incumbent solution xi and the poll and mesh size parameters Dpi POLL points are evaluated can impact performance. One approach
and Dmi to obtain a positive spanning set of directions Di. In general, in such improvement can be witnessed is what is called dynamic
the MADS set of directions Di is not a subset of D. polling, in which the most recent successful direction is moved
At iteration i, the MADS frame is defined to be the set to the front of the queue after each successful POLL step. Dynamic
polling was presented useful in Audet and Dennis (2006) on a
Pi ¼ fxi þ Dm
i d : d 2 Di g  M i ð8Þ chemical engineering parameter fit problem (Hayes, Bertrand,
where Di is a positive spanning set such that 0 R Di and for each Audet, & Kolaczkowski, 2003). Custodio and Vicente (2007) have
d e Di, also seen a reduction in function evaluations by computing a sim-
plex gradient and ordering POLL points in accord with how small
 d can be written as a non-negative integer combination of the an angle the corresponding poll directions make with the negative
directions in D: d = Du for some vector u 2 N nDi that may depend of the simplex gradient. However, these strategies (dynamic poll-
on the iteration number i, ing, surrogate and simplex gradient ordering) do not necessarily
 the distance from the frame center xi to a frame point lead to improved computational times in all cases.
xi þ Dm
i d 2 P i is bounded above by a constant times the poll size
p 0 0
parameter: Dm i kdk 6 Di maxfkd k : d 2n Dg, o 3.2.4. Parameters update
 limits of the normalized sets Di ¼ kdk d
: d 2 Di are positive Rules for refining and coarsening the mesh are as follows.
spanning sets. Given a fixed rational number s > 1 and two integers w 6 1
and w+ P 0, the mesh size parameter Dm i is updated according to
The set of all poll directions D ¼ [1 i¼1 Di is said to be asymptoti-
the rule
cally dense if the closure of D equals Rn.
Dm wi m
iþ1 ¼ r Di for some wi
Graphically, the definition of a hypothetical sequence of three  
POLL sets is given by Fig. 4. We can easily check if the above four f0; 1; . . . ; wþ g if an improv ed mesh po int is found
2
properties are satisfied. It is demonstrated in Audet and Dennis fw ; w þ 1; . . . ; 1g otherwise
(2006) that the set of normalized directions formed by [1 i¼1 Di is ð9Þ
dense in the unit circle with probability 1.
There are two strategies to evaluate f at the trial POLL points Pi. In MADS, the strategy for updating Dpi must be such that Dm
i 6 Dpi for
The first one is to terminate iteration i whenever there is a decline all i, and more, it must satisfy
in f. The exhaustive one consists of evaluating the entire POLL set Pi
lim Dm p
i ¼ 0 if and only if lim Di
and thereafter choosing the feasible point leading to the largest de- i2K i2K
crease of the objective function value f. In the present paper, the ¼ 0 for any infinite subset of indices K ð10Þ
first strategy was used since it was practically observed that it usu-
ally reduces the overall number of function evaluation. In some
cases, incomplete derivative information might be available. For 3.2.5. Termination
instance, in some multi-disciplinary design optimization (MDO) Some termination criterion must be specified, such as a mini-
problems, derivatives for some disciplines might be available, but mal value on the mesh size parameter Dm i , a maximal number of
not for others, and derivatives across multiple disciplines are not objective function evaluations, or a maximal number of consecu-
available. If the full gradient is available, directions can be selected tive unsuccessful function evaluations. It is shown in Dolan, Lewis,
so that all but one are ascent directions, which can be ignored, thus and Torczon (2003) that the mesh size parameter is a measure of
reducing the required number of function evaluations to one per first-order stationarity for GPS in the unconstrained case. As soon
iteration (Abramson et al., 2004). In this case, the MADS algorithm as one termination criterion is achieved, the algorithm terminates.
reduces to an approximation of steepest descent. Even if only some Otherwise, it returns to step 2.

Fig. 4. Examples of POLL sets Pi.


S.S. Sadat Hosseini et al. / Expert Systems with Applications 38 (2011) 6556–6564 6561

3.3. Constraints handling Table 1


Parameter settings for the MADS algorithm.

For bound and linearly constrained problems, the only addi- Parameters Setting
tional requirement over those of unconstrained problems is that Termination parameters
the set of poll directions must be selected so as to conform to Mesh size tolerance 106
the geometry of the constraints. In this study, non-linear con- Maximum number of function evaluations 1000
straints are treated using a filter. Filter-based methods attempt Maximum number of consecutive POLL failures 50
Maximum CPU time (s) 120
Mesh parameters
Initial mesh size ðDm
0 Þ
1
Mesh refinement factor 0.5
Mesh coarsening factor 1

to minimize both the objective function f and a non-negative


aggregate constraint violation function h, where h satisfies the con-
dition that h(x) P 0 with h(x) = 0 if and only if x is feasible.
In the context of MADS, a filter is a collection of points, none of
which dominates any other in the set with respect to their objec-
tive and constraint violation function values. That is to say, for
any two points x; y in the filter, the condition f(x) < f(y) and
h(x) < h(y), cannot hold. Polling is performed around either the best
feasible point found thus far, or the least infeasible point in an at-
tempt to add new points to the filter-preferentially new poll cen-
ters. A more detailed explanation can be found in Audet and
Dennis (2004).
Equality constraints are difficult to handle because of basic
properties of the MADS algorithm, and also due to numerical con-
siderations. Equality constraints are difficult to handle because of
Fig. 5. Feasible operation region for the cogeneration unit 2.
basic properties of the MADS algorithm, and also due to numerical
considerations. In order to avoid the problems that equality con-
straints present, one variable should be eliminated and these con-
straints should be reformulated as inequalities.

4. Case studies

Three examples taken from the optimization literature are used


to show the validity and effectiveness of the MADS algorithm. The
MADS algorithm can handle the cases of multiple heat areas and
power areas. For the demonstration, a single heat area and power
area system is first considered (case study I). This case study was
originally proposed by Guo, Henwood, and van Ooijen (1996).
The system tested is comprised of a conventional power unit,
two cogeneration units and a heat-only unit. The objective is to ob-
tain the minimum overall cost of units subject to constraints on
heat and power production and demands. The power capacity lim-
its for the conventional power unit are 0.0 and 150 MW. The heat
capacity limits for the heat only unit are 0.0 and 2695.2 MWth. The
feasible operating regions of the two cogeneration units are shown
in Figs. 5 and 6. The system power demand Pd and the heat demand
Fig. 6. Feasible operation region for the third unit. Hd are respectively 200 MW and 115 MWth.

Table 2
Optimal results for case study I.

Methods Optimal results Cost ($)


P1 P2 P3 H2 H3 H4
GA [5] 0 159.23 40.77 39.94 75.06 0 9267.2
LR [3] 0 160 40 40 75 0 9257.07
ACSA [10] 0.08 150.93 49 48.84 65.79 0.37 9452.2
GT [8] 0 157.92 42.08a 26 89a 0 9207.64
HS [1] 0 160 40 40 75 0 9257.07
SARGA [11] 0 159.99 40.01 39.99 75 0 9257.07
MADS–LHS 0.0017 159.80 40.2014 42.4042 72.3904 0.2054 9277.131
MADS–PSO 0.0092 157.9392 42.0516 42.4459 72.5522 0.0019 9301.357
MADS–DACE 0 160 40 40 75 0 9257.07
a
Outside the feasible operating region of cogeneration unit 3.
6562 S.S. Sadat Hosseini et al. / Expert Systems with Applications 38 (2011) 6556–6564

The objective function of the CHPED problem is: 2007). In order to further test the MADS approach and compare it
with conventional methods, two other tests (case study II and III)
C 1 ¼ 50P1 ð11Þ
are conducted with various power and heat demands. These cases
C 2 ¼ 2650 þ 14:5P 2 þ 0:0345P22 þ 4:2H2 þ 0:03H22 þ 0:031P2 H2 were presented by Song and Xuan (1998). To verify the performance
ð12Þ of the MADS, the program is ran a 50 times on the examples. The
parameters settings for the MADS algorithm are shown in Table 1.
C 3 ¼ 1250 þ 36P3 þ 0:0435P23 þ 0:6H3 þ 0:027H23 þ 0:011P3 H3
The poll directions can be set to MADS Positive basis N + 1 and
ð13Þ 2N, where N is the number of independent variables for the objec-
C 4 ¼ 23:4H4 ð14Þ tive function. The MADS Positive basis 2N directions explore more
X
4 points around the current point at each iteration. Therefore, the Po-
Min f ¼ Ci ð15Þ sitive basis 2N directions are considered as the POLL method in the
i¼1 present study to avoid finding a local minimum rather than the
X
3
global minimum. Both of the dynamic polling and the simplex gra-
Pd ¼ Pi ð16Þ dient strategies are considered to solve the problem. In addition to
i¼1
better performance, the simplex gradient strategy led to improved
X
4
Hd ¼ Hi ð17Þ computational times in all cases compared to the dynamic polling.
i¼2 Each of the CHPED problems was solved using the LHS, PSO and
s:t: 0 6 P1 6 150MW ð18Þ DACE SEARCH strategies.
These examples have been previously solved using a variety of
0 6 H4 6 2695:2MWth ð19Þ
techniques (both evolutionary and traditional mathematical pro-
The CHPED problem formulation of all test cases has been solved gramming methods). The resulting minimum cost are used to com-
using MADS incorporated in the NOMADm software (Abramson, pare the performance of MADS with those of other methods. No
constraints are active for this solution. Table 2 presents the best
solution of this problem obtained using the MADS algorithm and
those reported by other researchers. It can be seen from Table 2
that the result obtained using the MADS–DACE algorithm is the
same as the best known solution reported previously in the litera-
ture (Guo et al., 1996; Subbaraj et al., 2009; Vasebi et al., 2007).
Although Sudhakaran and Slochanal (2003) reported better solu-
tion, this solution is not feasible due to violation of constraints.
Also, the results reported by Subbaraj et al. (2009) have some
problem about their considered feasible operating region (Sadat
Hosseini & Gandomi, 2010). For brevity, only the convergence nat-
ure of the MADS–DACE algorithm is shown in Fig. 7. It is evident
from this figure that MADS–DACE has a considerably fast conver-
gence. Moreover, the best, worst and mean solutions, the standard
deviation and the average CPU times obtained the MADS–LHS,
MADS–PSO, MADS–DACE algorithms are shown in Table 3. It can
Fig. 7. The convergence nature of the MADS–DACE algorithm for case study I.
be seen from this table that MADS–DACE provides significant sav-
ings of computational effort in addition to its superior performance
compared to the other SEARCH methods. These short computing
Table 3
Performance of MADS based on different SEARCH methods for case study I. times allow for more cases to be studied with the goal of increasing
the confidence in the final solution.
Performance SEARCH method
Optimal results obtained by different methods for case studies II
LHS PSO DACE and III are shown in Table 4. It can be seen from this table that the
Best solution 9277.1311 9301.3567 9257.0754 best performance is obtained by the MADS–DACE algorithm for
Worst solution 10005.3805 9997.6576 9260.4317 both case studies. For case study III, the MADS–LHS and MADS–
Mean solution 9547.9151 9551.277 9257.5148
PSO algorithms also provide superior performance compared to
Standard deviation 213.3516 251.1569 1.0743
CPU time (s) 7.0422 7.5594 2.3078 the available methods in the literature. The convergence nature
of the MADS–DACE algorithm is shown in Fig. 8(a) and (b).

Table 4
Optimal results for case studies II and III.

Cases Demand Demand Optimal results Cost ($)


Pd Hd P1 P2 P3 H2 H3 H4
II 175 110 MADS–LHS 0.0066 134.3325 40.6609 44.371 65.6061 0.0229 8622.0748
MADS–PSO 0.1381 128.9305 45.9314 29.9893 80.0027 0.0080 8629.4156
MADS–DACE 0 135 40 35 75 0 8555.9625
GA1 [5] NA 8622.9
GA2 [5] NA 8769.6
III 175 110 MADS–LHS 0 181 44 47.3125 77.6875 0 10101.4753
MADS–PSO 0.0144 182.0725 42.9131 49.0611 75.9298 0.0091 10101.8942
MADS–DACE 0 185 40 50 75 0 10074.4875
GA1 [5] NA 10128
GA2 [5] NA 10132.4
S.S. Sadat Hosseini et al. / Expert Systems with Applications 38 (2011) 6556–6564 6563

Fig. 8. The convergence nature of the MADS–DACE algorithm for (a) case study II, (b) case study III.

Table 5
Performance of MADS based on different SEARCH methods for case studies II and III.

Cases Demand Performance SEARCH method


Pd Hd LHS PSO DACE
II 175 110 Best solution 8622.0748 8629.4156 8555.9625
Worst solution 9442.1843 9046.6340 8555.9652
Mean solution 8921.7626 8799.8773 8555.9625
Standard deviation 252.3315 136.8504 0.0009
CPU time (s) 63.6000 56.5656 3.7094
III 225 125 Best solution 10101.4753 10101.8942 10074.4875
Worst solution 10833.7313 11393.4252 10074.4883
Mean solution 10295.4723 10455.1238 10074.4907
Standard deviation 236.1302 478.2026 0.0011
CPU time (s) 8.1516 7.0563 5.5281

Performance of MADS based on different SEARCH methods for PSO, MADS–DACE methods are compared to that of other tech-
these case studies is illustrated in Table 5. niques reported in the literature. The results clearly demonstrate
that the MADS-based methods are practical and valid for CHPED
5. Conclusion applications. The MADS–DACE algorithm performs superior than
or as well as the other recent methods in terms of solution quality,
This paper has introduced a recent optimization method, handling constraints and computation time.
namely MADS to solve the CHP economic dispatch problem consid-
ering the feasible operating region. MADS is a recently developed References
algorithm that is supported by a thorough convergence analysis.
The MADS method is illustrated using three test cases taken from Abramson, M. A. (2007). NOMADm version 4.6 User’s Guide (pp. 1–19).
the literature. The LHS, PSO and DACE algorithms are employed Abramson, M. A., & Audet, C. (2006). Convergence of mesh adaptive direct search to
second-order stationary points. SIAM Journal on Optimization, 17(2), 606–619.
as effective search strategies in MADS to solve each of the CHPED Abramson, M. A., Audet, C., & Dennis, J. E. Jr., (2004). Generalized pattern searches
problems. The performance of the utilized MADS–LHS, MADS– with derivative information. Mathematical Programming, 100, 3–25.
6564 S.S. Sadat Hosseini et al. / Expert Systems with Applications 38 (2011) 6556–6564

Abramson, M. A., Audet, C., & Dennis, J. E. Jr., (2006). Nonlinear programming by Lewis, R. M., & Torczon, V. (2000). Pattern search algorithms for linearly constrained
mesh adaptive direct searches. SIAG/Optimization Views-and-News, 17, 2–11. minimization. SIAM Journal on Optimization, 10(3), 917–941.
Audet, C. (2004). Convergence results for pattern search algorithms are tight. Lophaven, S. N., Nielsen, H. B., & Sondergaard, J. (2002). Dace: A matlab kriging
Optimization and Engineering, 5, 101–122. toolbox version 2.0. Technical Report [IMM-TR-2002-12]. Technical University of
Audet, C., Bechard, V., & Chaouki, J. (2008). Spent potliner treatment process Denmark. Copenhagen.
optimization using a MADS algorithm. Optimization and Engineering, 9, 143–160. Marsden, A. L. (2004). Aerodynamic noise control by optimal shape design. PhD thesis.
Audet, C., & Dennis, J. E. Jr., (2004). A pattern search filter method for nonlinear Stanford University.
programming without derivatives. SIAM Journal on Optimization, 14(4), Marsden, A. L., Wang, M., Dennis, J. E., Jr., & Moin, P. (2004). Optimal aeroacoustic
980–1010. shape design using the surrogate management framework. Optimization and
Audet, C., & Dennis, J. E. Jr., (2006). Mesh adaptive direct search algorithms for Engineering, 5(2), 235–262.
constrained optimization. SIAM Journal on Optimization, 17(1), 188–217. McKay, M. D., Conover, W. J., & Beckman, R. J. (1979). A comparison of three
Audet, C., Booker, A. J., Dennis, J. E., Jr., Frank, P. D., & Moore, D. W. (2000). A methods for selecting values of input variables in the analysis of output from a
surrogate-model-based method for constrained optimization. In 8th Proceedings computer code. Technometrics, 21, 239–245.
of the AIAA/NASA/USAF/ISSMO symposium on multidisciplinary analysis and Nicosia, G., & Stracquadanio, G. (2007). Generalized pattern search and mesh
optimization. adaptive direct search algorithms for protein structure prediction. In S. Istrail, P.
Audet, C., & Orban, D. (2006). Finding optimal algorithmic parameters using the Pevzner, & M. Waterman (Eds.), Algorithms in bioinformatics (pp. 183–193).
mesh adaptive direct search algorithm. SIAM Journal on Optimization, 17(3), Berlin: Springer.
642–664. Rooijers, F. J., & van Amerongen, R. A. M. (1994). Static economic dispatch for
Booker, A. J., Dennis, J. E., Jr., Frank, P. D., Serafini, D. B., Torczon, V., & Trosset, M. W. cogeneration systems. IEEE Transactions on Power Systems, 93, 1392–1398.
(1999). A rigorous framework for optimization of expensive functions by Sadat Hosseini, S. S., & Gandomi, A. M. (2010). Discussion on enhancement of
surrogates. Structural and Multidisciplinary Optimization, 17, 1–13. combined heat and power economic dispatch using self adaptive real-coded
Chang, C. S., & Fu, W. (1998). Stochastic multi-objective generation dispatch of genetic algorithm. In P. Subbaraj et al. (Eds.), Applied energy (Vol. 86. pp. 915–
combined heat and power systems. IEEE Proceedings-Generation Transmission 921). Applied Energy, 87(4), 1459.
and Distribution, 145(5), 583–591. Santner, T. J., Williams, B. J., & Notz, W. I. (2003). The design and analysis of computer
Conn, A. R., Scheinberg, K., & Vicente, L. N. (2009). Introduction to derivative-free experiments. New York: Springer-Verlag.
optimization. Philadelphia: SIAM. Song, Y. H., Chou, C. S., & Stonham, T. J. (1999). Combined heat and power dispatch
Coope, I. D., & Price, C. J. (2002). Positive bases in numerical optimization. by improved ant colony search algorithm. Electric Power Systems Research, 52,
Computational Optimization and Applications, 21(2), 169–175. 115–121.
Custodio, A. L., & Vicente, L. N. (2007). Using sampling and simplex derivatives in Song, Y. H., & Xuan, Q. Y. (1998). Combined heat and power economic dispatch
pattern search methods. SIAM Journal on Optimization, 18, 537–555. using genetic algorithm based penalty function method. Electric Power
Dolan, E. D., Lewis, R. M., & Torczon, V. (2003). On the local convergence properties Components and Systems, 26(4), 363–372.
of pattern search. SIAM Journal on Optimization, 14(2), 567–583. Su, C. T., & Chiang, C. L. (2004). An incorporated algorithm for combined heat and
Gould, N. I. M., Orban, D., & Toint, P. L. (2003). GALAHAD – a library of thread-safe power economic dispatch. Electric Power Systems Research, 69(2–3), 187–195.
FORTRAN 90 packages for large-scale nonlinear optimization. ACM Transactions Subbaraj, P., Rengaraj, R., & Salivahanan, R. (2009). Enhancement of combined heat
on Mathematical Software, 29(4), 353–372. and power economic dispatch using self adaptive real-coded genetic algorithm.
Guo, T., Henwood, M. I., & van Ooijen, M. (1996). An algorithm for heat and power Applied Energy, 86, 915–921.
dispatch. IEEE Transactions on Power Systems, 11(4), 1778–1784. Sudhakaran, M., & Slochanal, S. M. R. (2003). Integrating genetic algorithms and
Hayes, R. E., Bertrand, F. H., Audet, C., & Kolaczkowski, S. T. (2003). Catalytic tabu economic dispatch search for combined heat and power. Power System
combustion kinetics: Using a direct search algorithm to evaluate kinetic Stability and Control, 67–71.
parameters from light-off curves. Canadian Journal of Chemical Engineering, Torczon, V. (1997). On the convergence of pattern search algorithms. SIAM Journal
81(6), 1192–1199. on Optimization, 7, 1–25.
Kennedy, J., & Eberhart, R. (1995). Particle swarm optimization. Proceeding IEEE Vasebi, A., Fesanghary, M., & Bathaee, S. M. T. (2007). Combined heat and power
International Conference on Neural Networks, 1942–1948. economic dispatch by harmony search algorithm. Electric Power Systems
Kolda, T. G., Lewis, R. M., & Torczon, V. (2004). Optimization by direct search: New Research, 29, 713–719.
perspectives on some classical and modern methods. SIAM Review, 45(3), Wang, L. F., & Singh, C. (2008). Stochastic combined heat and power dispatch based
385–482. on multi-objective particle swarm optimization. International Journal of
Lewis, R. M., & Torczon, V. (1996). Rank ordering and positive bases in pattern search Electrical Power & Energy Systems, 30, 226–234.
algorithms. Technical Report [96-71]. Institute for Computer Applications in Wong, K. P., & Algie, C. (2002). Evolutionary programming approach for combined
Science and Engineering NASA Langley Research Center. Hampton, Virginia. heat and power dispatch. Electric Power Systems Research, 61, 227–232.

You might also like