Neural Network Sorting

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 6

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/5600563

A novel artificial neural network for sorting

Article in IEEE TRANSACTIONS ON CYBERNETICS · February 1999


DOI: 10.1109/3477.752799 · Source: PubMed

CITATIONS READS
20 744

1 author:

Tatiana Tambouratzis
University of Piraeus
85 PUBLICATIONS 308 CITATIONS

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Analysis and custom-made presentation of music notation View project

Power Plant Operation, Monitoring and Control View project

All content following this page was uploaded by Tatiana Tambouratzis on 22 December 2015.

The user has requested enhancement of the downloaded file.


IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 29, NO. 2, APRIL 1999 271

of the equalizer, first, the equalizer was trained with 5000 symbols [11] , “Functional link neural network-based adaptive equalization of
to obtain optimum weight values. Then, subsequent 1000 symbols nonlinear channels with QAM signal,” in Proc. IEEE Int. Conf. Systems,
were transmitted and the outputs of the equalizer were obtained for Man, Cybernetics, Vancouver, B.C., Canada, Oct. 1995, pp. 2081–2086.
[12] N. Sadegh, “A perceptron based neural network for identification and
plotting the eye-patterns. control of nonlinear systems,” IEEE Trans. Neural Networks, vol. 4, pp.
The signal constellation of 1000 noisy received signal samples 982–988, Nov. 1993.
(before equalization) along with the eye-patterns of 1000 symbols [13] S. Siu, G. J. Gibson, and C. F. N. Cowan, “Decision feedback equal-
at SNR of 15 dB for CH = 6 with nonlinear model (NL = 1) ization using neural network structures and performance comparison
with standard architecture,” Proc. Inst. Elect. Eng., vol. 137, pt. 1, pp.
is shown in Fig. 10. It may be seen that the effectiveness of 221–225, Aug. 1990.
channel equalization using ANN’s is superior to the linear LMS- [14] J. T. Tou and R. C. Gonzalez, Pattern Recognition Principles. Reading,
based equalizer. Out of the three ANN structures, FLANN-based MA: Addison-Wesley, 1981.
equalizer provides the most effective equalization for both linear and [15] B. Widrow and M. A. Lehr, “30 Years of adaptive neural networks:
nonlinear channel models. Similar observations were made for all the perceptron, madaline and backpropagation,” Proc. IEEE, vol. 78, pp.
1415–1442, Sept. 1990.
six channels with linear and three nonlinear models studied. [16] Z. Xiang, G. Bi, and T.-L. Ngoc, “Polynomial perceptrons and their
applications to fading channel equalization and co-channel interference
suppression,” IEEE Trans. Signal Processing, vol. 42, pp. 2470–2479,
VIII. CONCLUSION Sept. 1994.
Considering channel equalization as a multi-category classification [17] S. S. Yang and C. S. Tseng, “An orthonormal neural network for
problem, it is shown that performance of ANN-based equalizers function approximation,” IEEE Trans. Syst., Man, Cybern. B, vol. 26,
pp. 779–785, Oct. 1996.
provides substantial improvement in terms of convergence rate, MSE
floor level and BER. In a linear equalizer the performance degrades
drastically with increase in EVR, specially when the channel is
nonlinear. However, it is shown that, in the ANN-based equalizer
the performance degradation with increase in EVR is not so severe.
We have introduced a novel FLANN-based equalizer structure for A Novel Artificial Neural Network for Sorting
adaptive channel equalization of nonlinear channels. Because of its
single layer structure the FLANN offers advantages over MLP. The T. Tambouratzis
performance of PPN and MLP is found to be similar in most of
the experiments. But the single layer PPN structure is preferable
Abstract—An artificial neural network (ANN) is employed for sorting
than MLP as it offers less computational complexity. Out of the a sequence of real elements in monotonic (descending or ascending)
three ANN equalizer structures, the performance of the FLANN is order. Although inspired by harmony theory (HT), whereby the same
found to be the best in terms of MSE level, convergence rate, BER construction as for the HT ANN is followed, the proposed ANN differs in
and computational complexity for linear as well as nonlinear channel the mode of operation, namely the obliteration of the consensus (harmony)
function, the circumvention of simulated annealing as a means of settling
models over a wide range of SNR and EVR variations. Because of
to a solution, the simplification of the activation updating of the nodes
computational advantages the FLANN may be used in other signal of the upper layer, the clamping of the nodes of the lower layer, the
processing applications. gradual shrinking of the ANN and the use of an automatic termination
criterion. The creation of the sorted sequence is progressive, whereby at
most as many network updates are required as there are elements in the
REFERENCES sequence. Ties between elements are resolved by simultaneous activation
of the corresponding nodes. Finally, the min and max problems are solved
[1] S. Chen, G. J. Gibson, and C. F. N. Cowan, “Adaptive channel in a single network update.
equalization using a polynomial perceptron structure,” Proc. Inst. Elect.
Eng., vol. 137, pt. 1, pp. 257–264, Oct. 1990. Index Terms—Artificial neural networks, harmony theory, sorting.
[2] S. Chen, G. J. Gibson, C. F. N. Cowan, and P. M. Grant, “Adaptive
equalization of finite nonlinear channels using multilayer perceptrons,”
Signal Process., vol. 20, pp. 107–119, 1990. I. INTRODUCTION
[3] W. S. Gan, J. J. Soraghan, and T. S. Durrani, “A new functional-link
based equaliser,” Electron. Lett., vol. 28, pp. 1643–1645, Aug. 1992. Sorting constitutes a basic operation of many computing tasks,
[4] G. J. Gibson, S. Siu, and C. F. N. Cowan, “The application of nonlinear including VLSI design, digital signal processing, network commu-
structures to the reconstruction of binary signals,” IEEE Trans. Signal nications, database management and data processing, for which it
Processing, vol. 39, pp. 1877–1884, Aug. 1991.
[5] S. Haykin, Adaptive Filter Theory, 2nd ed. Englewood Cliffs, NJ:
has been estimated that sorting operations account for over 25% of
Prentice-Hall, 1991. the total processing time. The significance of sorting is reflected in
[6] M. Meyer and G. Pfeiffer, “Multilayer perceptron based decision feed- the multitude of sorting techniques that have been proposed over the
back equalizers for channels with intersymbol interference,” Proc. IEE, previous decades. The main aim of these techniques, be they serial
vol. 140, pt. 1, pp. 420–424, Dec. 1993. or parallel in construction and operation, is the minimization of the
[7] Y.-H. Pao, Adaptive Pattern Recognition and Neural Networks. Read-
ing, MA: Addison-Wesley, 1989. time and storage requirements of the sorting operation. Instructive
[8] Y.-H. Pao, G.-H. Park, and D. J. Sobajic, “Learning and generalization overviews of serial and parallel sorting techniques are given in [1]
characteristics of random vector functional-link net,” Neurocomputation, and [2], respectively. Representative serial sorting techniques appear
vol. 6, pp. 163–180, 1994.
[9] J. C. Patra, “Some studies on artificial neural networks for signal pro- Manuscript received June 19, 1998; revised November 8, 1998. This paper
cessing applications,” Ph.D. dissertation, Indian Inst. Technol., Kharag- was recommended by Associate Editor R. Popp.
pur, Dec. 1996. The author is with the Institute of Nuclear Technology—Radiation Protec-
[10] J. C. Patra and R. N. Pal, “A functional link artificial neural network for tion, NCSR “Demokritos,” Aghia Paraskevi 153 10, Athens, Greece (e-mail:
adaptive channel equalization,” Signal Process., vol. 43, pp. 181–195, tatiana@ipta.demokri-tos.gr).
May 1995. Publisher Item Identifier S 1083-4419(99)02296-7.

1083–4419/99$10.00  1999 IEEE


272 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 29, NO. 2, APRIL 1999

consensus (harmony) function associated with the HT ANN


assigns a harmony value to every state corresponding to its
fitness in constituting a solution of the problem; the states
where the harmony function is maximized (harmony maxima
states) correspond to the optimal solutions. The harmony value
is raised when such activation values are assigned to the HT
ANN nodes as to (for more details on the harmony function the
reader is referred to [19] and [21]): a) maximize the number of
active nodes of the upper layer, b) render the active nodes of the
upper layer compatible (i.e. the signs of the nonzero weights
connecting all the active nodes of the upper layer with the same
node of the lower layer must be identical; this must hold for
all the nodes of the lower layer), and c) render the nodes of the
lower layer compatible (i.e. the signs of the activation values
Fig. 1. The general structure of the HT ANN. of the nodes of the lower layer and the signs of the weights of
the connected active nodes of the upper layer must coincide).
in [3]–[9], while, more recently, parallel sorting techniques have been 4) The HT ANN employs simulated annealing in order to settle
proposed by [10]–[18]. (reach a solution). During simulated annealing, the HT ANN
In this piece of research, a novel artificial neural network (ANN), performs a number of network updates (stochastic updates of
characterized by simplicity and transparency in construction as well the activation values of its nodes) which collectively aim at
as during operation, is proposed for sorting a sequence of N real ele- maximizing harmony. Each update is executed at a specific
ments fni g; i = 1; 2; . . . ; N . The same ANN can sort the elements in value of the temperature parameter T , which is decremented
descending or in ascending order, while ties between the elements are before the next update. The stochastic nature of the updates
accommodated for. The creation of the sorted sequence is progressive, causes nonmonotonic ascent in terms of harmony which, in
whereby at most N node activation updates are required; additionally, turn, ensures that the network settles at a global harmony
an automatic termination criterion of operation is utilized. maximum state (optimal solution).
The proposed ANN is derived from harmony theory (HT) [19] 5) During simulated annealing, each network update is performed
whereby the same construction as for the standard HT ANN is in two passes for a specific value of the temperature parameter
T . The first pass comprises the stochastic update of the activa-
followed. However, its mode of operation constitutes a modified
version of the corresponding HT ANN in terms of the clamping of the tion values of the nodes of the upper layer, while the second
nodes of the lower layer, the simplification of the activation updating pass comprises the stochastic update of the activation values of
of the nodes of the upper layer, the circumvention of the slow and the nodes of the lower layer. For the first network update only,
time-consuming process of simulated annealing as a means of settling an initial random assignment of f01; +1g activation values to
to a solution and the obliteration of the consensus (harmony) function. the nodes of the lower layer is made. Assuming that the HT
This paper is organized as follows: Section II describes the struc- ANN consists of x and y nodes in the upper and lower layer,
ture of the HT ANN, while Section III introduces the modifications respectively, the first pass is described by (1)–(3). The total
leading to the ANN proposed for sorting. The general results as input Ii (i = 1; 2; . . . ; x) from all the nodes of the lower layer
well as a comparison with three recent—and distinct—ANN sorting to each node of the upper layer is calculated:
techniques appear in Section IV. Section V concludes the paper. y
Ii = wij actlj 0 k (1)
j =1
II. HARMONY THEORY ARTIFICIAL NEURAL NETWORKS (HT ANN’s)
The ANN proposed for sorting is derived from the harmony theory where wij denotes the weight of the connection between the
(HT) ANN [19]. The main characteristics of the HT ANN, shown in ith node of the upper layer and the j th node of the lower layer
Fig. 1, are as follows. (j = 1; 2; . . . ; y ); actlj denotes the current activation value of
Relating to construction the j th node of the lower layer and k denotes the threshold
parameter of HT. k is crucial in guiding the HT ANN toward
1) The nodes assume binary activation values and are partitioned
a state of maximum harmony: it specifies the required degree
into two groups, each of which is arranged in a separate layer.
of agreement between the connections of a node of the upper
The nodes of the lower layer assume activation values of
f01; +1g and represent the elements of the problem to be layer and the activation values of the connected nodes of the
lower layer in order for the node of the upper layer to raise
solved, while those of the upper layer assume activation values
of f0; +1g and represent the constraints between the elements.
harmony by becoming active (for more details the reader is
referred to [19] and [21]). Ii is processed through a sigmoid
2) HT is furnished with a training phase whereby the connections
nonlinearity, producing a quantity P between 0 and 1
and the accompanying weights evolve over time; the alternative
of direct construction also exists. In both cases, the connections 1
1 + e0I=T
P = (2)
are symmetric and only allowed between nodes of different
layers. The weights of the connections serve to enforce the where I = Ii . The new activation value actui is produced by
constraints between the elements of the problem. comparing P with a random number R in the range [0; +1]
+1 if P > R
Relating to the mode of operation actui = (3)
0 otherwise
3) The HT ANN state is described by the activation values
of its nodes; these values collectively represent the current The second pass is described by (2), (4), and (5). The total input
problem state and, consequently, the proposed solution. The Ij(j = 1; 2; . . . ; y ) from all the nodes of the upper layer to
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 29, NO. 2, APRIL 1999 273

each node of the lower layer is calculated: the first update determines the min and max elements of the
x sequence.
I j =2 ij actui
w (4) 50 ) Each network update comprises a single deterministic pass of
j =1 activation. The activation values of the nodes of the lower
layer aclj (j = 1; 2; . . . ; N ) are uniformly clamped to +1
which is again processed as shown in (2) for I = j . The new
I
or 01 depending on whether the desired order of sorting is
activation value actlj is given by descending or ascending, respectively. Owing to clamping, the
+1 if P > R second pass of activation is obliterated. Concerning the first
actlj =
01 otherwise.
(5) pass, (2) and (3) are substituted by

The comparisons in (3) and (5) are responsible for the stochastic actui =
1 i0
I
(8)
nature of the activation updates of the nodes of both layers. The 0 i<0
I

interested reader is referred to [21] for more details on the effect


i.e., the nodes of the upper layer are reduced into simple
of T on the sharpness of the sigmoid in (2) and, subsequently,
threshold units. Once the activation value of a node of the
on the monotonicity of the ascent in terms of harmony.
upper layer becomes +1, the corresponding element of the
Directly constructed HT ANN’s have been employed in constraint- given sequence is added to the sorted sequence and the node
propagation tasks (electrical circuit operation, scene analysis via line is deactivated (not considered in the next network updates).
labeling) [19], [20], in optimization tasks (map coloring, the n-queens Node deactivation also provides the criterion for the automatic
problem, graph planarization, satellite broadcasting) [21]–[26] and, termination of ANN operation; once all the nodes of the upper
more recently, in clustering tasks [27]. It has been demonstrated that, layer have been deactivated, the entire sorted sequence has
for appropriate values of the HT parameters, an optimal solution of been produced. Equation (1) becomes
the problem is invariably settled upon. Additionally, if more than one
optimal solutions exist, each of them is settled upon with roughly I i = Compi 0 km (9)
equal probability.
i.e., i
I is decomposed into a constant term Compi (i =
III. MODIFIED HT ANN FOR SORTING and the variable term km 1 Compi equals
1; 2; . . . ; N )

The proposed ANN for sorting is derived from the HT ANN. In N


this section it will be explained how Compi = ij actlj
w

j =1
• the construction characteristics of the proposed ANN are inher- N wij ;
j =1 if descending order
0 Nj=1 wij ;
ited from the HT ANN; = (10)
• the mode of operation of the proposed ANN differs from the HT if ascending order
ANN in terms of the circumvention of both simulated annealing
while km (m = 1; 2; . . . ; N ) stands for the progressively de-
and the harmony consensus function as a means of settling to
creasing threshold parameter employed at the network updates
a solution, the simplification of the activation updating of the
and is given by
nodes of the upper layer, the clamping of the nodes of the lower
layer and the progressive deactivation of the nodes of the upper m = N + 1 0 2m
k (11)
layer.
Assuming that a sequence of N elements fni g (i = 1; 2; . . . ; N ) such that the required degree of agreement in order for a node
is to be sorted, the ANN is characterized as follows. of the upper layer to become active is reduced at each update.
Relating to construction As a result of (8)–(11), the nodes of the upper layer assume
10 ) 2 1 N nodes, arranged in two layers of N nodes each. The activation values of +1 in the order in which the corresponding
ith node of the upper layer (i = 1; 2; . . . ; N ) represents the
elements ni appear in the sorted sequence. In the event of ties
ith element ni of the given sequence and its activation value
between l elements at a particular value of km , these elements
of 0 or +1 denotes whether ni must be entered in the sorted are added to the sorted sequence at the same network update,
sequence. while the threshold parameter jumps l consecutive values for
20 ) N 2 symmetric connections. The network is fully connected the next network update (km+l ). If M distinct elements appear
and the weights of the connections are directly determined by in the given sequence, exactly M network updates are required
for producing the entire sorted sequence.
+1; if ni > nj
w ij = 01; if ni < nj (6)
IV. RESULTS COMPARISONS
ni = nj
AND
A; if
In order to sort a given sequence of N elements fni g (i =
(i; j = 1; 2; . . . ; N ),
where the value of A depends on the 1; 2; . . . ; N ), the proposed ANN comprises two layers of N nodes
desired order of sorting each, with full connectivity between nodes of different layers. The
+1; if descending order nodes of the lower layer are clamped to activation values of 61
A =
01; if ascending order.
(7) depending on the desired order of the sorted sequence, whereby
each network update consists of updating the activation values of
the nodes of the upper layer only. As many updates are required for
Relating to the mode of operation producing the sorted sequence as there are distinct elements in the
30 ) The harmony function is not employed as a means of reaching given sequence, while the min and max elements are output in the
a solution. first network update. At each update, as many nodes of the upper layer
40 ) No use of simulated annealing is made. At most N network are deactivated as there are elements added to the ordered sequence.
updates are required for producing the sorted sequence, while Fig. 2 illustrates the proposed ANN for sorting the sample sequence
274 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 29, NO. 2, APRIL 1999

TABLE I
COMPARISON OF CONSTRUCTION AND PERFORMANCE
CHARACTERISTICS OF FOUR ANN SORTING TECHNIQUES

Fig. 2. The proposed ANN for sorting the sample sequence f08 72 2 07
: ; : ;
3:15; 0; 2:07g.

f08 72 2 07 3 15 0 2 07g, where the weights have been marked


: ; : ; : ; ; :

next to the corresponding connections.


To create the descending order sequence Sdesc ; A = +1 in (7)
(i.e. w11 = w22 = w33 = w44 = w55 = w25 = w52 = +1),
while all the nodes of the lower layer are clamped to actlj = +1
(j = 1; 2; 3; 4; 5). The values Compi (i = 1; 2; 3; 4; 5) are given by
(10) as Comp1 = 03; Comp2 = 3; Comp3 = 5; Comp4 = 01; [17] can sort a sequence of real elements in descending as well as
and Comp4 = 3, while the consecutive values of the threshold in ascending order, while that of [14] can only sort positive integers
parameter are given by (11) as k1 = 4; k2 = 2; k3 = 0; k4 = 02 in ascending order; the ANN of [18] can also sort the elements of
and k5 = 04. the given sequence in bitonic order. Table I contrasts the four ANN’s
• For the first network update (k1 = 4); I1 = 07; I2 = 01; in terms of construction and performance characteristics. The main
I3 = 1; I4 = 05; and I5 = 01 from (9), whereby only features distinguishing the proposed ANN from the other three ANN’s
actu3 = +1 from (8). The third element of the given sequence are summarized next.
n3 = 3:15 is added to the sorted sequence Sdesc = f3:15g and
1) The proposed ANN is simple in terms of construction and
the third node of the upper layer is deactivated. Additionally, operation. As seen in Table I, it is smaller in the number of
3:15 constitutes the max element of the given sequence. nodes and connections, while simple pair-wise comparisons
• For the second network update (k2 = 2); I1 = 05; I4 = 03 between elements suffice for ANN construction. Additionally,
and I2 = I5 = 1, whereby actu2 = actu5 = +1; a tie l = 2 the total number of operations as well as the node complexity
occurs between the second and fifth elements of the sequence (average number of operations per node activation update) of
n2 = n5 = 2:07 that are simultaneously added to the sorted
the proposed ANN are significantly lower than those of the
sequence Sdesc = f3:15; 2:07; 2:07g; the corresponding two other ANN’s, implying faster sorting. Owing to the modifica-
nodes are deactivated and the value of the threshold parameter tion of the HT ANN the following is true.
jumps to k = k2+l = k4 for the next network update.
• For the third network update (k4 = 02); I1 = 01 and I4 = 1, • Simulated annealing and the stochasticity of correspond-
whereby actu4 = +1 and n4 = 0 is added to the sorted ing updating procedure are substituted by a deterministic
sequence Sdesc = f3:15; 2:07; 2:07; 0g, while the corresponding network updating scheme, while no consensus (harmony)
node is deactivated. function is employed. As a result, 1) the need for a large
• For the fourth update (k5 = 04); I1 = 1; actu1 = +1 number of network updates in order to reach a solution (as in
and n1 = 08:72 is added to the sorted sequence, while the [18]) 2) the search for exact parameter values accomplishing
corresponding node is deactivated. As all the nodes of the upper successful as well as fast ANN settling (as in [14]), and 3)
layer of the ANN have been deactivated, the sorted sequence a consensus-function-based termination criterion concerning
becomes finalized to Sdesc = f3:15; 2:07; 2:07; 0; 08; 72g. ANN operation are obliterated. Unlike the ANN’s of [14]
An identical procedure is followed for the creation of the ascending and [18] whose computational requirements remain con-
order sequence Sasc . The same consecutive values of the threshold stant throughout sorting, the proposed ANN progressively
parameter k are used (k1 = 4; k2 = 2; k3 = 0; k4 = 02 and shrinks, thus further limiting the total number of required
k5 = 04) and A = 01 (w11 = w22 = w33 = w44 = w55 = w25 = operations.
w52 = 01). The nodes of the lower layer are clamped to actlj = 01 • Owing to the clamping of the nodes of the lower layer,
(j = 1; 2; 3; 4), whereby Comp1 = 5; Comp2 = 1; Comp3 = 03; the removal of the sigmoid activation function and the
Comp4 = 3 and Comp5 = 1. obliteration of the consensus function, each network update
In the following, a comparison between the proposed ANN and comprises a single pass of activation which is straight-
three recent ANN sorting techniques [14], [17], [18] is made; the forward to construct and rapid to perform. The relative
three ANN’s are characterized by distinct architectures, none of which simplicity of the activation functions of the nodes of the
is related to harmony theory. Like the proposed ANN, the ANN of proposed ANN becomes evident when the node complexity
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 29, NO. 2, APRIL 1999 275

of the proposed ANN is compared to that of [17]: the [4] D. T. Lee, H. Chang, and C. K. Wong, “An on-chip compare/steer bubble
observed difference is due to the intricacy of the activation sorter,” IEEE Trans. Comput., vol. C-30, pp. 396–405, June 1981.
functions of the nodes of first hidden and output layers [17] [5] G. Bilardi and F. P. Preparata, “A minimum area VLSI network for
O(log n) time sorting,” IEEE Trans. Comput., vol. C-34, pp. 336–343,
(both ANN’s are characterized by the lack of a consensus Apr. 1985.
function, a similar logic for the production of the ordered [6] D. Rotem, N. Santoro, and J. B. Sidney, “Distributed sorting,” IEEE
sequence and a comparable total number of operations). Trans. Comput., vol. C-34, pp. 372–376, Apr. 1985.
[7] Y. H. Choi and M. Malek, “A fault-tolerant systolic sorter,” IEEE Trans.
2) The proposed ANN is transparent during operation. Comput., vol. 37, pp. 621–624, May 1988.
[8] R. S. Francis and I. D. Mathieson, “Benchmark parallel sort for
• Unlike the two consensus-function ANN’s [14] and [18], shared memory multiprocessors,” IEEE Trans. Comput., vol. 37, pp.
the termination criterion of the proposed ANN is automatic; 1619–1626, Dec. 1988.
once all the nodes of the upper layer of the ANN are [9] H. M. Alnuweiri and V. K. P. Kumar, “Optimal VLSI sorting with
deactivated, the entire sorted sequence has been produced. reduced number of processors,” IEEE Trans. Comput., vol. 40, pp.
105–110, Jan. 1991.
• The elements are output sequentially, in the order in which [10] M. Atkins, “Sorting by Hopfield net,” in Proc. Int. Joint Conf. Neural
they appear in the sorted sequence. This is preferable to Networks, Washington, DC, 1989, pp. 65–68.
the precedence-matrix [14] and the position-matrix [18] [11] W. Chen and K. Hsieh, “A neural sorting algorithm with (1) timeO
representations of the solution, both of which require further complexity,” in Proc. Int. Joint Conf. Neural Networks, San Diego, CA,
1990, pp. 793–798.
processing for determining the ordered sequence.
[12] K. Y. Siu and J. Bruck, “Neural computation of arithmetic functions,”
• The ties between elements of the given sequence are re- Proc. IEEE, vol. 78, pp. 1669–1675, Oct. 1990.
solved by simultaneous activation of the corresponding [13] Y. Takefuji and K. C. Lee, “A two-step parallel sorting algorithm based
nodes of the upper layer, i.e., in a straightforward and on neural networks,” J. Neural Network Comput., vol. 2, no. 1, pp.
order-of-appearance-independent manner. This contrasts the 30–32, 1990.
[14] Y. Takefuji and K. C. Lee, “A super-parallel sorting algorithm based on
appearance of equiprobable solutions in the same matrix neural networks,” IEEE Trans. Circuits Syst., vol. 37, pp. 1425–1429,
representation [14] as well as the appearance—in the sorted Nov. 1990.
sequence—of equal-valued elements in the order in which [15] R. W. Brockett, “Dynamical systems that sort lists, diagonize matrices
the elements appear in the given sequence [17]. and solve linear programming problems,” Linear Algebra Applicat., vol.
146, pp. 79–91, 1991.
• The min and max elements of the sequence are output in
[16] T. M. Kwon and M. Zervakis, “A parallel sorting network without
a single (first) iteration. Similarly, the proposed ANN is comparators: A neural network approach,” in Proc. Int. Joint Conf.
capable of seeking the part of the sorted sequence that is Neural Networks, Baltimore, MD, 1991, pp. 701–706.
contained within given bounds without the need to perform [17] Y. H. Tseng and J. L. Wu, “Solving sorting and related problems by
all of the network updates. Such an approach is preferable quadratic perceptrons,” Electron. Lett., vol. 28, no. 10, pp. 906–908,
1992.
to the other ANN’s which operate completely in parallel [18] J. Wang, “Analysis and design of an analog sorting network,” IEEE
and, thus, output the entire sorted sequence one-shot. The Trans. Neural Networks, vol. 6, no. 4, pp. 962–971, 1995.
advantage of the proposed ANN becomes especially evident [19] P. Smolensky, “Information processing in dynamical systems: Foun-
when large sequences are involved, whereby a fraction dations of harmony theory,” in Parallel Distributed Processing: Explo-
only of the total number of computations is necessary for rations in the Microstructure of Cognition Foundations, D. E. Rumelhart
and J. L. McClellands, Eds. Cambridge, MA: MIT Press, 1986, pp.
executing the aforementioned operations. 194–281, 1986.
[20] T. Tambouratzis, “An implementation of a harmony theory network for
V. CONCLUSION interpreting line-drawings,” Network: Computation Neural Syst., vol. 2,
no. 4, pp. 443–454, 1991.
A new artificial-neural-network-based sorting technique has been [21] , “A consensus-function artificial neural network for map-
proposed which is capable of sorting a given sequence of real coloring,” IEEE Trans. Syst., Man, Cybern. B, vol. 28, pp. 721–728,
elements in monotonic (descending or ascending) order. Although the Oct. 1998.
[22] , “A simulated annealing artificial neural network implementation
employed artificial neural network is derived from the harmony theory
artificial neural network [19] in terms of construction characteristics,
n
of the -queens problem,” Int. J. Intell. Syst., vol. 12, no. 10, pp.
739–751, 1997.
its mode of operation has been modified in the following respects: [23] , “Edge crossing minimization via artificial neural networks,”
neither the harmony consensus function nor simulated annealing Electron. Lett., vol. 33, no. 19, pp. 1621–1623, 1997.
are employed, the activation functions are simplified, determinis- [24] , “Graph planarization employing a harmony theory artificial neural
network,” Neural Comput. Applicat., vol. 6, no. 4, pp. 116–124, 1997.
tic activation updating is performed, progressive deactivation of [25] P. Burrascano and M. Melis, “A neural technique or satellite coverage
the activated nodes occurs and an automatic termination criterion plans optimization,” in Artificial Neural Networks 2, I. Aleksander and J.
concerning operation is provided. These modifications render the Taylor, Eds. Amsterdam, The Netherlands: North-Holland, 1992, pp.
proposed technique simple in construction as well as transparent and 1499–1502.
rapid during operation. [26] T. Tambouratzis, “A decomposition-coordination artificial neural net-
work for satellite broadcast scheduling,” Electron. Lett., vol. 34, no. 15,
pp. 1503–1504, 1998.
ACKNOWLEDGMENT [27] G. Tambouratzis, T. Tambouratzis, and D. Tambouratzis, “Neural net-
work versus statistical clustering techniques: A pilot study in a phoneme
The author wishes to thank the anonymous referees for their helpful recognition task,” in Mathematics of Neural Networks: Models, Algo-
comments in improving the manuscript. rithms and Applications, Massachusetts, S. W. Ellacott, J. C. Mason,
and I. J. Anderson, Eds. Norwell, MA: Kluwer, 1997, pp. 355–360.
REFERENCES

[1] D. E. Knuth, The Art of Computer Programming: Sorting and Searching.


Reading, MA: Addison-Wesley, 1973.
[2] S. G. Akl, Parallel Sorting Algorithms. New York: Academic, 1985.
[3] D. P. Agrawal and K. K. Agarwal, “Efficient sorting with CCD’s
and magnetic bubble memories,” IEEE Trans. Comput., vol. C-30, pp.
153–157, Feb. 1981.

View publication stats

You might also like