Professional Documents
Culture Documents
Neural Network Sorting
Neural Network Sorting
Neural Network Sorting
net/publication/5600563
CITATIONS READS
20 744
1 author:
Tatiana Tambouratzis
University of Piraeus
85 PUBLICATIONS 308 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Tatiana Tambouratzis on 22 December 2015.
of the equalizer, first, the equalizer was trained with 5000 symbols [11] , “Functional link neural network-based adaptive equalization of
to obtain optimum weight values. Then, subsequent 1000 symbols nonlinear channels with QAM signal,” in Proc. IEEE Int. Conf. Systems,
were transmitted and the outputs of the equalizer were obtained for Man, Cybernetics, Vancouver, B.C., Canada, Oct. 1995, pp. 2081–2086.
[12] N. Sadegh, “A perceptron based neural network for identification and
plotting the eye-patterns. control of nonlinear systems,” IEEE Trans. Neural Networks, vol. 4, pp.
The signal constellation of 1000 noisy received signal samples 982–988, Nov. 1993.
(before equalization) along with the eye-patterns of 1000 symbols [13] S. Siu, G. J. Gibson, and C. F. N. Cowan, “Decision feedback equal-
at SNR of 15 dB for CH = 6 with nonlinear model (NL = 1) ization using neural network structures and performance comparison
with standard architecture,” Proc. Inst. Elect. Eng., vol. 137, pt. 1, pp.
is shown in Fig. 10. It may be seen that the effectiveness of 221–225, Aug. 1990.
channel equalization using ANN’s is superior to the linear LMS- [14] J. T. Tou and R. C. Gonzalez, Pattern Recognition Principles. Reading,
based equalizer. Out of the three ANN structures, FLANN-based MA: Addison-Wesley, 1981.
equalizer provides the most effective equalization for both linear and [15] B. Widrow and M. A. Lehr, “30 Years of adaptive neural networks:
nonlinear channel models. Similar observations were made for all the perceptron, madaline and backpropagation,” Proc. IEEE, vol. 78, pp.
1415–1442, Sept. 1990.
six channels with linear and three nonlinear models studied. [16] Z. Xiang, G. Bi, and T.-L. Ngoc, “Polynomial perceptrons and their
applications to fading channel equalization and co-channel interference
suppression,” IEEE Trans. Signal Processing, vol. 42, pp. 2470–2479,
VIII. CONCLUSION Sept. 1994.
Considering channel equalization as a multi-category classification [17] S. S. Yang and C. S. Tseng, “An orthonormal neural network for
problem, it is shown that performance of ANN-based equalizers function approximation,” IEEE Trans. Syst., Man, Cybern. B, vol. 26,
pp. 779–785, Oct. 1996.
provides substantial improvement in terms of convergence rate, MSE
floor level and BER. In a linear equalizer the performance degrades
drastically with increase in EVR, specially when the channel is
nonlinear. However, it is shown that, in the ANN-based equalizer
the performance degradation with increase in EVR is not so severe.
We have introduced a novel FLANN-based equalizer structure for A Novel Artificial Neural Network for Sorting
adaptive channel equalization of nonlinear channels. Because of its
single layer structure the FLANN offers advantages over MLP. The T. Tambouratzis
performance of PPN and MLP is found to be similar in most of
the experiments. But the single layer PPN structure is preferable
Abstract—An artificial neural network (ANN) is employed for sorting
than MLP as it offers less computational complexity. Out of the a sequence of real elements in monotonic (descending or ascending)
three ANN equalizer structures, the performance of the FLANN is order. Although inspired by harmony theory (HT), whereby the same
found to be the best in terms of MSE level, convergence rate, BER construction as for the HT ANN is followed, the proposed ANN differs in
and computational complexity for linear as well as nonlinear channel the mode of operation, namely the obliteration of the consensus (harmony)
function, the circumvention of simulated annealing as a means of settling
models over a wide range of SNR and EVR variations. Because of
to a solution, the simplification of the activation updating of the nodes
computational advantages the FLANN may be used in other signal of the upper layer, the clamping of the nodes of the lower layer, the
processing applications. gradual shrinking of the ANN and the use of an automatic termination
criterion. The creation of the sorted sequence is progressive, whereby at
most as many network updates are required as there are elements in the
REFERENCES sequence. Ties between elements are resolved by simultaneous activation
of the corresponding nodes. Finally, the min and max problems are solved
[1] S. Chen, G. J. Gibson, and C. F. N. Cowan, “Adaptive channel in a single network update.
equalization using a polynomial perceptron structure,” Proc. Inst. Elect.
Eng., vol. 137, pt. 1, pp. 257–264, Oct. 1990. Index Terms—Artificial neural networks, harmony theory, sorting.
[2] S. Chen, G. J. Gibson, C. F. N. Cowan, and P. M. Grant, “Adaptive
equalization of finite nonlinear channels using multilayer perceptrons,”
Signal Process., vol. 20, pp. 107–119, 1990. I. INTRODUCTION
[3] W. S. Gan, J. J. Soraghan, and T. S. Durrani, “A new functional-link
based equaliser,” Electron. Lett., vol. 28, pp. 1643–1645, Aug. 1992. Sorting constitutes a basic operation of many computing tasks,
[4] G. J. Gibson, S. Siu, and C. F. N. Cowan, “The application of nonlinear including VLSI design, digital signal processing, network commu-
structures to the reconstruction of binary signals,” IEEE Trans. Signal nications, database management and data processing, for which it
Processing, vol. 39, pp. 1877–1884, Aug. 1991.
[5] S. Haykin, Adaptive Filter Theory, 2nd ed. Englewood Cliffs, NJ:
has been estimated that sorting operations account for over 25% of
Prentice-Hall, 1991. the total processing time. The significance of sorting is reflected in
[6] M. Meyer and G. Pfeiffer, “Multilayer perceptron based decision feed- the multitude of sorting techniques that have been proposed over the
back equalizers for channels with intersymbol interference,” Proc. IEE, previous decades. The main aim of these techniques, be they serial
vol. 140, pt. 1, pp. 420–424, Dec. 1993. or parallel in construction and operation, is the minimization of the
[7] Y.-H. Pao, Adaptive Pattern Recognition and Neural Networks. Read-
ing, MA: Addison-Wesley, 1989. time and storage requirements of the sorting operation. Instructive
[8] Y.-H. Pao, G.-H. Park, and D. J. Sobajic, “Learning and generalization overviews of serial and parallel sorting techniques are given in [1]
characteristics of random vector functional-link net,” Neurocomputation, and [2], respectively. Representative serial sorting techniques appear
vol. 6, pp. 163–180, 1994.
[9] J. C. Patra, “Some studies on artificial neural networks for signal pro- Manuscript received June 19, 1998; revised November 8, 1998. This paper
cessing applications,” Ph.D. dissertation, Indian Inst. Technol., Kharag- was recommended by Associate Editor R. Popp.
pur, Dec. 1996. The author is with the Institute of Nuclear Technology—Radiation Protec-
[10] J. C. Patra and R. N. Pal, “A functional link artificial neural network for tion, NCSR “Demokritos,” Aghia Paraskevi 153 10, Athens, Greece (e-mail:
adaptive channel equalization,” Signal Process., vol. 43, pp. 181–195, tatiana@ipta.demokri-tos.gr).
May 1995. Publisher Item Identifier S 1083-4419(99)02296-7.
each node of the lower layer is calculated: the first update determines the min and max elements of the
x sequence.
I j =2 ij actui
w (4) 50 ) Each network update comprises a single deterministic pass of
j =1 activation. The activation values of the nodes of the lower
layer aclj (j = 1; 2; . . . ; N ) are uniformly clamped to +1
which is again processed as shown in (2) for I = j . The new
I
or 01 depending on whether the desired order of sorting is
activation value actlj is given by descending or ascending, respectively. Owing to clamping, the
+1 if P > R second pass of activation is obliterated. Concerning the first
actlj =
01 otherwise.
(5) pass, (2) and (3) are substituted by
The comparisons in (3) and (5) are responsible for the stochastic actui =
1 i0
I
(8)
nature of the activation updates of the nodes of both layers. The 0 i<0
I
j =1
• the construction characteristics of the proposed ANN are inher- N wij ;
j =1 if descending order
0 Nj=1 wij ;
ited from the HT ANN; = (10)
• the mode of operation of the proposed ANN differs from the HT if ascending order
ANN in terms of the circumvention of both simulated annealing
while km (m = 1; 2; . . . ; N ) stands for the progressively de-
and the harmony consensus function as a means of settling to
creasing threshold parameter employed at the network updates
a solution, the simplification of the activation updating of the
and is given by
nodes of the upper layer, the clamping of the nodes of the lower
layer and the progressive deactivation of the nodes of the upper m = N + 1 0 2m
k (11)
layer.
Assuming that a sequence of N elements fni g (i = 1; 2; . . . ; N ) such that the required degree of agreement in order for a node
is to be sorted, the ANN is characterized as follows. of the upper layer to become active is reduced at each update.
Relating to construction As a result of (8)–(11), the nodes of the upper layer assume
10 ) 2 1 N nodes, arranged in two layers of N nodes each. The activation values of +1 in the order in which the corresponding
ith node of the upper layer (i = 1; 2; . . . ; N ) represents the
elements ni appear in the sorted sequence. In the event of ties
ith element ni of the given sequence and its activation value
between l elements at a particular value of km , these elements
of 0 or +1 denotes whether ni must be entered in the sorted are added to the sorted sequence at the same network update,
sequence. while the threshold parameter jumps l consecutive values for
20 ) N 2 symmetric connections. The network is fully connected the next network update (km+l ). If M distinct elements appear
and the weights of the connections are directly determined by in the given sequence, exactly M network updates are required
for producing the entire sorted sequence.
+1; if ni > nj
w ij = 01; if ni < nj (6)
IV. RESULTS COMPARISONS
ni = nj
AND
A; if
In order to sort a given sequence of N elements fni g (i =
(i; j = 1; 2; . . . ; N ),
where the value of A depends on the 1; 2; . . . ; N ), the proposed ANN comprises two layers of N nodes
desired order of sorting each, with full connectivity between nodes of different layers. The
+1; if descending order nodes of the lower layer are clamped to activation values of 61
A =
01; if ascending order.
(7) depending on the desired order of the sorted sequence, whereby
each network update consists of updating the activation values of
the nodes of the upper layer only. As many updates are required for
Relating to the mode of operation producing the sorted sequence as there are distinct elements in the
30 ) The harmony function is not employed as a means of reaching given sequence, while the min and max elements are output in the
a solution. first network update. At each update, as many nodes of the upper layer
40 ) No use of simulated annealing is made. At most N network are deactivated as there are elements added to the ordered sequence.
updates are required for producing the sorted sequence, while Fig. 2 illustrates the proposed ANN for sorting the sample sequence
274 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 29, NO. 2, APRIL 1999
TABLE I
COMPARISON OF CONSTRUCTION AND PERFORMANCE
CHARACTERISTICS OF FOUR ANN SORTING TECHNIQUES
Fig. 2. The proposed ANN for sorting the sample sequence f08 72 2 07
: ; : ;
3:15; 0; 2:07g.
of the proposed ANN is compared to that of [17]: the [4] D. T. Lee, H. Chang, and C. K. Wong, “An on-chip compare/steer bubble
observed difference is due to the intricacy of the activation sorter,” IEEE Trans. Comput., vol. C-30, pp. 396–405, June 1981.
functions of the nodes of first hidden and output layers [17] [5] G. Bilardi and F. P. Preparata, “A minimum area VLSI network for
O(log n) time sorting,” IEEE Trans. Comput., vol. C-34, pp. 336–343,
(both ANN’s are characterized by the lack of a consensus Apr. 1985.
function, a similar logic for the production of the ordered [6] D. Rotem, N. Santoro, and J. B. Sidney, “Distributed sorting,” IEEE
sequence and a comparable total number of operations). Trans. Comput., vol. C-34, pp. 372–376, Apr. 1985.
[7] Y. H. Choi and M. Malek, “A fault-tolerant systolic sorter,” IEEE Trans.
2) The proposed ANN is transparent during operation. Comput., vol. 37, pp. 621–624, May 1988.
[8] R. S. Francis and I. D. Mathieson, “Benchmark parallel sort for
• Unlike the two consensus-function ANN’s [14] and [18], shared memory multiprocessors,” IEEE Trans. Comput., vol. 37, pp.
the termination criterion of the proposed ANN is automatic; 1619–1626, Dec. 1988.
once all the nodes of the upper layer of the ANN are [9] H. M. Alnuweiri and V. K. P. Kumar, “Optimal VLSI sorting with
deactivated, the entire sorted sequence has been produced. reduced number of processors,” IEEE Trans. Comput., vol. 40, pp.
105–110, Jan. 1991.
• The elements are output sequentially, in the order in which [10] M. Atkins, “Sorting by Hopfield net,” in Proc. Int. Joint Conf. Neural
they appear in the sorted sequence. This is preferable to Networks, Washington, DC, 1989, pp. 65–68.
the precedence-matrix [14] and the position-matrix [18] [11] W. Chen and K. Hsieh, “A neural sorting algorithm with (1) timeO
representations of the solution, both of which require further complexity,” in Proc. Int. Joint Conf. Neural Networks, San Diego, CA,
1990, pp. 793–798.
processing for determining the ordered sequence.
[12] K. Y. Siu and J. Bruck, “Neural computation of arithmetic functions,”
• The ties between elements of the given sequence are re- Proc. IEEE, vol. 78, pp. 1669–1675, Oct. 1990.
solved by simultaneous activation of the corresponding [13] Y. Takefuji and K. C. Lee, “A two-step parallel sorting algorithm based
nodes of the upper layer, i.e., in a straightforward and on neural networks,” J. Neural Network Comput., vol. 2, no. 1, pp.
order-of-appearance-independent manner. This contrasts the 30–32, 1990.
[14] Y. Takefuji and K. C. Lee, “A super-parallel sorting algorithm based on
appearance of equiprobable solutions in the same matrix neural networks,” IEEE Trans. Circuits Syst., vol. 37, pp. 1425–1429,
representation [14] as well as the appearance—in the sorted Nov. 1990.
sequence—of equal-valued elements in the order in which [15] R. W. Brockett, “Dynamical systems that sort lists, diagonize matrices
the elements appear in the given sequence [17]. and solve linear programming problems,” Linear Algebra Applicat., vol.
146, pp. 79–91, 1991.
• The min and max elements of the sequence are output in
[16] T. M. Kwon and M. Zervakis, “A parallel sorting network without
a single (first) iteration. Similarly, the proposed ANN is comparators: A neural network approach,” in Proc. Int. Joint Conf.
capable of seeking the part of the sorted sequence that is Neural Networks, Baltimore, MD, 1991, pp. 701–706.
contained within given bounds without the need to perform [17] Y. H. Tseng and J. L. Wu, “Solving sorting and related problems by
all of the network updates. Such an approach is preferable quadratic perceptrons,” Electron. Lett., vol. 28, no. 10, pp. 906–908,
1992.
to the other ANN’s which operate completely in parallel [18] J. Wang, “Analysis and design of an analog sorting network,” IEEE
and, thus, output the entire sorted sequence one-shot. The Trans. Neural Networks, vol. 6, no. 4, pp. 962–971, 1995.
advantage of the proposed ANN becomes especially evident [19] P. Smolensky, “Information processing in dynamical systems: Foun-
when large sequences are involved, whereby a fraction dations of harmony theory,” in Parallel Distributed Processing: Explo-
only of the total number of computations is necessary for rations in the Microstructure of Cognition Foundations, D. E. Rumelhart
and J. L. McClellands, Eds. Cambridge, MA: MIT Press, 1986, pp.
executing the aforementioned operations. 194–281, 1986.
[20] T. Tambouratzis, “An implementation of a harmony theory network for
V. CONCLUSION interpreting line-drawings,” Network: Computation Neural Syst., vol. 2,
no. 4, pp. 443–454, 1991.
A new artificial-neural-network-based sorting technique has been [21] , “A consensus-function artificial neural network for map-
proposed which is capable of sorting a given sequence of real coloring,” IEEE Trans. Syst., Man, Cybern. B, vol. 28, pp. 721–728,
elements in monotonic (descending or ascending) order. Although the Oct. 1998.
[22] , “A simulated annealing artificial neural network implementation
employed artificial neural network is derived from the harmony theory
artificial neural network [19] in terms of construction characteristics,
n
of the -queens problem,” Int. J. Intell. Syst., vol. 12, no. 10, pp.
739–751, 1997.
its mode of operation has been modified in the following respects: [23] , “Edge crossing minimization via artificial neural networks,”
neither the harmony consensus function nor simulated annealing Electron. Lett., vol. 33, no. 19, pp. 1621–1623, 1997.
are employed, the activation functions are simplified, determinis- [24] , “Graph planarization employing a harmony theory artificial neural
network,” Neural Comput. Applicat., vol. 6, no. 4, pp. 116–124, 1997.
tic activation updating is performed, progressive deactivation of [25] P. Burrascano and M. Melis, “A neural technique or satellite coverage
the activated nodes occurs and an automatic termination criterion plans optimization,” in Artificial Neural Networks 2, I. Aleksander and J.
concerning operation is provided. These modifications render the Taylor, Eds. Amsterdam, The Netherlands: North-Holland, 1992, pp.
proposed technique simple in construction as well as transparent and 1499–1502.
rapid during operation. [26] T. Tambouratzis, “A decomposition-coordination artificial neural net-
work for satellite broadcast scheduling,” Electron. Lett., vol. 34, no. 15,
pp. 1503–1504, 1998.
ACKNOWLEDGMENT [27] G. Tambouratzis, T. Tambouratzis, and D. Tambouratzis, “Neural net-
work versus statistical clustering techniques: A pilot study in a phoneme
The author wishes to thank the anonymous referees for their helpful recognition task,” in Mathematics of Neural Networks: Models, Algo-
comments in improving the manuscript. rithms and Applications, Massachusetts, S. W. Ellacott, J. C. Mason,
and I. J. Anderson, Eds. Norwell, MA: Kluwer, 1997, pp. 355–360.
REFERENCES