Professional Documents
Culture Documents
Day 2004
Day 2004
www.elsevier.com/locate/sysarc
Received 1 January 2003; received in revised form 19 April 2004; accepted 20 May 2004
Available online 23 July 2004
Abstract
This paper derives a number of results related to the topological properties of OTIS k-ary n-cube interconnection
networks. The basic topological metrics of size, degree, shortest distance, and diameter are obtained. Then results re-
lated to embedding in OTIS k-ary n-cubes of OTIS k-ary (n1)-cubes, cycles, meshes, cubes, and spanning trees are
derived. The OTIS k-ary n-cube is shown to be Hamiltonian. Minimal one-to-one routing and optimal broadcasting
algorithms are proposed. The OTIS k-ary n-cube is shown to be maximally fault-tolerant. These results are derived
based on known properties of k-ary n-cube networks and general properties of OTIS networks.
2004 Elsevier B.V. All rights reserved.
Keywords: Interconnection networks; k-ary n-cube; Optical transpose interconnection systems; Topological properties
1383-7621/$ - see front matter 2004 Elsevier B.V. All rights reserved.
doi:10.1016/j.sysarc.2004.05.002
698 K. Day / Journal of Systems Architecture 50 (2004) 697–705
different parts of the world [3,16]. Optical technol- The OTIS architecture has gained considerable
ogies can provide thousands of high bandwidth attention recently. A number of algorithms have
channels at a single communication point (chip, been developed such as BPC permutations [17],
board, or computer). However, electronic inter- routing, selection and sorting [14,15], data rear-
connects perform better when the distance is up rangement [25], matrix multiplication [22], and
to few millimeters [8]. Motivated by these observa- image processing [21]. Several general topological
tions, new hybrid computer architectures utilizing properties of OTIS networks are obtained in [7].
both optical and electronic technologies have been This paper presents a further contribution in
proposed and investigated [11,18,19]. In these sys- this direction by studying the properties of k-ary
tems short interconnects at the chip level are elec- n-cube networks including basic topological prop-
tronic; while optical interconnects are used for erties, embedding, routing, broadcasting, and
larger distances. fault-tolerance. These results are derived based
Optical Transpose Interconnection Systems on known properties of k-ary n-cube networks
(OTIS) present a new optoelectronic computer [2,6] and general properties of OTIS networks [7].
architecture that takes benefits from both optical
and electronic technologies. In this architecture,
processors are divided into groups where elec- 2. Definition and basic properties
tronic interconnects are used to connect processors
within the same group; while optical interconnects Definition 1. The k-ary n-cube Qkn has N = kn nodes
are used for inter-group communication. The each of the form X = xn1xn2. . .x0, where 0 6
OTIS architecture was used in [15,26] to propose xi < k, for all 0 6 i < n. Two nodes X = xn1xn2. . .x0
interconnection networks for multiprocessor sys- and Y = yn1yn2. . .y0 in Qkn are connected if and
tems, by replacing a subset of wires with pairs of only if there exists i, 0 6 i < n, such that xi = yi ± 1
transmitters and receivers. The result is an interest- (mod k) and xj = yj, for i 6¼ j. For clarity we omit to
ing trade-off between electrical and optical inter- write mod k in similar expressions for the remainder
connects in terms of speed, power consumption of the paper.
and space reduction [8]. It has been shown in [10]
that the bandwidth and the power consumption It is shown in [6] that Qkn has degree 2n and
are optimized when the number of processors in diameter n ºk/2ß. Given two nodes X = xn1xn2
a group equals the number of groups. Several stud- . . .x0 and Y = yn1yn2. . .y0 in Qkn , we denote by
ies of OTIS architecture have then been limited to dH (X,Y) the Hamming distance between X and
the case where the group size is equal to the num- Y, i.e., the number of digits in which X and Y dif-
ber of groups in the system [15,17,23,24,26]. In fer. The length of a shortest path between X and Y
such an OTIS system, an inter-group (optical) link is
Pequal to the Lee distance [2] given by: d L ðX ; Y Þ ¼
n1
connects processor p of group g to processor g of i¼0 w i , where wi = min0 6 i < n(jxiyij, kjxiyij).
group p. The intra-group (electronic) links form
an interconnection topology corresponding to a Definition 2. The OTIS-Qkn network is an undi-
k-ary n-cube in the OTIS k-ary n-cube. rected graph (V, E) given by
More recently, Coudert et al. have studied a
more general OTIS architecture where the group V ¼ fhg; pijg; p 2 Qkn g
size is not necessarily equal to the number of
groups. In [4], authors study the relationship be- and
tween the OTIS architecture and the family of de E ¼ fðhg; p1 i; hg; p2 iÞjðp1 ; p2 Þ is an edge in Qkn g
Bruijn digraphs which leads to minimize the total
[ fðhg; pi; hp; giÞjg 6¼ p in Qkn g:
number of lenses of the optical layout. In [5],
authors have proposed an optical layout of several The OTIS-Qkn is composed of N = kn node-disjoint
kðN Þ
graph topologies (including de Bruijn and Kautz sub-graphs Qkð1Þ kð2Þ
n ; Qn ; . . . ; Qn , called groups.
digraphs) using OTIS. Each of these groups is isomorphic to a k-ary n-
K. Day / Journal of Systems Architecture 50 (2004) 697–705 699
jGj size of a graph G, i.e., the number of We start by showing how to embed disjoint
nodes of G OTIS-Qkn1 sub-graphs in OTIS-Qkn . An example
degG (p) degree of a node p in a graph G showing how three disjoint OTIS-Q31 Õs are embed-
distG (p1, p2) the length of a shortest path between ded in OTIS-Q32 is given in Fig. 2.
the nodes p1 and p2 in G
diam(G) diameter of a graph G, i.e. the maxi- Theorem 1. OTIS-Qkn embeds k disjoint OTIS-Qkn1
mum distG (p1, p2) for all p1 and p2 in G sub-graphs.
The following basic topological metrics of Proof. Qkn can be partitioned into k disjoint Qkn1
OTIS-Qkn are derived from Definitions 1 and 2 sub-graphs. This can be seen by considering the
and from similar results in the literature [17,24]: sub-graph Qk;i k k
n1 of Qn containing all nodes in Qn
k;i
of the form X = ixn2. . .x1x0. Clearly each Qn1 is
1. j OTIS Qkn j¼ k 2n isomorphic to Qkn1 . Furthermore, it is shown in
2n; if g ¼ p [7] that if G embeds k disjoint sub-graphs each iso-
2. degOTISQkn ðhg; piÞ ¼
2n þ 1; if g 6¼ p morphic to a graph H, then OTIS-G embeds k dis-
joint sub-graphs each isomorphic to OTIS-H.
Hence OTIS-Qkn embeds k disjoint copies of
OTIS-Qkn1 . h
00 01 02 00 01 02 00 01 02
Now, we present some additional results related
10 11 12 10 11 12 10 11 12 to the embedding of other structures in OTIS-Qkn
such as cycles, meshes, binary cubes, and spanning
20 21 22 20 21 22 20 21 22 trees.
00 01 02
00 01 02 00 01 02 00 01 02 Theorem 2. For any i, 0 6 i 6 n1, OTIS-Qkn
10 11 12 10 11 12 10 11 12
embeds kn + i disjoint cycles each of length kni.
Fig. 2. (a) Q32 embeds 3 disjoint copies of Q31 (3 cycles of 3 nodes The spanning tree construction is illustrated on
each). (b) OTIS-Q32 embeds 3 disjoint copies of OTIS-Q31 (for OTIS-Q32 . First, an overview of the spanning tree
clarity only the optical links appearing in the OTIS-Q31 sub- construction for Qkn described in [2] is given. Start-
graphs are drawn). ing with node 00. . .0 as a root, each node
X = xn1xn2. . .x0 has two children for each lead-
ing 0 digit in the address. These two children are
Theorem 3. For any i, 0 6 i 6 n1, OTIS-Qkn em- obtained by keeping all digits unchanged in the ad-
beds, with dilation 3, ki node-disjoint kni · kni dress except for one of the leading zeros which is
wrap-around meshes (tori). replaced by 1 in one child and by k1 in another
child. A node may have one additional child ob-
Proof. We have seen in the proof of theorem 2 tained by incrementing (modulo k) the leftmost
that Qkn can be partitioned into ki disjoint cycles non zero digit (if it exists) if it is less than ºk/2ß
each of length L = kni, for any i, 0 6 i 6 n1. It or by decrementing it if it is larger than ºk/2ß + 1.
is shown in [7] that if G embeds a cycle of length Using this tree as a basis, a spanning tree rooted
L, then OTIS-G embeds an L · L wrap-around at Æ00, 00æ in OTIS-Q32 is obtained as shown in
mesh with dilation 3 with group and processor ad- Fig. 3.
dresses corresponding to node addresses in the cy- The construction of a Hamiltonian cycle in
cle of length L. Therefore OTIS-Qkn embeds, with OTIS-Qkn is addressed next.
dilation 3, ki node-disjoint kni · kni wrap-around
meshes. h Lemma 1. If k is odd then OTIS-Qkn is Hamiltonian.
K. Day / Journal of Systems Architecture 50 (2004) 697–705 701
00 01 02 00 01 02 00 01 02
at start and ending at end. This path is ob-
tained from H by deleting its edge between
10 11 12 10 11 12 10 11 12 the nodes start and end.
(ii) The start node of a group is the group number
20 21 22 20 21 22 20 21 22
of the previous group (cyclically) in the above
00 01 02 sequence and the end node of a group is the
00 01 02 00 01 02 00 01 02
group number of the next group in the se-
10 11 12 10 11 12 10 11 12 quence. Therefore there exists an optical link
in OTIS-Qkn connecting the last node in a
20 21 22 20 21 22 20 21 22 group to the first node of the next group in
10 11 12 the circular list.
00 01 02 00 01 02 00 01 02
We therefore have a sequence of Hamiltonian
10 11 12 10 11 12 10 11 12
paths for the kn groups which can be intercon-
20 21 22 20 21 22 20 21 22 nected by optical links to form a Hamiltonian
cycle in OTIS-Qkn . h
20 21 22
Fig. 3. A Spanning tree (thick lines) rooted at Æ00, 00æ in OTIS- Lemma 2. For n P 2, if k is even then there exists
Q32 . a Hamiltonian cycle H ¼ X 0 ; X 1 ; . . . ; X kn 1 ; X kn ;
2 2
Proof. Since k is odd, kn (size of Qkn ) is also odd. X kn 1 in Qkn such that the two nodes X0 and X kn 1
2
k
Let kn = 2p1 and H = X0, X1, . . .,Xp1, Y0, Y1, . . ., are connected in Qn and the two nodes X kn and
2
Yp2, X0 be a Hamiltonian cycle in Qkn . We con- X kn 1 are also connected in Qkn .
struct a Hamiltonian cycle for OTIS-Qkn by con-
necting Hamiltonian paths in the groups of Proof. (by induction on n) For n = 2, Fig. 4 illus-
OTIS-Qkn (each group is a Qkn ) in the following trates the construction of the desired Hamiltonian
group order: X0, Y0, X1, Y1, . . ., Xp2, Yp2, Xp1. cycle in Q62 . The same dual ÔzigzagÕ construction is
The traversed Hamiltonian path in each of these possible for any even k. Now assume n P 3. Let
groups is obtained by following the Hamiltonian X0, X1, . . ., Xkn11, X0 be a Hamiltonian cycle in
cycle H backward starting at a specific node and Qkn1 such that X0 and X kn 11 are connected and
2
ending at the next node listed in the sequence H also X kn1 and X kn1 1 are connected. Let
n1 2
above. The start node and end node for each of q ¼ k 2 1. Since k is even and n P 3, q must be
the groups X0, Y0, X1, Y1, . . ., Xp2, Yp2, Xp1 in odd and kn11 must also be odd. Consider the
this order are given below in the form [start, end] ordering of the nodes of Qkn shown in Fig. 5.
written under the corresponding group number.
This sequence has the same properties (i) and lowing node C on a minimum routing path to-
(ii) listed in the proof of Lemma 1. We therefore wards node D. It uses a routing vector R = Rn1,
have a sequence of Hamiltonian paths for the kn Rn2, . . ., R0 where Ri encodes both the sign and
groups which can be interconnected by optical magnitude of the minimum routing distance along
links to form a Hamiltonian cycle in OTIS-Qkn . h dimension i.
K. Day / Journal of Systems Architecture 50 (2004) 697–705 703
function NEXT Qkn ðC; DÞ //invoked at node Ægs, pcæ to contribute in broadcast-
//returns the node following C on a minimal routing ing M originating at source Ægs, psæ
path from C to D {
{ if gs6¼pc then initiate BROADCAST Qkn of M in
let R = Rn1Rn2. . .R0 be the routing vector from group pc starting at source gs
C to D for each p in ChildrenQkn (ps, pc) do
let i be the smallest j such that Rj 6¼ 0 invoke BROADCAST OTISQkn (M, Ægs, psæ,
if (i does not exist) then Ægs, pæ)
return null //destination has been reached }
else if (Ri > 0) then
return Cn1Cn2. . .Ci + 1 Ci + 1 Ci1. . .C0 BROADCAST OTISQkn traces the optimal height
else return Cn1Cn2. . .Ci + 1 Ci1Ci1. . .C0 spanning tree constructed in Theorem 5 delivering a
} single message copy to each node in OTIS-Qkn .
00 References
01 02 00 01 02 00 01 02
[18] T. Szymanski, Hypermesh optical interconnection net- [25] C.-F. Wang, Algorithms for the OTIS optoelectronic
works for parallel computing, Journal of Parallel and computer, Ph.D. Thesis, Department of Computer Science,
Distributed Computing 26 (1995) 1–23. University of Florida, 1998.
[19] T. Szymanski, H. Hinton, Architecture of a terabit free- [26] F. Zane, P. Marchand, R. Paturi, S. Esener, Scalable
space intelligent optical backplane, Journal of Parallel and network architectures using the optical transpose intercon-
Distributed Computing 55 (1) (1998) 1–31. nection system (OTIS), Journal of Parallel and Distributed
[20] B. Vanvoorst, S. Seidel, E. Barscz, Workload of an iPSC/ Computing 60 (5) (2000) 521–538.
860, Proceedings of Scalable High-Performance Comput-
ing Conference (1994) 221–228.
[21] C.-F. Wang, S. Sahni, Image processing on the OTIS- Khaled Day received his undergraduate
Mesh optoelectronic computer, IEEE Transactions on degree in computer science from
the University of Tunis in 1986 and the
Parallel and Distributed Systems 11 (2) (2000) 97–109. M.Sc. and Ph.D. degrees from the
[22] C.-F. Wang, S. Sahni, Matrix multiplication on the OTIS- University of Minnesota (USA) in
mesh optoelectronic computer, IEEE Transactions on 1989 and 1992 respectively. Dr. Day is
Computers 50 (7) (2001) 635–646. currently an Associate Professor at the
Department of Computer Science of
[23] C.-F. Wang, S. Sanhi, OTIS optoelectronic computers, in: Sultan Qaboos University in Oman.
K. Li, Y. Pan, S.Q. Zhang (Eds.), Parallel Computation His areas of interest include intercon-
Using Optical Interconnections, Kluwer Academic, 1998, nection networks, parallel algorithms,
pp. 99–116. distributed computing and cluster
computing. He is a senior member of
[24] C.-F. Wang, S. Sahni, Basic operations on the OTIS-mesh the IEEE.
optoelectronic computer, IEEE Transactions on Parallel
and Distributed Systems 9 (12) (1998) 1226–1236.