Professional Documents
Culture Documents
Congestion Aware Fplan Synthesis
Congestion Aware Fplan Synthesis
936
capacity of a grid is defined as the number of available
vertical routing tracks. The usage of a grid is defined as 5. Congestion aware high level synthesis
the number of used routing tracks within the grid. The algorithm
horizontal usage of a grid is the number of used horizontal
routing tracks and the vertical usage of a grid is the The congestion aware high level synthesis algorithm
number of used vertical routing tracks. proposed in this paper is based on simulated annealing
Assume the pins are located at the lower left and upper (SA) algorithm. High level synthesis algorithm and the
right corners of the bounding box, the two-pin net covers method of computing cost function have been introduced
an m × n mesh, where m and n are the number of rows and in section III and IV. The algorithm flow chart is show as
the number of columns. We define F(m, n) as the total in Figure 2. The algorithm is a double loop. The outer
number of possible ways to optimally route a two-pin net loop is a gradually cooling process, and the inner loop is a
covering the mesh. practical iteration. The iteration count and the reject
F(m, 1)= F(1, n)=1 function are determined by the current temperature. When
F(m, n)= F(m-1, n)+ F(m, n-1) each loop is executed, we get a new high level synthesis
F(m, n)= F(n,m) result, and then accordingly update the routing demand of
F(m, n)=C(m+n-2, m-1)= C(m+n-2, n-1) global edges and computer the new cost function value. If
Assume Px(i,j) and Py(i,j) represent the probabilities of the cost function value of the new solution is equal of less
horizontal and vertical usages for this net in grid(i, j) than the current cost function value, we accept it.
⎧a : F (m, n − 1)
⎪b :1 (1) Otherwise, we accept it with the probability of
exp ( ( cost-new cost ) / ( k * T ) ) .
⎪
⎪c : F (m − i + 1, n − 1)
1 ⎪
Px (i, j ) = × ⎨ F (m, n − j + 1) + F (m, n − j )
F ( m, n ) ⎪ d :
⎪ 2
⎪ F (i, j ) F (m − i + 1, n − j ) + F (i, j − 1) F (m − i + 1, n − j + 1)
⎪⎩e : 2
⎧ a : F (m − 1, n)
⎪b :1 (2)
⎪
⎪ F (m − i + 1, n) + F (m − i, n)
1 ⎪c :
Py (i, j ) = ×⎨ 2
F (m, n) ⎪
d : F (m − 1, n − j + 1)
⎪
⎪ F (i, j ) F (m − i, n − j + 1) + F (i − 1, j ) F (m − i + 1, n − j + 1)
⎪⎩e : 2
case a : i = 1, j = 1
case b : i = 1, j = n
case c :1 < i < m, j = 1
case d : i = 1,1 < j < n
case e : other
According to the formula (1) and (2), we can obtain
the routing demand of every global edge and the routing
demand of the whole chip using the same method. Then
we can find the routing overflow edge.
Since our objective of optimization is to distribute the
routing most evenly, the cost function here is to indicate
the degree of evenly-distribute of the routing demands.
The objective cost function is shown as formula (3) (4).
⎧⎪0 di < d Figure 2 Flow chart of the congestion aware high level
ci = ⎨ (3)
synthesis method
⎪⎩ di / d d i ≥ d
Cost= ∑ci
i (4) 6. Experimental results
Where di is the routing demand on global edge i; d is The experiments are carried out on a PC Pentium 4
with 3.0GHz master frequency and 512M memory. We
the average routing demand; ci is the cost on global edge i.
have implemented our methods in C language. Three
The bigger the value of ci is, the more the routing demand benchmarks are used in our high level synthesis method,
exceeds the average value. Formula (4) obtains the cost of which are 11th order FIR filter FIR11, 7th order IIR filter
the high level synthesis scheme by accumulating all the IIR7 and elliptical wave filter EWF. We compared our
cost of global edges. method with traditional method and CRSF method in [3].
937
Table 1 Maximum congestion value
FIR11 IIR7 EWF 7. Conclusion
Traditional 91.4 112.2 24.5
This paper proposes a novel routing congestion aware
CRSF 52.9 62.9 20.6 high level synthesis algorithm. A probabilistic congestion
Our method 49.7 58.6 19.1 estimation model is given first, and then the routing
congestion method based on simulated annealing and
Table2 Routing demand distribution of fir11 genetic algorithm is proposed. Experimental data show
that our algorithm can make the wires evenly distributed
Number of global edges routing demands range
on the chip before physical design begin. The maximum
0~10 11~25 26~50 >50 congestion value is about 6% less than CRSF, and 40%
Traditional 3768 204 25 14 less than the traditional method.
CRSF 3908 94 7 2
Our method 3941 65 5 0 References
Table3 Routing demand distribution of iir7 [1] Jinan Lou, Shashidhar Thakur, Shankar Krishnamoorthy,
Henry S. Sheng, Estimating routing congestion using
Number of global edges routing demands range probabilistic analysis, IEEE transactions on computer aided
0~10 11~25 26~50 >50 design of integrated circuits and systems, Vol.21, No.1,2002
[2] Vijay Sundaresan, Ranga Vemuri, A novel approach to
Traditional 1127 131 28 14 performance-oriented datapath allocaton and floorplanning,
CRSF 1134 137 26 3 Proceedings of the 2006 Emerging VLSI Technologies and
Our method 1139 140 20 1 Architectures, 2006
[3] Um Junhyung , Kim Jae-hoon , Kim Taewhan, Layout
Table4 Routing demand distribution of EWF driven resource sharing in high-1evel synthesis. Proceedings of
International Conference of Computer Aided Design,San Jose
Number of global edges routing demands range ,2002:614—618
0~5 6~15 16~22 >22 [4] Y.Wang, J.Bian, Q.Wu and H.Hu, Reallocation and
rescheduling after floor-planning for timing optimization, ASIC,
Ttraditional 2133 211 12 2
2003, Proceedings 5th International Conference, pp.212-215
CRSF 2235 118 5 0 Vol.1. 2003.
Our method 2234 120 4 0 [5] A stammermann, D.Helms, M.Schulte, A Schulz and
W.Nebel, Binding allocation and flooplanning in low power
high-level synthesis, Computer Aided Design, 2003, ICCAD,
Table I shows that the maximum congestion value in International conference,pp.544-550,2003.
our algorithm is about 6% less than CRSF, and about 40% [6] Mohamed A.Elgamel, Magdy A.Bayoumi, On low power
less than traditional method. In our algorithm, each global hign level synthesis using Genetic Algorithm, Proceeding of the
edge has assigned a value indicating routing demands. For 9th International conference on Electronics, Circuits and
benchmark FIR11, Table II shows the routing demands systems, Vol.2, pp:725-728,2002
amount within different value regions of global edges. It [7] Lin Zhong, Niraj K.Jha, Interconnect-aware low-power high-
can be seen that the number of global edges with high level synthesis, IEEE transactions on computer aided design of
routing demands decrease, at the same time, the number integrated circuits and systems, 2005,24(3):336-351
of global edges with low routing demands increase in our [8] Davide Pandini, Lawrence T.Pileggi, and Andrzej
J.Strojwas, Congestion aware logic synthesis, Proceedings of the
algorithm. For FIR11 in Table II, the edge with highest
2002 Design, Automation and Test in Europe Conference and
routing demands (>50) completely disappears, and the Exhibition, 2002
edges with middle routing demands also reduced while [9] Thomas Kutzschebauch, Leon Stok, Congestion aware
the edges with low routing demands increase compared layout driven logic synthesis, Proceedings of the International
with CRSF algorithm. The data show that the routing conference on Computer-aided design, 2001, 216 - 223
demands are distributed much more evenly on the chip [10] Wang W J, Bian J N, Wang Y F. A congestion driven re-
without congestion in some local areas by our method. synthesis method after floorplanning. Proceedings of
Experimental results of testing benchmark IIR7 and EWF International Conference on Communications, Circuits and
are shown in table III and IV, which also indicate the Systems.2005:1220 一 1240
routing demands are more evenly distributed in our
algorithm than traditional method and CRSF.
938