Professional Documents
Culture Documents
12
12
12
Abstract—In this paper (split into Parts I and II), an extensive So far various analyses have been carried out, each focusing
comparison of existing flip-flop (FF) classes and topologies is car- on aspects pertinent with FFs comparison [1], [2], [4]–[11].
ried out. In contrast to previous works, analysis explicitly accounts Among these works, [4]–[6] are the most thorough in terms of
for effects that arise in nanometer technologies and affect the en-
ergy-delay-area tradeoff (e.g., leakage and the impact of layout and adopted figures of merit and evaluated parameters. However,
interconnects). Compared to previous papers on FFs comparison, previous comparisons exhibit (some of) the following lacks.
the analysis involves a significantly wider range of FF classes and • They involve a limited number of topologies and/or do not
topologies. In particular, in this Part I, the comparison strategy, cover the entire spectrum of applications and con-
which includes the simulation setup, the energy-delay estimation straints that are observed in real designs (therefore, no uni-
methodology, and an overview of an optimum design strategy, to-
gether with the introduction of the analyzed FF classes and topolo- form comparison is available for the wide range of existing
gies, are reported. FF classes).
• The area-delay and leakage-delay tradeoffs are usually not
Index Terms—Clocking, energy-delay tradeoff, energy effi- considered at all.
ciency, flip-flops (FFs), high speed, interconnects, leakage, Logical
Effort, low power, nanometer technologies, VLSI.
• Circuit designers are typically accustomed to think in terms
of minimum energy-delay products or . Instead, a
fair comparison should take into account the FFs behavior
over the whole space [6], [12]–[14].
I. INTRODUCTION
• The FF input capacitance is typically assumed fixed
or at most swept as a parameter in a narrow range, whose
extent is selected in a naïve manner. Hence, it is not clear
to a well-
microprocessors [1]. Indeed, FFs affect the clock frequency, defined point in the space.
since their delay occupies a significant fraction of the clock • Till now, the most significant FF analyses in the litera-
cycle, especially in fast micro-architectures with low logic depth ture have not adopted sub-100-nm technologies, thereby
[2]. Moreover, together with the circuits devoted to the clock neglecting:
generation and distribution, FFs are part of the clock network, • the leakage influence in active and in standby modes;
which is responsible for 30%–50% of the whole chip energy • the impact of layout parasitics associated with intercon-
budget [3], [4]. nects, which degrade both speed and energy.
Various classes of FFs have been proposed to achieve a de- In this paper (split into Parts I and II), a novel analysis and
sired energy-delay tradeoff and depending on the fea- comparison strategy is proposed, which suitably accounts for all
tures of the application (high speed, low energy, low standby the previously mentioned aspects to achieve fair and meaningful
energy, etc.). Understanding the suitability of FFs for a given results. Such strategy is applied to compare a large number of FF
application is difficult and so is their selection, since it involves classes (4) and topologies (19) in a 65-nm CMOS technology.
a large number of existing topologies and depends on transis- In particular, we show the following.
tors sizing. In particular, an appropriate sizing methodology is 1) The comparison is carried out by including local wires par-
necessary to get reliable results usable in practical designs. asitics within the transistors sizes optimization. To limit the
number of real layouts, wires parasitics are extracted from
Manuscript received September 10, 2009; revised December 21, 2009. some reference layouts and are estimated for different tran-
M. Alioto is with the Dipartimento di Ingegneria dell’Informazione (DII), sistors sizes from the analysis of stick diagrams.
Università di Siena, 53100 Siena, Italy and also with the Berkeley Wireless
Research Center—Electrical Engineering and Computer Science Department,
2) Leakage is separately evaluated and its impact is analyzed
University of California, Berkeley, CA 94704-1302 USA (e-mail: malioto@dii. in both active and standby mode.
unisi.it; alioto@eecs.berkeley.edu) 3) The space is explored by considering the points
E. Consoli and G. Palumbo are with the Dipartimento di Ingegneria Elettrica, where products are minimized ( and are widely
Elettronica e dei Sistemi (DIEES), Università di Catania, I-95125 Catania, Italy
(e-mail: econsoli@diees.unict.it; gpalumbo@diees.unict.it). varied to cover this space). Accordingly, every design is
Digital Object Identifier 10.1109/TVLSI.2010.2041376 associated with a point in the space that has a clear
1063-8210/$26.00 © 2010 IEEE
Authorized licensed use limited to: BANGALORE INSTITUTE OF TECHNOLOGY. Downloaded on March 30,2010 at 02:23:17 EDT from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
the FF input capacitance seen from the clock terminal, both the
transistors and local (i.e., internal to the FF) interconnects ca-
pacitances are taken into account.
As concerns the FF data input signal, we follow a different
approach with respect to [1], [2], where another constant slope
policy was adopted for simplicity. Indeed, in real pipelines, the
speed of the logic block driving the FF data input is obviously
comparable to the FF speed. Accordingly, the size of the data-
driving inverter close to the FF, , is set so that the slope of
the FF data input signal ( in Fig. 1) is equal to the slope at the
output of the FF first stage that is driven by . The latter slope
is estimated by resorting to the Logical Effort (LE) model [16].
Indeed, during the exploration of the design space, the sizes of
all FF transistors are known and LE model can be applied (also
including the layout parasitics).
In the case of circuits that are driven by complementary clock
(e.g., master-slave FFs) or data (e.g., differential FFs) signals,
both polarities are generated through buffers that are considered
Fig. 1. Test bench circuit used to characterize a generic FF.
as external to the FF, in order to avoid a penalty with respect
to other circuits [8]. Moreover, we assume that the comparison
of inverting and non-inverting FFs does not require further ar-
meaning, which links results to the hardware intensity con-
rangements and that neither of them is presumptively better than
cept in [15]. This allows for gaining a deeper insight into
the other ones [8].
the tradeoff.
Finally, the output load is swept to test the FF response under
4) is a design variable allowing for further exploring the
light, moderate and heavy loading conditions [8]. Typical rea-
potentials of each topology in the minimization of different
sonable loads are , being the input capac-
figures of merit, differently from [6], where separate en-
itance of a symmetrical minimum inverter (i.e., with
ergy-efficient curves (EECs) were extracted under very few
). Greater loads are not considered since, ac-
(three) different parametrical values.
cording to LE, they usually require the insertion of a buffer at
5) In addition to the thorough investigation of the
the FF output, which alters the intrinsic energy-delay FF fea-
tradeoff, the interdependence of several other circuit pa-
tures [8]. Observe that the first loading inverter in Fig. 1 that
rameters is analyzed, including leakage, silicon area and
loads the FF output is in turn loaded by another inverter, which
clock load. To this aim, appropriate figures of merit to rank
is 4 times wider to avoid an unrealistically strong Miller effect
the considered FF classes and topologies are introduced.
in the gate-drain capacitances at the FF output.
This paper is divided into two parts: Part I presents the com-
parison methodology, the design strategies and the FF topolo- B. Definition of Speed and Timing Parameters
gies to be analyzed, whereas the results are reported and com-
mented in detail in Part II [42]. The timing parameters characterizing a FF are well-known
Part I is structured as follows. In Section II the simulation and are accurately described in [2]. They are as follows:
and analysis setup is shown. The space analysis and the 1) the minimum data-to-output delay , which is ob-
related topics are in Section III. The design-optimization strate- tained by selecting the optimum data-to-clock delay;
gies for FFs are discussed in Section IV. Section V presents the 2) the setup time , which is the optimum data-to-clock
selected FF classes and topologies. Finally, Section VI reports a delay that leads to ;
brief conclusion. This paper also includes an Appendix related 3) the minimum clock-to-output delay , occurring
with the accurate evaluation of the transient energy dissipation. when the data input transition occurs well before the clock
transition;
4) the hold time , which is the clock-to-data delay that
II. SIMULATION SETUP, ENERGY-DELAY ESTIMATION AND leads to a 5% increment of clock-to-output delay with
ANALYSIS OF LAYOUT ISSUES respect to .
In the analysis of the FF behavior within the space,
A. Test Bench: Driving Circuits and Output Load
the speed is quantified through the minimum achievable
Fig. 1 shows the setup used to test a generic FF, which is data-to-output delay, i.e., , according to [6]. In-
similar to that proposed in [1], [2] but with some differences. deed, represents the FF timing contribution to the cycle
The clock signal fed to the FF comes from a two-stage buffer, time when the FF is placed into a critical path [5]. Moreover,
sized to attain a typical slope [1], [2] at the clock input every delay is evaluated by considering the greatest among
node of the FF ( is the slope of the output waveform of all the possible data-to-output paths [namely two
an inverter loaded by inverters of the same size [16]). Hence, for the single-edge-triggered (SET) FFs and four paths for the
the size of the clock-driving inverter close to the FF is dual-edge-triggered (DET) FFs]. On the other hand, FFs lying
set to get an electrical effort equal to 3 [16]. When evaluating in fast paths do not affect system speed. Anyhow, data races
Authorized licensed use limited to: BANGALORE INSTITUTE OF TECHNOLOGY. Downloaded on March 30,2010 at 02:23:17 EDT from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
ALIOTO et al.: ANALYSIS AND COMPARISON IN THE ENERGY-DELAY-AREA DOMAIN OF NANOMETER CMOS FLIP-FLOPS 3
Authorized licensed use limited to: BANGALORE INSTITUTE OF TECHNOLOGY. Downloaded on March 30,2010 at 02:23:17 EDT from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
and where only Poly, Metal1, and Metal2 layers are employed
to connect transistors. In particular, the length of these inter-
connects depend on the relative position of the transistors along
their path, as well as the physical dimension of these transistors.
Stick diagrams are a useful tool to evaluate the relative posi-
tion [17], whereas the physical dimension of transistors can be
found from detailed layout consideration. From these consid-
erations, the adopted systematic methodology to extract inter-
connects parasitics is that reported in the Appendix A of [24].
Once the interconnect lengths are derived according to the above
methodology, the layout parasitics are found by multiplying the
lengths by the capacitances per unit length of the connecting Fig. 2. Energy-Efficient curve and minimum E D design points.
layers. Observe that the capacitances per unit length depend on
the detailed layout configuration, hence they must be properly
extracted from the design kit (see [24, App. A] for details). for a given delay (energy). Given the circuit topology, load and
The above procedure leads to the definition of expressions of supply voltage, the EEC has a hyperbolic shape, as shown in
parasitics that depend on the sizes of the FF transistors. Hence it Fig. 2 [15]
can be automated in circuit simulations to include the effect of
interconnects when sizing transistors. The accuracy of the above (2)
procedure has been validated through comparison with real lay-
outs of minimum- designs. On average, the parasitics are where and are the energy and delay asymptotes of the cir-
underestimated by 10%–25% (for a detailed example, see Ap- cuit, and the parameter is determined by fitting experimental
pendix A of [24]). The error in the delay and energy estimation is data (typically ). Actually, there is a minimum energy
actually lower (5%–10%), because each node capacitance also that is achievable with the minimum transistors sizes al-
includes the contribution of transistors. lowing correct operation. Therefore, the points between and
The analysis of all topologies shows that layout parasitic ca- must be discarded in the EEC (see Fig. 2). Regarding ,
pacitances are comparable to (or even greater than) the capaci- it can be only asymptotically approached through transistors
tances of all transistors connected to the same node. From the LE sizing, and measures the speed potential of a specific topology.
perspective, this means that the branching effort due to the pres- can be estimated through methods that are focused on speed
ence of layout parasitics is typically 2 or more, thus confirming optimization (e.g., the LE method [16]).
that interconnects have a huge impact on the FF speed and en- The resulting EEC is made up by the points minimizing the
ergy. Compared with the case where such parasitics are not in- products , with , as shown in Fig. 2. This result
cluded (as in previous works, [1]–[11]), the transistors sizes to was found in [15], [28] referring to the slightly different metrics
maintain a given speed strongly increase in high-speed designs , where the parameter is called “hardware intensity”.
and the energy is nearly doubled. A fair comparison among FFs requires an exhaustive anal-
To conclude, layout parasitics must be included in the tran- ysis across the entire space, i.e., the extraction of
sistor-level design loop to correctly characterize the the EEC under different loading, switching activity and logic
tradeoff of an FF, as well as to fairly compare different topolo- depth conditions. This task was somewhat accomplished in [6]
gies. Moreover, observe that this procedure can predict the size through extensive simulations, but with no reference to any
of the FF cell, other than the length of interconnects. Hence, it design metric. Instead, in the following, the relation between
is also a useful tool to estimate the FF area, thereby enabling a the EEC and the metrics is exploited to efficiently extract
thorough analysis of the energy-delay-area tradeoff for the first the EEC curve, as well as to give a clear physical meaning to
time. each point of the EEC.
Authorized licensed use limited to: BANGALORE INSTITUTE OF TECHNOLOGY. Downloaded on March 30,2010 at 02:23:17 EDT from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
ALIOTO et al.: ANALYSIS AND COMPARISON IN THE ENERGY-DELAY-AREA DOMAIN OF NANOMETER CMOS FLIP-FLOPS 5
Authorized licensed use limited to: BANGALORE INSTITUTE OF TECHNOLOGY. Downloaded on March 30,2010 at 02:23:17 EDT from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
(3)
Authorized licensed use limited to: BANGALORE INSTITUTE OF TECHNOLOGY. Downloaded on March 30,2010 at 02:23:17 EDT from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
ALIOTO et al.: ANALYSIS AND COMPARISON IN THE ENERGY-DELAY-AREA DOMAIN OF NANOMETER CMOS FLIP-FLOPS 7
The simulated and theoretical sensitivities are plotted in Fig. 5 VI. CONCLUSION
for all the optimum designs found within this work under var- In Part I, exhaustive analysis and design methodologies for
ious conditions and for all the considered FFs (see detailed re- nanometer CMOS FFs have been presented. Such methodolo-
sults in Part II). Detailed numerical values of the mean, standard gies are based on the notion of the Energy Efficient Curve and
Authorized licensed use limited to: BANGALORE INSTITUTE OF TECHNOLOGY. Downloaded on March 30,2010 at 02:23:17 EDT from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Fig. 6. Schematics of the analyzed FFs and variable widths (w ) to be optimized: (a) TGFF; (b) WPMS; (c) GMSL; (d) DTLA; (e) HLFF; (f) SDFF; (g) USDFF;
(h) IPPFF; (i) CPFF; (j) SEPFF; (k) TGPL; (l) MSAFF; (m) STFF; (n) CCFF; (o) VSWFF; (p) DET-TGLM; (q) DET-SPGFF; (r) DET-SPL; (s) DET-CDFF.
on the evaluation of its points that correspond to figures of merit meaning. Moreover, the FF input capacitance is considered as a
in the energy-delay space, which have a clear physical further independent variable to be optimized and the impact of
Authorized licensed use limited to: BANGALORE INSTITUTE OF TECHNOLOGY. Downloaded on March 30,2010 at 02:23:17 EDT from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
ALIOTO et al.: ANALYSIS AND COMPARISON IN THE ENERGY-DELAY-AREA DOMAIN OF NANOMETER CMOS FLIP-FLOPS 9
Fig. 7. Layouts of the analyzed FFs (minimum ED sizing): (a) TGFF; (b) WPMS; (c) GMSL; (d) DTLA; (e) HLFF; (f) SDFF; (g) USDFF; (h) IPPFF; (i) CPFF;
(j) SEPFF; (k) TGPL; (l) MSAFF; (m) STFF; (n) CCFF; (o) VSWFF; (p) DET-TGLM; (q) DET-SPGFF; (r) DET-SPL; (s) DET-CDFF.
Authorized licensed use limited to: BANGALORE INSTITUTE OF TECHNOLOGY. Downloaded on March 30,2010 at 02:23:17 EDT from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
complicated task. In [2], [5] some significant guidelines were input signal (clock or data) that is varying and to the specific
outlined. For instance, the evaluation of the energy consump- transition or , the second subscript refers to the
tion must not include the energy dissipated in the charging/dis- value of the other stable input signal or and the third
charging of the external output load, since it is a value solely subscript is related with the output behavior, which can remain
depending on the load dimension and not on the FF features. stable or or can vary or . In the
However, one should not simply subtract the magnitude of following, with no loss of generality, we will refer to a non-in-
the current flowing from and towards the load, otherwise, the verting Positive SET FF. The first four contributions are those
effect of some undesired output transitions would not be taken related with the clock transitions when the input and output are
into account. To be more specific, some topologies (e.g., some stable
semi-dynamic FFs) can suffer from glitches both on the internal
and output nodes. In this case, the energy dissipation due to (A2)
output glitches must be included, since it is a shortcoming that (A3)
worsens the FF features right dependently on the load value. (A4)
The energy spent to charge the data and clock inputs has to
be included in the computation [2], [5], because it is a feature (A5)
dependent on the FF characteristics. The replicas of the data-
The second four contributions are related with the clock tran-
and clock-driving buffers are inserted in the simulation setup
sitions when the input and output are different. In the case of a
(see Fig. 1), to subtract the energy due to the parasitic load of
Positive SET FF, this does (not) lead to an output change for the
the same driving inverters [2], [5].
clock transition (the situation is reversed for a
Summarizing, defining , and as
Negative SET FF)
the currents drawn from the power supply by the FF, by the data-
and clock-driving inverters close to the FF and by their unloaded (A6)
replicas, the generic contribution to the transient energy (at the
(A7)
moment including the energy on the load) is
(A8)
(A1) (A9)
where the definition of the integration limits and has to Finally come the data input transitions, which can occur
be properly done. during the high or low clock phase
In [2], the authors deal with the energy breakdown by re-
ferring to four energy contributions: and (A10)
. They are evaluated by considering a single clock pe- (A11)
riod during which a single event on data occurs (A12)
. The authors state that it is possible to infer (A13)
clocking, precharge and internal nodes energy contributions by
simply combining the four terms according to transition proba- These 12 contributions are evaluated by integrating the supply
bilities and subtracting the energy spent on load. current according to (A1), assuming to be the point of time
However, the simple approach shown in [2] does not allow to where the input experiences a transition, and to be the point
accurately separate and localize the various sources of energy of time where the slowest node within the FF reaches 99% of
consumption (clock, precharge, ), because the energy dissi- its steady value.3 In this way, the time window is suf-
pation related with the transition of one signal is influenced by ficiently wide to fully capture the dynamic and short-circuit en-
the values of the other signals. For instance, according to the FF ergy contributions, whereas it is sufficiently narrow to neglect
functionality, the transition of the data input can cause simply the impact of leakage.
the charging of the input gate capacitance or the transition of To determine the average transient energy in a clock cycle,
internal nodes according to the state of the clock (e.g., in MS the switching activity needs to be used. If at most one data
circuits). If, after the transition, the data always remained stable transition occurs for each clock period (i.e., ), the av-
waiting for being transferred through the FF, the argument made erage transient energy can be written as
in [2] would be completely correct and exhaustive. But, actually,
the data can change during the opaque phase of the FF and one
needs to account for all the possible transition scenarios, in order
to have the most general information about the transient energy.
Moreover, the integration of the supply current over the entire
3Sometimes, the nodes voltages can take long times to reach the 99% of the
clock period includes the static energy due to leakage, whereas
steady value. Anyhow, when not employing simple pass-transistors that cause
it should be separately evaluated and weighted according to the a threshold drop and when all transistors are properly sized according to the ar-
chosen logic depth . chitectural T =F O4 specification (as described in Section IV-A or [24]), the
For these reasons we suggest to consider all the possible tran- 99% value can be closely approached in practically acceptable times. Neverthe-
less, a good estimation of transient energy comes out also considering slightly
sitions that arise according to all possible inputs combinations. smaller values than 99% (e.g., 90%), and hence it is simply a matter of conven-
We adopt the following notation: the first subscript refers to the tion when characterizing an FF.
Authorized licensed use limited to: BANGALORE INSTITUTE OF TECHNOLOGY. Downloaded on March 30,2010 at 02:23:17 EDT from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
ALIOTO et al.: ANALYSIS AND COMPARISON IN THE ENERGY-DELAY-AREA DOMAIN OF NANOMETER CMOS FLIP-FLOPS 11
Authorized licensed use limited to: BANGALORE INSTITUTE OF TECHNOLOGY. Downloaded on March 30,2010 at 02:23:17 EDT from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
[35] S. Naffziger, G. Colon-Bonet, T. Fischer, R. Riedlinger, T. Sullivan, ences, and companies throughout the world. He has served as a member of var-
and T. Grutkowski, “The implementation of the Itanium 2 micropro- ious conference technical program committees (ISCAS, PATMOS, ICM, ICCD,
cessor,” IEEE J. Solid-State Circuit, vol. 37, no. 11, pp. 1448–1460, CSIE) and Track Chair (ICECS, ISCAS, ICM, ICCD). He serves as an Asso-
Nov. 2002. ciate Editor of the IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION
[36] B. Nikolic, V. Stojanovic, V. Oklobdzija, W. Jia, J. Chiu, and M. (VLSI) SYSTEMS, as well as of the Microelectronics Journal, the Integration
Leung, “Improved sense-amplifier-based flip-flop: Design and mea- The VLSI Journal and the Journal of Circuits, Systems, and Computers. He is
surements,” IEEE J. Solid-State Circuits, vol. 35, no. 6, pp. 876–884, Guest Editor of the Special Issue Advances in Oscillator Analysis and Design
Jun. 2000. of the Journal of Circuits, Systems, and Computers (2009).
[37] N. Nedovic, V. Oklobdzija, and W. Walker, “A clock skew absorbing
flip-flop,” in Proc. IEEE Int. Solid-State Circuit Conf., Feb. 2003, pp.
342–344.
[38] S. Shin and B. Kong, “Variable sampling window flip-flops for low- Elio Consoli was born in Catania, Italy, in 1983. He received the Master’s de-
power high-speed VLSI,” IEE Proc. IEE Circuits, Devices Syst., vol. gree in microelectronic engineering from the University of Catania, Catania,
152, no. 3, pp. 266–271, Jun. 2005. Italy, in 2008, where he is currently pursuing the Ph.D. degree in the Depart-
[39] R. Llopis and M. Sachdev, “Low power, testable dual edge triggered ment of Electrical, Electronic, and Systems Engineering (DIEES).
flip-flops,” in Proc. Int. Symp. Low Power Electron. Des., Aug. 1996, His primary research interests include clocking strategies and energy-efficient
pp. 341–345. design techniques for high-performance and low-power digital VLSI systems
[40] N. Nedovic, W. Walker, V. Oklobdzija, and M. Aleksic, “A low power in nanometer CMOS technologies. He is the co-author of scientific papers on
simmetrically pulsed dual edge-triggered flip-flop,” in Proc. IEEE Eur. referred international Journals and Conferences.
Solid-State Circuits Conf., Sep. 2002, pp. 399–402.
[41] P. Zhao, T. Darwish, and M. Bayoumi, “High-performance and low-
power conditional discharge flip-flop,” IEEE Trans. Very Large Scale
Integr. (VLSI) Syst., vol. 12, no. 5, pp. 477–484, May 2004.
[42] M. Alioto, E. Consoli, and G. Palumbo, “Analysis and comparison Gaetano Palumbo (F’07) was born in Catania, Italy, in 1964. He received the
in the energy-delay-area domain of nanometer CMOS flip-flops: Part Laurea degree in electrical engineering and the Ph.D. degree from the University
II–Results and figures of merit,” IEEE Trans. Very Large Scale Integr. of Catania, Catania, Italy, in 1988 and 1993, respectively.
(VLSI) Syst., accepted for publication. Since 1993, he conducts courses on Electronic Devices, Electronics for Dig-
ital Systems, and Basic Electronics. In 1994, he joined the Dipartimento Elet-
trico Elettronico e Sistemistico (DEES), now Dipartimento di Ingegneria Elet-
trica Elettronica e dei Sistemi (DIEES), University of Catania, as a Researcher,
Massimo Alioto (M’01–SM’07) was born in Brescia, Italy, in 1972. He received subsequently becoming an Associate Professor in 1998. Since 2000, he is a Full
the Laurea degree in electronics engineering and the Ph.D. degree in electrical Professor with the same department. His primary research interest has been
engineering from the University of Catania, Catania, Italy, in 1997 and 2001, analog circuits with particular emphasis on feedback circuits, compensation
respectively. techniques, current-mode approach, low-voltage circuits. Then, his research has
In 2002, he joined the Dipartimento di Ingegneria dellInformazione (DII), also embraced digital circuits with emphasis on bipolar and MOS current-mode
University of Siena, Siena, Italy, as a Research Associate and in the same year as digital circuits, adiabatic circuits, and high-performance building blocks fo-
an Assistant Professor. In 2005, he was appointed Associate Professor of Elec- cused on achieving optimum speed within the constraint of low power oper-
tronics, and was engaged in the same faculty in 2006. In the summer of 2007, ation. In all these fields he is developing some the research activities in col-
he was a Visiting Professor at EPFL—Lausanne, Switzerland. In 2009-2010, he laboration with STMicroelectronics of Catania. He was the co-author of three
is a Visiting Professor with BWRC, University of California at Berkeley, inves- books CMOS Current Amplifiers (Kluwer, 1999), Feedback Amplifiers: Theory
tigating on ultra-low power circuits and wireless sensor nodes. Since 2001, he and Design (Kluwer, 2001), and Model and Design of Bipolar and MOS Cur-
has been teaching undergraduate and graduate courses on advanced VLSI digital rent-Mode Logic (CML, ECL and SCL Digital Circuits) (Kluwer, 2005) and a
design, microelectronics and basic electronics. He has authored or co-authored textbook on electronic device in 2005. He is the author of 350 scientific papers
more than 140 publications on journals (over 50, mostly IEEE Transactions) on referred international journals (over 150) and in conferences. Moreover he is
and conference proceedings. Two of them are among the 25 most downloaded co-author of several patents.
TVLSI papers in 2007 (respectively, 10th and 13th). He is co-author of the book Prof. Palumbo was a recipient of the Darlington Award in 2003. Since June
Model and Design of Bipolar and MOS Current-Mode Logic: CML, ECL and 1999 to the end of 2001 and since 2004 to 2005, he served as an Associated Ed-
SCL Digital Circuits (Springer, 2005). His primary research interests include the itor of the IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—PART I: REGULAR
modeling and the optimized design of CMOS high-performance, low-power and PAPERS for the topic analog circuits and filters and digital circuits and systems,
ultra low-power digital circuits, arithmetic and cryptographic circuits, intercon- respectively. Since 2006 to 2007, he served as an Associated Editor of the IEEE
nect modeling, design/modeling for variability-tolerant and low-leakage VLSI TRANSACTIONS ON CIRCUITS AND SYSTEMS—PART II: EXPRESS BRIEFS. Since
circuits, circuit techniques for emerging technologies. He is the director of the 2008, he is serving as an Associated Editor of the IEEE TRANSACTIONS ON
Electronics Lab at University of Siena (site of Arezzo). CIRCUITS AND SYSTEMS—PART I: REGULAR PAPERS. In 2005, he was one of
Prof. Alioto is a member of the HiPEAC Network of Excellence. He is the the 12 panelists in the scientific-disciplinare area 09-industrial and informa-
Chair Elect of the VLSI Systems and Applications Technical Committee of the tion engineering of the Committee for Evaluation of Italian Research (CIVR),
IEEE Circuits and Systems Society, for which he is also Distinguished Lecturer. which has the aim to evaluate the Italian research in the above area for the period
He is regularly invited to give talks and tutorials to academic institutions, confer- 2001-2003.
Authorized licensed use limited to: BANGALORE INSTITUTE OF TECHNOLOGY. Downloaded on March 30,2010 at 02:23:17 EDT from IEEE Xplore. Restrictions apply.