Professional Documents
Culture Documents
Low-Voltage High-Speed CML D-Latches in Nanometer CMOS Technologies
Low-Voltage High-Speed CML D-Latches in Nanometer CMOS Technologies
Abstract— This paper presents the design of a novel noise and current mode allows to achieve fast switching at
low-voltage high-speed D-latch circuit suitable for nanometer the cost of higher power consumption [13]–[15].
CMOS technologies. The proposed topology is compared against Nowadays, CMOS nanometer technologies offer fast
the low-voltage triple-tail D-latch and its advantages are demon-
strated both by simulations, under different performance/power devices, with transition frequencies even higher than 350 GHz
consumption tradeoffs with a 40-nm CMOS technology, and for n-channel and 200 GHz for p-channel devices, but these
theoretically, thanks to a simple model of the propagation delay technology nodes exhibit some critical drawbacks, which have
derived for both low-voltage topologies. In order to further raised the new design issues.
demonstrate the advantages of the proposed topology, it has The most important issue to cope with is the dramatic
also been used to design a D flip-flop (DFF), where thanks
to the feature to need just 1 clock differential pair; a further reduction of the supply voltage; in fact, in order to maintain the
speed improvement is achieved over the conventional triple-tail electric field under the critical value in the channel region,
topology. Indeed, by comparing a two-stage frequency divider the power supply has been reduced to very low values during
designed using both the triple-tail DFF and the proposed folded the last decade, reaching two or three times the device’s thresh-
DFF, a 54% improvement in the maximum operating frequency old voltage (below 1 V for sub-50-nm CMOS technology
is found when using the proposed folded DFF.
nodes).
Index Terms— Current mode logic (CML) D-latch, D flip- This very low-voltage constraint limits the possibility to
flop (DFF), low voltage, nanometer CMOS. have several stacked transistors in a CML gate making most of
the conventional CML topologies unsuitable from a practical
I. I NTRODUCTION point of view as we will point out in the following.
Novel modeling and design approaches for CML gates
D URING the last decade, the increasing interest for
high-speed communications and chip-to-chip intercon-
nect applications has rising the needs of high-performance
have been recently presented in [16]–[19] referring to CMOS
technologies, but the problem of stacking several levels of
logics, which can support both low-power and high data-rate transistors (as in the CML D-latch topology) under a very
applications. Fiber optics, wireline and backplane communi- low-voltage constraint has not been addressed in these works.
cations as well as equalizers, and millimeter-wave-sampling Another drawback which comes using nanometer CMOS
ADCs demand the use of tens of Gb/s capable logics that technologies is the degradation of important small-signal para-
cannot be supported using standard CMOS logic style [1]–[6]. meters, such as gm and rds due to short-channel effects [20]:
Basic building blocks, such as phase detectors, multiplexers, this limits the intrinsic gain of MOS transistors and worsen
decision circuits, frequency dividers, and prescalers, used in the analog performances and noise margins of the conventional
these applications, require high immunity to noise and are CML designs.
required to operate at very high clock rates [7]–[12]. Latch topologies suitable for low-voltage operation have
The D-latch is one of the main building blocks in high- been proposed in [21] and [22]. The circuit proposed in [21]
speed digital circuits. To satisfy stringent speed require- requires inductive load to implement the high pass feed-
ments, D-latches are usually designed in current-mode forward concept, and this results in a quite large silicon area
logic (CML) [13]–[15]. compared with resistive or active load. Furthermore, transistors
The idea behind CML is to use a MOS differential pair defining the tail current of differential pairs are directly driven
as core block for logical and sequential circuits, since the by the clock signals and this results in a tail current which is
differential signaling offers a good protection to switching strongly dependent on the swing of the clock signals and on
process, voltage and temperature (PVT) variations. The CML
Manuscript received January 13, 2017; revised May 10, 2017 and D-latch circuit presented in [22] has higher speed than the
July 21, 2017; accepted August 23, 2017. (Corresponding author: conventional CML D-latch, but as stated in the same paper,
Giuseppe Scotti.)
G. Scotti, D. Bellizia, and A. Trifiletti are with the Department of it suffers from some drawbacks.
Information Engineering, Electronics and Telecommunications, University of 1) This circuit is vulnerable to the common-mode noise
Rome “La Sapienza,” 00184 Rome, Italy (e-mail: scotti@diet.uniroma1.it; coming from the ground rail due to the operation of
bellizia@diet.uniroma1.it; trifiletti@diet.uniroma1.it).
G. Palumbo is with the Department of Electrical and Electron- current source transistors in triode region.
ics Engineering, University of Catania, 95125 Catania, Italy (e-mail: 2) The currents of transistors driven by the clock signals
gaetano.palumbo@dieei.unict.it). are sensitive to PVT variations. For example, those
Color versions of one or more of the figures in this paper are available
online at http://ieeexplore.ieee.org. currents are affected by any change in the amplitude
Digital Object Identifier 10.1109/TVLSI.2017.2750207 of the clock signals. Also, any change in the threshold
1063-8210 © 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Fig. 7. Equivalent circuit models for the folded D-latch. Computation of (a) τ A_FL , (b) τ B_FL , and (c) τC_FL , respectively.
In [28], a good approximation of the propagation delay while the capacitive contribution at the node is equal to
of the second-order system is reported, which after trivial
Ceq A_FL = 2C gd1,2 + Cdb1,2 + Cdb7,8 + C gs7,8
manipulations can be rewritten as
+ C gs9,10 + C gd9,10. (15)
τA + τB √
τpd = 1 + 0.35 · √ τAτB . (11)
τAτB From the drain of M9,10 now acting as a current source
Equation (11) is able to give an accurate approximation of the to the drain of M4,5 (which is now considered in short-
propagation delay if τA and τB are real poles and the following circuit condition), a current-to-current transfer with time
condition is satisfied2 : constant τ B_FL can be identified
τB τ BFL = Req BFL Ceq BFL (16)
< 10. (12)
τA
where
B. Propagation Delay Model of the Low-Voltage 1
Folded D-Latch Req B_FL = (17)
G m3 + G m4
The equivalent circuit model used for the computation of the Ceq B_FL = Cdb9,10 + C gd9,10 + 2C gs3,4 + 2C gb3,4. (18)
propagation delay of the low-voltage folded D-latch is shown
The third time constant τC_FL is due to the current-to-
in Fig. 7.
voltage transfer from the drain of M4,5 (which acts as a current
The propagation delay can be split into three components.
source) to the capacitive load at the output node (Q)
Each one of these components can be considered as a single-
pole system, and we will give a definition of the three time τC_FL = ReqC_FL CeqC_FL (19)
constants, as in the previous case. The first time constant
τ A_FL is related to the voltage–current transfer from the gate where
of M1,2 (vCK ) to the drain of M9,10 (i D9) in short-circuit ReqC_FL ≈ R D (20)
conditions
CeqC_FL = 2C gd3,5 + C gs5,6 + 2Cdb3,5 + C R D + C L . (21)
τ A_FL = Req A_FL Ceq A_FL . (13)
By using this approach, the folded D-latch is modeled as the
In this case, the equivalent resistance is the resistance third-order system. In order to further simplify the analysis,
offered at the drain of M7,8 we can notice that the time constant τ B_FL is typically smaller
1 1 than that of the other two. This can be shown numerically by
Req A_FL = //rds1,2 ≈ (14) comparing (18) against (15) and (21) and (17) against (14)
G m7,8 G m7,8
and (20). From a circuit perspective, the current-to-current
2 Condition (12) states that equation (11) is accurate when we are not transfer related with time constant τ B_FL can be analyzed
in dominant pole condition. If we consider the limit case τ B /τ A = 10 referring to a common gate transistor acting as a current buffer,
equation (11) gives: τ pd = 0.701τ B , which is in reasonable agreement with
equation (10). When (τ B /τ A ) > 10 equation (10) results more accurate than which is, as well known, faster than the other configurations.
equation (11). Hence, if we neglect the pole due to the time constant τ B_FL ,
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
TABLE I TABLE II
M AIN P ROCESS PARAMETERS FOR THE A DOPTED D ESIGN PARAMETERS FOR T RIPLE -TAIL AND F OLDED D-L ATCHES
40-nm CMOS T ECHNOLOGY
Fig. 9. Comparison of propagation delay model versus bias current against Fig. 10. Propagation delay plot versus bias current for triple-tail D-latch
simulations. (a) Triple-tail D-latch. (b) Folded D-latch (TB1). referring to TB2. Case (a) FO = 1 and (b) FO = 5, respectively.
Fig. 11. Propagation delay plot versus bias current for the folded D-latch Fig. 12. Propagation delay plot versus bias current for triple-tail D-latch
referring to TB2. Case (a) FO = 1 and (b) FO = 5, respectively. referring to TB3. Case (a) FO = 1 and (b) FO = 5, respectively.
TABLE IV
TABLE III CPM OF THE F OLDED AND T RIPLE -TAIL D-L ATCHES FOR D IFFERENT
C OMPARISON OF THE T RIPLE -TAIL AND F OLDED C LOCK F REQUENCIES (VSWING = 0.8 V I S A SSUMED B OTH
D-L ATCHES AT THE R EFERENCE ISS FOR D ATA AND C LOCK S IGNALS )
Fig. 13. Propagation delay plot versus bias current for the folded D-latch Fig. 14. Comparison of the folded versus triple-tail D-latches propagation
referring to TB3. Case (a) FO = 1 and (b) FO = 5, respectively. delay for TB2. (a) FO1. (b) FO5.
Fig. 18. Comparison of propagation delay for the folded and triple-tail DFFs.
Fig. 17. Detail of the improved clock switching part of the folded DFF.
Fig. 19. Simulation testbench based on clock frequency divider-by-4 to
evaluate the maximum toggle frequency.
In fact, M7B and M8B can be sized to be equal to M3 and M4
(see Fig. 16) and the bias voltage V B can be set to be equal to
the common-mode voltage of D signals to equalize the drain–
source voltage VDS of transistors M7 A , M9 , M9B , M8 A , M10 ,
and M10 B .
In this way, the channel length modulation effect is min-
imized and the accuracy of the current mirrors is strongly
improved.
In order to demonstrate the advantages provided by the
proposed DFF topology, we have compared the DFF imple-
mented using triple-tail latches against the proposed DFF
shown in Fig. 16 and using the clock switching stage shown
in Fig. 17.
Fig. 18 shows the comparison of the “clock to Q” delay
of the folded and triple-tail DFFs for different bias current
settings. The comparison has been carried out while consid-
ering the same power consumption for both DFFs, which has
been fixed to 6ISS including clock driving buffers.
The performance improvement has been found to be 42%
Fig. 20. Maximum output frequency of the clock frequency divider-by-4
on average and 35% minimum over the whole considered circuit for different ISS settings.
ISS range.
As a further comparison between the folded and triple-tail
DFFs, we have considered the simulation testbench shown The maximum output frequency of the clock frequency
in Fig. 19 and based on a clock frequency divider-by-4 circuit divider-by-4 circuit is reported in Fig. 20 as a function
in which the maximum output frequency has been evaluated. of ISS : the performance improvement in the maximum toggle
Also, in this case, the DFFs have been designed with the frequency has been found to be 48% on average and 54%
same power consumption fixed to 6ISS including clock driving maximum, showing how the folded DFF-based circuit outper-
buffers. forms the triple-tail-based one in all ISS conditions.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
[24] M. Alioto, R. Mita, and G. Palumbo, “Performance evaluation of the Davide Bellizia was born in 1989. He received the
low-voltage CML D-latch topology,” Integr., VLSI J., vol. 36, no. 4, bachelor’s degree in electronic engineering and the
pp. 191–209, 2003. master’s (summa cum laude) degree in electronic
[25] K. Gupta, N. Pandey, and M. Gupta, “Analysis and design of MOS design from the University “La Sapienza,” Rome,
current mode logic exclusive-OR gate using triple-tail cells,” Microelec- Italy, in 2011 and 2014, respectively, where he
tron. J., vol. 44, no. 6, pp. 561–567, 2013. is currently pursuing the Ph.D. degree with the
[26] K. Gupta, N. Pandey, and M. Gupta, “MCML D-latch using triple-tail Dipartimento di Ingegneria dell’Informazione,
cells: Analysis and design,” Active Passive Electron. Compon., vol. 2013, Elettronica e Telecomunicazioni.
pp. 1–9, 2013, doi: 10.1155/2013/217674. His current research interests include the design
[27] N. Pandey, K. Gupta, G. Bhatia, and B. Choudhary, “MOS current of cryptographic ICs for counteracting power
mode logic exclusive-OR gate using multi-threshold triple-tail cells,” analysis attacks and VLSI design for DSP algorithm
Microelectron. J., vol. 57, no. 11, pp. 13–20, Nov. 2016. implementations.
[28] B. Kuo, Automatic Control Systems, 3rd ed. Englewood Cliffs, NJ, USA: Mr. Bellizia received the “Laureato Eccellente” Award for the best
Prentice-Hall, 1975. graduated student of the year in 2014.
[29] S. K. Enam and A. A. Abidi, “NMOS IC’s for clock and data regener-
ation in gigabit-per-second optical-fiber receivers,” IEEE J. Solid-State Alessandro Trifiletti was born in Rome, Italy,
Circuits, vol. 27, no. 12, pp. 1763–1774, Dec. 1992. in 1959. He received the Laurea degree in elec-
[30] J. Ramirez-Angulo, R. G. Carvajal, and A. Torralba, “Low supply tronic engineering from the Università di Rome
voltage high-performance CMOS current mirror with low input and “La Sapienza,” Rome, Italy.
output voltage requirements,” IEEE Trans. Circuits Syst. II, Exp. Briefs, In 1991, he joined the Dipartimento di Ingegne-
vol. 51, no. 3, pp. 124–129, Mar. 2004. ria Elettronica, Università di Rome “La Sapienza,”
as a Research Assistant, where he is currently an
Associate Professor. He has authored over 70 inter-
national journal papers and 120 contributions in
conference proceedings. His current research inter-
ests include high-speed circuit design techniques,
III–V device modeling, DSP techniques to enhance analog circuit perfor-
mance, techniques to improve resilience to security attacks in VLSI ICs, and
robust design methodologies.
Giuseppe Scotti was born in Cagliari, Italy, in 1975. Gaetano Palumbo (F’07) was born in Catania,
He received the M.S. and Ph.D. degrees in elec- Italy, in 1964. He received the Laurea degree in
tronic engineering from the University of Rome electrical engineering and the Ph.D. degree from the
“La Sapienza,” Rome, Italy, in 1999 and 2003, University of Catania, Catania, in 1988 and 1993,
respectively. respectively.
In 2010, he joined the Department of Information In 1994, he joined the University of Catania,
Engineering, Electronics and Telecommunications, where he is currently a Full Professor. He has
University of Rome “La Sapienza,” as a Researcher co-authored four books (Kluwer Academic Publish-
(Assistant Professor), where he was appointed as an ers and Springer in 1999, 2001, 2005, and 2014),
Associate Professor in 2015. He teaches undergrad- a textbook on electronic devices in 2005, and several
uate and graduate courses on basic electronics and patents. He has authored more than 400 scientific
microelectronics. His research activity was mainly concerned with integrated papers in referred international journals (more than 170) and conferences.
circuits design and focused on design methodologies able to guarantee His current research interests include analog and digital circuits.
robustness with respect to parameter variations in both analog circuits and Mr. Palumbo served as a member of the Board of Governors of the IEEE
digital VLSI circuits. In the context of analog design, his research activity CAS Society, from 2011 to 2013. In 2003, he received the Darlington
was concerned with circuit topologies for the realization of low-voltage Award. He served as an Associate Editor of the IEEE T RANSACTIONS
analog building blocks using ultra-short channel CMOS technologies and ON C IRCUITS AND S YSTEMS PART I during 1999–2001, 2004–2005, and
with the development of current mode analog functions. He has been also 2008–2011, and the IEEE T RANSACTIONS ON C IRCUITS AND S YSTEMS
involved in research and development activities held in collaboration between PART II during 2006–2007. In 2005, he was one of the 12 panelists in
“La Sapienza” University and some industrial partners which led, between the scientific-disciplinary area 09—industrial and information engineering of
2000 and 2015, to the implementation of 13 application-specified integrated the CIVR (Committee for Italian Research Assessment). In 2015, he has
circuits. He has co-authored more than 45 publications in international journals been a panelist of the Group of Evaluation Experts in the scientific area
and 70 contributions in conference proceedings, and is a co-inventor of 09—industrial and information engineering of the ANVUR for the Assessment
two international patents. of Italian Research Quality during 2011–2014.