Professional Documents
Culture Documents
Yuan 2016
Yuan 2016
YUAN et al.: A 70 mW 25 Gb/s QUARTER-RATE SerDes TRANSMITTER AND RECEIVER CHIPSET WITH 40 dB OF EQUALIZATION 3
tH = tC−Q − tD . (1)
TABLE II
S IMULATION R ESULTS OF T ERMINATION I MPEDANCE
Fig. 7. Simulated duty cycle of PD’s output versus temperature across corners
when VB = 580 mV.
Fig. 10. Simulation environment setup.
YUAN et al.: A 70 mW 25 Gb/s QUARTER-RATE SerDes TRANSMITTER AND RECEIVER CHIPSET WITH 40 dB OF EQUALIZATION 5
Fig. 11. Simulated eye diagrams when T 3 : 0 = 1111. (a) TX’s output at
point A when the channel is disconnected. (b) TX’s output after the channel at
the receiver side B. Fig. 14. Schematic of proposed CTLE with inductive peaking.
Fig. 12. Simulated eye diagrams when T 3 : 0 = 0111. (a) TX’s output at
point A when the channel is disconnected. (b) TX’s output after the channel at
the receiver side B.
Fig. 15. Frequency response of CTLE. (a) Without inductive peaking. (b) With
inductive peaking.
B. CTLE
Fig. 13. Proposed quarter-rate RX architecture. The implementation of the CTLE realized with RC degen-
eration and inductive peaking is described in Fig. 14. The
III. R ECEIVER variable resistor consists of a NMOS transistor and a poly
resistor. By controlling the gate voltage VG of the transistor, the
A. Overall Architecture
CTLE’s boost gain at high frequency can been continuously cal-
Fig. 13 shows the overall block diagram of the proposed ibrated. The inductive peaking technology is adopted to enlarge
quarter-rate RX, which also includes a data path and a clock the boost gain without increasing the power. Fig. 15 shows the
path. The data path consists primarily of a continuous-time simulated frequency response of the proposed CTLE and the
linear equalizer (CTLE), a quarter-rate DFE and error sampler, traditional CTLE with no inductive peaking. When VG changes
a DFE adaption logic with 3 current digital-to-analog converters from 1 V to 400 mV, the traditional CTLE’s boost gain at
(IDACs) and a baud-rate CDR logic. Due to the benefit of baud- 12.5 GHz varies from 0.6 dB to 11 dB. By introducing the in-
rate CDR, the edge samplers used in the common oversampling ductive peaking, the boost gain ranges from 5.5 dB to 16 dB,
CDR are saved, which also induces the simplification in the which means that an extra 5 dB compensation for the channel
phase interpolator (PI) based clock path. As shown in Fig. 1, loss can be provided.
the traditional quarter-rate RX with the DFE adaption requires
the PI to generate 8-phase clocks. However, the PI in
C. Low Power DFE
Fig. 13 only needs to generate 4-phase clocks, the power of
which is reduced considerably. An additional 25%-duty-cycle Fig. 16 presents the implementation of the low-power
clock generator is necessary in the clock path to realize the quarter-rate soft-decision DFE with 4 quarter-lanes ( Quarter_
HACS with little penalty in power and area. The details of the Lane_0–3). In each quarter-lane, data after the equalization
proposed HACS will be described in the following paragraphs. of the CTLE is firstly sampled by the sample-and-hold (S/H)
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
YUAN et al.: A 70 mW 25 Gb/s QUARTER-RATE SerDes TRANSMITTER AND RECEIVER CHIPSET WITH 40 dB OF EQUALIZATION 7
Fig. 28. Measured eye diagrams of 22 Gb/s single-ended TX output with both
FFE and XTC. (a) VB = 500 mV. (b) VB = 600 mV.
Fig. 29. (a) Measured eye diagram of 24 Gb/s single-ended TX output with
both FFE and XTC. (b) Measured eye diagram of 25 Gb/s single-ended TX
Fig. 26. (a) Testing PCB. (b) Measured channel loss and eye diagram of output with both FFE and XTC.
20 Gb/s single-ended TX output without FFE and XTC.
DDJ is reduced by 7.2 ps because of the proposed XTC. On this
occasion, T 3 : 0 = 1000 and X1 : 0 = 01.
The measured 22 Gb/s eye diagrams are presented in
Fig. 28(a) and (b). When the VB of V/I is 500 mV [Fig. 28(a)],
the TJ is 31.3 ps with 1.57 ps of duty cycle distortion (DCD).
Then VB is calibrated to 600 mV (Fig. 28(b)), the DCD is
decreased to only 40 fs and the TJ is also improved by 0.8 ps.
Actually, this TX is able to operate up to 25 Gb/s. Fig. 29(a)
and (b) exhibit the eye diagrams at 24 Gb/s and 25 Gb/s,
respectively. Due to the single-ended output structure and more
serious channel loss and crosstalk, both the time margin perfor-
mance and the voltage margin performance are not very ideal
Fig. 27. (a) Measured eye diagram of 20 Gb/s single-ended TX output with even though the measured bit-error-rate (BER) is still smaller
FFE but without XTC. (b) Measured eye diagram of 20 Gb/s single-ended TX than 1e-12. If adopting the differential output structure with less
output with both FFE and XTC. severe channel, the eye diagram at 25 Gb/s will be obviously
improved.
that the average power consumption for each channel is less
than 22 mW. The proposed divider-less clock generation circuit
B. Receiver Chip
(including VCDL, PD, and V/I) consumes only 4.7 mW.
As shown in Fig. 26(a), the 2-inch coupled micro-strip lines Fig. 30 shows the RX die micrograph, which measures
are realized as the test channels with the spacing of 30 mil. 1.512 mm × 1.276 mm including all the test circuits, de-
Fig. 26(b) shows that the measured channel loss at 12.5GHz is coupling MOS capacitors and I/O pads. The active area is
8.9 dB. A Keysight DSOX93204A Infiniium oscilloscope with 0.52 mm × 0.35 mm. Fig. 31 is the simulated power breakdown
32 GHz bandwidth is used to measure the eye diagrams and when the RX operates at 25 Gb/s. The total power consumption
the jitter decomposition. Fig. 26(b) also shows the measured is 48 mW, among which the low-power DFE draws only 6 mW,
20 Gb/s eye diagram of the TX output without the FFE and XTC. thus the power efficiency of the DFE is 0.24 mW/Gb/s. A
The eye is completely closed. Fig. 27(a) presents the measured Keysight E5071C ENA network analyzer is used to measure the
eye diagram of the 20 Gb/s TX output when the FFE is enabled channel loss, and a Tektronix BSA286C 28.6 Gb/s BERTScope
but the XTC is disabled. The total jitter (TJ) is 36 ps and eye is used to test the BER performance.
height is 85 mV. The data dependent jitter (DDJ) is 19.4 ps. The As presented in Fig. 32, the test channel including the
measured eye diagram of the 20 Gb/s TX output when the FFE bonding wire, PCB trace, SMA connector, and cable has a loss
and XTC are both active is shown in Fig. 27(b). The TJ and of 33.8 dB at the Nyquist frequency (12.5 GHz). The eye of the
eye height is improved by 8.2 ps and 95 mV, respectively. The received 25 Gb/s PRBS-7 data after this channel is completely
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
YUAN et al.: A 70 mW 25 Gb/s QUARTER-RATE SerDes TRANSMITTER AND RECEIVER CHIPSET WITH 40 dB OF EQUALIZATION 9
Fig. 33. (a) Measured eye diagram of the recovered clock at 3.125 GHz.
(b) Measured eye diagram of the demuxed data at 6.25 Gb/s.
Fig. 32. Measured channel loss and eye diagram of received 25 Gb/s data after Fig. 35. Measured BER bathtub curve.
the channel.
C. Performance Comparison
closed. Fig. 33 shows the eye diagrams of the 3.125 GHz
recovered clock and the 6.25 Gb/s dumuxed data. The measure Table III summarizes the measured performance of the TX
TJ of the recovered clock is 23 ps, with a random RMS jitter of described in this paper, in comparison to some recently pub-
1 ps. The TJ of the 6.25 Gb/s dumuxed data is 52 ps. lished similar designs [5]–[8]. The one-channel power con-
The RX performance is also demonstrated by the jitter tol- sumption of the TX working at 25 Gb/s is 21.8 mW, and the
erance test at 25 Gb/s as shown in Fig. 34, along with the power efficiency is reduced by at least 17% compared with
CEI-25G/28G JTOL mask [21]. Measured at the BER threshold the other state-of-the-art designs. Additionally, the FFE and
of 1e-9, the out-of-band jitter tolerance at 100 MHz is 0.22 UI. XTC are merged together with the SST driver in this work
The BER bathtub curves are measured after turning off the CDR to compensate for both the ISI and FEXT. Table IV is the
logic. As indicated in Fig. 35, at first the DFE is disabled and the RX performance summary and comparison with other similar
CTLE is maximally opened (VG = 400 mV). The measured designs [14]–[16], [20], [22], which indicates that the RX
BER is above 1e-8. Then the adaptive DFE is active so that the implemented in this paper can compensate for the most serious
eye opening reaches 0.42 UI for BER = 1e − 12. channel loss. The eye opening and the DFE power efficiency
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
TABLE III [3] J. Bulzacchelli et al., “A 28-Gb/s 4-tap FFE/15-Tap DFE serial link trans-
TX P ERFORMANCE S UMMARY AND C OMPARISON ceiver in 32-nm SOI CMOS technology,” IEEE J. Solid-State Circuits,
vol. 47, no. 12, pp. 3232–3248, Dec. 2012.
[4] B. Zhang et al., “A 28 Gb/s multistandard serial link transceiver for
backplane applications in 28 nm CMOS,” IEEE J. Solid-State Circuits,
vol. 50, no. 12, pp. 3089–3100, Dec. 2015.
[5] C. Menolfi et al., “A 28 Gb/s source-series terminated TX in 32 nm CMOS
SOI,” in IEEE ISSCC Dig. Tech. Papers, 2012, pp. 334–336.
[6] Y. H. Song et al., “An 8-to-16 Gb/s 0.65-to-1.05 pJ/b 2-tap impedance-
modulated voltage-mode transmitter with fast power-state transitioning in
65 nm CMOS,” in IEEE ISSCC Dig. Tech. Papers, 2014, pp. 446–447.
[7] H. Wang et al., “A 21-Gb/s 87-mW transceiver with FFE/DFE/analog
equalizer in 65-nm CMOS technology,” IEEE J. Solid-State Circuits,
vol. 45, no. 4, pp. 909–920, Apr. 2010.
[8] J. Kim et al., “A 16-to-40 Gb/s quarter-rate NRZ/PAM4 dual-mode
transmitter in 14 nm CMOS,” in IEEE ISSCC Dig. Tech. Papers, 2015,
pp. 60–61.
[9] A. A. Hafez et al., “A 32–48 Gb/s serializing transmitter using multiphase
serialization in 65 nm CMOS technology,” IEEE J. Solid-State Circuits,
vol. 50, no. 3, pp. 763–775, Mar. 2015.
[10] S. Yuan et al., “A 4 × 20-Gb/s 0.86 pJ/b/lane 2-tap-FFE source-series-
TABLE IV terminated transmitter with far-end crosstalk cancellation and divider-less
RX P ERFORMANCE S UMMARY AND C OMPARISON clock generation in 65 nm CMOS,” in Proc. IEEE Custom Integr. Circuits
Conf. (CICC), 2015, pp. 1–4.
[11] S. Y. Kao et al., “A 7.5-Gb/s one-tap-FFE transmitter with adaptive far-
end crosstalk cancellation using duty cycle detection,” IEEE J. Solid-State
Circuits, vol. 48, no. 2, pp. 391–404, Feb. 2013.
[12] H. K. Jung et al., “A slew-rate controlled transmitter to compensate for
the crosstalk-induced jitter of coupled microstrip lines,” in Proc. IEEE
Custom Integr. Circuits Conf. (CICC), 2010, pp. 1–4.
[13] K. L. J. Wong et al., “A 5-mW 6-Gb/s quarter-rate sampling receiver with
a 2-tap DFE using soft decisions,” IEEE J. Solid-State Circuits, vol. 42,
no. 4, pp. 881–888, Apr. 2007.
[14] A. Agrawal et al., “A 19 Gb/s serial link receiver with both 4-tap FFE and
5-tap DFE functions in 45 nm SOI CMOS,” in IEEE ISSCC Dig. Tech.
Papers, 2012, pp. 134–135.
[15] J. W. Jung and B. Razavi, “A 25 Gb/s 5.8 mW CMOS equalizer,” in IEEE
ISSCC Dig. Tech. Papers, 2014, pp. 44–45.
[16] F. Spagna et al., “A 78 mW 11.8 Gb/s serial link transceiver with adaptive
RX equalization and baud-rate CDR in 32 nm CMOS,” in IEEE ISSCC
Dig. Tech. Papers, 2010, pp. 366–367.
[17] P. A. Francese et al., “A 16 Gb/s 3.7 mW/Gb/s 8-tap DFE receiver and
baud-rate CDR with 31 kppm tracking bandwidth,” IEEE J. Solid-State
Circuits, vol. 49, no. 11, pp. 2490–2502, Nov. 2014.
[18] S. Yuan et al., “A 48 mW 15-to-28 Gb/s source-synchronous receiver with
adaptive DFE using hybrid alternate clock scheme and baud-rate CDR in
are also comparable to the others. The power efficiency of the 65 nm CMOS,” in Proc. Eur. Solid-State Circuits Conf. (ESSCIRC), 2015,
pp. 144–147.
entire RX is 1.92 mW/Gb/s, which is improved by at least 47% [19] J. Y. Chang et al., “A 15–20 GHz delay-locked loop in 90 nm CMOS
compared with the other state-of-the-art designs. technology,” in Proc. IEEE Asian Solid-State Circuits Conf. (ASSCC),
2008, pp. 213–216.
[20] R. Bai et al., “A 0.25 pJ/b 0.7 V 16 Gb/s 3-tap decision-feedback equalizer
V. C ONCLUSION in 65 nm CMOS,” in IEEE ISSCC Dig. Tech. Papers, 2014, pp. 46–47.
[21] Optical Internetworking Forum, “IA title: Common electrical I/O
In this paper, we have demonstrated a 25 Gb/s quarter-rate (CEI)—Electrical and jitter interoperability agreements for 6G + bps, 11G
SerDes transmitter and receiver chipset fabricated in a 65 nm + bps and 25G + bps I/O, section 13.3.11.2.1.” [Online]. Available: http://
CMOS technology. Some novel structures and clock schemes www.oiforum.com/public/documents/OIF_CEI_03.1.pdf
[22] Y. Doi et al., “32 Gb/s data-interpolator receiver with 2-tap DFE in
are developed for the purpose of low power and intensive 28 nm CMOS,” in IEEE ISSCC Dig. Tech. Papers, 2013, pp. 36–37.
equalization ability. The total power consumption of the TX
and RX chipset is approximately 70 mW (TX: 21.8 mW, RX:
48 mW) at 25 Gb/s, which compensates for more than 40 dB
channel loss (TX: 8.9 dB, RX: 33.8 dB). The power efficiency
and equalization performance are better than the reported state-
of-the-art works.
Shuai Yuan (S’15) was born in Jilin, China. He
received the B.S. degree from the Department
R EFERENCES of Electronic Engineering, Tsinghua University,
[1] CISCO VNI forecast highlights. [Online]. Available: http://www.cisco. Beijing, China, in 2011. He is now working toward
com/c/en/us/solutions/service-provider/visual-networking-index-vni/ the Ph.D. degree at the Institute of Microelectron-
vni-forecast.html ics, Tsinghua University, China. His research inter-
[2] H. Kimura et al., “A 28 Gb/s 560 mW multi-standard SerDes with single- ests include high-speed wireline transceiver and low
stage analog front-end and 14-tap decision feedback equalizer in 28 nm power equalizer.
CMOS,” IEEE J. Solid-State Circuits, vol. 49, no. 12, pp. 3091–3103,
Dec. 2014.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
YUAN et al.: A 70 mW 25 Gb/s QUARTER-RATE SerDes TRANSMITTER AND RECEIVER CHIPSET WITH 40 dB OF EQUALIZATION 11
Liji Wu (M’95) received the B.S., M.S., and Ph.D. Chun Zhang was born in 1972 in Jilin province,
degrees in electronic engineering from Tsinghua China. He received the B.S. and Ph.D. degrees from
University, Beijing, China, in 1988, 1991, and 1997, the Department of Electronic Engineering, Tsinghua
respectively. University, Beijing, China, in 1995 and 2000, re-
From April 1997 to May 2000, he worked in the spectively. He works in Tsinghua University since
Center for Advanced Technology in Telecommunica- 2000 till now. He worked in the Department of Elec-
tions, Polytechnic Institute of New York University tronic Engineering from 2000 to 2004. From 2005,
(NYU-Poly), Brooklyn, NY, USA, as a Postdoctoral he is an Associate Professor of the Institute of Micro-
Fellow, worked on design and implementation of electronics. His research interests include mixed
high-speed control circuits and systems utilized in signal integrated circuits and systems, embedded
WDM ATM Multicast optical switching systems microprocessor design, digital signal processing, and
sponsored by DARPA. Then he worked in high-tech industry in the U.S. for radio frequency identification.
more than 4 years, including TyCom Laboratories (former AT&T Bell Labs on
Undersea Optical Fiber Communications), Eatontown, NJ, USA, as a Senior
Member of Technical Staff. He received Tsinghua University Outstanding Zhihua Wang (M’99–SM’04) received the B.S.,
Graduate Award and Medal in 1988. He joined Tsinghua University as a Full- M.S., and Ph.D. degrees in electronic engineering
Time Faculty since 2005. He is a Board Member of Shanghai Pudong Science from Tsinghua University, Beijing, China, in 1983,
& Technology Association, Shanghai, China since 2006. 1985, and 1990, respectively.
In 1983, he joined the faculty at Tsinghua Uni-
versity, where he is a full Professor since 1997 and
Ziqiang Wang was born in 1975 in Beijing, China. Deputy Director of Institute of Microelectronics
He received the B.S. and Ph.D. degrees from the since 2000. From 1992 to 1993, he was a visiting
Department of Electronic Engineering, Tsinghua scholar at Carnegie Mellon University. From 1993 to
University, Beijing, China, in 1999 and 2006, respec- 1994, he was a Visiting Researcher at KU Leuven,
tively. He works as a Research Assistant in the Insti- Belgium. His current research mainly focuses on
tute of Microelectronics, Tsinghua University, after CMOS RF IC and biomedical applications. His ongoing work includes RFID,
Doctor graduation. From 2015, he is an Associate PLL, low-power wireless transceivers, and smart clinic equipment with combi-
Professor of the Institute of Microelectronics. His nation of leading edge CMOS RFIC and digital imaging processing techniques.
research interest is analog circuit design. Prof. Wang has served as Deputy Chairman of Beijing Semiconductor
Industries Association and ASIC Society of Chinese Institute of Communi-
cation, as well as Deputy Secretary General of Integrated Circuit Society in
China Semiconductor Industries Association. He had been one of the chief
scientists of the China Ministry of Science and Technology serves on the
Xuqiang Zheng received the B.S. and M.S. degrees expert committee of the National High Technology Research and Development
from Central South University, Hunan, China. He is
Program of China (863 Program) in the area of information science and
now working as an engineer in Tsinghua University, technologies from 2007 to 2011. He had been an official member of China
China. His research interest is high-speed SerDes Committee for the Union Radio-Scientifique Internationale (URSI) during 2000
design and analog to digital converter. to 2010. He was the chairman of IEEE Solid-State Circuit Society Beijing
Chapter during 1999–2009. He has been a Technologies Program Committee
member of ISSCC (International Solid-State Circuit Conference) during 2005
to 2011. He is an Associate Editor for IEEE T RANSACTIONS ON B IOMEDICAL
C IRCUITS AND S YSTEMS and IEEE T RANSACTIONS ON C IRCUITS AND
S YSTEMS —PART II: E XPRESS B RIEFS .