Professional Documents
Culture Documents
Ieee 2016 PDF
Ieee 2016 PDF
Ieee 2016 PDF
ABSTRACT 16-Bit 1
WL1
sensing applications. This FIFO memory is designed and D115 D25415 D25715 D51015
D20
D2540 D2570 D5100
implemented using folded bit-interleaved 10T near-/sub-threshold D315 D25215 D25915 D50815 D30 D2520 D2590 D5080
512-Word
128
SRAM bit-cells, self-timed pointers and bank-level power control Memory
512
MSB
MSB
128
circuits. The 10T SRAM cell is proposed for the bit-interleaving Array
structure with 2.4X write static noise margin (SNM) improvement. D12515 D13015 D38115 D38615 D1250 D1300 D3810 D3860
and avoids long routing wires for the circular self-timed pointers.
MUX15 MUX0
Additionally, the event-driven self-timed pointers are designed to
LSB
reduce the power consumption of clock buffers. For further MSB
decreasing the overall power dissipation, bank-level column-based Figure 1. Folded bit-interleaving structure for 512x16 FIFO memory.
power control circuitry is proposed to switch the voltages for
different banks to achieve 60.5% power saving. A 512x16 FIFO signal sensing platform. This FIFO memory is designed using self-
memory is implemented in UMC 28nm HKMG CMOS technology. timed pointers and folded bit-interleaving structure without global
Compared with the prior arts, 47X power reduction and 2.7X area clock signal. For the control of different operation voltages and
efficiency can be achieved by the proposed design techniques. frequencies of FIFO memory, a bank-level dynamic voltage scaling
technique is utilized to further reduce the total power consumption
I. INTRODUCTION by providing adjustable voltages for different banks. The rest of this
paper is organized as follows. The folded bit-interleaving structure
FIFO memories are widely utilized for data buffers and flow for the 512x16 FIFO memory is presented in Section II. Section III
control in many bio-signal sensing applications, and dominate the describes the proposed 10 transistors (10T) SRAM bit-cell to
overall die area and power consumption [1]. For multi-bio-signal enhance read/write margin and performance. The design of event-
sensing applications, the sensing platforms have to be lightweight driven self-timed pointers is described in Section IV. Section V
with small form factor for wearable or implantable devices. Thus, elucidates the bank-level power control circuitry for further reducing
limited energy consumption and small area are the two critical design total power consumption. Section VI summaries the post-simulation
challenges of multi-bio-signal sensing platforms. As such, ultra-low- results and the conclusion is given in Section VII.
power FIFO memories become the significant building blocks for
bio-sensing applications.
A conventional FIFO memory consists of three major components:
II. FOLDED BIT-INTERLEAVING STRUCTURE
storage elements, read/write pointers and timing control circuitries. For the 256x16 FIFO memory, the folded bit-interleaving
For high density and low power concerns, the SRAM-based storage structure is adopted to prevent long wires of control signals and to
elements are usually adopted rather than register-based memories. reduce the number of shift registers as shown in Fig. 1. In this
Accordingly, near-/sub-threshold SRAMs are utilized to reduce the structure, each column with 256 bit-cells is folded into four sub-
power consumption significantly by decreasing the operation voltage columns controlled by circular column-based pointers. Therefore, the
to near-/sub-threshold region [2-4]. However, the degraded design 512x16 memory array is transformed to a 128x64 array. Hence, the
margin and increased transistor variations are the serious challenges bit-line capacitance is reduced to improve the read/write access time.
for near-/sub-threshold SRAMs [1]. The read/write pointers of FIFO Based on the folded structure, the number of pointers is decreased
memories are typically implemented by shift registers [2] or counter- from 512 to 192 (128+4x16). Moreover, the original 512 circular
based pointers [2]. Both types of pointers consume a relatively large pointers require a long wire to transfer the token signal from the
portion of the total power consumption due to large number of flip- bottom register to the top register. In the folded interleaving structure,
flops for shift-register-based pointers, and overhead of decoder the pointers are implemented by 128-bit up/down scanning shift
circuit for counter-based pointers. Additionally, these two types of registers and 4-bit circular scanning shift registers. The up/down
pointers also require a global clock for the timing control circuits scanning requires only about 1/4 wire length compared with the
with large power consumption by clock buffers. original un-folded design for transferring the token from the bottom
To realize ultra-low-power FIFO memories, an 8kb near-/sub- to the top. As such, the operation speed of pointers is significantly
threshold FIFO memory is presented in this paper for multi-bio- improved.
MN1 A Q D
FF_neg
Reset 1
Q Qb MN4 Replica
MN2 write pulse
MN3
B WLn-1 Q
MP1 WL0
MP2
WLn Reset
Q D
FF
A
Reset
WBL WWLAb WLn-1 Q
WL1
RBL WLn Reset Q D
FF
Regular Vt Low Vt
Reset
WLn-1 Q
WL2
WLn Reset
Figure 2. 10T bit-interleavable near-/sub-threshold SRAM bit-cell.
Pointer Column
28nm HKMG tech. @TT 25°C VDD=400mV Bank-Level WL17
MC Simulation (16,000 samples) MUXs Up/Down
2FFXUUHQFHWLPHV
WL110
Indicators
μ=108.4 10T Bit-cell [5]
ȱ=11.3 9T Bit-cell [6] WLn-1 Q
WL126
This work WLn Reset Q D
FF
μ=105.8 μ=110.9 Reset
WLn-1 Q
ȱ=14.0 ȱ=13.8 WL127
WLn Reset Q D
FF
Reset
Q D
A FF
Reset 0
B
5HDG610P9 Q D
FF_neg
Figure 3. Distribution of read SNM from Monte Carlo simulations. Reset 1
PSC
PSC
PSC
PSC
WLn-1 FF WLn
Reset Qn
WLn
Global Reset
Figure 6. Schematic and timing diagram of a self-timed pointer unit. VDDL VDDH
Bank 0
Normalized Pointer Power
32
-34% UHDGFRQWUROcircuitry
-59% 4PSC 4PSC įįį 4PSC
:ULWH'ULYHU
VDD4
VDD3
VDD2
4
Bank0
Bank1
Bank7
VDD1
' įįį
6$
16
įįį
ZULWHFRQWUROcircuitry
Shift-Register- Counter-Based Self-Timed
Based Pointer Pointer Pointer
Figure 8. Bank-level column-based power control.
Figure 7. P ower comparison of different pointers.
VDDL VDDH
write-1 margin at 0.4V, TT, 25oC from Monte Carlo simulations WWLAj
WWLB8i-1
PS_L
with 16,000 samples. The nominal write-1 margin of the proposed VCS
CEN PS_H
10T SRAM bit-cell is 1.9X and 2.4X higher than that of the WWLAj
RWL8i-1
previous 10T and 9T SRAM bit-cells, respectively. WWLAj PS_H DSCH
RWL16i-1
IV. EVENT-DRIVEN SELF-TIMED POINTER
Delay line
As the depth of FIFO increases, the number of flip-flops and the
lengths of clock/control wires increase. The access time of the FIFO PS_L
both access time and power consumption by avoiding long wire and VCS
VDDL VDDH
C D D C
LEV LEV
Write Track
C D D C
LEV LEV
SRAM Bank SRAM Bank
self-timed pointers are utilized to reduce both the access time and
PS W W PS
C D D C
R
R
LEV
199.16 μm
LEV
DIDO DIDO
Read/Write Horizontal Pointers Read/Write Horizontal Pointers power consumption. Moreover, bank-level power control is
DIDO DIDO
implemented to provide dynamic voltage scaling based on the
Column
Column
LEV LEV
Read Track
olumn
olumn
PS W W PS
l