Content Addressable Memory With Efficien PDF

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 7

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/333672454

Content Addressable Memory with Efficient Power Consumption and


Throughput

Article · June 2019

CITATIONS

3 authors, including:

Prasanna Venkatesan G K D
Karpagam Academy of Higher Education
107 PUBLICATIONS   91 CITATIONS   

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Prediction of cirrhosis disease from radiologist liver medical image using hybrid coupled dictionary pairs on longitudinal domain approach View project

An Efficient Cluster head Selection Strategy for Multicasting and Geocasting View project

All content following this page was uploaded by Prasanna Venkatesan G K D on 10 June 2019.

The user has requested enhancement of the downloaded file.


IJETST- Volume||01||Issue||03||Pages 399-404||May||ISSN 2348-9480 2014

International journal of Emerging Trends in Science and Technology

Content Addressable Memory with Efficient Power


Consumption and Throughput
Authors

Karthik.M1, R.R.Jegan2, Dr.G.K.D.Prasanna Venkatesan3


1
ME VLSI student PGP College of Engineering and Technology Nammakkal Tamilnadu India
2
AP,PGP College of Engineering and Technology Nammakkal Tamilnadu India
3
Vice Principal,Professor and Head/ECE, PGP College Of Engineering and Technology Namakkal.
Tamilnadu, India
Corresponding Author
Karthik.M
ME VLSI student
PGP College of Engineering and Technology
Nammakkal Tamilnadu,India
Email: Mkarthik.jan@gmail.com

Abstract:
Content-addressable memory (CAM) is a hardware table that can search and Store data.CAM is
actually considerable Power Consumption and parallel comparison feature where a large amount
of transistor are active on each lookup. Thus, robust speed and low-power sense amplifiers are
highly sought-after in CAM designs. In this paper, we introduce a modified parity bit matching that
leads to delay reduction and power overhead. The modified design minimizes the searching time by
matching the store bit from most significant bit instead of matching all the data's present in the
row. Furthermore, we propose an effective gated power techniques to decrease the peak and
average power consumption and enhance robustness of the design against the process variation.
Indexterms-CAM,ParityCAM,ATMController,VPI/VCI

I.INTRODUCTION
Content-addressable memories (CAMs) are reduced delay associated with the large amount of
hardware search engines that are much faster than parallel active circuitry, without sacrificing speed
algorithmic approaches for search- intensive or memory density.CAM Design technique at the
applications. CAMs are composed of conventional circuit level and architecture level. we review
semiconductor memory Static RAM with added low-power match line sensing techniques and
comparison circuitry that enables a search search line driving approaches. At the Architecture
operation to complete in a one clock cycle. level We review three methods for reducing the
We survey recent development in the design of power consumption.
large capacity CAM. A CAM is a memory that II. Proposed Method : PARITY CAM
implements the lookup-table function in a single Content-addressable memory (CAM) is frequently
clock cycle using comparison circuitry .CAMs are used in applications, such as lookup tables,
Especially popular in Network routers for packet databases, associative computing, and networking,
forwarding and packet classification, But they are that require high-speed searches due to its ability
also beneficial in variety of application that require to improve application performance by using
high speed lookup. The main CAM-design parallel comparison to reduce search time.
challenge is to reduce power consumption and to Although the use of parallel comparison results in

Karthik M, et al www.ijetst.in Page 399


IJETST- Volume||01||Issue||03||Pages 399-404||May||ISSN 2348-9480 2014
reduced search time, it also significantly increases considerable modification to the memory
power consumption. In this paper, we propose a architecture to achieve high performance.
Block-XOR approach to improve the efficiency of Therefore, it is beyond the scope of the general
low power pre computation-based CAM (PB- CAM design. Moreover, the disadvantage of the
CAM). The major contribution of this paper is that ones count PB-CAM system is that it adopts a
it presents theoretical and practical proofs to verify special memory cell design for reducing power
that our proposed Block-XOR PB-CAM system consumption, which is only applicable to the ones
can achieve greater power reduction without the count parameter extractor.
need for a special CAM cell design.

Fig2.1 parity bit based CAM


A content-addressable memory (CAM) is a
critical device for applications involving Fig1.2 ATM Controller Block Diagram
asynchronous transfer mode (ATM), CAMs can be used in Asynchronous Transfer
communication networks, LAN bridges/switches, Mode (ATM) switching network components as a
databases, lookup tables, and tag directories, due translation table. Since ATM networks are
to its high-speed data search capability. A CAM is connection-oriented, virtual circuits need to be set
a functional memory with a large amount of stored up across them prior to any data transfer. There are
data that simultaneously compares the input search two kinds of ATM virtual circuits: Virtual Path
data with the stored data. Once matching data are (identified by a virtual path identifier or VPI) and
found, their addresses are returned as output as Channel Path (identified by a channel path
shown in Fig. 1. The vast number of comparison identifier or VCI). VCI/VPI values are localized;
operations required by CAMs consumes a large each segment of the total connection has unique
amount of power. In the past decade, much VPI/VCI combinations. Whenever an ATM cell
research on energy reduction has focused on the travels through a switch, its VPI/VCI value has to
circuit and technology domains provides a be changed into the value used for the next
comprehensive survey on CAM designs from segment of connection. This process is called
circuit to architectural level. Several works on VPI/VCI translation. Since speed is an important
reducing CAM power consumption have focused factor in ATM network, the speed at which this
on reducing match- line power. Although there has translation occurs forms a critical part of the
been progress in this area in recent years, the network’s overall performance.
power consumption of CAMs is still high CAM can act as an address translator in an
compared with RAMs of similar size. At the same ATM switch and perform the VPI/VCI translation
time, research in associative cache system design very quickly. During the translation process, the
for power efficiency at the architectural level CAM takes incoming VPI/VCI values in ATM cell
continues to increase. The filter cache and headers and generates addresses that access data in
location cache techniques can effectively reduce an external RAM (since standard CAM
the power dissipation by adding a very small architectures cannot support the required capacity,
cache. However, the use of these caches requires a CAM/RAM combination enables the realization
major modifications to the memory structure and of multi- megabit translation tables with fully-
hierarchy. proposed a cache-CAM that reduces parallel search capability). VPI/VCI fields from
power consumption relative to the cache hit rate. the ATM cell header are compared against a list of
presented a ones-count pre computation-based current connections stored in the CAM array. As a
CAM (PB-CAM) that achieves low-power, low result of the comparison, CAM generates an
cost, low-voltage, and high-reliability features. address that is used to access an external RAM
Although further improved the efficiency of PB- where VPI/VCI mapping data and other
CAMs, the approach proposed requires connection information is stored. The ATM
Karthik M, et al www.ijetst.in Page 400
IJETST- Volume||01||Issue||03||Pages 399-404||May||ISSN 2348-9480 2014
controller modifies the cell header using the A. Memory Structure
VPI/VCI data from the RAM and the cell is sent to For the ATM function to operate
the switch. parameters are required for each channel.
There are address compression tables for
III ATM CONTROLLER the receiver to decide which channel any given cell
The basic functionality of the controller is to relates to, connection tables for both receiver and
receive ATM cells from the PHY via the utopia transmitter for defining all the protocol and data
interface, utilize the cell header to determine what routing information for each channel, pace control
channel related to this FCC the cell belongs to, and tables for directing the scheduling of cell to
transfer the data from the cell to the next available transmit, Interrupt queues for both receive and
location in the appropriate buffer. The reverse is transmit, and buffer descriptors and buffers for the
also true. When the PHY indicates that it has room data.
for a cell to transmit the controller must decide Some of these parameters are in DP RAM
which is the next channel to be transmitted, locate some in external memory.
the buffer containing the required cell, add the FCC buffering is an intermediate buffer to
appropriate header to the cell, and transfer the cell improve data throughput and where some
to the PHY. processing is handled by the CP and the register
This is a very simple description of the area is where the basic functionality of the FCC is
activity and as you progress through the course the defined.
process will be described in more detail. The
process of segmentation and re-assembly is simply
the transfer of the required data between the cells
and buffers, where the cell contains the fixed block
of data between 46 and 48 bytes depending on the
adaptation layer used, and the buffers contain the
user data of whatever size it is. Address Mapping
is the Address header and appropriate channel.
Buffer is the way of another protocol. ATM Many
Channel all the information handled by the
connection table. Below the Fig 3.1 to overall
explanation for the separate blocks.

Fig 3.2 Memory Structure

B. VPI/VCI Evaluation

VPI is the virtual path identifier and can be


considered like the outer case of a multi core
cable. It identifies the major route of the data. VCI
is the virtual circuit identifier and can be
considered like an inner core of the cable.
These values are used to route the cell data
to the required destination. However, the difficult
concept here is that when the cell is transmitted,
Fig 3.1ATM Controller
these values are not defining the final destination
but simply the first stage routing.

Karthik M, et al www.ijetst.in Page 401


IJETST- Volume||01||Issue||03||Pages 399-404||May||ISSN 2348-9480 2014
A very brief explanation of what happens Powe r I(mA) P(mW)
is that before any data is transmitted the system Summary
has to through a form of negotiation to find the Total power 0 118
best routing for the cell and in the process each Estimated Power
node or ATM switch must set up a table for the Consumption
VPI and VCI values, and priority and quality of Vccint l00v 62 111
service for the transfer. Vcce Quicien 2 7
This is all handled by higher level Clocks 38 69
applications software outside the FCC operation Input 8 15
and so is not a function of this training. The VPI Logic 0 0
and VCI is added into the header of the cell when Output - -
transmitted, and the receiver examines these values Signals 0 0
to determine which channel the cell belongs to. Quicient Vccint 15 27
l00v
Quicien Vcc33. 2 7
3,0

Table 4.1 Existing Method Power Analytical

Estimated Junction 27*c


Temperature
Ambient 25*c
Temperature
Case Temperature 27*c
Theta J-A 17c/W

Table 4.2 Existing Thermal Summary


Comparison table power consumption vs
Thermal power. Two types of Analytical progress
various types of Parameter should be Analyzed. In
this Existing method Estimate Power Consumption
118 P(mW) at the same time Temperature level
27*C. Clock pulse 69 clock P(mW).
In this Existing Method Concepts fully pre
Computation Based CAM. Is also Used Parity
Fig 3.3 ATM Network VPI/VCI CAM in the proposed system. Various Technology
is Used to find the Efficient Power Consumption.
IV SIMULATION RESULT ANALYSIS
The proper Operation of both power optimized Powe r I(mA) P(mW)
and Speed Optimized designs are verified by Summary
performing extensive Microwind and Dsch Total power 0 115
&Xilinx simulation. All simulation presented this Estimated
section compare ML0 against an ML1 on 144 bit Power
CAM Word. Consumption
A. Design Optimized for Power Consumption Vccint l00v 60 109
For accurate Comparison of the ML voltages, the Vcce Quicien 2 7
initial currents to all the ML Must be identical. For Clocks 37 67
minimum Power Consumption initial current must Input 8 15
be zero.and the pre charging node Vdd-Vtn.
Logic 0 0
Table 4.1 and Table 4.2 shows the Xilinx
Output - -
Simulation results for this design compares the
Signals 0 0
power level for 144 bit CAM Word.
Quicient Vccint 15 27

Karthik M, et al www.ijetst.in Page 402


IJETST- Volume||01||Issue||03||Pages 399-404||May||ISSN 2348-9480 2014
l00v To Compare Existing and Proposed System Clock
Quicien 2 7 Speed Differents of the Maximum input arrival
Vcc33.3,0 rate 1.416 nsec. But maximum Output required
time after clock is same.
Table 4.3 Proposed Method Power Analytical
V.CALCULATING RESULT ANALYSIS
Thermal Summary SPEED VS POWER CONSUMPTION
Ambient 25*c
Temperature
Case Temperature 27*c
Theta J-A 17c/W

Table 4.4 Proposed Method Power Analytical

B. Design Optimized for Search speed


To increase Search Speed ,the initial Current
supplied MLs is increased by Pre Charging the
VAR node.
Speed Grade: -7
Minimum period: 3.680ns (Maximum VI.CONCLUSION
Frequency: The Proposed CAM allocates power to match
271.739MHz) decision based on the number of mismatchd bits in
each CAM Word. This scheme result considerable
Minimum input 5.892ns power reduction. The Proposed System Power
arrival time Consumption 115 P(mW).Existing Method Power
before clock Consumption 118 P(mW). Then the input arrival
Maximum 8.129ns rate both are the Difference is 1.416nsec.Power
output required consumption Avg 0.9754% Variations and then
time after clock avg speed is 1.416nsec
Maximum No path found
combinational REFERENCES
path delay 1) T. Do, S. S. Chen, Z. H. Kong, and K. S.
Table 4.5 Existing Search Speed Yeo, “A low-power CAMwith efficient
Proposed system Search speed power and delay trade-off,” in Proc. IEEE
Speed Grade: -7 Int. Symp. CircuitsSyst. (ISCAS), 2011, pp.
2573–2576.
Minimum period 1.714ns (Maximum 2) N. Mohan and M. Sachdev, “Low- leakage
Frequency: storage cells for ternary content
583.431MHz) addressable memories,” IEEE Trans. Very
Large Scale Integr.(VLSI) Syst., vol. 17,
Minimum input 4.476ns no. 5, pp. 604–612, May 2009.
arrival time before 3) O. Tyshchenko and A. Sheikholeslami,
clock “Match sensing using match line stability
Maximum output 8.129ns in content addressable memorys (CAM),”
required time after IEEE J. Solid-State Circuits, vol. 43, no. 9,
clock pp. 1972–1981, Sep. 2008.
Maximum No path found 4) N. Mohan, W. Fung, D. Wright, and M.
combinational path Sachdev, “A low-powerternary CAM with
delay positive-feedback match-line sense
amplifiers,”IEEE Trans. Circuits Syst. I,
Table 4.6 Proposed Search Speed Reg. Papers, vol. 56, no.3,pp. 566–
573,Mar. 2009.

Karthik M, et al www.ijetst.in Page 403


IJETST- Volume||01||Issue||03||Pages 399-404||May||ISSN 2348-9480 2014
5) S. Baeg, “Low-power ternary content-
addressable memory design using a
segmented match line,” IEEE Trans.
Circuits Syst. I, Reg.Papers, vol. 55, no. 6,
pp. 1485–1494, Jul. 2008.
6) K. Pagiamtzis and A. Sheikholeslami, “A
low-power content-addressable memory
(CAM) using pipelined hierarchical search
scheme, ”IEEE J. Solid-State Circuits, vol.
39, no. 9, pp. 1512–1519, Sep. 2004

Karthik M, et al www.ijetst.in Page 404

View publication stats

You might also like