Professional Documents
Culture Documents
NHsurya Project-2
NHsurya Project-2
BACHELOR OF TECHNOLOGY
IN
ELECTRONICS AND COMMUNICATION ENGINEERING
By
i
DEPARTMENT OF ELECTRONICS ANDCOMMUNICATION ENGINEERING
(Autonomous)
CERTIFICATE
This is to certify that the project report entitled "LOW POWER AREA
EFFICIENT ALU USING GDI BASED FULL ADDER " that is being submitted by
Dr.S.VIJAYAKUMAR,Ph.D Dr.K.GOPI,M.Tech,Ph.D
Professor, Dept of ECE Head of the
Department Internal Guide Dept. of ECE
Submitted for University Examination (Viva-Voce) held on
The successful completion of this project work was made possible with the and
guidance received from various quarters. we would like to available this opportunity to
express oursincere thanks and gratitude to all of them.
We Finally, we thank all the teaching faculty and non-teaching staff of the
Department of Electronics and Communication Engineering for giving their advices and their
help during the course of our project work as well as studies.
iii
Course Outcomes for project work
CO-PO MAPPING
COs\
POs
CO1 √
CO2 √
CO3 √
CO4 √
CO5 √
CO6 √
CO7 √
CO8 √
CO9 √
CO10 √
CO11 √
CO12 √
iv
Evaluation Rubrics for Project work:
Rubric (CO) Excellent (Wt = 3) Good (Wt = 2) Fair (Wt = 1)
Select a latest topic through Select a topic through Select a topic through
Selection of Topic (CO1) complete knowledge partial knowledge improper knowledge of
offacts and concepts. offacts and facts and concepts.
concepts.
Reasonable Improper comprehension
Analysis and Synthesis Thorough comprehension
comprehension through through analysis/
(CO2) through analysis/
analysis/ synthesis. synthesis.
synthesis.
Reasonable
Thorough comprehension Improper comprehension
comprehension about
Problem Solving (CO3) about what is proposed about what is proposed in
what is proposed in
inthe literature papers. the literature.
the literature papers.
Considerable literature Incomplete literature
Extensive literature survey
Literature Survey (CO4) survey with standard survey with substandard
with standard references.
references. reference.
Clearly identified and has Identified and has Identified and has
Usage of Techniques complete knowledge of sufficient knowledge inadequate knowledge of
& Tools (CO5) techniques & tools used oftechniques & tools techniques & tools used
in the project work. in the project work. in project work.
Conclusion of Conclusion of project
Project work impact Conclusion of project
projectwork has work has feeble impact
on Society (CO6) workhas strong impact on
considerable on society.
society.
impact on society.
Conclusion of project work Conclusion of project Conclusion of project
Project work impact
has strong impact work has considerable work has feeble impact
on Environment (CO7) onEnvironment. impact environment. environment.
Moderate understanding Insufficient
Clearly understands ethical
Ethical attitude (CO8) of ethical and understanding ethical
and social practices. socialpractices. and social practices.
Did literature survey Did literature survey Selected a topic as
Independent Learning
andselected topic with a andselected topic with suggested by the
(CO9) considerable guidance
little guidance supervisor
Presentation in logical
Presentation with key Presentation with
sequence with key
Oral Presentation (CO10) points, conclusion insufficient key points
points,clear conclusion
andgood language and imprope conclusion
and excellent language
Status report with
Status report with clear and
logicalsequence of Status report not properly
Report Writing (CO10) logic sequence of chapters
chapterusing language organized
using excellent language
Time and Cost Analysis Comprehensive time Moderate time and Reasonable time and cost
(CO11) andcost analysis costanalysis analysis
Continuous learning Highly enthusiastic towards Interested in Inadequate interest in
(CO12) continuous learning continuouslearning continuous learning
v
ABSTRACT
vi
CONTENTS
LIST OF TABLES xi
LIST OF ABBREVIATIONS xii
1 INTRODUCTION 1
1.1 Need For low power 3
1.2 Need for low Power VLSI chips 3
2 LITERATURE SURVEY 6
2.1 Design of Low power ALU using 7
8T FA and PTL based Mux circuits
2.2 A review paper on 3T XOR cells 7
and 8T adder design in cadence
180nm
2.3 Design of low power high speed 7
ALU using feedback switch logic
2.4 Low power sub-threshold 7
asynchronous QDI static
logic transistor level
2.5 Design of a low power , sub- 8
threshold ,asynchronous ALU using
a bi-directional adder
2.6 Energy efficient,high 8
performance circuits for arithmetic
unit
2.7 A low power 10-transistor full- 9
adder cell for embedded
architecture
2.8 An implementation of 1-bit low 9
power Full Adder based on
multiplexer and pass transistor logic
2.9 A fast ALU design in CMOS for 9
low voltage operation
3 PROPOSED APPROACH 10
4 EXISTING SYSTEM 16
4.1 Power dissipation types 17
4.2 Gate diffusion input technique 19
4.3 ALU using GDI technique 21
4.4 Multiplexer 22
4.5 Design of arithmetic and logic 23
unit
5 VLSI DESIGN 24
5.1 Introduction 25
vii
5.2 Verilog advantages over VHDL 27
5.3 VLSI designs classification 27
5.3.1 Analog 27
5.3.2 ASICS or Application Specific 27
Integrated Circuits
5.3.3 SOC or System on a chip 28
5.4 VLSI design styles 28
5.4.1 Field Programmable Gate 28
Array (FPGA)
5.4.2 Gate array design 29
5.4.3 Standard-cells based design 30
5.4.4 Full custom design 31
5.5 Tool (DSCH & Ex-port Micro 32
wind)
5.6 Simulation environment 37
5.6.1 DSCH (Digital Schematic ) 37
5.6.2 Micro wind 38
6 SIMULATION RESULTS 43
6.1 Existing method 44
6.1.1 1-Bit ALU 44
6.1.2 4-Bit ALU 46
6.1.3 GDI cell 47
6.1.4 GDI 1-Bit ALU 48
6.1.5 GDI 4-Bit ALU 50
6.2 Proposed system 51
6.2.1 Modified GDI 1-Bit ALU 51
6.2.2 Modified GDI 4-Bit ALU 53
6.3 Comparision results 54
6.4 advantages of the proposed work 56
6.5 applications of the project 56
7 CONCLUSION AND FUTUR SCOPE 57
REFERENCES 59
v
LIST OF FIGURES
ix
6.1.2.2 4-Bit ALU Layout 46
6.1.2.3 4-Bit ALU output 47
6.1.3 GDI cell 47
6.1.3.1 Basic GDI cell 48
6.1.3.2 4-Bit ALU 48
6.1.4 GDI 1-Bit ALU 48
6.1.4.1 GDI 1-Bit ALU Schematic 48
6.1.4.2 GDI 1-Bit ALU Layout 49
6.1.4.3 GDI 1-Bit ALU Output 49
6.1.5 GDI 4-Bit ALU 50
6.1.5.1 GDI 4-Bit ALU Schematic 50
6.1.5.2 GDI 4-Bit ALU Layout 50
6.1.5.3 GDI 4-Bit ALU Output 51
6.2.1 Modified GDI 1-Bit ALU 51
6.2.1.1 Modified GDI 1-Bit ALU Schematic 51
6.2.1.2 Modified GDI 1-Bit ALU Layout 52
6.2.1.3 Modified GDI 1-Bit ALU Output 52
6.2.2 Modified GDI 4-Bit ALU 53
6.2.2.1 Modified GDI 4-Bit ALU Schematic 53
6.2.2.2 Modified GDI 4-Bit ALU Layout 53
6.2.2.3 Modified GDI 4-Bit ALU Output 54
x
LIST OF TABLES
xi
LIST OF ABBREVIATIONS
VLSI- Very Large Scale Integration
ALU- Arithmetic and Logical Unit
CMOS- Complementary Metal Oxide Semiconductor
DSCH- Digital Schematic
FPGA- Field Programmable Gate Array
HDL- Hardware Description Language
PLD- Programmable Logic Devices
PROM- Programmable Read Only Memory
CLB - Configurable Logic Blocks
LUT- Look Up Table
IC- Integrated Circuit
IEEE- Institute of Electrical and Electronics Engineers
VHSIC- Very High Speed Integrated Circuits
VHDL- Very High Speed Integrated Circuit Hardware Description Language
ASIC-Application Specific Integrated Circuit
SOC-System On Chip
CLB-Configurable Logic Blocks
RAM- Random Access Memory
MOS-Metal Oxide Semiconductor
GA- Gate Array
SOG- Sea Of Gates
PLA- Programmable Logic Array
DRC-Design Rule Check
CPL-Complementary Pass transistor Logic
GDI-Gate Diffusion Input
CPU-Central Processing Unit
DSP-Digital Signal Processors
AOI-And Or Invert
OAI-Or And Invert
xii
CHAPTER-1
INTRODUCTION
1|Page
Dept. of
CHAPTER-1
INTRODUCTION
The increasing demand for low-power very large scale integration (VLSI) can be
addressed at different design levels, such as the architectural, circuit, layout, and the
process technology level . At the circuit design level, considerable potential for power
savings exists by means of proper choice of a logic style for implementing combinational
circuits. This is because all the important parameters governing power dissipation
switching capacitance, transition activity, and short-circuit currents are strongly influenced
by the chosen logic style. Depending on the application, the kind of circuit to be
implemented, and the design technique used, different performance aspects become
important, disallowing the formulation of universal rules for optimal logic styles.
Investigations of low-power logic styles reported in the literature so far, however, have
mainly focused on particular logic cells, namely full-adders, used in some arithmetic
circuits. In this paper, these investigations are extended to a much wider set of logic gates,
and with that, to arbitrary combinational circuits. The power dissipation characteristics of
various existing logic styles are compared qualitatively and quantitatively by actual logic
gate implementations and simulations under realistic circuit arrangements and operating
conditions . Investigations of sequential elements, such as latches and flip-flops, were not
included in this work, but can be found elsewhere in the literature .
The increasing speed and complexity of today’s designs implies a significant
increase in the power consumption of very-large-scale integration circuits. Voltage is
increased to attain higher speed of operation with the penalty of increased power
consumption. To meet this challenge, researchers have developed many different design
techniques to reduce power. CMOS technology scaling has now moved to a power
constrained condition. Circuit techniques to reduce chip standby leakage have become a
key enabler. Scaling is a tradeoff between performance and leakage. One of the key
features that led to the success of complementary metal-oxide semiconductor technology
was its intrinsic low-power consumption . This meant that circuit designers and electronic
design automation (EDA) tools could afford to concentrate on maximizing circuit
performance and minimizing circuit area. Another interesting feature of CMOS technology
is its nice scaling properties, which has permitted a steady decrease in the feature size,
allowing for more and more complex systems on a single chip, working at higher clock
frequencies . In fact, power consumption is regarded as the limiting factor in the continuing
2|Page
Dept. of
scaling of CMOS technology.
3|Page
Dept. of
The tremendous growth in battery - operated and power-sensitive applications has
created a global demand for low-power semiconductors. Today’s power-sensitive
designers face stricter system power limits, specifications, and standards that put a cap on
the total power consumed by the system. Simultaneously, for these applications, the
demand for increased features, performance, and complexity continues to grow, but at the
expense of power. The art of power analysis and optimization of integrated circuits are
used to be a narrow specialty in analog circuit design. The newer generations of
semiconductor processing technologies present more stringent requirements to the power
dissipation of digital chips due to increased device density, speed and complexity. Power is
the rate at which energy is delivered or exchanged. In a VLSI chip, electrical energy is
converted to heat energy during operation. The rate at which energy is taken from the
source and converted in to heat is the power dissipation. Heat energy has to be completely
dissipated from the system in order to avoid increase in chip temperature, which can cause
permanent or temporary circuit failure.
1.1. Need For Low Power
As the demand for high speed, Low power consumption and high performance
continues to grow each year, there is a need to scale the devices to smaller dimensions. As
the market trend moves towards a greater scale of integration, the move toward a reduction
in supply voltage also have the advantage of improving the reliability of the chip. As the
complexity of the chip increases design problems occurs such as layout of a chip and
simulation for the circuit and these are all becomes more intense. Power consumption is
regarded as the limiting factor in the present technology. High power supply affects the
temperature and further influences the cooling cost of the chip. If the temperature increases
it directly reflects the environmental problems. Noise and reliability is the one more big
issue in the field of high power consumption devices. With all this, high power
consumption will defiantly degrade Battery life of the electronics devices.
1.2 NEED FOR LOW POWER VLSI CHIPS
As the scale of integration improves, more transistors, faster and smaller than their
predecessors, are being packed into chip. This leads to the steady growth of the operating
frequency and processing capacity per chip, resulting in increased power dissipation. The
Moore’s law predicts the growth rate of integrated circuits. New generation of processing
technology are being developed at a faster rate than Moore’s law. A need for low power
VLSI chips arises from such evolution forces integration circuits. Another factor that fuels
4|Page
Dept. of
the needs for low power chips is the increased market demand for portable consumer
electronics powered by batteries. The craving for smaller, lighter and more durable
electronic products indirectly translates to low power requirements. Ironically, high
performance computing system characterized by large power dissipation also drives the
low power needs. Power dissipation has a direct impact on the packaging cost of the chip
and the cooling cost of the system. Another major demand for low power chips and
systems comes from environmental concerns. Since electricity generation is a major source
of air pollution, insufficient energy usage in computing equipment indirectly contributes to
environmental Pollution.
A. Design Types
The VLSI low power design problems can be broadly classified into two types such
as Analysis and Optimization. Analysis problems are concerned about the accurate
estimation of the power or energy dissipation at different phases of the design process. The
purpose is to give assurance that the power consumption specifications are not violated.
The accuracy of analysis depends on the availability of design information. Here, better
accuracy is demanded and longer analysis time is allowed. Analysis techniques also serve
as the foundation of design optimization. Optimization is the process of generating the best
design, given an optimization goal, without violating design specifications. An automatic
design optimization algorithm requires a fast analysis engine to evaluate the merits of
design choices.
A design to apply a particular low power technique often tradeoffs from different
sources pulling in various directions. The major criteria to be considered are the impact to
the circuit delay, which affects the performance and throughput of the chip, and the chip
area, which directly translates to manufacturing costs. Other factors of the chip design such
has design cycle time, testability, quality, reliability, reusability, risk etc may all be
affected by a particular design to achieve low power requirement.
B. Introduction to FPGA
The Field Programmable Gate-Array is the most powerful programmable integrated
circuit. It consists of an array of logic blocks that can be connected by general
interconnection resources. FPGAs are programmed using a logic circuit diagram or a
source code in a hardware description language (HDL). The FPGA industry sprouted from
programmable read only memory (PROM) and programmable logic devices (PLDs).
Xilinx Co-Founders, Ross Freeman and Bernard Vonderschmitt, invented the first
commercially viable field programmable gate array in 1985 – the XC2064. The
5|Page
Dept. of
XC2064 had
6|Page
Dept. of
programmable gates and programmable interconnects between gates, the beginnings of a
new technology and market. The XC2064 boasted a mere 64 configurable logic blocks
(CLBs), with two 3-input lookup tables. The FPGA is an integrated circuit that contains
many (64 to over 10,000) identical logic cells that can be viewed as standard components.
7|Page
Dept. of
CHAPTER -2
LITERATURE SURVEY
8|Page
Dept. of
CHAPTER-2
LITERATURE SURVEY
2.1 Design of Low Power ALU using 8T FA and PTL Based Mux Circuits
In this paper they proposed an ALU using novel 8T FA and pass transistor logic
based multiplexers. A 4:l mux and a 2:l mux were used to design an ALU. FA is an
essential component for designing all types of processors like digital signal 5 processors
(DSP), microprocessors, etc. In existing method, FA and multiplexers were designed using
transmission gate logic. To reduce the number of transistors, multiplexers were designed
using pass transistor logic while FA is designed using 8 transistors logic in the
implementation of ALU. The power and the area were greatly reduces to more than 70%
compared to the existing method.
2.2 A Review Paper on 3T XOR cells and 8T Adder Design in Cadence 180nm
This paper gives a review of already existing 3T XOR cells and provides an
optimized value of width/length (W/L) on the basis simulation results obtained which
helped to improve the driving capability as to improve the threshold loss problems present
in the existing designs of 3T XOR cells. However, the driving capability obtained is not
sufficient for large circuits like multipliers. From their best improved version of 3T XOR
cell obtained, they designed a FA circuit. They implement all the basic circuits and their
improved versions in Cadence Virtuoso for 180nm technology and 1.8V source.
2.3 Design of Low Power High Speed ALU Using Feedback Switch Logic
A new dynamic like static circuit family called Feedback-Switch Logic (FSL) has
been proposed in this paper. The FSL is suitable for high speed and low power because it
offers fast switching, reduced capacitance and input-switching dependent activity factor
without the need of clock connection. This paper presents the design of low power high
speed 32-bit ALU based on static CMOS and FSL logics at 90nm CMOS process in
CADENCE design tool. Simulation results shows that the design of ALU using FSL
achieves 14% reduction in delay but at the cost of 8% increased power consumption
compared to static CMOS logic. This ALU combines adder, shifter and logical units which
are having low power consumption, less delay and uses lesser area. ALU using FSL
attained low power and high speed by optimal sizing of transistors.
2.4 Low Power Sub-Threshold Asynchronous QDI Static Logic Transistor Level
Implementation (SLTI) 32-bit ALU In, they proposed an asynchronous-logic
Quasi- Delay-Insensitive (QDI) static logic transistor-level implementation (SLTI)
9|Page
Dept. of
approach for
10 | P a g
Dept. of
low power sub-threshold operation. The approach is implemented to design 32-bit
pipelined ALUs, the primary computation core for microprocessors, and benchmarked
against the reported Pre-Charged Half-Buffer (PCHB). There are two key attributes in this
proposed design. First, the proposed SLTI ALU design can perform dynamic voltage
scaling seamless by only changing the Vdd from nominal (1V) to sub threshold (~0.2V)
regions for high speed/low power operation. Second, the ALU achieves ultra-low power
dissipation (3.5μW) at the lowest Vdd point (~0.15V). For fair of comparison, both
implemented ALUs have identical functionality and functional blocks, are implemented
using the same 65nm CMOS process. Based on the simulations, the minimum energy point
occurs at Vdd of 0.2V for SLTI-based ALU and at Vdd of 0.3V for PCHB based ALU.
The SLTI-based ALU have
~93% and ~89% lower energy on the arithmetic and logic operations respectively from
Vdd of 1V to Vdd of 0.2V. At Vdd of 0.2V, with 9 MHz input switching rate, the
asynchronous ALU based on their proposed SLTI approach dissipates ~51% and ~44%
lower power than the reported PCHB counterpart on the arithmetic and logic operations
respectively.
2.5 Design of a Low Power, Sub-Threshold, Asynchronous ALU Using a Bidirectional
Adder
A novel asynchronous bidirectional ALU is introduced in this paper. The adder in
the proposed design is a ripple carry adder with the bidirectional characteristic. The ALU
is designed with asynchronous dual rail circuit style. Several ALUs with sizes ranging
from 4 bits to 32 bits were built. Their power and performance metrics were compared
with the conventional ALUs built with the fast adders designed with dynamic logic style.
Significant power reduction with the sub-threshold operating voltage is achieved. Also the
design is compared with the ALU design proposed for reversible quantum computers in the
CMOS context to show the logic efficiency of 7 the proposed design around 30 % in area.
Power reduction of 9 - 26% was achieved for the addition operation and 19.5 - 75.1% for
the logical operation on the proposed 32 bit ALU, compared to the conventional dynamic
logic based ALU operated over the voltage range 0.2-0.3V.
2.6 Energy – Efficient, High Performance Circuits for Arithmetic Units
This paper present a new full adder structure based on complementary pass
transistor logic (CPL) which is faster and more energy efficient than the existing structures.
They also proposed a new technique of implementing multiplier circuit using
decomposition logic which improves speed and reduces power consumption by reducing
11 | P a g
Dept. of
the spurious transition on international nodes. There is substantial improvement in the
performance of the multiplier structure with the combination of the new adder structure
and the decomposition logic. The
12 | P a g
Dept. of
proposed circuits were implemented using TSPICE for simulation in TSMC 180nm
technology.
2.7 A Low Power 10-transistor Full Adder Cell for Embedded Architecture
This paper has proposed a full adder cell using 10T which has the advantage of low
power consumption and high operating speed. It occupied a small area due to the small
transistor count. The objective of low power is achieved at the circuit level by reducing the
number of internal node capacitances by eliminating direct paths between the supply
voltage and the ground, by maintaining low switching activity in the circuit. The proposed
cell is compared with standard transmission gate adder cell and a 16T adder cell and
characterized by its low power consumption compared to other adder cells. Using the
proposed adder cell, one 4-bit multiplier is constructed and used as a test vehicle to check
the performance of the new proposed design in embedded architecture. The circuit is
developed using 0.35μm CMOS technology using Cadence development tools and
simulated using HSPICE. The circuit consumed 75.2μW at a frequency of 500 MHz.
2.8 An Implementation of 1-bit Low Power Full Adder Based on Multiplexer and
Pass Transistor Logic
This paper presents the design of low power full adder based on XOR pass
transistor logic and transmission gate for carry. They have not connected power supply
rail directly, instead of that inputs are given directly to reduce the transition activity and
charge recycling capability and this result in great amount of reduction in power
consumption. The proposed 14T full adder has been developed using Tanner SPICE
simulation. Based on their result, there is saving of power supply by the factor of 30% as
compared to 10T full adder and a reduction in power by 26% as compared to the
conventional 28T CMOS adder.
2.9 A Fast ALU Design in CMOS for Low Voltage Operation
In this paper, a high-speed 4-bit ALU has been designed for 1V operation to
demonstrate the usefulness of the back-gate forward substrate bias (BGFSB) method in
1.2μm n-well CMOS technology. The 4-bit ALU employs a ripple carry adder and is
capable of performing eight operations which are four arithmetic and four logical
operations. The BGFSB has been limited to |0.4|V. The delay time measurements are
taken for all operations from the SPICE simulations with and without the back-gate
forward substrate bias.
13 | P a g
Dept. of
CHAPTER -3
PROPOSED APPROACH
14 | P a g
Dept. of
CHAPTER -3
PROPOSED APPROACH
In this paper the MGDI technique is used to realize the circuits. Components
A. 2x1 Multiplexer
A multiplexer is a digital switch which chooses the output from several inputs
based on a select signal , shown in Figure 3.1 , 2x1 multiplexer consists of 2 transistors.
15 | P a g
Dept. of
B. Full Adder
A full adder is a combinational circuit that performs the arithmetic sum of three
input bits. It consists of three inputs and two outputs. The adder cell used in this design
realized using full-swing AND, OR, and XOR gates. This design was has low power
operation, it has the lowest delay and with some modifications it performs the logic
operations such as AND,OR,XOR, & XNOR as well, these modifications will save large
area of the ALU design
VI. DESIGN OF
ALU
An ALU is a critical component in the Central Processing Unit (CPU) of any
computer; even the simplest microprocessors contain one. It performs arithmetic
operations such as addition, subtraction, increment, decrement and logic operations such as
AND, OR, XOR and XNOR .
16 | P a g
Dept. of
Figure 3.4 Schematic of 1-Bit ALU Stage using GDI
17 | P a g
Dept. of
The figure 3.5 shows the 4-bit ALU design using GDI logic, 4-bit ALU consists of
three selection bits, eight 4×1 MUX, eight 2×1 MUX & four processing units.
The proposed design of the 4-Bit Arithmetic Logic Unit consists of 4 stages, each
stage is an 1-Bit ALU block realized using the previously designed circuits as follows:
Each 1-Bit ALU stage consists of two 2x1 multiplexers, two 4x1 multiplexers and one full
adder cell, this design requires 48 transistors as depicted in Fig.3.5. Any desired operation
can be performed based on the selection line S0, S1, S2 code; Table II summarizes the
truth table of the proposed ALU.
The 4x1 multiplexer at the input is responsible for the selection of B input based on
the values of S0 and S1 selection lines it selects from logic 1, B, B' And logic 0 to perform
the Decrement, Addition, Subtraction and the Increment operations respectively, S2
chooses between the arithmetic and the logic operations.
18 | P a g
Dept. of
Figure 3.6 Schematic of 1-Bit ALU Stage using MGDI
To realize the 4-Bit ALU , four stages were used as shown in Figure 3.6, While the
carry input of ALU0 connected to selection line S1 to obtain logic 1 which needed for
subtraction and increment operations, however the other values don’t affect the results of
the logic operations.
19 | P a g
Dept. of
Figure 3.7 Schematic of 4-Bit ALU Stage using MGDI
20 | P a g
Dept. of
CHAPTER-4
EXISTING SYSTEM
21 | P a g
Dept. of
CHAPTER-4
EXISTING SYSTEM
The existing 4-bit ALU is designed using 3T XNOR Gates and multiplexers. The
total numbers of transistors used are 88 when compared to 96 Transistors using GDI
Technique or by using only 2T XOR gates. These further results in lower power
dissipation. Drawbacks:
The existing design uses 3T XNOR gates and multiplexers.
Remedy:
1. Use GDI Multiplexer based design for ALU.
2. Use transistor sizing for reducing power dissipation.
22 | P a g
Dept. of
f – Input Signal Frequency
CT – Total Capacitance
Effect
C. Basic types of low power design
• Reduce Switching Voltage:
The method used to reduce switching voltage is to reduce the operating voltage of
the circuit. Other method to reduce voltage swing by well known circuit techniques such as
charge sharing, transistor threshold voltage etc. There are many trade off to be considered
in voltage reduction. The main reason is that the threshold voltage of the transistor does not
scale accordingly with the operating voltage to avoid excessive leakage current.
• Reduce Capacitance:
Signals with high switching frequency should be routed with minimum parasitic
capacitance to conserve power. Conversely nodes with large parasitic capacitance should
not be allowed to switch at high frequency. Capacitance reduction can be achieved at most
design abstraction level materials, circuit techniques, transistor sizing, architectural
transformation and alternative computation algorithms.
• Reduce Switching Frequency:
The techniques are often applied to logic level design. Reductions of switching
frequency also have the side effects of improving the reliability of the chip as some failure
mechanism is related to switching frequency. One effective method of reducing switching
frequency is to eliminate logic switching that is not necessary for computation. Other
method involves logic implementation since there are many ways to design a logic network
to perform an identical function.
• Reduce Static Current:
Leakage power problem mainly appears in very low frequency circuits or ones with
sleep modes. Leakage reduction techniques are applied at low level design abstraction such
as process, device and circuit design. Static current can be reduced by transistor sizing,
layout techniques and careful circuit designing. Circuit modules that consume static current
should be turned off if not used.
23 | P a g
Dept. of
taken based on the results. The data required to perform the arithmetic and logical
functions are inputs
24 | P a g
Dept. of
from the designated CPU registers and operands. The ALU relies on basic items to perform
its operations.
The 4-bit ALU is designed using GDI based multiplexer design. Further the design is
optimized for power dissipation using transistor sizing
4.2 GATE DIFFUSION INPUT TECHNIQUE
Main concept of GDI is that gate of PMOS and NMOS is diffused. In these days in
digital circuits design, for a digital circuit designer’s a high speed, high throughput and
small silicon area and also low- power consumption in digital circuit is most essential
things for a designers. GDI is such a technique that we can use for design of low power
digital circuits. And it is also a novel design method of a low power digital circuit. In GDI
cell using only two transistors allows implementation of a wide range of complex logic
functions. In GDI by using only two transistor (PMOS AND NMOS) many function such
as AND, OR, XOR and XNOR can be implemented.GDI method is suitable for designing
of fast, low power circuits, using reduced number of transistor as compared to CMOS
techniques and also improving power characteristics. As compared to CMOS, fast, low
power circuits can be designed bu using gdi technique. The main advantage of GDI is that
it require less number of transistors as compared to CMOS. Therefore area is reduced.
Therefore delay is less and speed is increased. Morgenshtein has proposed basic GDI cell
shown in Fig 4.1.
Approach for designing low power digital combinational circuit is a GDI
Technique. GDI technique is basically two transistor implementation of complex logic
functions which provides in-cell swing restoration under certain operating condition. This
25 | P a g
Dept. of
approach leads to reduction in power consumption, propagation delay and area of digital
circuits. while having
26 | P a g
Dept. of
low complexity of logic design. An important feature of GDI cell is that the source of the
PMOS in a GDI cell is not connected to VDD and the source of the NMOS is not
connected to GND. Drain terminals of the two transistors are taken as output. Source
terminal of Pmos is acting as one input, and Source of Nmos is acting as another input.
Therefore GDI cell gives two extra input pins for use which makes the GDI design more
flexible than CMOS design.
There are three inputs in a GDI cell - G (common gate input of NMOS and PMOS),
P (input to the source/drain of PMOS) and N (input to the source/drain of NMOS).Bulks of
both NMOS and PMOS are connected to N and P respectively. Table 4.2 shows different
logic functions implemented by GDI logic based on different input values. So, various
logic functions can be implemented with less power and high speed with GDI technique as
compared to conventional CMOS design.
27 | P a g
Dept. of
Technique use less
28 | P a g
Dept. of
transistors and power dissipation is also less. A new technique of low power digital circuit
design is a GDI (Gate Diffusion Input). Power consumption, delay and area of digital
circuits is reduced by This technique, maintains low complexity of logic design. Since GDI
cell has no power supply connected to it, there will be a voltage drop at the output. This
drop will be negligible for small circuits. Implementations of GDI circuits in SOI or twin-
well CMOS processes are expected to supply more power- delay efficient design.
Due to the use of a complete cell library with reduced transistor count. Because of
less number of transistors, the switching is reduced and hence there will be a less power,
delay and also reduced area. Because of the less number of transistors the switching node
capacitance will be reduced that results in reduction of dynamic power 1.2 ALU An ALU
is one of the main components of a microprocessor.ALU also contribute to one of the
highest power- density locations on the processor, as it is clocked at the highest speed and
is busy mostly all the time which results in thermal hotspots and sharp temperature
gradients within the execution core. Therefore, this motivate us strongly for a energy
efficient ALU designs that satisfy the high- performance requirements, while reducing
peak and average power dissipation. Basically ALU is a combinational circuit that
performs arithmetic and logical operations on a pair of n bit operands. Arithmetic Unit
Employing fast and efficient adders in arithmetic logic unit will aid in the design of low
power high performance system. In this paper Arithmetic Unit consists of adder and
substractor and logical unit consists of AND, OR.
4.3 ALU USING GDI TECHNIQUE
29 | P a g
Dept. of
Logic Unit performs all logic operations asked to perform. A MUX operated by select
lines, for which particular logic operation to perform, is used inside this logic block
4.4 MULTIPLEXER
Multiplexer is a digital switch. The multiplexer has numbers of input data lines and
one output line. The selection of a particular input line is controlled by a set of selection
line. There are ‘2n’ input lines and ‘n’ selection lines whose bit combinations determine
which input is selected. Fig 4.3 shows implementation of basic 2x1multiplexer using GDI
cell. The 4x1 multiplexer has four inputs, two selection lines and one output. Depending on
thetwo selection lines, one output is selected at a time among the four input lines. Fig. 4.4.1
shows implementation of 4x1multiplexer using GDI cell.
XOR Gate The main building block of full adder circuit is XOR gate which gives
sum output. So the overall performance of full adder circuit can be improved by optimizing
XOR gate.Fig.4.4.2 shows the implementation of XOR gate using GDI technique . It uses
less number of transistors as compared to conventional design of XOR gate using CMOS
logic Units.
30 | P a g
Dept. of
Fig. 4.4.3 GDI Based XOR Gate Full Adder
The Full Adder circuit adds three one-bit binary numbers(A, B & C) and outputs
two one-bit binary numbers, a sum(S) and a carry (Cout). The full adder is usually a
component in cascade of adders, which add 4, 8, 16 etc. binary numbers. Implementation
of full adder circuit using GDI technique which is a basic building block of arithmetic and
logic unit has been shown in Fig. 4.4.3.
31 | P a g
Dept. of
CHAPTER-5
VLSI DESIGN
32 | P a g
Dept. of
CHAPTER-5
VLSI DESIGN
5.1 INTRODUCTION
The electronics industry has achieved a phenomenal growth over the last two
decades, mainly due to the rapid advances in integration technologies, large-scale systems
design - in short, due to the advent of VLSI. The number of applications of integrated
circuits in high-performance computing, telecommunications, and consumer electronics
has been rising steadily, and at a very fast pace. Typically, the required computational
power (or, in other words, the intelligence) of these applications is the driving force for the
fast development of this field. Gives an overview of the prominent trends in information
technologies over the next few decades. The current leading-edge technologies (such as
low bit-rate video and cellular communications) already provide the end-users a certain
amount of processing power and portability. This trend is expected to continue, with very
important implications on VLSI and systems design. One of the most important
characteristics of information services is their increasing need for very high processing
power and bandwidth (in order to handle real-time video, for example). The other
important characteristic is that the information services tend to become more and more
personalized (as opposed to collective services such as broadcasting), which means that
the devices must be more intelligent to answer individual demands, and at the same time
they must be portable to allow more flexibility/mobility.
33 | P a g
Dept. of
VLSI has been around for a long time, there is nothing new about it ... but as a side
effect of advances in the world of computers, there has been a dramatic proliferation of
tools that can be used to design VLSI circuits. Alongside, obeying Moore's law, the
capability of an IC has increased exponentially over the years, in terms of computation
power, utilization of available area, yield. The combined effect of these two advances is
that people can now put diverse functionality into the IC's, opening up new frontiers.
Examples are embedded systems, where intelligent devices are put inside everyday objects,
and ubiquitous computing where small computing devices proliferate to such an extent that
even the shoes you wear may actually do something useful like monitoring your heartbeats.
Verilog was developed at a time when designers were looking for tools to combine
different levels of simulation. In the early 1980s, there were switch-level simulators, gate-
level simulators, functional simulators (often written ad-hoc in software) and no simple
means to combine them. Further, the more-widespread, traditional programming languages
themselves were/are essentially sequential and thus "semantically challenged" when
modeling the concurrency of digital circuitry.
Verilog was created by Phil Moore in 1983 at Gateway Design Automation and the
first simulator was written a year later. It borrowed much from the existing languages of
the time: the concurrency aspects may be seen in both Modula and (earlier) Simulate; the
syntax is deliberately close to that of C; and the methods for combining different levels of
abstraction owe much to Hilo.
34 | P a g
Dept. of
specifications".
35 | P a g
Dept. of
While Verilog emerged from developments within private companies, its main rival
came from the American Department of Defense (DOD). In 1981, the DOD sponsored a
workshop on hardware description languages as part of its Very High Speed Integrated
Circuits (VHSIC) program, and the outcome formed a specification for the VHSIC
hardware description language (VHDL) in 1983. There is, of course, the question as to
which language is better. And this, of course, is a hard question to answer without causing
excitement and rebuttals from the marketing departments of the less preferred language.
However, the following points featured in a recent debate in the VHDL and Verilog news
groups.
The main factor is the language syntax - since Verilog is based on C and VHDL is
based on ADA. Verilog is easier to learn since C is a far simpler language. It also produces
more compact code: easier both to write and to read. Furthermore, the large number of
engineers who already know C (compared to those who know ADA) makes learning and
training easier. VHDL is very strongly typed, and allows programmer to define their own
types although, in practice, the main types used are either the basic types of the language
itself, or those defined by the IEEE. The benefit is that type checking is performed by the
compiler which can reduce errors; the disadvantage is that changing types must be done
explicitly.
It allows switch-level modeling - which some designers find useful for exploring
new circuits. It ensures that all signals are initialized to "unknown" who ensures that all
designers will produce the necessary logic to initialize their design - the base types in
VHDL initialize to zero and the "hasty" designer may omit a global reset.
5.3.1 Analog:
Small transistor count precision circuits such as Amplifiers, Data converters, filters,
Phase locked loops, Sensors etc.
Progress in the fabrication of IC's has enabled us to create fast and powerful circuits
36 | P a g
Dept. of
in smaller and smaller devices. This also means that we can pack a lot more of
functionality
37 | P a g
Dept. of
into the same area. The biggest application of this ability is found in the design of ASIC's.
These are IC's that are created for specific purposes - each device is created to do a
particular job, and do it well. The most common application area for this is DSP - signal
filters, image compression, etc. To go to extremes, consider the fact that the digital
wristwatch normally consists of a single IC doing all the time-keeping jobs as well as extra
features like games, calendar, etc.
These are highly complex mixed signal circuits (digital and analog all on the same
chip). A network processor chip or a wireless radio chip is an example of a SOC.
Fully fabricated FPGA chips containing thousands of logic gates or even more,
with programmable interconnects, are available to users for their custom hardware
programming to realize desired functionality. This design style provides a means for fast
prototyping and also for cost-effective chip design, especially for low-volume applications.
A typical field programmable gate array (FPGA) chip consists of I/O buffers, an array of
configurable logic blocks (CLBs), and programmable interconnect structures. The
programming of the interconnects is implemented by programming of RAM cells whose
output terminals are connected to the gates of MOS pass transistors. A more detailed view
showing the locations of switch matrices used for interconnect routing.
A simple CLB (model XC2000 from XILINX) It consists of four signal input
terminals (A, B, C, D), a clock signal terminal, user-programmable multiplexers, an SR-
latch, and a look-up table (LUT). The LUT is a digital memory that stores the truth table of
the Boolean function. Thus, it can generate any function of up to four variables or any two
functions of three variables.
38 | P a g
Dept. of
CLB is configured such that many different logic functions can be realized by
programming its array. More sophisticated CLBs have also been introduced to map
complex functions. The typical design flow of an FPGA chip starts with the behavioral
description of its functionality, using a hardware description language such as VHDL. The
synthesized architecture is then technology-mapped (or partitioned) into circuits or logic
cells. At this stage, the chip design is completely described in terms of available logic cells.
Next, the placement and routing step assigns individual logic cells to FPGA sites (CLBs)
and determines the routing patterns among the cells in accordance with the net list.
Performance of the design can be simulated and verified before downloading the
design for programming of the FPGA chip. The programming of the chip remains valid as
long as the chip is powered-on or until new programming is done. In most cases, full
utilization of the FPGA chip area is not possible - many cell sites may remain unused.
The largest advantage of FPGA-based design is the very short turn-around time,
i.e., the time required from the start of the design process until a functional chip is
available. Since no physical manufacturing step is necessary for customizing the FPGA
chip, a functional sample can be obtained almost as soon as the design is mapped into a
specific technology.
The typical price of FPGA chips are usually higher than other realization
alternatives (such as gate array or standard cells) of the same design, but for small-volume
production of ASIC chips and for fast prototyping, FPGA offers a very valuable option.
In view of the fast prototyping capability, the gate array (GA) comes after the
FPGA. While the design implementation of the FPGA chip is done with user
programming, that of the gate array is done with metal mask design and processing. Gate
array implementation requires a two-step manufacturing process: The first phase, which is
based on generic (standard) masks, results in an array of uncommitted transistors on each
GA chip. These uncommitted chips can be stored for later customization, which is
completed by defining the metal interconnects between the transistors of the array Since
the patterning of metallic interconnects is done at the end of the chip fabrication, the turn-
around time can be still short, a few days to a few weeks. a corner of a gate array chip
which contains bonding pads on its left and bottom edges, diodes for I/O protection,
NMOS transistors and PMOS transistors for chip output driver circuits in the neighboring
areas of bonding pads, arrays of
39 | P a g
Dept. of
NMOS transistors and PMOS transistors, underpass wire segments, and power and ground
buses along with contact windows.
Magnified portion of the internal array with metal mask design (metal lines
highlighted in dark) to realize a complex logic function. Typical gate array platforms allow
dedicated areas, called channels, for inter cell routing. The availability of these routing
channels simplifies the interconnections, even using one metal layer only. The
interconnection patterns to realize basic logic gates can be stored in a library, which can
then be used to customize rows of uncommitted transistors according to the net list. While
most gate array platforms only contain rows of uncommitted transistors separated by
routing channels, some other platforms also offer dedicated memory (RAM) arrays to
allow a higher density where memory functions are required. The layout views of a
conventional gate array and a gate array platform with two dedicated memory banks.
With the use of multiple interconnect layers, the routing can be achieved over the
active cell areas; thus, the routing channels can be removed as in Sea-of-Gates (SOG)
chips. Here, the entire chip surface is covered with uncommitted NMOS and PMOS
transistors. As in the gate array case, neighboring transistors can be customized using a
metal mask to form basic logic gates. For intercell routing, however, some of the
uncommitted transistors must be sacrificed. This approach results in more flexibility for
interconnections, and usually in a higher density. The basic platform of a SOG chip is
shown in offers a brief comparison between the channeled (GA) vs. the channel less (SOG)
approaches.
The standard-cells based design is one of the most prevalent full custom design
styles which require development of a full custom mask set. The standard cell is also called
the police. In this design style, all of the commonly used logic cells are developed,
characterized, and stored in a standard cell library. A typical library may contain a few
hundred cells including inverters, NAND gates, NOR gates, complex AOI, OAI gates, D-
latches, and flip- flops. Each gate type can have multiple implementations to provide
adequate driving capability for different fan outs. For instance, the inverter gate can have
standard size transistors, double size transistors, and quadruple size transistors so that the
chip designer can choose the proper size to achieve high circuit speed and layout density.
The characterization of each cell is done for several different categories. It consists of
40 | P a g
Dept. of
delay time vs. load capacitance
41 | P a g
Dept. of
and PLA blocks.
42 | P a g
Dept. of
5.5 TOOL (DSCH& Export Micro wind)
The DSCH3 program is a logic editor and simulator. DSCH3 is used to validate the
architecture of the logic circuit before the microelectronics design is started. DSCH3
provides a user-friendly environment for hierarchical logic design, and fast simulation with
delay analysis, which allows the design and validation of complex logic structures. Some
techniques for low power design are described in the manual. DSCH3 also features the
symbols, models and assembly support for 8051 and 18f64. DSCH3 also includes an
interface to SPICE. The MICROWIND3program allows the student to design and simulate
an integrated circuit at physical description level. The package contains a library of
common logic and analog ICs to view and simulate. MICROWIND3includes all the
commands for a mask editor as well as original tools never gathered before in a single
module (2D and 3D process view, Verilog compiler, tutorial on MOS devices). You can
gain access to Circuit Simulation by pressing one single key. The electric extraction of
your circuit is automatically performed and the analog simulator produces voltage and
current curves immediately.
Logic Levels:
The MOS transistor is basically a switch. When used in logic cell design, it can be
on or off. When on, a current can flow between drain and source. When off, no current
flow between drain and source. The MOS is turned on or off depending on the gate
voltage. In CMOS technology, both N-channel (and NMOS) and P-channel MOS (or
PMOS) devices exist. The NMOS and PMOS symbols are reported below. The symbols
for the ground voltage source (0 or VSS) and the supply (1 or VDD).
43 | P a g
Dept. of
Fig. 5.5 N-Channel and P-Channel.
The N-channel MOS device requires a logic value 1 (or a supply VDD) to be on. In
contrary, the P-channel MOS device requires a logic value 0 to be on. When the MSO
device is on, the link between the source and drain is equivalent to a resistance. The order
of range of this ‘on’ resistance is 100Ω-5KΩ. The ‘off’ resistance is considered infinite at
first order, as its value is several MΩ.
Both NMOS devices and PMOS devices exhibit poor performances when
transmitting one particular logic information. The NMOS degrades the logic level 1, the
PMOS degrades the logic level 0. Thus, a perfect pass gate can be constructed from the
combination of NMOS and PMOS devices working in a complementary way, leading to
improved switching performances. Such a circuit is called the transmission gate. In
DSCH3, the symbol may be found in the Advance menu in the palette. The transmission
gate includes one inverter, one NMOS and one PMOS.
In this section, an inverter circuit is loaded and simulated. Click File→ Open in the
main menu. Select INV.SCH in the list. In this circuit are one button situated on the left
side of the design, the inverter and a led. Click Simulate→ Start simulation in the main
menu.
44 | P a g
Dept. of
Fig.5.5.2 Logics of Inverter.
Now, click inside the buttons situated on the left part of the diagram. The result is
displayed on the leds. The red value indicates logic 1; the black value means logic 0. Click
the button Stop simulation shown in the picture below. You are back to the editor.
Click the chronogram icon to get access to the chronograms of the previous simulation.
As seen in the waveform, the value of the output is the logic opposite of that of the input.
Double click on the INV symbol; the symbol properties window is activated. In this
window appear the VERILOG description (left side) and the list of pins (right side). A set
of drawing options is also reported in the same window. Notice the gate delay (0.03ns in
the default technology), the fan out that represents the number of cells connected to the
output pin (1 cell connected), and the wire delay due to this cell connection (An extra
0.140ns delay).
BASIC GATES:
Table gives the corresponding symbol to each basic gate as it appears in the logic
editor window as well as the logic description. In this description, the symbol & refers to
the logical AND, | to OR, ~ to INVERT, NAND ^ to XOR.
45 | P a g
Dept. of
Table:5.5.3 Basic gates
The truth-table and logic symbol of the NAND gate with 2 inputs are shown below.
In DSCH select the NAND symbol in the palette, add two buttons and one lamp as shown
above. Add interconnects if necessary to link the button and lamps to the cell pins. Verify
the logic behavior of the cell.
The CMOS inverter design is detailed in the figure below. Here the P-channel MOS
and the N-channel MOS transistors function as switches. When the input signal is logic 0
the NMOS is switched off while PMOS passes VDD through the output. When the input
signal is logic 1, the PMOS is switched off while the NMOS passes VSS to the output.
The fan-out corresponds to the number of gates connected to the inverter output.
Physically, a large fan-out means a large number of connections, that is a large load
capacitance. If we simulate an inverter loaded with one single output, the switching delay
is small. Now, if we load the inverter by several outputs, the delay and the power
consumption are increased due to the current needed to charge and discharge that
46 | P a g
Dept. of
capacitance.
47 | P a g
Dept. of
Figure. 5.5.4 CMOS Inverter Circuits.
Inverter Simulation:
The command Simulate → Run Simulation gives access to the analog simulation.
Select the simulation mode Voltage vs. Time. The analog simulation of the circuit is
performed. The time domain waveform, proposed by default, details the evolution of the
voltages in1 and out1 versus time. This mode is also called transient simulation.
48 | P a g
Dept. of
Figure.5.5.6 CMOS Inverter Wave forms.
DSCH is software for logic design. Based on primitives, a hierarchical circuit can
be built and simulated. It also includes delay and power consumption evaluation. Silicon is
for 3D display of the atomic structure of silicon, with emphasis on the silicon lattice, the do
pants, and the silicon dioxide.
MOS as a Switch:
The MOS transistor is basically a switch. When used in logic cell design, it can be
on or off. When on, a current can flow between drain and source. When off, no current
flow between drain and source. The MOS is turned on or off depending on the gate
voltage. In CMOS technology, both N-channel (and NMOS) and P-channel MOS (or
PMOS) devices exist. The NMOS and PMOS symbols are reported below.
49 | P a g
Dept. of
The n-channel MOS device requires a logic value 1 (or a supply VDD) to be on. In
contrary, the p-channel MOS device requires a logic value 0 to be on. When the MSO
device is on, the link between the source and drain is equivalent to a resistance. The order
of range of this ‘on’ resistance is 100Ω-5KΩ. The ‘off’ resistance is considered infinite at
first order, as its value is several MΩ.
Connecting Procedure:
Instantiate NMOS or PMOS transistors from the symbol library and place them in
the editor window. Connect Vdd and GND to the schematic. Connect input button and
output LED. The simulation output can be observed as a waveform after the application of
the inputs as above. Click on the timing diagram icon in the icon menu to see the timing
diagram of the input and output waveforms. The Verilog, Hierarchy and Net list window
appears. This window neither shows the Verilog representation of NOR gate. Click OK to
save the Verilog as a .txt file.
5.6.2 MICROWIND:
Micro wind is a tool for designing and simulating circuits at layout level. The tool
features full editing facilities (copy, cut, past, duplicate, move), various views (MOS
characteristics, 2D cross section, 3D process viewer), and an analog simulator. The Micro
wind program allows designing and simulating an integrated circuit at physical description
level. The package contains a library of common logic and analog ICs to view and
simulate.
Micro wind includes all the commands for a mask editor as well as original tools
never gathered before in a single module (2D and 3D process view, Verilog compiler,
tutorial on MOS devices). You can gain access to Circuit Simulation by pressing one
single key. The electric extraction of your circuit is automatically performed and the
analog simulator produces voltage and current curves immediately.
MOS Layout:
Micro wind is used to draw the MOS layout and simulate its behavior. The Micro
wind display window includes four main windows:.
50 | P a g
Dept. of
4. The layer palette.
51 | P a g
Dept. of
The layout window features a grid, scaled in lambda (λ) units. The lambda unit is
fixed to half of the minimum available lithography of the technology. The default
technology is a CMOS 6-metal layers 0.12µm technology, consequently lambda is 0.06µm
(60nm). The palette is located in the lower right corner of the screen. A red color indicates
the current layer. Initially the selected layer in the palette is polysilicon. By using the
following procedure, you can create a manual design of the n-channel MOS.
1. Fix the first corner of the box with the mouse. While keeping the mouse button
pressed, move the mouse to the opposite corner of the box. Release the button. This
creates a box in polysilicon layer. The box width should not be inferior to 2 λ,
which is the minimum width of the polysilicon box.
2. Change the current layer into N+ diffusion by a click on the palette of the Diffusion
N+ button. Make sure that the red layer is now the N+ Diffusion. Draw a n-
diffusion box at the bottom of the drawing as in Figure. 6.2 N-diffusion boxes are
represented in green. The intersection between diffusion and polysilicon creates the
channel of the nMOS device.
The MOS size (width and length of the channel situated at the intersection of the
polysilicon gate and the diffusion) has a strong influence on the value of the current. In
Figure, the MOS width is 1.74µm and the length is 0.12µm. A high gate voltage (Vg
=1.2V) corresponds to the highest Id/Vdd curve. For Vg=0, no current flows. You may
change the voltage values of Vdd, Vg, Vs by using the voltage cursors situated on the right
side of the window. A maximum current around 1.5mA is obtained for Vg=1.2V,
Vdd=1.2V, with Vs=0.0. The MOS parameters correspond to SPICE Level 3.
52 | P a g
Dept. of
Fig. 5.6.3 N-Channel MOS characteristics.
53 | P a g
Dept. of
box of metal drawn across a box of polysilicon does not allow an electrical connection. To
build an electrical connection, a physical contact is needed. The corresponding layer is
called "contact". You may insert a metal-to-polysilicon contact in the layout using a direct
macro situated in the palette.
Supply Connections:
The next design step consists in adding supply connections, that is the positive
supply VDD and the ground supply VSS. We use the metal2 layer (Second level of
metallization) to create horizontal supply connections. Enlarging the supply metal lines
reduces the resistance and avoids electrical overstress. The simplest way to build the
physical connection is to add a metal/Metal2 contact that may be found in the palette. The
connection is created by a plug called "via" between metal2 and metal layers. The final
layout design step consists in adding polarization contacts. These contacts convey the VSS
and VDD voltage supply close to the bulk regions of the device. Remember that the n-well
region should always be polarized to a high voltage to avoid short-circuit between VDD
and VSS. Adding the VDD polarization in the n-well region is a very strict rule.
Interconnects:
Up to 6 metal layers are available for signal connection and supply purpose. A
significant gap exists between the 0.7µm 2-metal layer technology and the 0.12µm
technology in terms of interconnects efficiency.
Firstly, the contact size is 6 lambda in 0.7µm technology, and only 4 lambda in
0.12µm. This features a significant reduction of device connection to metal and metal2.
Notice that a MOS device generated using 0.7µm design rules is still compatible with
0.12µm technology. But a MOS device generated using 0.12µm design rules would violate
several rules if checked in 0.7µm technology. Secondly, the stacking of contacts is not
allowed in micro technologies. This means that a contact from poly to metal2 requires a
significant silicon area as contacts must be drawn in a separate location. In deep-submicron
technology (Starting 0.35µm and below), stacked contacts are allowed.
Simulation:
A simulation window appears with inputs and output, shows the tphl, tplh and tp of
the circuit. The power consumption is also shown on the right bottom portion of the
window. If you are unable to meet the specifications of the circuit change the transistor
sizes. Generate
54 | P a g
Dept. of
the layout again and run the simulations till you achieve your target delays. Depending on
the input sequences assigned at the input the output is observed in the simulation.
The Design Rule Checker (DRC) scans the design and verifies a set of design rules.
The errors are highlighted in the display window, with an appropriate message giving the
nature of the error. Details about the position and type of errors appear on the screen. Only
an error-free layout can be sent to fabrication.
55 | P a g
Dept. of
CHAPTER-6
SIMULATION RESULTS
56 | P a g
Dept. of
CHAPTER-6
SIMULATION RESULTS
57 | P a g
Dept. of
Fig.6.1.1.3 :1-bit ALU Layout
58 | P a g
Dept. of
6.1.2. 4-bit ALU
59 | P a g
Dept. of
Fig.6.1.2.3:4-bit ALU Output
60 | P a g
Dept. of
Fig 6.1.3.2: 4 bit ALU
61 | P a g
Dept. of
Fig.6.1.4.2: GDI 1-Bit ALU Layout
62 | P a g
Dept. of
6.1.5 GDI 4-bit ALU
63 | P a g
Dept. of
Fig.6.1.5.3: GDI 4-Bit ALU Output
64 | P a g
Dept. of
Fig.6.2.1.2: Modified GDI 1-Bit ALU Layout
65 | P a g
Dept. of
6.2.2. Modified GDI 4-bit ALU
66 | P a g
Dept. of
Fig.6.2.2.3: Modified GDI 4-Bit ALU Output
For 1-bit ALU, the number of transistors in modified GDI based ALU is less by
10% when compared with GDI based ALU and 66.67% when compared with CMOS based
ALU. The area utilized in modified GDI based ALU is less 7.46% when compared with
GDI based ALU and 37.05% when compared with CMOS based ALU. The power
dissipated by modified GDI based ALU is less 46.318% when compared with GDI based
ALU and 79.92% when compared with CMOS based ALU. But the trade off or
compromise is in delay i.e., delay is increased in modified GDI based ALU by 0.8% when
compared with GDI based ALU and 50.48% when compared with CMOS based ALU.
67 | P a g
Dept. of
Table 6.3.2: Comparison of 4 – bit ALU
For 4-bit ALU, the number of transistors in modified GDI based ALU are less 10%
when compared with GDI based ALU and 66.67% when compared with CMOS based
ALU. The area utilized in modified GDI based ALU is more by 0.2% when compared with
GDI based ALU and less by 10.216% when compared with CMOS based ALU. The power
dissipated by modified GDI based ALU is less 5.3% when compared with GDI based ALU
and is 7% more when compared with CMOS based ALU. The delay is decreased in
modified GDI based ALU by 0% when compared with GDI based ALU and 74.97% when
compared with CMOS based ALU.
68 | P a g
Dept. of
6.4 ADVANTAGES OF THE PROPOSED WORK
69 | P a g
Dept. of
CHAPTER-7
70 | P a g
Dept. of
CHAPTER-7
This work presents a 4-Bit ALU designed in 65nm CMOS Design using the MGDI
technique and simulated using the DSCH & MICROWIND simulator. Simulation results
showed an advantage of the proposed ALU design in terms of power consumption and
transistor count, while maintaining FullSwing Operation. The proposed design consists of
72 transistors and operates under 1V supply voltage.
This technology used for less area and low power with 3T XNOR full adder and Gate
Diffusion Input(GDI) multiplexer. Various methods of multiplexer and full Adder are implemented
and compared. Compared with previous paper, it reduces the power and area to 44% . the 2x1
multiplexer and 4x1 multiplexer are designed using Gate Diffusion Input technique. Full adder
based proposed technique consumes 10.190micro watts , but 3T XNOR technique consumes
8.639microwatts .Using these techniques, Full adder power leads to reduce, therefore the ALU
design also lends to reduced. Power dissipation, Propagation delay and number of transistors are
compared using GDI,PTL,2T XOR and 3T XNOR technique.
71 | P a g
Dept. of
REFERENCESS
1. A. Srivastava and D. Govindarajan. (2002). A Fast ALU Design in CMOS for Low
Voltage Operation, VLSI Design, pp. 315-327
2. Yang, Y., Chandrakasan, A.P., Sheng, S. and Brodersen, R.W. (1992). LowLower
CMOS Digital Design, IEEE Journal of Solid-State Circuits 27(4), pp. 822- 839
3. Basak, S., Saha, D., Mukherjee, S., Chatterjee, S. and Sarkar, C.K. (2012). Design and
Analysis of a Robust, High Speed, Energy Efficient 18 Transistor 1-bit Full Adder
Cell, Modified with the concept of MVT Scheme, International Symposium on
Electronic System Design, pp. 130-134
4. Nehru, K., Shanmugam, A. and Darmila Thenmozhi, G. (2012). Design of Low Power
ALU Using 8T FA and PTL Based MUX Circuits, IEEE-International Conference on
Advances in Engineering, Science and Management, pp. 145-149
5. Khan, A.A., Pandey, S. and Pathak, J. (2014). A Review Paper on 3-T Xor Cells and
8- T Adder Design in Cadence 180nm, International Conference for Convergence of
Technology, pp. 1-6
6. Prakash, P. and Saxena, A.K. (2009). Design of Low Power High Speed ALU Using
Feedback Switch Logic, International Conference on Advances in Recent
Technologies in Communication and Computing, pp. 899-902
7. Weng-Geng Ho, Kwen-Siong Chong, Bah-Hwee Gwee and Chang, J.S. (2013). Low
Power Sub-Threshold Asynchronous QDI Static Logic Transistor-level
Implementation (SLTI) 32-Bit ALU, IEEE International Symposium, pp. 353-356 43
8. Jiaoyan Chen, Vasudevan, D., Popovici, E. and Schellekens, M. (2011). Design of a
Low Power, Sub-Threshold, Asynchronous Arithmetic Logic Unit using a
Bidirectional Adder, 14th Euromicro Conference on Digital System Design, pp. 301-
308
9. Agarwal, S., Pavankumar, V.K. and Yokesh, R. (2008). Energy – Efficient, High
Performance Circuits for Arithmetic Units, 21st International Conference on VLSI
Design, pp. 371-376
10. Fayed, A.A. and Bayoumi, M.A. (2001). A Low Power 10-transistor Full Adder Cell
for Embedded Architecture, The 2001 IEEE International Symposium on Circuits and
Systems, pp. 226-229
11. [11] Parihar, R., Tiwari, N., Mandloi, A. and Kumar, B. (2014). An Implementation of
1-bit Low Power Full Adder based on Multiplexer and Pass Transistor Logic.
72 | P a g
Dept. of
Outcome attained and its justification
POs Justification
The knowledge on concept of Very large Scale Integration was gained through
PO1
thisproject work.
PO2 This problem analysis and literature had been done on existing work.
PO3 Analysis and test data to gives the results of classified output.
73 | P a g
Dept. of