Professional Documents
Culture Documents
J Chitra Conference
J Chitra Conference
RECONFIGURABLE MULTIPLY-ACCUMULATE
CO-PROCESSOR
1
K.Jayachitra1, T.Suresh 2
12
kjayachitra@ymail.com,2fiosuresh@yahoo.co.in
I INTRODUCTION
Multiplication and accumulations are the critical
operations in Digital Signal Processing applications.
High
speed and high throughput Multiply- Accumulate (MAC)
always a key to achieve a high performance digital signal
processing system. The main consideration of MAC
design is to enhance the speed of the system. Multiplyaccumulate (MAC) is a common accelerator used in
extensively used in microprocessors and digital signal
processors for data intensive applications. For example,
many filters, orthogonal frequency- division multiplexing,
channel
estimation
etc
require
FIR,FFT/IFFT
computations that can be efficiently accelerated by
dedicated MAC units.
Reconfigurable computing incorporates some degree
of flexibility in the arithmetic structures to serve different
purposes to minimize the wastage of resources.
Reconfigurable Computing uses reconfigurable devices
as piece of hardware able to dynamically adapt to
algorithms. Reconfigurable hardware devices offer bit
level reconfigurability and it accelerates the need for high
performance in complex and real time applications. Coarse
grain Reconfigurable architecture provides high flexibility
and tackle mainly Digital signal processing /multimedia
issues.
II RELATED WORK
The related work of this paper is discussed here.
Hardware structure and application of dynamically
reconfigurable
adder
and
accumulation
unit.
Reconfigurable MAC resolves the design conflict between
versatility, area and computation speed. The multiplication
part is implemented through the recursive decomposition
of a partial product matrix, repeated use of small mxm
multipliers and small adder circuit blocks. Adder is
implemented by using combined blocks of carry-save
adders. Appropriate number of pipeline stages can be
bypassed through multiplexers, trading throughput rate for
power consumption. This technique achieves high
performance and very low energy dissipation by adapting
its structure dynamically to computational requirements
over time.
Interface unit manipulates the incoming operands for
multiplication and addition and performs data transfers
from/to the bus. Arithmetic selection unit performs data
conversion between different arithmetic representations.
Reconfigurable multiplier performs multiplication for
various word lengths of input data and it affects the overall
latency, power consumption and area.
Addition unit performs addition for various members
of summands and word lengths of 8,16,32,64 bits and
adder unit is based on carry-save of multiple summands
for increased performance.
Reconfigurable hardware MAC co-processor is
designed to delegate critical tasks and it handles
simultaneous data streams of different bit widths.
Reconfigurable MAC co-processor provides opportunities
of power savings that will keep device feasible for batterypowered hand-held devices.MAC co-processor performs
the tasks parallel and thus it is expected to be more power
efficient than an equivalent implementation of MAC on
either microprocessor or FPGA.The architecture can
switch dynamically between different bit widths of
operands.
IV PROPOSED WORK
A. Reconfigurable Multiplication Unit
The unsigned multiplication can be expressed by
Pi=AiBj2i+j
D. Accumulation unit
The accumulation unit is shown in fig.3.Accumulation
unit consists of register and carry look ahead adder.
Recent sum is stored in the register and it will be
transferred to the adder to add it to next available input.
The maximum sum that comes out of the addition unit is
66 bits. Reset and clock are the signals used to transfer the
sum to accumulate the sum. When the multiplier
multiplies two 128 bit input it does not pass through the
AU because it is widest product and it is produced only
once in each time step.
8
bit
16
bit
32
bit
64 bit
17
17
17
Number of slices
10
10
10
25
41
73
Number of GCLKS
8
bit
16
bit
32
bit
64 bit
18
37
Number of slices
16
32
64
27
48
96
192
of
VI CONCLUSION
The design of a dynamically reconfigurable multiplyaccumulate co-processor is presented. It can be
reconfigured in terms of bit widths and throughput
rate. Reconfigurable MAC co-processor consists of
reconfigurable adder and reconfigurable multiplier to
perform efficient multiplication and addition by the
splitting mechanism. Reconfigurable MAC coprocessor is capable of handling parallel streams of
data of various bit widths. The superiority of the design
is achieved through the use of sub-multipliers,
repeatable parts and totally operation independent
units. Moreover the DRMP provides power saving
opportunities resulting from the reconfigurable MAC
and its concurrent tasks. DRMP handles tasks
concurrently by the request and grant services from/to
the arbiter and the RFUs reconfigures based on the
various bit widths of the operands and thus the DRMP
increases the speed and the performance of the
multiply accumulate operations in the digital
applications.
REFERENCES
[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]