Professional Documents
Culture Documents
ESZG512 ESD Assignment2 WILP
ESZG512 ESD Assignment2 WILP
ESZG512 ESD Assignment2 WILP
Fixed-Point Representation -
This representation has fixed number of bits for integer part and for fractional part. For
example, if given fixed-point representation is IIII.FFFF, then you can store minimum value
is 0000.0001 and maximum value is 9999.9999. There are three parts of a fixed-point number
representation: the sign field, integer field, and fractional field
Fixed-Point DSP
Fixed point DSP do vital arithmetic operation of type S xi * yi. It will have either multiplier
and ALU integrated or separate followed by operand registers, accumulators to store the
operands and results respectively. One or more shifter to handle result having size greater
than the basic size of DSP to shift the data before passing to ALU. Multiplier will support 2n
bits when n*n bits data as input. Accumulator will have guard bits to prevent overflow during
accumulation operations. When the result overflows, fixed point DSP handles using
saturation, truncation and convergent rounding.
1. Fixed point DSP hardware performs strictly integer arithmetic.
2. Fixed point DSPs usually represent each number with a minimum of 16 bits, although
a different length can be used.
3. Have single or multiple data paths, e.g. TI’s TMS320C62x™ have two data paths
operating in parallel,each with a 16-bit word width that provides signed integer values
within a range from –2^15 to 2^15.
4. In a fixed-point DSP, the programmer is responsible for performing the normalization
and scaling operation.
Floating-Point Representation -
This representation does not reserve a specific number of bits for the integer part or the
fractional part. Instead it reserves a certain number of bits for the number (called the mantissa
or significand) and a certain number of bits to say where within that number the decimal
place sits (called the exponent).
The floating number representation of a number has two part: the first part represents a
signed fixed point number called mantissa. The second part of designates the position of the
decimal (or binary) point and is called the exponent
Floating-Point DSP –
1. Floating-point DSPs support either integer or real arithmetic, the latter normalized in
the form of scientific notation.
2. Floating point DSPs typically use a minimum of 32 bits to store each value. They
divide a 32-bit data path into two parts: a 24-bit mantissa that can be used for either
for integer values or as the base of a real number, and an 8-bit exponent. The 16M
range of precision offered by 24 bits with the addition of an 8-bit exponent, thus
supporting a vastly greater dynamic range than is available with the fixed-point
format.
3. Most floating point data paths can also do fixed pt. functions but only one of them can
be carried out per inst cycle.
4. Some floating point processors provide two data paths ->one for fixed pt. and the
other for floating point numbers. In such processors the fixed point data path usually
does not include a multiplier.
5. Floating point DSPs do not usually follow law of conservation of bits. In case of 32-
bit nos. By law of conservation of bits resultant mantissa must be 48 but in floating pt
DSP product is larger only by few bits (8/12).
6. In Floating point DSPs normalization and scaling operation happens automatically
offering greater dynamic range and precision.
IEEE 754 –
The IEEE Standard for Floating-Point Arithmetic (IEEE 754) is a technical standard for
floating-point computation which was established in 1985 by IEEE.
IEEE 754 numbers are divided into two based on the above three components: single
precision and double precision.
According to IEEE 754 standard, the floating-point number is represented in following ways:
Single Precision (32 bit): 1 sign bit, 8 bit exponent, and 23 bit mantissa.
Double Precision (64 bit): 1 sign bit, 11 bit exponent, and 52 bit mantissa.
BIASED NORMALISED
TYPES SIGN BIAS
EXPONENT MANTISA
Single precision 1(31st bit) 8(30-23) 23(22-0) 127
Double
1(63rd bit) 11(62-52) 52(51-0) 1023
precision
IEEE 754r –
IEEE 754-2008 (previously known as IEEE 754r) was published in August 2008 and is a
significant revision to, and replaces, the IEEE 754-1985 floating-point standard, while in
2019 it was updated with a minor revision IEEE 754-2019. The 2008 revision extended the
previous standard where it was necessary, added decimal arithmetic and formats, tightened up
certain areas of the original standard which were left undefined, and merged in IEEE 854 (the
radix-independent floating-point standard).
BIASED NORMALISED
TYPES SIGN BIAS
EXPONENT MANTISA
Half precision 1(15th bit) 5(14-10) 10(9-0) 15
Double
1(127th bit) 14(126-113) 113(112-0) 8191
precision
One with a binary integer significand field encodes the significand as a large binary
integer between 0 and 10p−1. This is expected to be more convenient for software
implementations using a binary ALU.
Another with a densely packed decimal significand field encodes decimal digits more
directly. This makes conversion to and from binary floating-point form faster, but
requires specialized hardware to manipulate efficiently. This is expected to be more
convenient for hardware implementations.