Bits, Bytes, Integers, and Floats Notes

Bits, Bytes, Integers, and Floats
Contents
Binary Representation ......................................................................................................... 2
Numeral Systems.............................................................................................................. 2
Base Conversion ............................................................................................................... 3
Integer ................................................................................................................................ 5
Signed vs unsigned ........................................................................................................... 5
Two’s complement ........................................................................................................... 6
Signed vs unsigned in Python ............................................................................................ 6
Python numbers. Integral .................................................................................................. 6
Real Numbers ...................................................................................................................... 7
Fixed-point numbers......................................................................................................... 7
Fixed-point numbers......................................................................................................... 8
Fixed-point numbers: accuracy problems ........................................................................... 8
Floating-point numbers..................................................................................................... 9
Floating point reference for Intel Architecture.................................................................... 9
Floating-point numbers................................................................................................... 10
Floating-point numbers: support ..................................................................................... 10
Floating-point numbers: accuracy problems ..................................................................... 11
Symbols and Characters ..................................................................................................... 11
Scalar data type: symbols and characters ......................................................................... 11
American Standard Code for Information Interchange ...................................................... 12
Several character set encoding ........................................................................................ 13
Handling Unicode ........................................................................................................... 13
Logical Operations ............................................................................................................. 14
Basic operation (addition) ............................................................................................... 14
Combinational circuits .................................................................................................... 15
Bit-level operations in Python.......................................................................................... 15
Bit-level operations in Python (cnt.) ................................................................................. 16
Logical operations in Python............................................................................................ 17
Conclusion......................................................................................................................... 18
Binary Representation
Numeral Systems
▪ Humans are accustomed to decimal (base 10) arithmetic
▪ A computer performs only binary (base 2) arithmetic
▪ N-bit registers impose limitations on the size of fields and require

special treatment for large values
▪ Hexadecimal (base 16) allows a compact notation and a straight
forward conversion from/to binary.
Base Conversion
▪ To convert a base 10 (decimal) number to base 16 (hexadecimal):
▪ Read the remainders from bottom to top: 314156₁₀ = 4CB2C₁₆

▪ To convert a base 10 (decimal) number to base 2 (binary):
▪ Read the remainders from bottom to top:

174₁₀ = 1001010₂
▪ To convert from base b a number y, with n digits, of the form

yn−1yn−2 . . . y1y0 to base 10 (decimal):
▪ Constants starting with 0b or 0B are interpreted being in binary
▪ Constants starting with 0x or 0X are interpreted being in

hexadecimal
Integer
Signed vs unsigned
▪ Unsigned: ordinary binary representation
o Range: from 0 to 2ⁿ − 1
▪ Signed: two’s complement
o Range: from −2ⁿ⁻1 to 2 ⁿ⁻1 – 1

Two’s complement
➢ Converting to two’s complement representation: Reverse the bits and add 1,

ignore the overflow.
➢ Examples, n = 4 bits:
5₁₀ = 0101₂; −5₁₀ = 1010₂ + 1₂ = 1011₂

2₁₀ = 0010₂; −2₁₀ = 1101₂ + 1₂ = 1110₂
➢ One special case:

The two’s complement of the most negative number representable is itself.
➢ Example, n = 4 bits, the most negative number representable is −2 3:

8₁₀ = 1000₂; −8₁₀ = 0111₂ + 1₂ = 1000₂
Signed vs unsigned in Python
➢ numbers. Integral represent numbers in an unlimited range, subject to available

(virtual) memory only.
➢ Negative numbers are represented in a variant of 2’s complement which gives

the illusion of an infinite string of sign bits extending to the left.
➢ for the purpose of shift and mask operations, a binary representation is assumed.
Python numbers. Integral
Quoted from docs.python.org

➢ Booleans (bool)
o These represent the truth values False and True
o The two objects representing the values False and True are the only
Boolean objects.
o The Boolean type is a subtype of the integer type, and Boolean values
behave like the values 0 and 1, respectively, in almost all contexts.
o The exception being that when converted to a string, the strings “False”
or “True” are returned, respectively.
Real Numbers
➢ Numbers with a fractional component

➢ Two main representations:
o Fixed-point versus Floating-point
o The radix point is fixed or can float anywhere
o The symbol to separate integer and fractional parts of a real number
➢ Implementation: trade-off between cost and precision

o Lack of hardware resources → e.g. Multimedia decoders
o Boost performance although degraded precision → e.g. PlayStation,
Doom
Fixed-point numbers
➢ Bits = 1 + m + n
o 1 bit for sign (if signed)
o m bits for integer component
o n bits for fraction component
➢ Notation: Qm.n
o Integer number without fraction component Qm.0
o Fractional number without integer component Qn
Fixed-point numbers
Fixed-point numbers: accuracy problems
➢ Precision loss and overflow
➢ Results can require more bits than operands

o Round or truncate
o Specify different size for result
➢ Boundary numbers to prevent overflow
➢ Exception: overflow flag

o If supported by hardware
Floating-point numbers
➢ Bits = 1 + e + k
o 1 bit for sign (if signed)
o e bits for exponent 1, . . ., (2e − 1) – 1
o k bits for mantissa (fraction)
▪ There are an implicit 1-bit (top left) equals to 1, unless exponent
is equal to zero
➢ Most processors follow IEEE floating point standard

o First version on 1985
o Standardize formats
o Special Values
Floating point reference for Intel Architecture

Floating-point numbers
Floating-point numbers: support

➢ Float in Python
o float is usually implemented using double in C (64b). Check precision and
implementation in sys.float_info
o decimal.Decimal for floating-point numbers with user-definable
precision.
➢ Most 32-bit architectures comprise 64-bit support in FPU (floating-point unit)

o IA-32 and x86-64 present 80-bit floating-point type (double-extended
precision format)
➢ Quad-precision (128-bit)
o Software support
o Few architectures provide hardware support
o E.g. IBM POWER9 processors (MareNostrum 4)
Floating-point numbers: accuracy problems
Symbols and Characters

Scalar data type: symbols and characters
➢ Char data type: encode alphanumeric data and symbols

➢ Several character set encoding
o ASCII code (American Standard Code for Information Interchange)
o Adopted by microcomputers
o The standard for the early HTML
o Single byte using the bottom 7 bits. From 0 to 127
o The 128 values that represent the printable Latin A-Z (65-90), a-z (97-
122), 0-9 (48-57)
o Many common punctuations characters
o Several non-displayable device control codes (0-31 & 127)
American Standard Code for Information Interchange

Several character set encoding
➢ 128 symbols are not enough!
➢ What about the last bit (128-255)?

o Windows-1252 code (CP-1252): also called ANSI
▪ It is a superset of ISO-8859-1(more print able characters)
▪ Used by default on legacy components of Microsoft Windows
▪ ANSI comprises 8 bits: 1 additional bit compared to ASCII
➢ ISO-8859-1 code (Latin Alphabet)

o 1 full byte (256 characters): extension to ASCII
o The standard from HTML2.0 to HTML4.01
o The Latin-1 code-page defines many characters and symbols used by
Latin-based languages
o IBM used code-page 437 that define non-printable characters
o Shells allow the user to change code-pages, which causes the terminal to
display different characters
➢ Nevertheless, 255 symbols are not enough!
➢ the solution: UNICODE
Handling Unicode
➢ Unicode or ISO/IEC 10646: is an international standard defining every

character/glyph used in almost every writing system on Earth
➢ Unicode also defines several character encodings:
o UTF-8: 1-byte for the first 127 code points (maintaining compatibility with
ASCII), and an optional additional 1-3 bytes (4 bytes total) for other
characters
o UTF-16/UCS-2: 2-bytes for each character. UCS-2 (used internally by

Windows) supports encoding the first 65536 code points (known as the
Basic Multilingual Plane – BMP). UTF-16 extends UCS-2 by incorporating
a 4-byte encoding for 17 additional planes of characters
o UTF-32: 4-bytes per character

Logical Operations
Basic operation (addition)
➢ Operands:
➢ True sum: n + 1 bits
➢ Standard addition ignores the carry bit output.
➢ Implements modular arithmetic: p + q mod 2 n
Example: p = 5, q = 4, n = 3.
Combinational circuits
Bit-level operations in Python

Bit-level operations in Python (cnt.)
Logical operations in Python
➢ Three Boolean operators: and,or, and not.

➢ bool type with two values: True and False.
➢ Any non-zero number is interpreted as True.
➢ Relational operators: ==, !=, >,=, <=
➢ Boolean expressions:
Conclusion
➢ Memory is full of binary digits
➢ Hardware and software interpret bits

o Following some kind of codification
▪ Natural binary, two’s complement, fixed point, floating point,
unicode, . . .
➢ There are a lot of scalar data types

o Integers, floats, chars, Booleans
o Bitmaps (enumeration), pointers, . . .
➢ And even more aggregated data types

o Arrays, structures, classes, unions, strings, heaps, stacks, dictionaries, . . .

Bits, Bytes, Integers, and Floats Notes

Uploaded by

Copyright:

Available Formats

You might also like

Bits, Bytes, Integers, and Floats Notes

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Bits, Bytes, Integers, and Floats Notes

Uploaded by

Copyright:

Available Formats

Bits, Bytes, Integers, and Floats

▪ N-bit registers impose limitations on the size of fields and require

▪ To convert a base 10 (decimal) number to base 16 (hexadecimal):

▪ Read the remainders from bottom to top: 314156₁₀ = 4CB2C₁₆

▪ Read the remainders from bottom to top:

▪ To convert from base b a number y, with n digits, of the form

▪ Constants starting with 0b or 0B are interpreted being in binary

▪ Constants starting with 0x or 0X are interpreted being in

▪ Unsigned: ordinary binary representation

▪ Signed: two’s complement

o Range: from −2ⁿ⁻1 to 2 ⁿ⁻1 – 1

➢ Converting to two’s complement representation: Reverse the bits and add 1,

5₁₀ = 0101₂; −5₁₀ = 1010₂ + 1₂ = 1011₂

➢ One special case:

➢ Example, n = 4 bits, the most negative number representable is −2 3:

Signed vs unsigned in Python

➢ numbers. Integral represent numbers in an unlimited range, subject to available

➢ Negative numbers are represented in a variant of 2’s complement which gives

Python numbers. Integral

Quoted from docs.python.org

➢ Numbers with a fractional component

➢ Implementation: trade-off between cost and precision

Fixed-point numbers: accuracy problems

➢ Precision loss and overflow

➢ Results can require more bits than operands

➢ Boundary numbers to prevent overflow

➢ Exception: overflow flag

➢ Most processors follow IEEE floating point standard

Floating point reference for Intel Architecture

Floating-point numbers: support

➢ Most 32-bit architectures comprise 64-bit support in FPU (floating-point unit)

Symbols and Characters

➢ Char data type: encode alphanumeric data and symbols

American Standard Code for Information Interchange

➢ 128 symbols are not enough!

➢ What about the last bit (128-255)?

➢ ISO-8859-1 code (Latin Alphabet)

➢ Nevertheless, 255 symbols are not enough!

➢ the solution: UNICODE

➢ Unicode or ISO/IEC 10646: is an international standard defining every

➢ Unicode also defines several character encodings:

o UTF-16/UCS-2: 2-bytes for each character. UCS-2 (used internally by

o UTF-32: 4-bytes per character

Basic operation (addition)

➢ True sum: n + 1 bits

➢ Standard addition ignores the carry bit output.

➢ Implements modular arithmetic: p + q mod 2 n

Bit-level operations in Python

➢ Three Boolean operators: and,or, and not.

➢ Memory is full of binary digits

➢ Hardware and software interpret bits

➢ There are a lot of scalar data types

➢ And even more aggregated data types

You might also like