0% found this document useful (0 votes)

22 views23 pages

Floating Point Numbers

Uploaded by

Khaled Alshurman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views23 pages

Floating Point Numbers

Uploaded by

Khaled Alshurman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 23

Floating Point

Numbers
Topics Covered

 Fixed point Numbers

 Representation of Floating Point Numbers

IEEE 32-bit floating point number.

 Floating point Arithmetic

Fixed Point Numbers

 The binary (or decimal) point is assumed

to be in a fixed position

 Base 10 fixed point arithmetic:

7632135 763.2135
1794821 179.4821
9426956 942.6956
Fixed Point (Binary) Numbers
 Example: Add 3.625 and 6.5
1. Convert the numbers to 8-bit form (4-bit int, 4-bit fraction):
3.625  11.101  0011.1010
6.500  110.10  0110.1000

2. Consider the numbers having an imaginary binary point and

added in the normal way:

00111010 + 01101000 = 10100010

3. The integer part of the result is converted to 10, and the

fractional part is interpreted as .125. Therefore, the result is
10.125.
Problem with Fixed Point
(Binary) Numbers

 Some systems require a large range of

numbers:

1. Mass of sun:
1990000000000000000000000000000000
grams
Requires about 14 bytes

2. Mass of electron:
000000000000000000000000000910956
grams
Requires about 12 bytes
Floating Point Numbers
Definitions

 Range
 How small and how large the numbers can be.

 Precision
 The number of significant figures used to represent the
number.
 A measure of a number’s exactness.
 PI = 3.141592 is more precise that PI = 3.14

 Accuracy
 A measure of the correctness of a number.
 PI = 3.241592 is more precise than PI = 3.14, but
 PI = 3.14 is more accurate.
IEEE Floating Point Numbers
Single Precision Format

-1s * 2E-B * 1.F

B = 127
IEEE Floating Point Numbers
Range of Mantissa

 A floating point mantissa is limited to one of the three ranges:

-2 < x <= -1
x = 0
+1 <= x < +2
IEEE Floating Point Numbers
Exponent

Binary Value True Biased Special Numbers

Exponent Exponent
0000 0000 -127 0 zero
0000 0001 -126 1
0000 0010 -125 2
0000 0100 -124 3
. . .
1000 0000 0 128
. . .
1111 1100 125 252
1111 1101 126 253
1111 1110 127 254
1111 1111 128 255 +- Infinity
IEEE Floating Point Numbers
Excess - n

 The stored exponent is also called excess – n, or

excess 127, for the IEEE single precision format.

 The stored exponent exceeds the true exponent by

127, the bias.

 b’ = b + 127
where b’ is the biased exponent, and b is the true
exponent.
 Examples:
 If the true exponent is 2, the exponent is stored in biased form as
2 + 127 = 1000 0001.
 If the stored exponent is 0000 0001, the true exponent is
1 – 127 = -126.
IEEE Floating Point Numbers
Representation of Zero

 The smallest stored exponent 0000 0000 (in biased

form), corresponding to a true exponent of -127, is
used to represent zero.
IEEE Floating Point Numbers
Infinity and Not a Number (NaN)

 1111 1111  used as +- infinity.

 1111 1111 and Mantissa != 0  used as

NaN.
IEEE Floating Point Numbers
Example Representation

 Represent -2345.125 as a single precision IEEE

floating point number.

 -2345.12510 = -100100101001.0012

 -2345.12510 = -1.001001010010012 x 211

 S = 1 (negative)
 The biased exponent is 11 + 127 = 138 =
100010102
 The fractional part of the mantissa
is .00100101001001000000000
 Therefore, -2345.125
10 =
1 10001010 00100101001001000000000
Numbers
Addition and Subtraction
Flowchart
IEEE Floating Point Numbers
Arithmetic Example #1

1. Convert the decimal numbers 123.5 and 100.25 into the IEEE
32-bit floating point number representation. Then carry out the
subtraction of 123.5 – 100.25 and express the result as a
normalized 32-bit floating point number.

 123.510 = 1111011.12 = 1.1110111 x 26

 The mantissa is positive, and so S = 0.
 The exponent is +6, which is stored in biased
form as 6 + 127 = 13310 = 100001012.
 The mantissa is 1.1110111, which is stored in 23-
bits, with the leading ‘1’ suppressed.
 Therefore, 123.510 is stored as:
0 10000101 11101110000000000000000IEEE
IEEE Floating Point Numbers
Arithmetic Example #1 (Continued)

 100.2510 = 1100100.012 = 1.10010001 x 26

 The mantissa is positive, and so S = 0.
 The exponent is +6, which is stored in biased
form as 6 + 127 = 13310 = 100001012.
 The mantissa is 1.10010001, which is stored in
23-bits, with the leading ‘1’ suppressed.
 Therefore, 100.2510 is stored as:
0 10000101 10010001000000000000000IEEE
IEEE Floating Point Numbers
Arithmetic Example #1 (Continued)

 The two IEEE numbers are first unpacked: the

sign, exponent, and mantissa must be
reconstituted.
 The two exponents are compared. If they are the
same, the mantissas are added. If they are not,
the number with the smaller exponent is
denormalized by shifting its mantissa right
(i.e., dividing by 2) and incrementing its
exponent (i.e., multiplying by 2) until the two
exponents are equal. Then the numbers are added.
IEEE Floating Point Numbers
Arithmetic Example #1 (Continued)

1. Convert the decimal numbers 123.5 and 100.25 into the 32-bit
floating point number representation. Then carry out the
subtraction of 123.5 – 100.25 and express the result as a
normalized 32-bit floating point number. (Continued)

 After unpacking, insert the leading ‘1’

and perform the subtraction.
1.11101110000000000000000
-1.10010001000000000000000
0.01011101000000000000000
 Normalize the result:
1.01110100000000000000000
IEEE Floating Point Numbers
Arithmetic Example #1 (Continued)

 The exponent must be decreased by 2.

 10000101 – 210 = 10000011

 The result expressed in IEEE format is:

0 10000011 01110100000000000000000
IEEE Floating Point Numbers
Arithmetic Example #2

2. Convert the decimal numbers 42.6875 and -0.09375 into the

IEEE 32-bit floating point number representation. Then carry
out the addition of 42.6875 and – 0.09375 and express the
result as a normalized 32-bit floating point number.

 42.687510 = 101010.10112 = 1.010101011 x 25

 The mantissa is positive, and so S = 0.
 The exponent is +5, which is stored in biased
form as 5 + 127 = 13210 = 100001002.
 The mantissa is 1.010101011, which is stored in
23-bits, with the leading ‘1’ suppressed.
 Therefore, 42.687510 is stored as:
0 10000100 01010101100000000000000IEEE
IEEE Floating Point Numbers
Arithmetic Example #2 (Continued)

2. Convert the decimal numbers 42.6875 and -0.09375 into the

IEEE 32-bit floating point number representation. Then carry
out the addition of 42.6875 – 0.09375 and express the result as
a normalized 32-bit floating point number (continued).

 -0.0937510 = -0.000112 = -1.1 x 2-4

 The mantissa is negative, and so S = 1.
 The exponent is -4, which is stored in biased
form as -4 + 127 = 12310 = 011110112.
 The mantissa is 1.1, which is stored in 23-bits,
with the leading ‘1’ suppressed.
 Therefore, -0.0937510 is stored as:
1 01111011 10000000000000000000000IEEE
IEEE Floating Point Numbers
Arithmetic Example #2 (Continued)

2. …
+42.687510:0 10000100 101010101100000000000000
-0.0937510:1 01111011 110000000000000000000000

 In order to perform the addition, the

exponents must be the same.
 Increase the second exponent by 9 and shift
the mantissa right 9 times to get:
+42.687510:0 10000100 101010101100000000000000
-0.0937510:1 10000100 000000000110000000000000000000000
IEEE Floating Point Numbers
Arithmetic Example #2 (Continued)

2. …
+42.687510:0 10000100 101010101100000000000000
-0.0937510:1 10000100 000000000110000000000000000000000

 Adding the mantissas, we get:

101010100110000000000000

 The result is positive with a biased

exponent of 10000100.
 Therefore, the result is stored as:

0 10000100 0101010011000000000000

Lec07 - Computer Arithmetic - Floating-Point Representation and Arithmetic
No ratings yet
Lec07 - Computer Arithmetic - Floating-Point Representation and Arithmetic
42 pages
08m13e14 - CAM2 Measure v10.7 - FaroArm Training Workbook - September 2017
100% (1)
08m13e14 - CAM2 Measure v10.7 - FaroArm Training Workbook - September 2017
542 pages
SimSXCu Full Version 2.0
60% (5)
SimSXCu Full Version 2.0
345 pages
Floating Point Numbers 237045407 237045407
No ratings yet
Floating Point Numbers 237045407 237045407
20 pages
Floating Point Number Representation
No ratings yet
Floating Point Number Representation
21 pages
Floating Point Representation: Reading: B&O 2.4
No ratings yet
Floating Point Representation: Reading: B&O 2.4
44 pages
9-Algorithms For Floating Point Arithmetic Operations-22-01-2024
No ratings yet
9-Algorithms For Floating Point Arithmetic Operations-22-01-2024
49 pages
Floating Point
No ratings yet
Floating Point
26 pages
#3 - Floating Point
No ratings yet
#3 - Floating Point
38 pages
IEEE Standard 754
No ratings yet
IEEE Standard 754
10 pages
arch1-LECTURE-NUMBER REPRESENTATION
No ratings yet
arch1-LECTURE-NUMBER REPRESENTATION
42 pages
Module 2 - PART D Floating
No ratings yet
Module 2 - PART D Floating
30 pages
Floating Points
No ratings yet
Floating Points
31 pages
Fixed and Floating Point Numbers: Dr. Ashish GUPTA Sense, Vit-Ap Ashish - Gupta@vitap - Ac.in
No ratings yet
Fixed and Floating Point Numbers: Dr. Ashish GUPTA Sense, Vit-Ap Ashish - Gupta@vitap - Ac.in
34 pages
Floating Point Number Representation
No ratings yet
Floating Point Number Representation
27 pages
Unit-1 COA
No ratings yet
Unit-1 COA
26 pages
4.4 - 1 New Floating Point
No ratings yet
4.4 - 1 New Floating Point
22 pages
Module2.1 of Nothing
No ratings yet
Module2.1 of Nothing
7 pages
Floating Point Representation
No ratings yet
Floating Point Representation
17 pages
COA Module6 FloatingPoint
No ratings yet
COA Module6 FloatingPoint
17 pages
Floating Point
No ratings yet
Floating Point
26 pages
Floating Point Numbers: The Architecture of Computer Hardware and Systems Software
No ratings yet
Floating Point Numbers: The Architecture of Computer Hardware and Systems Software
28 pages
3-EED220 Lecture 3
No ratings yet
3-EED220 Lecture 3
22 pages
16-Algorithms For Floating Point Arithmetic Operations and Numericals-01-02-2024
No ratings yet
16-Algorithms For Floating Point Arithmetic Operations and Numericals-01-02-2024
21 pages
10 MIPS Floating Point Arithmetic
No ratings yet
10 MIPS Floating Point Arithmetic
28 pages
Floating Point Representation
No ratings yet
Floating Point Representation
19 pages
Floating-Point Arithmetic Operations (Aligning The Mantissas - Biased Exponent - Overflow)
No ratings yet
Floating-Point Arithmetic Operations (Aligning The Mantissas - Biased Exponent - Overflow)
18 pages
Part 5 Floating Point Add Sub Mul
No ratings yet
Part 5 Floating Point Add Sub Mul
20 pages
Lecture 05 - Floating Point Numbers
No ratings yet
Lecture 05 - Floating Point Numbers
28 pages
Floating Point Representation: Major: All Engineering Majors Authors: Autar Kaw, Matthew Emmons
No ratings yet
Floating Point Representation: Major: All Engineering Majors Authors: Autar Kaw, Matthew Emmons
21 pages
Bhanu CV (3) - Testing Latest
No ratings yet
Bhanu CV (3) - Testing Latest
8 pages
Floating Point
No ratings yet
Floating Point
16 pages
Lecture 4 - Computer Arithmetic
No ratings yet
Lecture 4 - Computer Arithmetic
18 pages
L-5 Floating Point Representation of Numbers
No ratings yet
L-5 Floating Point Representation of Numbers
12 pages
Week 5: IEEE Floating Point Revision Guide For Phase Test
No ratings yet
Week 5: IEEE Floating Point Revision Guide For Phase Test
23 pages
How To Represent Real Numbers: - in Decimal Scientific Notation
No ratings yet
How To Represent Real Numbers: - in Decimal Scientific Notation
16 pages
Floating Point Numbers: CS031 September 12, 2011
No ratings yet
Floating Point Numbers: CS031 September 12, 2011
22 pages
Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic 33333
No ratings yet
Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic 33333
18 pages
COA
No ratings yet
COA
14 pages
BCS302 Unit-2 (Part-III)
No ratings yet
BCS302 Unit-2 (Part-III)
7 pages
Floating Point Numbers: Do You Have Your Laptop Here?
No ratings yet
Floating Point Numbers: Do You Have Your Laptop Here?
10 pages
Floating Point Alu
No ratings yet
Floating Point Alu
11 pages
Floating Point Tutorial
No ratings yet
Floating Point Tutorial
15 pages
Floating-Point Numbers and Operations Representation
No ratings yet
Floating-Point Numbers and Operations Representation
8 pages
Fuzzy Logic Practice Question
No ratings yet
Fuzzy Logic Practice Question
5 pages
Ieee Standard For Floating Point Numbers
No ratings yet
Ieee Standard For Floating Point Numbers
5 pages
EC-502 - Aritra Dutta
No ratings yet
EC-502 - Aritra Dutta
6 pages
The IEEE Standard For Floating Point Arithmetic
No ratings yet
The IEEE Standard For Floating Point Arithmetic
9 pages
8.1.4 Data Representation - Floatng Point Numbers
No ratings yet
8.1.4 Data Representation - Floatng Point Numbers
3 pages
The Conversion Procedure (Decimal To Floating Point)
No ratings yet
The Conversion Procedure (Decimal To Floating Point)
8 pages
Floating Point Numbers
No ratings yet
Floating Point Numbers
26 pages
Data Representation Workbook
No ratings yet
Data Representation Workbook
8 pages
Computer Organisation
No ratings yet
Computer Organisation
4 pages
Decimal To Floating-Point Conversions: The Conversion Procedure
No ratings yet
Decimal To Floating-Point Conversions: The Conversion Procedure
5 pages
IEEE Paper On Floating Point
No ratings yet
IEEE Paper On Floating Point
28 pages
Floating Point Representation: Major: All Engineering Majors Authors: Autar Kaw, Matthew Emmons
No ratings yet
Floating Point Representation: Major: All Engineering Majors Authors: Autar Kaw, Matthew Emmons
21 pages
IEEE FP Representation
No ratings yet
IEEE FP Representation
3 pages
Scientific Computation (Floating Point Numbers)
No ratings yet
Scientific Computation (Floating Point Numbers)
4 pages
Wolfram Von Eschenbach-Parzival
No ratings yet
Wolfram Von Eschenbach-Parzival
358 pages
Unicast Routing - Mukesh
No ratings yet
Unicast Routing - Mukesh
29 pages
Project Report e Governance
0% (2)
Project Report e Governance
19 pages
Fast Track Item Creation Process in Oracle Inevntory Using Workflow
100% (8)
Fast Track Item Creation Process in Oracle Inevntory Using Workflow
43 pages
Decimal To Binary Conversion
No ratings yet
Decimal To Binary Conversion
4 pages
Floating Point Numbers
No ratings yet
Floating Point Numbers
27 pages
MRP Document
100% (1)
MRP Document
103 pages
Dokumen - Tips - Project Management Fundamentals 5925c114d0ff3
No ratings yet
Dokumen - Tips - Project Management Fundamentals 5925c114d0ff3
35 pages
Mobile Application Design & Development
No ratings yet
Mobile Application Design & Development
206 pages
Lab 1
100% (1)
Lab 1
10 pages
7BCEE2A-Digital Principles and Computer Organization
No ratings yet
7BCEE2A-Digital Principles and Computer Organization
111 pages
Monkey Banana Problem Solution in AI
No ratings yet
Monkey Banana Problem Solution in AI
8 pages
EET206-M3 Ktunotes - in
No ratings yet
EET206-M3 Ktunotes - in
119 pages
EET206 M4 Ktunotes - in
No ratings yet
EET206 M4 Ktunotes - in
195 pages
Floating Point Numbers
No ratings yet
Floating Point Numbers
7 pages
Impact of E-Commerce and Use of ICT
No ratings yet
Impact of E-Commerce and Use of ICT
48 pages
Sharp - Le810-Le820 Series Update
67% (3)
Sharp - Le810-Le820 Series Update
4 pages
Pyregex
No ratings yet
Pyregex
71 pages
EET206-M2 Ktunotes - in
No ratings yet
EET206-M2 Ktunotes - in
127 pages
Maybank Services
0% (1)
Maybank Services
3 pages
EET206-M5 Ktunotes - in
No ratings yet
EET206-M5 Ktunotes - in
115 pages
Lectures On Power System Analysis: Dr. Kassim Al-Anbarri
No ratings yet
Lectures On Power System Analysis: Dr. Kassim Al-Anbarri
31 pages
Craft Software For Dummies
No ratings yet
Craft Software For Dummies
18 pages
Guide For Foreign Instructors - NKU&HBPU
No ratings yet
Guide For Foreign Instructors - NKU&HBPU
22 pages
Bis Eccn
No ratings yet
Bis Eccn
14 pages
Chapter 2
No ratings yet
Chapter 2
15 pages
Foliage Design Tutorial
No ratings yet
Foliage Design Tutorial
45 pages
Secugen Biometric Device Installation & Configuration of Java Settings
No ratings yet
Secugen Biometric Device Installation & Configuration of Java Settings
19 pages
Answers Siebel Set I
No ratings yet
Answers Siebel Set I
5 pages
Design & Implementation of Image Compression Using Huffman Coding Through VHDL, Kumar Keshamoni
No ratings yet
Design & Implementation of Image Compression Using Huffman Coding Through VHDL, Kumar Keshamoni
7 pages
Zfsboottalk Solaris Boot Zfsboot
No ratings yet
Zfsboottalk Solaris Boot Zfsboot
38 pages
2150708
No ratings yet
2150708
3 pages
Unit 2 Question Bank
No ratings yet
Unit 2 Question Bank
4 pages
All About Cropping
No ratings yet
All About Cropping
4 pages
SPL Ree Fracas
No ratings yet
SPL Ree Fracas
1 page
Pl1 - DB2 Examples How To Use Cursors. Example 1
No ratings yet
Pl1 - DB2 Examples How To Use Cursors. Example 1
2 pages
Yosua Prima Gultom Kno Oerjxd Bdo Flight - Originating
No ratings yet
Yosua Prima Gultom Kno Oerjxd Bdo Flight - Originating
2 pages
Apexion Dental Products and Services
No ratings yet
Apexion Dental Products and Services
1 page
Principles of Digital Electronics
From Everand
Principles of Digital Electronics
Sapana Rane
No ratings yet
GCSE Maths Teachers Pack V11
From Everand
GCSE Maths Teachers Pack V11
Clive W. Humphris
No ratings yet

Floating Point Numbers

Uploaded by

Floating Point Numbers

Uploaded by

Floating Point

 Fixed point Numbers

 Representation of Floating Point Numbers

 Floating point Arithmetic

 The binary (or decimal) point is assumed

 Base 10 fixed point arithmetic:

2. Consider the numbers having an imaginary binary point and

00111010 + 01101000 = 10100010

3. The integer part of the result is converted to 10, and the

 Some systems require a large range of

-1s * 2E-B * 1.F

 A floating point mantissa is limited to one of the three ranges:

Binary Value True Biased Special Numbers

 The stored exponent is also called excess – n, or

 The stored exponent exceeds the true exponent by

 The smallest stored exponent 0000 0000 (in biased

 1111 1111  used as +- infinity.

 1111 1111 and Mantissa != 0  used as

 Represent -2345.125 as a single precision IEEE

 -2345.12510 = -1.001001010010012 x 211

 123.510 = 1111011.12 = 1.1110111 x 26

 100.2510 = 1100100.012 = 1.10010001 x 26

 The two IEEE numbers are first unpacked: the

 After unpacking, insert the leading ‘1’

 The exponent must be decreased by 2.

 The result expressed in IEEE format is:

2. Convert the decimal numbers 42.6875 and -0.09375 into the

 42.687510 = 101010.10112 = 1.010101011 x 25

2. Convert the decimal numbers 42.6875 and -0.09375 into the

 -0.0937510 = -0.000112 = -1.1 x 2-4

 In order to perform the addition, the

 Adding the mantissas, we get:

 The result is positive with a biased

You might also like