0% found this document useful (0 votes)
21 views

Floating Point Representation of Data: By-Astha Jain Class-It1 0827IT171019

This document discusses floating point numbers and their representation. It covers: - Floating point numbers use scientific notation to represent very large and small numbers. - They are represented by a sign bit, exponent field, and fraction field according to the IEEE 754 standard. - Normalized floating point numbers have a hidden leading 1 for the fraction, while de-normalized numbers use a leading 0 to represent zero. - Floating point addition, subtraction, and multiplication may require re-aligning operands, normalizing results, and can result in errors like overflow or underprecision.

Uploaded by

Ashi Jain
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
21 views

Floating Point Representation of Data: By-Astha Jain Class-It1 0827IT171019

This document discusses floating point numbers and their representation. It covers: - Floating point numbers use scientific notation to represent very large and small numbers. - They are represented by a sign bit, exponent field, and fraction field according to the IEEE 754 standard. - Normalized floating point numbers have a hidden leading 1 for the fraction, while de-normalized numbers use a leading 0 to represent zero. - Floating point addition, subtraction, and multiplication may require re-aligning operands, normalizing results, and can result in errors like overflow or underprecision.

Uploaded by

Ashi Jain
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 16

Floating Point

Representation of Data

BY-ASTHA JAIN
CLASS-IT1
0827IT171019
Contents….
 Floating Point Numbers
 Floating Point
Representation
 IEEE Floating Point
Standard
 Normalised Form
 De-Normalised Form
 Floating Point operations
and their Flowchart
 Addition
 Subtraction
 Multiplication
 Thank-You
The World is Not Just Integers

 Programming languages support numbers with fraction


Called floating-point numbers
Examples:
3.14159265… (π)
2.71828… (e)
0.000000001 or 1.0 × 10–9 (seconds in a nanosecond)
86,400,000,000,000 or 8.64 × 1013 (nanoseconds in a day)
last number is a large integer that cannot fit in a 32-bit integer
 We use a scientific notation to represent
Very small numbers (e.g. 1.0 × 10–9)
Very large numbers (e.g. 8.64 × 1013)
Scientific notation: ± d. f1f2f3f4 … × 10 ± e1e2e3 The World is Not Just Integers
Floating Point Numbers?

 Examples of floating-point numbers in base 10 …


5.341×103 , 0.05341×105 , –2.013×10–1 , –201.3×10–3
Examples of floating-point numbers in base 2 …
1.00101×223 , 0.0100101×225 , –1.101101×2–3 , –1101.101×2–6
Exponents are kept in decimal for clarity
The binary number (1101.101)2 = 23+22+20+2–1+2–3 = 13.625
Floating-point numbers should be normalized
Exactly one non-zero digit should appear before the point
In a decimal number, this digit can be from 1 to 9
In a binary number, this digit should be 1
Normalized FP Numbers: 5.341×103 and –1.101101×2–3
NOT Normalized: 0.05341×105 and –1101.101×2–6
Floating-Point Representation

 A floating-point number is represented by the triple


S is the Sign bit (0 is positive and 1 is negative)
Representation is called sign and magnitude
E is the Exponent field (signed)
Very large numbers have large positive exponents
Very small close-to-zero numbers have negative exponents
More bits in exponent field increases range of values
F is the Fraction field (fraction after binary point)
More bits in fraction field improves the precision of FP numbers
Value of a floating-point number = (-1)S × val(F) × 2val€
IEEE 754 Floating-Point Standard

 Found in virtually every computer invented since 1980


Simplified porting of floating-point numbers
Unified the development of floating-point algorithms
Increased the accuracy of floating-point numbers
 Single Precision Floating Point Numbers (32 bits)
1-bit sign + 8-bit exponent + 23-bit fraction
 Double Precision Floating Point Numbers (64 bits)
1-bit sign + 11-bit exponent + 52-bit fraction
Normalized Floating Point Numbers

 For a normalized floating point number (S, E, F)


 Significand is equal to (1.F)2 = (1.f1f2f3f4…)2
IEEE 754 assumes hidden 1. (not stored) for normalized numbers
 Significand is 1 bit longer than fraction
 Value of a Normalized Floating Point Number is
(–1)S × (1.F)2 × 2val(E)
(–1)S × (1.f1f2f3f4 …)2 × 2val(E)
(–1)S × (1 + f1×2-1 + f2×2-2 + f3×2-3 + f4×2-4 …)2 × 2val(E)
(–1)S is 1 when S is 0 (positive), and –1 when S is 1 (negative)
De-Normalized Form

 Normalized form has a serious problem, with an implicit leading 1 for the fraction,
it cannot represent the number zero!
 De-normalized form was devised to represent zero and other numbers.
 For E=0, the numbers are in the de-normalized form. An implicit leading 0
(instead of 1) is used for the fraction; and the actual exponent is always -126.
 Hence, the number zero can be represented with E=0 and F=0 (because 0.0×2^-126=0).
Floating Point Operations

 Addition
 Subtraction
 Multiplication
Floating Point Addition

 Floating Point Addition


• To add two floating point values, they have to be aligned so that they
have the same exponent.
• After addition, the sum may need to be normalized.
• Potential errors include overflow, underflow and inexact results.
• Examples:
2.34 ´ 103 6.22´ 108 + 0.88 ´ 103 + 3.93 ´108 3.22 ´ 103 10.15 ´108 = 1.015 ´10^8
Flowchart
Floating Point Subtraction

• Subtracting floating point values also requires re-alignment so that they


have the same exponent.
• After subtraction, the difference may need to be normalized.
• Potential errors include overflow, underflow and inexact results, and the
difference may have one signficant bit less than the operands..
• Examples: 2.34 ´ 103 6.44 ´ 104 -0.88 ´ 103 - 6.23 ´104 1.46 ´ 103 0.21 ´104 =
2.1´103
Flowchart
Floating Point Multiplication

• Multiplying floating point values does not requires realignment - realigning


may lead to loss of significance.
• After multiplication, the product may need to be normalized.
• Potential errors include overflow, underflow and inexact results.
• Examples: 2.4 ´ 10-3 ´ 6.3 ´ 102 15.12 ´101 = 1.512´102
Flowchart

You might also like