简体繁体中英

IEEE 754 and machine numbers

原文 2018-12-08 20:06:11 0 1 floating-point/ binary/ rounding/ ieee-754

I've been trying to wrap my head around machine numbers like the unit roundoff (u) and epsilon (e) in combination with the IEEE 754 standard. My textbook states some things that don't really make sense to me.

Unit roundoff according to my textbook is:

for single precision (mantissa is 23 bit): u = 6e-8
for double precision (mantissa is 52 bit): u = 2e-16

I've been trying to derive a formula for these results with two relations:

my textbook states: "In binary arithmetic with rounding we usually have e = 2*u"
- e = 2^-n, n being the amount of mantissa bits

These combined results would then give: u = 2^-(n+1), again with n being the amount of mantissa bits. Checking this formule with the given results of u for different precisions:

for single: u = 2^-(23+1) = 5.96e-8, this result checks out. for double: u = 2^-(52+1) = 1.11e-16, this result doesn't check out.

Could someone please help me derive a correct formule for the unit roundoff, or point me to some mistakes I have been making? All help is appreciated.

1 answers

This appears to be an error in your textbook.

The significands of the IEEE-754 basic 32- and 64-bit binary floating-point formats are 24 and 53 bits, respectively. ¹ It is sometimes stated the significands are 23 bits and 52 bits, but this is a mistake. Those are the sizes of the main fields for encoding the significands, but the full 24-bit significand is encoded with 23 bits in the main significand field and 1 bit in the exponent field. Similarly, the full 53-bit significand is encoded with 52 bits in the main significand field and 1 bit in the exponent field. (The leading bit of the full significand comes from the exponent field: If the exponent field is zero, the leading significand bit is 0. If the exponent field is neither zero nor all ones, the leading significand bit is 1. If the exponent field is all ones, the floating-point object is a special value, either an infinity or a NaN.)

When the leading bit of the 24-bit significand represents the value 1, the least significant bit represents the value 2 ⁻²³ . That is the so-called epsilon. When a real number is being rounded to the nearest representable floating-point value, the maximum error is half the value of the least significant bit. (Because, if it were more than half the distance between two numbers, we would choose the number in the other direction, since it is closer.)

For a 53-bit significand, the least significant bit represents the value 2 ⁻⁵² relative to the leading bit, and the maximum error when rounding to nearest is half that. So, for a leading bit of 1, the maximum rounding error should be 2 ⁻⁵³ , which is about 1.11•10 ⁻¹⁶ . If your book says it is 2 ⁻¹⁶ , it is incorrect.

Footnote

¹ “Significand” is the preferred term. “Mantissa” is an old term for the fraction portion of a logarithm. Significands are linear. Mantissas are logarithmic.

IEEE 754 multiplication small numbers

How to subtract IEEE 754 numbers?

Rounding IEEE 754 Floating Point Numbers

Subtracting different numbers in IEEE 754 always nonzero?

how many whole numbers in IEEE 754

IEEE-754: cardinality of the set of rational numbers

How can I subtract IEEE 754 numbers?

Denormalized Numbers - IEEE 754 Floating Point

How to normalize the sum of two IEEE754 single precision numbers?

how IEEE-754 floating point numbers work

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question IEEE 754 multiplication small numbers How to subtract IEEE 754 numbers? Rounding IEEE 754 Floating Point Numbers Subtracting different numbers in IEEE 754 always nonzero? how many whole numbers in IEEE 754 IEEE-754: cardinality of the set of rational numbers How can I subtract IEEE 754 numbers? Denormalized Numbers - IEEE 754 Floating Point How to normalize the sum of two IEEE754 single precision numbers? how IEEE-754 floating point numbers work

Related Tags

IEEE 754 and machine numbers

Question

1 answers

solution1 1 ACCPTED 2018-12-09 01:52:16

Footnote

solution1
1 ACCPTED 2018-12-09 01:52:16