Content deleted Content added
→See also: Extended precision: more general than 80-bit. Removed "long double" since there are no cross references between the articles. |
→IEEE 754 octuple-precision binary floating-point format: binary256: various corrections (there was confusion with quadruple precision!) - Please check! |
||
Line 9:
== IEEE 754 octuple-precision binary floating-point format: binary256 ==
The [[IEEE 754]] standard specifies a '''binary256''' format among the ''interchange formats'' (it is not a basic format), as having:
* [[Sign bit]]: 1 bit
* [[Exponent]] width:
* [[Significand]] [[precision (arithmetic)|precision]]:
<!-- "significand", with a d at the end, is a technical term, please do not confuse with "significant" -->
The format is written with an implicit lead bit with value 1 unless the exponent is all zeros. Thus only
<!-- (Commented out since the image is incorrect; it could be re-added once corrected.)
The bits are laid out as follows:
[[File:Octuple
▲The format is written with an implicit lead bit with value 1 unless the exponent is all zeros. Thus only 112 bits of the [[significand]] appear in the memory format, but the total precision is 113 bits (approximately 34 decimal digits: {{nowrap|log<sub>10</sub>(2<sup>113</sup>) ≈ 34.016}}). The bits are laid out as follows:
-->
▲[[File:Octuple persision visual demontration.png|1000px|Octuple precision visual demonstration]]
=== Exponent encoding ===
Line 25 ⟶ 26:
The octuple-precision binary floating-point exponent is encoded using an [[offset binary]] representation, with the zero offset being 262143; also known as exponent bias in the IEEE 754 standard.
* E<sub>min</sub> =
* E<sub>max</sub> = 262143
* [[Exponent bias]] =
Thus, as defined by the offset binary representation, in order to get the true exponent the offset of 16383 has to be subtracted from the stored exponent.
The stored exponents
{|class="wikitable" style="text-align:center"
! Exponent !! Significand zero !! Significand non-zero !! Equation
|-
|
|-
|
|-
|
|}
The minimum strictly positive (subnormal) value is {{nowrap|2<sup>
The minimum positive normal value is 2<sup>
The maximum representable value is 2<sup>
=== Octuple-precision examples ===
Line 55 ⟶ 56:
8000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 = −0
7fff
ffff
By default, 1/3 rounds down like [[double precision]], because of the odd number of bits in the significand.
|