Talk:Floating-point arithmetic/Archive 4: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 01:34, 30 September 2012 edit MiszaBot I (talk \| contribs) 234,552 edits m Robot: Archiving 1 thread from Talk:Floating point. ← Previous edit		Latest revision as of 20:21, 9 August 2017 edit undo Deacon Vorbis (talk \| contribs) Extended confirmed users, Rollbackers 23,589 edits m Deacon Vorbis moved page Talk:Floating point/Archive 4 to Talk:Floating-point arithmetic/Archive 4: Talk archive wasn't moved with rest of page
(5 intermediate revisions by 2 users not shown)
Line 565: :::It would stop FA status. Have a look at the articles about the individual formats. They describe in quite enough details the format. Any particular algorithm is up to the user, they are not interesting or discussed in secondary sources. [[User:Dmcq\|Dmcq]] ([[User talk:Dmcq\|talk]]) 10:01, 25 February 2012 (UTC) :::The closest in Wikipedia for the sort of stuff you're talking about is if somebody wrote something for wikibooks. Have you had a look at the various external sites? Really to me what you're talking about sounds like some homework exercise and we shouldn't help with those except perhaps to give hints. [[User:Dmcq\|Dmcq]] ([[User talk:Dmcq\|talk]]) 10:20, 25 February 2012 (UTC) == imho, "real numbers" is didactically misleading == I'd like to propose to change the beginning of the first sentence, because the limited amount of bits in the significand only allows for storing rational binary numbers. Because two is a prime factor of ten, this means only rational decimal numbers can be stored as well. Concluding, I'd like to propose to replace "real" by "rational" there. [[User:Drgst\|Drgst]] ([[User talk:Drgst\|talk]]) 13:17, 25 February 2012 (UTC) :Definitely not. That is a bad idea. They are approximations to real numbers. The concept of rational number just doesn't come into it. That they are rational is just a side effect. [[User:Dmcq\|Dmcq]] ([[User talk:Dmcq\|talk]]) 14:32, 25 February 2012 (UTC) ::In the section 'Some other computer representations for non-integral numbers' there are some systems that can represent some irrational numbers. for instance a logarithmic system does not necessarily represent rational numbers. [[User:Dmcq\|Dmcq]] ([[User talk:Dmcq\|talk]]) 14:36, 25 February 2012 (UTC) :::Sorry for the delayed answer, Dmcq, it seems I forgot to tick the "watch page" checkbox... now for the content: IEEE FP numbers definitely are rational numbers. Even the most simple irrational number in the world, i.e. sqrt(2), cannot be represented, e.g. Any mathematical theorem that really depends on the existence of irrational numbers does not hold for the set of FP numbers. Nevertheless, you are right in stating that FP numbers are meant to approximate real numbers. Yet, as no non-rational number can be represented, transcendental numbers are far from being representable. Of course, this has serious consequences: for example, none of these nice trigonometric identities involving pi or pi/2 can be used naively without introducing large errors. This is just a simple example of why I think people should be warned of associating floating point numbers with real numbers.[[User:Drgst\|Drgst]] ([[User talk:Drgst\|talk]]) 21:14, 27 June 2012 (UTC) ::::"Irrational numbers are those real numbers that cannot be represented as terminating or repeating decimals." --[[Irrational number]] Therefore, irrational numbers ''cannot be exactly represented on any digital computer''. However, you can get arbitrarily close. It really doesn't take all that many bits to handle a Planck length (~10^-35m) and the estimated size of the universe (~10^26m) in the same calculation. ::::The key point here is that floating point really is a method of representing (not perfectly but arbitrarily close) real numbers. Yes, it just so happens that some of them are represented exactly and others are not, but that's not relevant to the fact that FP is a method of representing (imperfectly) real numbers. All of this is covered quite nicely in the "Representable numbers, conversion and rounding" section. No need to make the lead confusing and misleading. --[[User:Guy Macon\|Guy Macon]] ([[User talk:Guy Macon\|talk]]) 22:48, 27 June 2012 (UTC) :::::I don't think this is correct "floating point really is a method of representing (not perfectly but arbitrarily close) real numbers". We talk about the "representable numbers" as those real numbers which can be represented exactly within the system. Other real numbers are rounded to some representable number. So I think we should either speak in terms of "working with real numbers" (which seems a little vague) or "representing approximations to real numbers" (as we do later in the article). --[[User:JakeVortex\|Jake]] ([[User talk:JakeVortex\|talk]]) 08:50, 22 October 2012 (UTC) ::::::You make a good point, but while "working with real numbers" is inexact and vague, "representing approximations to real numbers" is wordy and clumsy. Perhaps we can devise a third alternative? --[[User:Guy Macon\|Guy Macon]] ([[User talk:Guy Macon\|talk]]) 12:57, 22 October 2012 (UTC) :::::::What about "approximating real numbers"? But IMHO, "real numbers" is slightly incorrect, because floating point can also be used for complex arithmetic (though a complex number is here seen as a pair of two real numbers). Moreover a floating-point arithmetic is not just about the representation, but also the behavior when doing an operation (e.g. how the result is rounded). So, I would prefer something like: "a method of doing numerical computations" [[User:Vincent Lefèvre\|Vincent Lefèvre]] ([[User talk:Vincent Lefèvre\|talk]]) 22:09, 22 October 2012 (UTC) == Guard bits == Anybody know where the business of needing three extra bits comes from? For addition one only needs a guard/round digit plus a sticky bit as the sticky bit will always be zero if subtraction means you have to shift up. And for multiplication one needs the double length to cope with carry properly before rounding - but one can still cut that down to two bits before applying the particular rounding. The literaure talks about guard and round and sticky so I'm not disputig putting it in the text, just wondering why people got the idea in their heads in the first place. [[User:Dmcq\|Dmcq]] ([[User talk:Dmcq\|talk]]) 13:03, 8 March 2012 (UTC) :Somewhat related: Take a look at "2 vs 3 guard bits" here: :http://www.engineering.uiowa.edu/~carch/lectures07/55035-070404-prn.pdf :Also interesting: :http://www.google.com/patents/US4282582.pdf :These two searches turn up some interesting pages: :[http://www.google.com/search?q=%22floating+point%22+%2240+bits%22 <nowiki>http://www.google.com/search?q="floating+point"+"40+bits"</nowiki>] :[http://www.google.com/search?q=%22floating+point%22+%22eight+guard+bits%22+%22DSP%22 <nowiki>http://www.google.com/search?q="floating+point"+"eight+guard+bits"+"DSP"</nowiki>] :--[[User:Guy Macon\|Guy Macon]] ([[User talk:Guy Macon\|talk]]) 00:39, 9 March 2012 (UTC) ::Goldberg gives a discussion of the need for two guard digits in http://www.validlab.com/goldberg/paper.pdf (page 195). There is a very clear description with example cases in: Michael L. Overton (2001). Numerical Computing with IEEE Floating Point Arithmetic. SIAM. [[User:Brianbjparker\|Brianbjparker]] ([[User talk:Brianbjparker\|talk]]) 06:17, 9 March 2012 (UTC) :::Very good reference. It should be noted that he not only covers base 10 and guard (decimal) digits but also base 2 and guard bits. --[[User:Guy Macon\|Guy Macon]] ([[User talk:Guy Macon\|talk]]) 07:02, 9 March 2012 (UTC) :::I just looked at some implementation I did of the whole business I did ages ago and I did actually use three bits! Just me forgetting what I'd done, sorry. yes the subtraction does actually require them all. [[User:Dmcq\|Dmcq]] ([[User talk:Dmcq\|talk]]) 11:33, 9 March 2012 (UTC) == edit : computation in page is correct after all == Sorry for the confusion : I used t_(i+1) instead of t_i. for that reason I missed a factor 2 : 2^(i+1) = 2 * 2^i. <small><span class="autosigned">— Preceding [[Wikipedia:Signatures\|unsigned]] comment added by [[User:KeesLem\|KeesLem]] ([[User talk:KeesLem\|talk]] • [[Special:Contributions/KeesLem\|contribs]]) 14:36, 21 February 2013 (UTC)</span></small><!-- Template:Unsigned --> <!--Autosigned by SineBot--> == Justification for division by zero definition == I [http://en.wikipedia.org/w/index.php?title=Division_by_zero&diff=511812597&oldid=510158610 recently added] to [[division by zero]] this statement with an appropriate source: :"The justification for this definition is to preserve the sign of the result in case of [[arithmetic underflow]]. For example, in the double-precision computation 1/(''x''/2), where ''x'' = ±2<sup>−149</sup>, the computation ''x''/2 underflows and produces ±0 with sign matching ''x'', and the result will be ±∞ with sign matching ''x''. The sign will match that of the exact result ±2<sup>150</sup>, but the magnitude of the exact result is too large to represent, so infinity is used to indicate overflow." Provided this is valid, I wonder if it could also be added in some relevant ___location in the body of floating point related articles. In general I'd like to see more information on design rationales. Thanks! [[User:Dcoetzee\|Dcoetzee]] 07:42, 11 September 2012 (UTC) == Signed zero section, branch cuts == The section on signed zero (under Internal representation >> Special values >> Signed zero) says the following: "The difference between +0 and −0 is mostly noticeable for complex operations at so-called [[Branch cut\|branch cuts]]." In a strictly mathematical sense, +0/-0 ''can'' be interpreted as describing the limiting behaviors of a function, but that's not actually what's happening here. Moreover, branch cuts are not the only situation where these exceptional limiting behaviors appear, one can have branch cuts without exceptional limiting behaviors of this sort, and none of the examples given in the section are actually branch cuts. As far as I can tell, there is absolutely no significance to the relationship between branch cuts in complex analysis and signed zero in floating point numerical representations, but I wanted to make sure there wasn't a good reason for this being here. Thoughts? [[Special:Contributions/71.227.119.236\|71.227.119.236]] ([[User talk:71.227.119.236\|talk]]) 15:25, 29 September 2012 (UTC) :Result of a quick Google search: :"A system with signed zero can distinguish between asin(5+0i) and asin(5-0i) and pick the appropriate branch cut continuous with quadrant I or quadrant IV, respectively. A system without signed zero cannot distinguish and, according to the choses the branch cut such that it is continuous with quadrant IV (consistent with the rule of CCC). So, for asin(5+0i) it will return the same value as a system with signed zero would for asin(5-0i)." -Richard B. Kreckel ( [ http://www.ginac.de/~kreckel/ ] [ http://lists.gnu.org/archive/html/bug-gsl/2011-12/msg00004.html ] ). :I think that when he wrote "according to the" he meant "accordingly" (probably not a native English speaker). --[[User:Guy Macon\|Guy Macon]] ([[User talk:Guy Macon\|talk]]) 23:34, 29 September 2012 (UTC) ::Somewhat straying from the subject but still quite interesting; the "Signed Zero" section of "What Every Computer Scientist Should Know About Floating-Point Arithmetic" ( [ http://docs.oracle.com/cd/E19957-01/806-3568/ncg_goldberg.html ] ) --[[User:Guy Macon\|Guy Macon]] ([[User talk:Guy Macon\|talk]]) 23:41, 29 September 2012 (UTC) == imho, the computation for Pi as shown actually computes only Pi/2 == The algorithm as shown to compute an approximation of Pi actually computes imo in this form only Pi/2, even while the output shown contains an approximation for Pi. I think either the values should be halved or the formula should be changed into : 12 * 2^i * t_i [[User:KeesLem\|KeesLem]] ([[User talk:KeesLem\|talk]]) 15:16, 21 February 2013 (UTC) <span style="font-size: smaller;" class="autosigned">— Preceding [[Wikipedia:Signatures\|unsigned]] comment added by [[Special:Contributions/130.161.210.156\|130.161.210.156]] ([[User talk:130.161.210.156\|talk]]) </span><!-- Template:Unsigned IP --> <!--Autosigned by SineBot-->