Two envelopes problem: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 06:28, 3 July 2024 edit Headbomb (talk \| contribs) Edit filter managers, Autopatrolled, Extended confirmed users, Page movers, File movers, New page reviewers, Pending changes reviewers, Rollbackers, Template editors 473,514 edits publisher link better ← Previous edit		Latest revision as of 04:28, 14 August 2025 edit undo Chris the speller (talk \| contribs) Autopatrolled, Extended confirmed users, Pending changes reviewers 894,247 edits m →Other simple resolutions: replaced: widely-discussed → widely discussed Tag: AWB
(17 intermediate revisions by 9 users not shown)
Line 6: {{Cquote\|Imagine you are given two identical [[envelope]]s, each containing money. One contains twice as much as the other. You may pick one envelope and keep the money it contains. Having chosen an envelope at will, but before inspecting it, you are given the chance to switch envelopes. Should you switch? }} Since the situation is symmetric, it seems obvious that there is no point in switching envelopes. On the other hand, a simple calculation using expected values suggests the opposite conclusion, that it is always beneficial to swap envelopes, since the person stands to gain twice as much money if they switch, while the only risk is halving what they currently have.<ref name=":5" /> ==Introduction== Line 24: ===The puzzle=== The puzzle is to find the flaw in the line of reasoning in the switching argument. This includes determining exactly ''why'' and under ''what conditions'' that step is not correct, to be sure not to make this mistake in a situation where the misstep may not be so obvious. In short, the problem is to solve the paradox. The puzzle is ''not'' solved by finding another way to calculate the probabilities that does not lead to a contradiction. == History of the paradox ==▼ [[File:Greater Manchester Metrolink neckties.jpg\|150px\|thumb\|Two [[necktie]]s.]] The envelope paradox dates back at least to 1943, when Belgian mathematician [[Maurice Kraitchik]] proposed a puzzle in his book ''Recreational Mathematics'' concerning two men who meet and compare their fine neckties.<ref name=kraitchik>{{cite book \|first=Maurice \|last=Kraitchik \|authorlink=Maurice Kraitchik \|title="Mathematical Recreations" \|publisher=George Allen & Unwin \|___location=London \|year=1943\|url=https://archive.org/details/mathematicalrecr0000maur}}</ref><ref name=brown>{{Cite journal \|last=Brown \|first=Aaron C. \|year=1995 \|title=Neckties, Wallets, and Money for Nothing \|journal=[[Journal of Recreational Mathematics]] \|volume=27 \|issue=2 \|pages=116–122 }}</ref> Each of them knows what his own necktie is worth and agrees for the winner to give his necktie to the loser as consolation. Kraitchik also discusses a variant in which the two men compare the contents of their purses. He assumes that each purse is equally likely to contain 1 up to some large number ''x'' of pennies, the total number of pennies minted to date.<ref name=kraitchik/> The puzzle is also mentioned in a 1953 book on elementary mathematics and mathematical puzzles by the mathematician [[John Edensor Littlewood]], who credited it to the physicist [[Erwin Schrödinger]], where it concerns a pack of cards, each card has two numbers written on it, the player gets to see a random side of a random card, and the question is whether one should turn over the card. Littlewood's pack of cards is infinitely large and his paradox is a paradox of improper prior distributions. [[Martin Gardner]] popularized Kraitchik's puzzle in his 1982 book ''Aha! Gotcha'', in the form of a wallet game:▼ {{blockquote\|Two people, equally rich, meet to compare the contents of their wallets. Each is ignorant of the contents of the two wallets. The game is as follows: whoever has the least money receives the contents of the wallet of the other (in the case where the amounts are equal, nothing happens). One of the two men can reason: "I have the amount ''A'' in my wallet. That's the maximum that I could lose. If I win (probability 0.5), the amount that I'll have in my possession at the end of the game will be more than 2''A''. Therefore the game is favourable to me." The other man can reason in exactly the same way. In fact, by symmetry, the game is fair. Where is the mistake in the reasoning of each man?▼ \| author = [[Martin Gardner]]▼ \| source = ''Aha! Gotcha''▼ }}▼ Gardner confessed that though, like Kraitchik, he could give a sound analysis leading to the right answer (there is no point in switching), he could not clearly put his finger on what was wrong with the reasoning for switching, and Kraitchik did not give any help in this direction, either.▼ In 1988 and 1989, [[Barry Nalebuff]] presented two different two-envelope problems, each with one envelope containing twice what is in the other, and each with computation of the expectation value 5''A''/4. The first paper just presents the two problems. The second discusses many solutions to both of them. The second of his two problems is nowadays the more common, and is presented in this article. According to this version, the two envelopes are filled first, then one is chosen at random and called Envelope A. [[Martin Gardner]] independently mentioned this same version in his 1989 book ''Penrose Tiles to Trapdoor Ciphers and the Return of Dr Matrix''. Barry Nalebuff's asymmetric variant, often known as the Ali Baba problem, has one envelope filled first, called Envelope A, and given to Ali. Then a fair coin is tossed to decide whether Envelope B should contain half or twice that amount, and only then given to Baba.▼ Broome in 1995 called ~~the~~a probability distribution 'paradoxical' if for any given first-envelope amount ''x'', the expectation of the other envelope conditional on ''x'' is greater than ''x''. The literature contains dozens of commentaries on the problem, much of which observes that a distribution of finite values can have an infinite expected value.<ref>{{cite journal \|last1=Syverson \|first1=Paul \|title=Opening Two Envelopes \|journal=Acta Analytica \|date=1 April 2010 \|volume=25 \|issue=4 \|pages=479–498 \|doi=10.1007/s12136-010-0096-7\|s2cid=12344371 }}</ref>▼ ==Multiplicity of proposed solutions== Line 31 ⟶ 50: == Example resolution == Suppose that the total amount in both envelopes is a constant <math>c = 3x</math>, with <math>x</math> in one envelope and <math>2x</math> in the other. If you select the envelope with <math>x</math> first you gain the amount <math>x</math> by swapping. If you select the envelope with <math>2x</math> first you lose the amount <math>x</math> by swapping. So you gain on average <math>G = {1 \over 2} (x) + {1 \over 2} (-x) = {1 \over 2}(x - x) = 0</math> by swapping. Line 42 ⟶ 60: == Other simple resolutions == A widely- discussed way to resolve the paradox, both in popular literature and part of the academic literature, especially in philosophy, is to assume that the 'A' in step 7 is intended to be the [[expected value]] in envelope A and that we intended to write down a formula for the expected value in envelope B. Step 7 states that the expected value in B = 1/2(2A + A/2). It is pointed out that the 'A' in the first part of the formula is the expected value, given that envelope A contains less than envelope B, but the 'A', in the second part of the formula is the expected value in A, given that envelope A contains more than envelope B. The flaw in the argument is that the same symbol is used with two different meanings in both parts of the same calculation but is assumed to have the same value in both cases. This line of argument is introduced by McGrew, Shier and Silverstein (1997).<ref>{{cite journal \|last1=McGrew \|first1=Timothy \|last2=Shier \|first2=David \|last3=Silverstein \|first3=Harry \|title=The Two-Envelope Problem Resolved \|journal=Analysis \|date=1997 \|volume=57 \|issue=1 \|pages=28–33 \|doi=10.1093/analys/57.1.28 \|url=https://academic.oup.com/analysis/article-abstract/57/1/28/139339\|url-access=subscription }}</ref> A correct calculation would be: Line 58 ⟶ 76: which is equal to the expected sum in A. In non-technical language, what goes wrong ~~(see [[Necktie paradox]])~~ is that, in the scenario provided, the mathematics use relative values of A and B (that is, it assumes that one would gain more money if A is less than B than one would lose if the opposite were true). However, the two values of money are fixed (one envelope contains, say, $20 and the other $40). If the values of the envelopes are restated as ''x'' and 2''x'', it's much easier to see that, if A were greater, one would lose ''x'' by switching and, if B were greater, one would gain ''x'' by switching. One does not gain a greater amount of money by switching because the total ''T'' of A and B (3''x'') remains the same, and the difference ''x'' is fixed to ''T/3''. Line 7 should have been worked out more carefully as follows: Line 94 ⟶ 112: === Nalebuff asymmetric variant === The mechanism by which the amounts of the two envelopes are determined is crucial for the decision of the player to switch ~~their~~her envelope.<ref name="Tsikogiannopoulos"/><ref>{{citation \|last1=Priest \|first1=Graham \|last2=Restall \|first2= Greg \|year=2007 \|title=Envelopes and Indifference \|url= http://consequently.org/papers/envelopes.pdf \|journal= Dialogues, Logics and Other Strange Things \|publisher=College Publications \|pages=135–140}}</ref> Suppose that the amounts in the two envelopes A and B were not determined by first fixing the contents of two envelopes E1 and E2, and then naming them A and B at random (for instance, by the toss of a fair coin<ref name=":0">{{Cite journal\|last1=Nickerson\|first1=Raymond S.\|last2=Falk\|first2=Ruma\|date=2006-05-01\|title=The exchange paradox: Probabilistic and cognitive analysis of a psychological conundrum\|url=https://doi.org/10.1080/13576500500200049\|journal=Thinking & Reasoning\|volume=12\|issue=2\|pages=181–213\|doi=10.1080/13576500500200049\|s2cid=143472998\|issn=1354-6783\|url-access=subscription}}</ref>). Instead, we start right at the beginning by putting some amount in envelope A and then fill B in a way which depends both on chance (the toss of a coin) and on what we put in A. Suppose that first of all the amount ''a'' in envelope A is fixed in some way or other, and then the amount in Envelope B is fixed, dependent on what is already in A, according to the outcome of a fair coin. If the coin fell Heads then 2''a'' is put in Envelope B, if the coin fell Tails then ''a''/2 is put in Envelope B. If the player was aware of this mechanism, and knows that ~~they~~she ~~hold~~holds Envelope A, but do not know the outcome of the coin toss, and do not know ''a'', then the switching argument is correct and ~~they~~she ~~are~~is recommended to switch envelopes. This version of the problem was introduced by Nalebuff (1988) and is often called the Ali-Baba problem. Notice that there is no need to look in envelope A in order to decide whether or not to switch. Many more variants of the problem have been introduced. Nickerson and [[Ruma Falk\|Falk]] systematically survey a total of 8.<ref name=":0" /> Line 102 ⟶ 120: This interpretation of the two envelopes problem appears in the first publications in which the paradox was introduced in its present-day form, Gardner (1989) and Nalebuff (1988).<ref>{{Cite journal\|last1=Nalebuff\|first1=Barry\|date=Spring 1988\|title=Puzzles: Cider in Your Ear, Continuing Dilemma, The Last Shall Be First, and More\|journal = Journal of Economic Perspectives\|volume = 2\|issue=2\|pages=149–156\|doi = 10.1257/jep.2.2.149 \|doi-access=free}} and Gardner, Martin (1989) '' Penrose Tiles to Trapdoor Ciphers: And the Return of Dr Matrix. ''</ref>) It is common in the more mathematical literature on the problem. It also applies to the modification of the problem (which seems to have started with Nalebuff) in which the owner of envelope A does actually look in his envelope before deciding whether or not to switch; though Nalebuff does also emphasize that there is no need to have the owner of envelope A look in his envelope. If he imagines looking in it, and if for any amount which he can imagine being in there, he has an argument to switch, then he will decide to switch anyway. Finally, this interpretation was also the core of earlier versions of the two envelopes problem (Littlewood's, Schrödinger's, and Kraitchik's switching paradoxes); see [[Two envelopes problem#History of the paradox\|the ~~concluding~~history section~~, on history of TEP~~]]. This kind of interpretation is often called "Bayesian" because it assumes the writer is also incorporating a prior probability distribution of possible amounts of money in the two envelopes in the switching argument. Line 114 ⟶ 132: Suppose for the sake of argument, we start by imagining an amount of 32 in Envelope A. In order that the reasoning in steps 6 and 7 is correct ''whatever'' amount happened to be in Envelope A, we apparently believe in advance that all the following ten amounts are all equally likely to be the smaller of the two amounts in the two envelopes: 1, 2, 4, 8, 16, 32, 64, 128, 256, 512 (equally likely powers of 2<ref name=":1" />). But going to even larger or even smaller amounts, the "equally likely" assumption starts to appear a bit unreasonable. Suppose we stop, just with these ten equally likely possibilities for the smaller amount in the two envelopes. In that case, the reasoning in steps 6 and 7 was entirely correct if envelope A happened to contain any of the amounts 2, 4, ... 512: switching envelopes would give an expected (average) gain of 25%. If envelope A happened to contain the amount 1, then the expected gain is actually 100%. But if it happened to contain the amount 1024, a massive loss of 50% (of a rather large amount) would have been incurred. That only happens once in twenty times, but it is exactly enough to balance the expected gains in the other 19 out of 20 times. Alternatively, we do go on ad infinitum but now we are working with a quite ludicrous assumption, implying for instance, that it is infinitely more likely for the amount in envelope A to be smaller than 1, ''and'' infinitely more likely to be larger than 1024, than between those two values. This is a so-called [[~~Prior~~prior probability#Improper priors\|improper prior distribution]]: probability calculus breaks down; expectation values are not even defined.<ref name=":1" /> Many authors have also pointed out that if a maximum sum that can be put in the envelope with the smaller amount exists, then it is very easy to see that Step 6 breaks down, since if the player holds more than the maximum sum that can be put into the "smaller" envelope they must hold the envelope containing the larger sum, and are thus certain to lose by switching. This may not occur often, but when it does, the heavy loss the player incurs means that, on average, there is no advantage in switching. Some writers consider that this resolves all practical cases of the problem.<ref name=":2">{{Citation \| first = Barry \| last = Nalebuff \|title = Puzzles: The Other Person's Envelope is Always Greener\| journal = Journal of Economic Perspectives \| volume = 3 \| issue = 1 \| pages = 171–181 \| doi=10.1257/jep.3.1.171\| year = 1989 \| doi-access = free }}.</ref> Line 167 ⟶ 185: Under dominance reasoning, the fact that we strictly prefer ''A'' to ''B'' for all possible observed values ''a'' should imply that we strictly prefer ''A'' to ''B'' without observing ''a''; however, as already shown, that is not true because <math>E(B)=E(A)=\infty</math>. To salvage dominance reasoning while allowing <math>E(B)=E(A)=\infty</math>, one would have to replace expected value as the decision criterion, thereby employing a more sophisticated argument from mathematical economics. For example, we could assume the decision maker is an [[expected utility]] maximizer with initial wealth ''W'' whose utility function, <math>u(w)</math>, is chosen to satisfy <math>E(u(W+B)\|A=a)<u(W+a)</math> for at least some values of ''a'' (that is, holding onto <math>A=a</math> is strictly preferred to switching to ''B'' for some ''a''). Although this is not true for all utility functions, it would be true if <math>u(w)</math> had an upper bound, <math>\beta<\infty</math>, as ''w'' increased toward infinity (a common assumption in mathematical economics and decision theory).<ref>{{cite book\|last1=DeGroot\|first1=Morris H.\|title=Optimal Statistical Decisions\|date=1970\|publisher=McGraw-Hill\|pages=109}}</ref> [[Michael R. Powers]] provides necessary and sufficient conditions for the utility function to resolve the paradox, and notes that neither <math>u(w)<\beta</math> nor <math>E(u(W+A))=E(u(W+B))<\infty</math> is required.<ref>{{cite journal\|last1=Powers\|first1=Michael R.\|title=Paradox-Proof Utility Functions for Heavy-Tailed Payoffs: Two Instructive Two-Envelope Problems\|journal=Risks\|date=2015\|volume=3\|issue=1\|pages=26–34\|doi=10.3390/risks3010026\|doi-access=free\|hdl=10419/167837\|hdl-access=free}}</ref> Some writers would prefer to argue that in a real-life situation, <math>u(W+A)</math> and <math>u(W+B)</math> are bounded simply because the amount of money in an envelope is bounded by the total amount of money in the world (''M''), implying <math>u(W+A) \leq u(W+M)</math> and <math>u(W+B) \leq u(W+M)</math>. From this perspective, the second paradox is resolved because the postulated probability distribution for ''X'' (with <math>E(X)=\infty</math>) cannot arise in a real-life situation. Similar arguments are often used to resolve the [[St. Petersburg paradox]]. Line 189 ⟶ 207: Byeong-Uk Yi, on the other hand, argues that comparing the amount you would gain if you would gain by switching with the amount you would lose if you would lose by switching is a meaningless exercise from the outset.<ref>{{cite journal \|author=Byeong-Uk Yi \|year=2009 \|title=The Two-envelope Paradox With No Probability \|url=http://philosophy.utoronto.ca/people/linked-documents-people/c%20two%20envelope%20with%20no%20probability.pdf \|url-status=dead \|archive-url=https://web.archive.org/web/20110929034017/http://philosophy.utoronto.ca/people/linked-documents-people/c%20two%20envelope%20with%20no%20probability.pdf \|archive-date=2011-09-29 }}</ref> According to his analysis, all three implications (switch, indifferent, do not switch) are incorrect. He analyses Smullyan's arguments in detail, showing that intermediate steps are being taken, and pinpointing exactly where an incorrect inference is made according to his formalization of counterfactual inference. An important difference with Chase's analysis is that he does not take account of the part of the story where we are told that the envelope called envelope A is decided completely at random. Thus, Chase puts probability back into the problem description in order to conclude that arguments 1 and 3 are incorrect, argument 2 is correct, while Yi keeps "two envelope problem without probability" completely free of probability and comes to the conclusion that there are no reasons to prefer any action. This corresponds to the view of Albers et al., that without a probability ingredient, there is no way to argue that one action is better than another, anyway. Bliss argues that the source of the paradox is that when one mistakenly believes in the possibility of a larger payoff that does not, in actuality, exist, one is mistaken by a larger margin than when one believes in the possibility of a smaller payoff that does not actually exist.<ref>{{cite ~~journal~~arXiv \|author=Bliss \|year=2012 \|title=A Concise Resolution to the Two Envelope Paradox \|~~arxiv~~class=~~1202~~stat.~~4669~~OT \|~~bibcode~~eprint=~~2012arXiv1202~~1202.~~4669B~~ 4669}}</ref> If, for example, the envelopes contained $5.00 and $10.00 respectively, a player who opened the $10.00 envelope would expect the possibility of a $20.00 payout that simply does not exist. Were that player to open the $5.00 envelope instead, he would believe in the possibility of a $2.50 payout, which constitutes a smaller deviation from the true value; this results in the paradoxical discrepancy. Albers, Kooi, and Schaafsma consider that without adding probability (or other) ingredients to the problem,<ref name=":4" /> Smullyan's arguments do not give any reason to swap or not to swap, in any case. Thus, there is no paradox. This dismissive attitude is common among writers from probability and economics: Smullyan's paradox arises precisely because he takes no account whatever of probability or utility. Line 195 ⟶ 213: ==Conditional switching== As an extension to the problem, consider the case where the player is allowed to look in envelope A before deciding whether to switch. In this "conditional switching" problem, it is often possible to generate a gain over the "never switching" strategy", depending on the probability distribution of the envelopes.<ref name="rspa">{{cite journal \|last1=McDonnell \|first1=M. D. \|last2=Abott \|first2=D. \|title=Randomized switching in the two-envelope problem \|journal=[[Proceedings of the Royal Society A]] \|volume=465 \|issue=2111 \|pages=3309–3322 \|year=2009 \|doi=10.1098/rspa.2009.0312 \|bibcode=2009RSPSA.465.3309M }}</ref> ▲== History of the paradox == The envelope paradox dates back at least to 1953, when Belgian mathematician [[Maurice Kraitchik]] proposed a puzzle in his book ''Recreational Mathematics'' concerning two equally rich men who meet and compare their beautiful neckties, presents from their wives, wondering which tie actually cost more money. He also introduces a variant in which the two men compare the contents of their purses. He assumes that each purse is equally likely to contain 1 up to some large number ''x'' of pennies, the total number of pennies minted to date. The men do not look in their purses but each reason that they should switch. He does not explain what is the error in their reasoning. It is not clear whether the puzzle already appeared in an earlier 1942 edition of his book. It is also mentioned in a 1953 book on elementary mathematics and mathematical puzzles by the mathematician [[John Edensor Littlewood]], who credited it to the physicist [[Erwin Schrödinger]], where it concerns a pack of cards, each card has two numbers written on it, the player gets to see a random side of a random card, and the question is whether one should turn over the card. Littlewood's pack of cards is infinitely large and his paradox is a paradox of improper prior distributions. ▲[[Martin Gardner]] popularized Kraitchik's puzzle in his 1982 book ''Aha! Gotcha'', in the form of a wallet game: ▲{{blockquote\|Two people, equally rich, meet to compare the contents of their wallets. Each is ignorant of the contents of the two wallets. The game is as follows: whoever has the least money receives the contents of the wallet of the other (in the case where the amounts are equal, nothing happens). One of the two men can reason: "I have the amount ''A'' in my wallet. That's the maximum that I could lose. If I win (probability 0.5), the amount that I'll have in my possession at the end of the game will be more than 2''A''. Therefore the game is favourable to me." The other man can reason in exactly the same way. In fact, by symmetry, the game is fair. Where is the mistake in the reasoning of each man? ▲\| author = [[Martin Gardner]] ▲\| source = ''Aha! Gotcha'' ▲}} ▲Gardner confessed that though, like Kraitchik, he could give a sound analysis leading to the right answer (there is no point in switching), he could not clearly put his finger on what was wrong with the reasoning for switching, and Kraitchik did not give any help in this direction, either. ▲In 1988 and 1989, [[Barry Nalebuff]] presented two different two-envelope problems, each with one envelope containing twice what is in the other, and each with computation of the expectation value 5''A''/4. The first paper just presents the two problems. The second discusses many solutions to both of them. The second of his two problems is nowadays the more common, and is presented in this article. According to this version, the two envelopes are filled first, then one is chosen at random and called Envelope A. [[Martin Gardner]] independently mentioned this same version in his 1989 book ''Penrose Tiles to Trapdoor Ciphers and the Return of Dr Matrix''. Barry Nalebuff's asymmetric variant, often known as the Ali Baba problem, has one envelope filled first, called Envelope A, and given to Ali. Then a fair coin is tossed to decide whether Envelope B should contain half or twice that amount, and only then given to Baba. ▲Broome in 1995 called the probability distribution 'paradoxical' if for any given first-envelope amount ''x'', the expectation of the other envelope conditional on ''x'' is greater than ''x''. The literature contains dozens of commentaries on the problem, much of which observes that a distribution of finite values can have an infinite expected value.<ref>{{cite journal \|last1=Syverson \|first1=Paul \|title=Opening Two Envelopes \|journal=Acta Analytica \|date=1 April 2010 \|volume=25 \|issue=4 \|pages=479–498 \|doi=10.1007/s12136-010-0096-7\|s2cid=12344371 }}</ref> == See also == {{div col\|colwidth=30em}}