William Briggs, Statistician, posted an interesting decision theory problem a few day ago. As if you didn’t guess from the title, it’s called Newcomb’s Paradox, and as Briggs laid it out, it goes a little something like this:
“You will play a game against an evil entity that can perfectly predict your future actions. Just for the sake of giving the evil one a name, let’s call him “Bill”. Bill is evil because he likes to taunt you with the possibility of making money, only to jerk that chance away from you at the last moment.
“Here’s what Bill does, as traditionally described. Later, we add clarifications and twists. Bill puts $1,000 in a clear box, and then also presents you with an opaque box inside of which might contain $1 million. You will be allowed to take both boxes, or just the opaque one. To clarify: you may not take the $1,000 without also taking the opaque box. Obviously, you may keep the contents of whatever box or boxes you take.
“Now for the evil part. If Bill predicts—and remember, he’s never wrong—that you will take just the opaque box, he will leave it full of money. But if he predicts that you will greedily take both boxes, then he will put nada in the opaque box[...]“
This game is totally bizarre from a game theoretic standpoint, because Bill has no incentive to play such a game, but that does not mean the tools of game theory cannot shed a little light on the issue.
Basically, there are two things you can do: Take the opaque box, or take both boxes. Because Bill can perfectly predict what you will do, if you take the opaque box you will get $1 million, but if you take both you’ll only get $1000. Now, there is a single way to rationally take both: greed, paired with a belief in free will. If you believe you can walk into the room planning to take only the opaque box, the opaque box should be full. But if the opaque box is full, why not take both boxes? After all, once you’re in the room, Bill can’t change his mind about you and magically decide to remove the million.
This perspective is the Nash equilibrium. However, it fundamentally underestimates Bill’s powers. When it is stated his predictive powers are 100%, that means there’s no way around them. The person who takes both boxes by planning to trick Bill will always get the sucker’s payoff. But what if there was a way to get both boxes out of the room, with the million in the opaque? You’d need to plan to follow the determination of a random process. Bill can read you perfectly, but he can’t, for instance, predict the outcome of a coin flip.
So, you decide to walk into the room with the boxes, and flip a coin: heads, you take both; tails, you take only the opaque. Because both are equally likely, Bill doesn’t know what to do. He could put the million in every time, but then you’d just take both every time. He could take the million out, but then you’d just take both every time. The best Bill can do is devise a strategy to compel you to flip a coin, so he pulls out his own coin. Heads, he pulls the million; tails, he leaves it.
The payoffs, (as seen in the matrix to the right) are all equally likely. That means you can assign a .25 probability to each, and calculate the expected value of using this coin-flipping method.
EU(coin) = .25(1000) + .25(0) + .25(1,001,000) + .25(1,000,000) = $505,000
So the expected value of using the coin is $505,000. Not too shabby, but a real downer when you could have an expected value of $1,000,000 just by committing to picking only the opaque box. So your optimal strategy is to just pick the opaque box.
Or is it?
The coin has a fixed probability of .5 for each outcome {heads, tails}. Perhaps there is some probability which will allow you to force your expected value over $1,000,000. That would definitely be a superior strategy. Briggs explained this a Quantum Number Maker Upper (QNMU). While the name is cute, I’ll stick with the more orthodox Random Number Generator (RNG). It’s basically a machine that you tell the probability with which you want to defect (take both boxes), and it will tell you to do so, or not. In other words, tell it .99, and it will tell you to take the boxes 99 times out of 100. Tell it .5, and it starts doing exactly the same thing the coin did: take both half the time, and opaque half of the time.
In order to check whether there is any value which will give us a better expected value than just ignoring this whole randomness business, we need to define a probability p as the probability that you’ll defect and take both. Since Bill’s strategy is to use the same p-value as you to determine whether he should include the money or not, we’ll not be coy and just call both probabilities p. The probability matrix includes terms of p (probability that you will take both, probability Bill will empty the box), and 1-p (the probability the you and he won’t, respectively).
If we couple this probability with the payoffs for each, we get a fascinating expected utility function:
EU(p) = p2(1000) + p(1-p)(0) + (1-p)(p)(1,001,000) + (1-p)2(1,000,000)
Which, if we expand, gives us:
EU(p) = 1000p2 + 1,001,000p – 1,001,000p2 + 1,000,000 – 2,000,000p + 1,000,000p2
And, canceling terms out, ultimately yields:
EU(p) = 1,000,000 – 990,000p
So, if we wanted a value of p which made the expected utility rise above 1,000,000, We just have a simple inequality to tackle:
1,000,000 < 1,000,000 – 990,000p
0 < -990,000p
0 > 990,000p
0 > p
Hold up! p is a probability! And a probability cannot be negative! Clearly there does not exist any value of p which satisfies the constraints of our argument, meaning that no value of p will help you expect to get better than the million. Thus the optimal strategy is to enter 0 in the RNG, or just let the bloody thing collect dust, while you collect your cool, even million dollars.
It turns out that most people get this. I did a little survey experiment to see how people tended to respond to each of these situations (pure choice, coin tossing, RNG). The survey wasn’t terribly scientific, but I think it served its purpose. Of 24 respondents, 5 selected both boxes straightaway, meaning that these (approximately) 20% believed they could trick Bill, or that because the boxes were already filled or not, they had nothing to lose by taking both.
The coin option attracted only one respondent, while 18 chose to go straight for the opaque box, and again, five took both. Suffice to say there was only one gambler in the crowd. Likewise, when given the choice to set the probability of taking both boxes, only one person took the chance, while 16 still chose the opaque box and five picked both. This suggests both popular risk-aversion, and trust for the game. On that topic, one respondent wrote back: “if schrodinger’s cat is in the box instead, I’m going to be really upset…and slightly impressed[sic]“. Very like Schrodinger’s cat, indeed.
Ultimately, I assert that people will tend to favor the safer behavior. The Nash equilibrium (remove the million, take both boxes) is inadequate for predicting the probable response, either because of the fallacious existence of a perfect predictor, or the absent understanding and consideration of Bill’s preferences, or perhaps because of the predictive insufficiencies of the Nash equilibrium in the real world. Whatever the reason, most people’s inclination to take the money and head for the hills is spot on. Incidentally, if you ever should find yourself faced with this game, kindly remember who taught you this optimal strategy.
(Big shout-out to all my dear friends who braved the survey. Thanks a lot, guys!)