This example from the book Subjective Probability: The Real Thing by Richard Jeffrey was pointed out to me by a friend of mine yesterday. Let's draw a little diagram of possible outcomes and their utility:
B\A | death | life |
dishonour | 1 | * |
honour | 2 | 3 |
The '*' indicates an outcome with probability zero and the other outcomes are considered to be equally likely. A is the variable describing your state of liveness and B is the variable representing your honour.
We have:
E(utility|A=death) = 1.5
E(utility|B=dishonour) = 1
E(utility|B=honour) = 2.5
E(utility|A=life) = 3
With this set of utilities one does indeed rate death before dishonour but not honour before life.
This gets mixed reactions. Some people think that it's a nice counterintuitive result. Some people think it's so trivial they don't see why anyone would even mention it. If you're in the latter group, sorry for wasting your time. :-)
(I leave the Haskell program using a probability monad as an exercise...)
That's funny, when people say "death before dishonor", they usually mean that the options are death+honor vs. life+dishonor, not death+dishonor vs. death+honor vs. life+honor. Of course nobody would want dishonor if it also guarantees you death! :)
ReplyDeleteTo be honest, I'm having a hard time getting intuition about the utility matrix you've set up, particularly because you've made a life of dishonor impossible. Since that is actually one of the horns of the original dilemma, let's keep it in there and see if we can get the same result with more sensible probabilities.
Dying dishonorably: never
An honorable death: 2
A life of dishonor: 1
Honor and survival: 3
(Serendipitously, they all line up in monospace!)
* 1
2 3
So... U(death) = 2, U(dishonor) = 1, U(life) = 2, U(honor) = 2.5! That doesn't get us what we want -- and with these probabilities, no tweaking of U(life+honor) will change that as long as U(death+honor) > U(life+dishonor) <=> U(death) > U(dishonor).
So maybe the counter-intuitive result only holds when the probabilities are counter-intuitive? It's got to do with the fact that you took out an off-diagonal entry while I took out a diagonal one, I bet, but I've never played with utility matrices before so I don't have a strong intuition. It might be interesting to figure out the general conditions on the utility values and probabilities under which this sort of thing happens.
The assignment of utilities seems reasonable. We have:
ReplyDeleteLife and Honour>Death and Honour>Death and Dishonour
The probability assignment simply says that dying dishonourably never happens (and it will work the same if it just happens fairly rarely). One can imagine some people with an ethic that says that any death in battle is honourable.
What I think isn't natural about this setup is that I don't think it models any kind of reasonable decision process. There's nothing in that table that depends on any kind of choice I might make. Normally when peopke say "death before dishonour" they are making some statement about how they would act in certain situations. So this death before dishonour may be a bit of a distraction. Nonetheless, it is true statement about probability theory that A>B =/=> ~B>~A.
I don't think it's a red herring. The utilities simply describe your decision process -- or rather, how desirable you see the different possible outcomes, which then dictates what course of action you should take. If I, for example, prefer dishonor over death, you can model my behavior by having U_life_dishonor > U_death_honor in my utility matrix. Decision-making is then a simple matter of estimating the probabilities of the results of different courses of action, computing and comparing e.g. E(utility|run_to_the_hills) and E(utility|never_retreat_never_surrender), and picking the highest value. In various situations, then, you and I may then choose different courses of action even we estimate the same probabilities for the outcomes.
ReplyDeleteOf course, whether subjective utility is linear in probability is another question entirely... But hey, it's a reasonable first approximation. :)
Or you can take the next step and make it into a full inferential structure: a set S which is the range of some random variable, representing the actual experimental outcome, a class of distributions \Omega which could have produced this outcome, a space of decisions D, which could be an action to take or a position to assert ("epsilon has value 3.5"), and a cost function W :: \Omega -> D -> R. Then your utility matrix just becomes a cost function, and the task is to construct a sensible statistical procedure, that is a map t :: S -> D given that cost function. See Kiefer's lovely statistics book for details.
ReplyDelete