|
3. Thoughts on the balanced range model, +Now & Next theorem
In my first post in this column, I defined a balanced range as one where on average you didn't mind how your opponent played, since they could in no way exploit you, and left it at that. Today I'm going to elaborate on this and discuss in more detail the relevance of the balance range model to real poker. I'm also going to try to clarify some misinterpretations of the concept which lead to faulty or incomplete reasoning in decision making.
Another way to look at the balanced range model is as a theoretical construct in which you attempt to maximise your expected value vs an opponent who is perfectly and instantaneously adjusted to the optimal play vs your true range. Such an opponent forces you to play in a "balanced" way, because he will immediately be exploiting any imbalance. But to what extent is such an opponent realistic?
The key here is "instantaneously" adjusted. What I mean by this is that he knows when you decide to raise a hand that it's in your raising range. He knows if you fold that it's not (or sometimes isn't). If you normally raise only AA utg but this time decide to also raise 72o if you get dealt it then he's calculating on a range of AA +72o. This is obviously impossible. Your opponent can only judge your range against previous actions in previous hands, he can't know if you've suddenly changed your range (unless you give away some kind of tell).
Thus the balanced range model, apart from assuming your opponent to know everything there is to know about your game, takes no account for the dynamic nature of poker (more specifically of dynamic ranges).
What other flaws does the balanced range model have? A pretty major one is that it focuses on minimising the amount your opponent is able to exploit you, rather than exploiting your opponent's mistakes. This can be fine if you believe that your opponent is going to be one step ahead of you, but if this is the case you should perhaps find a new opponent. Of course that doesn't mean the balanced range model isn't useful to consider in general, but it is important to remember that the most balanced play is unlikely to be the optimal one, and that maximising EV, not
being balanced, is the ultimate goal in poker.
As a simple example of this I'm going to return to the simple scenario outlined in my first post, of the river decision with one player having a polarised range (50%nuts 50%air), and one player a condensed one(100%bluff catchers). (You should probably read this now if you haven't already.) As we established, the polarised player's most balanced range is to bluff half as often as his value range. It can similarly be determined that a balanced calling range is to call half of the time in this situation. (You can check this, but I figured doing the maths again would be boring.)
Now imagine you're playing this exact situation a million times against the same opponent, half of the time with the polarised range, half with the condensed one. If you do the balanced thing every single time (ie making decisions by appropriately weighted chance) then you will break-even, irrespective of your opponent's actions. We know this because by definition, we don't care what he does. If he is being exactly balanced then we're doing just the same thing as him, therefore we must break-even, and if he's doing something different then it makes no difference to us, therefore we must still break-even. Obviously the smart thing to do is to perhaps start out balanced if we don't know anything about the player, then unbalance ourselves appropriately to take advantage of his patterns of play.
Of course this trivial example sounds obvious, but the point is that the point of balance is something to be aware of, but not to strive towards. Knowing what the balanced percentages are allows you to know which side of this your opponent is on, which side you are on, and in what way you are each vulnerable to exploitation. Being on the point of balance wins you nothing.
A lot of the value of the balanced range model is in optimising things like betsizing. I've spent some time on this in the two previous posts without really explaining what I was doing. What I was really doing there was finding the optimal bet size for the worst case scenario of playing against a perfectly balanced opponent. This bet size will not necessarily be best for playing against a real, exploitable opponent, but it's going to be a pretty good starting point against anyone reasonably competent and capable of thinking beyond first level. ie. Someone who is going to notice and start thinking why if you bet different amounts depending on your hand strength.
I saw an amazing thread on 2+2 related to this idea (linked from an article in the high stakes forum on here) discussing the relative merits of different betsizes on a river bluff vs a very weak opponent. The crux of the debate was one poster proposing that a small bet was the best play, since it folded out all the same hands that a shove would, with another insisting that the bluff had to be a shove, since this was the optimised balanced play and since this is what the player would have done were he value betting.
Now although this second poster was in some ways thinking on a much higher level, his thinking is not joined up. He has failed to relate the more advanced concept of balance back to what it's all about: Expected value. Of course in this kind of situation, against a decent opponent you would shove all-in. Ask a good player why and they'll usually say something like "balance" or "I'd shove all-in if I was value betting". But that isn't actually a reason. There is only one possible reason to make a larger bluff, and that is because you think they'd call a smaller bet more often. But then you have to ask why they'll call more often. The reason for this is twofold, one reason is pot odds, the other is that your line doesn't make sense for a value betting hand. Why doesn't it make sense? "balance" or "Because you'd shove all-in if you were value betting". So what our good player says is true, but it misses out a couple of simple but important stages in reasoning.
What our 2+2er has forgotten then is to think through these stages. He hasn't thought about whether the opponent will work out that your line makes no sense. He's so used to playing against people who will spot an obviously unbalanced play a mile off that he's forgotten that this stage of reasoning even exists and has somehow come to think of playing in a balanced range optimising way as the ultimate goal, rather than just a way of increasing EV vs
well balanced opponents.
I feel like what I've written in the last couple of paragraphs is not totally clear, so I'm going to go over it again in a slightly different way, since i think it's important:
Poster 2 is advocating the balanced range optimising play. Using the kind of techniques I used in my first two posts to determine optimal play vs a perfectly-adjusted opponent you would find that you should bluff here a certain % of the time and the optimal betsizing would be all-in (Consistently betting smaller could also be profitable, but less so as discussed in 1st entry). Against this theoretical opponent if you bluff half pot and shove for value then you will have exactly defined your hand and you opponent will be able to call all your bluffs and fold to all your value-bets thus totally destroying you. A real, but thinking, opponent will also be able to work out what you're thinking a lot of the time and will also win a lot of money against you. However the opponent you're playing against here is far from perfectly adjusted and may not be thinking at all about whether your bet makes any sense. If he is equally likely to call bets of different sizes then of course it is better to bet more for value and less as a bluff.
I hope at least one of those explanations makes sense to you. Of course the point I'm trying to make here is not what is the best bet-size. It's quite possible that the opponent in question is smart enough to work out that the smaller bet makes no sense. It's also possible that he'll call a lot more just because it's cheaper. The point I'm trying to make is that considerations of balanced ranges and in this case balanced range optimisation can make even very good players lose sight of what they're really trying to achieve.
Judging by the amount of arse-kissing that went on further down the thread this guy is probably successful at very high stakes and yet appears to have a fundamental misunderstanding about basic poker theory.
This kind of misunderstanding is really common and it arises due to to vaguaries in the way poker players express themselves (see above comments by "our good player") and also due to the complications driven by metagame considerations.
What I mean by this is the fact that sometimes our decision in a particular hand has an effect on not only our expected value in this hand, but also the way opponents will react to us and hence expected value in future hands.
In general I think the importance of this is over-estimated, but of course it cannot be completely neglected. Resolving the kind of confusion and misconceptions that are caused by all this lead me to conceive:
Now and Next Theorem
There are two times you can make money:
1. Now, in this hand, with the actual hand you are holding.
2. Next, in all future hands, with all possible holdings.
And making more money is better than making less money
Ok so it's a bit of a joke calling this a theorem since it's more of a truism, but I wanted to have a theorem. The point of this "theorem" is that any time you make a decision you have to relate all of your reasoning back to either NowEV or NextEV.
If you plan to take a NowEV hit in order to increase NextEV, remember to consider:
1. Your whole range of future hands
2. Whether your opponent is really paying any attention
3. Whether you're going to play enough hands with the opponent (or other observing opponents) to make the short-term loss worthwhile.
Also remember that if you think a certain action increases your NextEV relative to another, what you're saying is that either one action makes your opponent a better player who is harder to exploit, or the other action tilts the player into adjusting badly and being easier to exploit. ie. a player adjusting against your probably isn't going to be any easier to exploit, unless they make a horrible job of it.
So people please don't make statements like "I bluff in order to make money from my good hands" without backing them up with an EV statement. "I bluff because you believe your opponent will overadjust to this and start to call down ridiculously light, thust giving me a ton of value on my better hands" is much better (NextEV). But remember in this case you also lose some NextEV where bluffing becomes less profitable. More likely to be true: "I bluff because my opponent expects me not to bluff often and therefore I will take advantage of this and take the pot down a lot" (NowEV). "If he works this out and seems likely to over-adjust then I will exploit him by value-betting more thinly and bluffing less."
Apologies for the length of this post. I promise the next one will be shorter! If I had to shorten this all down to one sentence I'd go with "Whatever theoretical tools you use to make your decisions in poker they must relate back to expected value in this hand, with your actual holdings or to a lesser extent in future hands . And you'd better have a damn good reason for sacrificing any value on this hand."
Damn, that was two sentences. I'm really not good at this conciseness thing...
|