Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Okay.

Your opponents strategy in poker is fixed; they will always play the nash equilibrium strategy, they will never play another strategy. Their strategy is fixed. They will never change it from this setting.

You've claimed that subgame perfect play can be calculated without respect to the subgame you aren't in, because you can make a choice on the basis of the EV of the subgame you are in without respect to the subgames you aren't in.

I disagree. I think you still need to account for every subgame you are in as if you are in all of them.

Let the subgame you are in be you having KK and your opponent having AA. However, obviously - you only know that you have KK.

Therefore, you should be able to compute the strategy which is the best response to 37 suited and according to your logic it should be equal to the best response to AA. After all, you have no means of determining which subgame you are in. So you have to have the same response in both subgames.

So compute the best response for KK to AA and prove that this is also the best response to 37 suited.

However, you've claimed you don't need to calculate this with respect to other subgames. So your computation of 37 suited and your computation for AA must not be equal to each other - if they are, then you share terms. You calculated them with respect to each other.

Let Br = Best response.

Write a program which shows Br(p1, p2, I[KK]) != Br(p1, p2, I[KK]) and Br(p1, p2, I[KK]) = Br(p1, p2, I[KK]) simultaneously. (That is to say, both your policy and your opponents policy are fixed)

My contention is that you can't do this. You claim you can. You are free to use a simpler variant of poker - Kuhn poker - so that the computation becomes more tractable.



Ah, looks like I misunderstood the definition of the term "subgame". I thought that this would be one subgame: "I have KK preflop on the button, and my opponent's range is X". And another subgame might be: "I have 75o preflop on the button, and my opponent's range is X". From my opponent's perspective, these 2 situations would occur on the same level of the game tree and my opponent has no way of distinguishing between these (at this stage of the game tree) due to the hidden information, thus I thought they would be called subgames. But you're telling me that the concept of subgame applies in both directions (not only imperfect information by my opponent's perspective, but also imperfect information in my perspective). So when I say "my opponent's presumed range is X", that sentence doesn't describe 1 subgame, that sentence actually describes multiple subgames.

My earlier point was that if an opponent's strategy is fixed, then I don't have to balance how I play that KK compared to how I play that 75o. That I can play each hand "in a vacuum", in a strategy where the EV of each hand is maximized. Note that this idea is only relevant if my opponent is not playing perfect GTO (contrary to your example). For example, my opponent may have a weakness where their preflop fold frequency is the same regardless of how whether I raise 3BB or 2BB. Against this weakness the optimal play would be to raise 3BB with KK and raise 2BB with 75o. Obviously, this would be a bad strategy if my opponent was allowed to adapt their strategy to me, but it's the optimal play if my opponent's strategy is locked.

To clarify further:

- "exploitative style" poker strategy is made at the level "My hand is 96s, and my opponent's range is presumed to be Y"

- "GTO style" poker strategy is made at the level "My range is X, and my opponent's range is presumed to be Y"

If the opponent's strategy is fixed, then our optimal strategy is to play 100% exploitative. If the opponent's strategy is not fixed, then the optimal strategy will be a mix with elements from both exploitative and GTO strategies. (Again, note that this only makes sense if the opponent is not playing perfect GTO. If the opponent is playing perfect GTO, then obviously the optimal strategy is to also play GTO.)

I thought that Noam Brown was making a point related to this concept. It seems that I was mistaken and his point was related to something else entirely.


> But you're telling me that the concept of subgame applies in both directions (not only imperfect information by my opponent's perspective, but also imperfect information in my perspective). So when I say "my opponent's presumed range is X", that sentence doesn't describe 1 subgame, that sentence actually describes multiple subgames.

Yes; consider a chess move, every move is a subgame - a subtree of that game. He just can't safely say tree, because there are games that are better described as a graph than a tree. Actually, Two Envelope is such a game. The action switch, in Two Envelope, leads to the subgame in which you are at I[null] and need to decide whether to keep or switch - the same situation as you started in. Switch again and you arrive at the subgame I[null]. It contains itself.

> If the opponent's strategy is fixed, then our optimal strategy is to play 100% exploitative. If the opponent's strategy is not fixed, then the optimal strategy will be a mix with elements from both exploitative and GTO strategies. (Again, note that this only makes sense if the opponent is not playing perfect GTO. If the opponent is playing perfect GTO, then obviously the optimal strategy is to also play GTO.)

I agree.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: