Article

Finding equilibria in large sequential games of imperfect information

Authors:
Andrew Gilpin

Carnegie Mellon University, Pittsburgh, PA, USA

Carnegie Mellon University, Pittsburgh, PA, USA
View Profile

,
Tuomas Sandholm

Carnegie Mellon University, Pittsburgh, PA, USA

Carnegie Mellon University, Pittsburgh, PA, USA
View Profile

EC '06: Proceedings of the 7th ACM conference on Electronic commerceJune 2006Pages 160–169https://doi.org/10.1145/1134707.1134725

Published:11 June 2006Publication History

EC '06: Proceedings of the 7th ACM conference on Electronic commerce

Pages 160–169

ABSTRACT

Finding an equilibrium of an extensive form game of imperfect information is a fundamental problem in computational game theory, but current techniques do not scale to large games. To address this, we introduce the ordered game isomorphism and the related ordered game isomorphic abstraction transformation. For a multi-player sequential game of imperfect information with observable actions and an ordered signal space, we prove that any Nash equilibrium in an abstracted smaller game, obtained by one or more applications of the transformation, can be easily converted into a Nash equilibrium in the original game. We present an algorithm, GameShrink, for abstracting the game using our isomorphism exhaustively. Its complexity is Õ(n²), where n is the number of nodes in a structure we call the signal tree. It is no larger than the game tree, and on nontrivial games it is drastically smaller, so GameShrink has time and space complexity sublinear in the size of the game tree. Using GameShrink, we find an equilibrium to a poker game with 3.1 billion nodes--over four orders of magnitude more than in the largest poker game solved previously. We discuss several electronic commerce applications for GameShrink. To address even larger games, we introduce approximation methods that do not preserve equilibrium, but nevertheless yield (ex post) provably close-to-optimal strategies.

References

W. Ackermann. Zum Hilbertschen Aufbau der reellen Zahlen. Math. Annalen, 99:118--133, 1928.]]Google ScholarCross Ref
D. Applegate, G. Jacobson, and D. Sleator. Computer analysis of sprouts. Technical Report CMU-CS-91-144, 1991.]]Google Scholar
R. Bellman and D. Blackwell. Some two-person games involving bluffing. PNAS, 35:600--605, 1949.]]Google ScholarCross Ref
D. Billings, N. Burch, A. Davidson, R. Holte, J. Schaeffer, T. Schauenberg, and D. Szafron. Approximating game-theoretic optimal strategies for full-scale poker. In IJCAI, 2003.]]Google ScholarDigital Library
D. Billings, A. Davidson, J. Schaeffer, and D. Szafron. The challenge of poker. Artificial Intelligence, 134:201--240, 2002.]] Google ScholarDigital Library
B. Bollobàs. Combinatorics. Cambridge University Press, 1986.]]Google Scholar
A. Casajus. Weak isomorphism of extensive games. Mathematical Social Sciences, 46:267--290, 2003.]]Google ScholarCross Ref
X. Chen and X. Deng. Settling the complexity of 2-player Nash equilibrium. ECCC, Report No. 150, 2005.]]Google Scholar
V. Chvátal. Linear Programming. W. H. Freeman & Co., 1983.]]Google Scholar
B. P. de Bruin. Game transformations and game equivalence. Technical note x-1999-01, University of Amsterdam, Institute for Logic, Language, and Computation, 1999.]]Google Scholar
S. Elmes and P. J. Reny. On the strategic equivalence of extensive form games. J. of Economic Theory, 62:1--23, 1994.]]Google ScholarCross Ref
L. R. Ford, Jr. and D. R. Fulkerson. Flows in Networks. Princeton University Press, 1962.]]Google Scholar
A. Gilpin and T. Sandholm. Finding equilibria in large sequential games of imperfect information. Technical Report CMU-CS-05-158, Carnegie Mellon University, 2005.]]Google Scholar
A. Gilpin and T. Sandholm. Optimal Rhode Island Hold'em poker. In AAAI, pages 1684--1685, Pittsburgh, PA, USA, 2005.]] Google ScholarDigital Library
A. Gilpin and T. Sandholm. A competitive Texas Hold'em poker player via automated abstraction and real-time equilibrium computation. Mimeo, 2006.]]Google Scholar
A. Gilpin and T. Sandholm. A Texas Hold'em poker player based on automated abstraction and real-time equilibrium computation. In AAMAS, Hakodate, Japan, 2006.]] Google ScholarDigital Library
M. L. Ginsberg. Partition search. In AAAI, pages 228--233, Portland, OR, 1996.]]Google Scholar
M. L. Ginsberg. GIB: Steps toward an expert-level bridge-playing program. In IJCAI, Stockholm, Sweden, 1999.]] Google ScholarDigital Library
S. Govindan and R. Wilson. A global Newton method to compute Nash equilibria. J. of Econ. Theory, 110:65--86, 2003.]]Google ScholarCross Ref
C. A. Knoblock. Automatically generating abstractions for planning. Artificial Intelligence, 68(2):243--302, 1994.]] Google ScholarDigital Library
E. Kohlberg and J.-F. Mertens. On the strategic stability of equilibria. Econometrica, 54:1003--1037, 1986.]]Google ScholarCross Ref
D. Koller and N. Megiddo. The complexity of two-person zero-sum games in extensive form. Games and Economic Behavior, 4(4):528--552, Oct. 1992.]]Google ScholarCross Ref
D. Koller and N. Megiddo. Finding mixed strategies with small supports in extensive form games. International Journal of Game Theory, 25:73--92, 1996.]]Google ScholarCross Ref
D. Koller, N. Megiddo, and B. von Stengel. Efficient computation of equilibria for extensive two-person games. Games and Economic Behavior, 14(2):247--259, 1996.]]Google ScholarCross Ref
D. Koller and A. Pfeffer. Representations and solutions for game-theoretic problems. Artificial Intelligence, 94(1):167--215, July 1997.]] Google ScholarDigital Library
D. M. Kreps and R. Wilson. Sequential equilibria. Econometrica, 50(4):863--894, 1982.]]Google ScholarCross Ref
H. W. Kuhn. Extensive games. PNAS, 36:570--576, 1950.]]Google ScholarCross Ref
H. W. Kuhn. A simplified two-person poker. In Contributions to the Theory of Games, volume 1 of Annals of Mathematics Studies, 24, pages 97--103. Princeton University Press, 1950.]]Google Scholar
H. W. Kuhn. Extensive games and the problem of information. In Contributions to the Theory of Games, volume 2 of Annals of Mathematics Studies, 28, pages 193--216. Princeton University Press, 1953.]]Google Scholar
C. Lemke and J. Howson. Equilibrium points of bimatrix games. Journal of the Society for Industrial and Applied Mathematics, 12:413--423, 1964.]]Google ScholarCross Ref
R. Lipton, E. Markakis, and A. Mehta. Playing large games using simple strategies. In ACM-EC, pages 36--41, 2003.]] Google ScholarDigital Library
C.-L. Liu and M. Wellman. On state-space abstraction for anytime evaluation of Bayesian networks. SIGART Bulletin, 7(2):50--57, 1996.]] Google ScholarDigital Library
A. Mas-Colell, M. Whinston, and J. R. Green. Microeconomic Theory. Oxford University Press, 1995.]]Google Scholar
R. D. McKelvey and A. McLennan. Computation of equilibria in finite games. In Handbook of Computational Economics, volume 1, pages 87--142. Elsevier, 1996.]]Google ScholarDigital Library
P. B. Miltersen and T. B. Sørensen. Computing sequential equilibria for two-player games. In SODA, pages 107--116, 2006.]] Google ScholarDigital Library
J. Nash. Equilibrium points in n-person games. Proc. of the National Academy of Sciences, 36:48--49, 1950.]]Google ScholarCross Ref
J. F. Nash and L. S. Shapley. A simple three-person poker game. In Contributions to the Theory of Games, volume 1, pages 105--116. Princeton University Press, 1950.]]Google Scholar
A. Perea. Rationality in extensive form games. Kluwer Academic Publishers, 2001.]]Google ScholarCross Ref
A. Pfeffer, D. Koller, and K. Takusagawa. State-space approximations for extensive form games, July 2000. Talk given at the First International Congress of the Game Theory Society, Bilbao, Spain.]]Google Scholar
R. Porter, E. Nudelman, and Y. Shoham. Simple search methods for finding a Nash equilibrium. In AAAI, pages 664--669, San Jose, CA, USA, 2004.]]Google ScholarDigital Library
I. Romanovskii. Reduction of a game with complete memory to a matrix game. Soviet Mathematics, 3:678--681, 1962.]]Google Scholar
T. Sandholm and A. Gilpin. Sequences of take-it-or-leave-it offers: Near-optimal auctions without full valuation revelation. In AAMAS, Hakodate, Japan, 2006.]] Google ScholarDigital Library
T. Sandholm, A. Gilpin, and V. Conitzer. Mixed-integer programming methods for finding Nash equilibria. In AAAI, pages 495--501, Pittsburgh, PA, USA, 2005.]] Google ScholarDigital Library
R. Savani and B. von Stengel. Exponentially many steps for finding a Nash equilibrium in a bimatrix game. In FOCS, pages 258--267, 2004.]] Google ScholarDigital Library
R. Selten. Spieltheoretische behandlung eines oligopolmodells mit nachfrageträgheit. Zeitschrift fòr die gesamte Staatswissenschaft, 12:301--324, 1965.]]Google Scholar
R. Selten. Evolutionary stability in extensive two-person games -- correction and further development. Mathematical Social Sciences, 16:223--266, 1988.]]Google ScholarCross Ref
J. Shi and M. Littman. Abstraction methods for game theoretic poker. In Computers and Games, pages 333--345. Springer-Verlag, 2001.]] Google ScholarDigital Library
S. J. J. Smith, D. S. Nau, and T. Throop. Computer bridge: A big win for AI planning. AI Magazine, 19(2):93--105, 1998.]]Google ScholarDigital Library
R. E. Tarjan. Efficiency of a good but not linear set union algorithm. Journal of the ACM, 22(2):215--225, 1975.]] Google ScholarDigital Library
F. Thompson. Equivalence of games in extensive form. RAND Memo RM-759, The RAND Corporation, Jan. 1952.]]Google Scholar
J. von Neumann and O. Morgenstern. Theory of games and economic behavior. Princeton University Press, 1947.]]Google Scholar
B. von Stengel. Efficient computation of behavior strategies. Games and Economic Behavior, 14(2):220--246, 1996.]]Google ScholarCross Ref
B. von Stengel. Computing equilibria for two-person games. In Handbook of Game Theory, volume 3. North Holland, Amsterdam, 2002.]]Google Scholar
R. Wilson. Computing equilibria of two-person games from the extensive form. Management Science, 18(7):448--460, 1972.]]Google ScholarDigital Library
S. J. Wright. Primal-Dual Interior-Point Methods. SIAM, 1997.]] Google ScholarDigital Library

Index Terms

Finding equilibria in large sequential games of imperfect information

Recommendations

Lossless abstraction of imperfect information games

Finding an equilibrium of an extensive form game of imperfect information is a fundamental problem in computational game theory, but current techniques do not scale to large games. To address this, we introduce the ordered game isomorphism and the ...
Read More
Better automated abstraction techniques for imperfect information games, with application to Texas Hold'em poker
AAMAS '07: Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems

We present new approximation methods for computing game-theoretic strategies for sequential games of imperfect information. At a high level, we contribute two new ideas. First, we introduce a new state-space abstraction algorithm. In each round of the ...
Read More
GS3 and Tartanian: game theory-based heads-up limit and no-limit Texas Hold'em poker-playing programs
AAMAS '08: Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems: demo papers

We demonstrate two game theory-based programs for heads-up limit and no-limit Texas Hold'em poker. The first player, GS3, is designed for playing limit Texas Hold'em, in which all bets are a fixed amount. The second player, Tartanian, is designed for ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
EC '06: Proceedings of the 7th ACM conference on Electronic commerce
June 2006
342 pages
ISBN:1595932364
DOI:10.1145/1134707
General Chair:
Joan Feigenbaum
Yale University, USA
,
Program Chairs:
John Chuang
University of California Berkeley, USA
,
David M. Pennock
Yahoo! Research, USA
Copyright © 2006 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 11 June 2006
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
automated abstraction
computer poker
equilibrium finding
game theory
sequential games of imperfect information
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate664of2,389submissions,28%
Upcoming Conference
EC '24

Sponsor:

sigecom

The 25th ACM Conference on Economics and Computation

July 8 - 11, 2024

New Haven , CT , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 17
  Total Citations
  View Citations
- 797
  Total Downloads
- Downloads (Last 12 months)15
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Finding equilibria in large sequential games of imperfect information

EC '06: Proceedings of the 7th ACM conference on Electronic commerce

ABSTRACT

References

Cited By

Index Terms

Recommendations

Lossless abstraction of imperfect information games

Better automated abstraction techniques for imperfect information games, with application to Texas Hold'em poker

GS3 and Tartanian: game theory-based heads-up limit and no-limit Texas Hold'em poker-playing programs