Article

Free Access

Playing the matching-shoulders lob-pass game with logarithmic regret

Authors:
Joe Kilian

NEC Research Institute

NEC Research Institute
View Profile

,
Kevin J. Lang

NEC Research Institute

NEC Research Institute
View Profile

,
Barak A. Pearlmutter

Siemens Corporate Research

Siemens Corporate Research
View Profile

COLT '94: Proceedings of the seventh annual conference on Computational learning theoryJuly 1994Pages 159–164https://doi.org/10.1145/180139.181094

Published:16 July 1994Publication History

COLT '94: Proceedings of the seventh annual conference on Computational learning theory

Pages 159–164

ABSTRACT

The best previous algorithm for the matching shoulders lob-pass game, ARTHUR (Abe and Takeuchi 1993), suffered O(t^1/2) regret. We prove that this is the best possible performance for any algorithm that works by accurately estimating the opponent's payoff lines. Then we describe an algorithm which beats that bound and meets the information-theoretic lower bound of O(logt) regret by converging to the best lob rate without accurately estimating the payoff lines. The noise-tolerant binary search procedure that we develop is of independent interest.

References

Abe, N. and Takeuchi, J. (1993). The lob-pass problem and an on-line learning model of rational choice. In Workshop on Computatwnal Learning Theory, pp. 422-428. Google ScholarDigital Library
Borgstrom, R. S. and Kosaraju, S. R. (1993). Comp~rieon-B~sed Search in the Pre~ence of Errors. In Proceedings of the Twenty-Fifth Annual ACM Symposium on Theory of Computing, pp. 130-136. Google ScholarDigital Library
Herrnstein, R. (1990). Rational Choice Theory. Amerzcan Psychologist, ~5(3), 356-367.Google Scholar
Rivest, R., Meyer, A., Kleitman, D., Winklmann, K., and Spencer, J. (1980). Coping with errors in binary search procedures. Journal of Computer and System Sciences, 33, 85-94.Google Scholar

Index Terms

Playing the matching-shoulders lob-pass game with logarithmic regret
1. Computing methodologies
  1. Machine learning
2. Theory of computation
  1. Computational complexity and cryptography
    1. Complexity classes
  2. Models of computation

Recommendations

Playing against no-regret players
Abstract
We consider n-player repeated games where one optimizer plays against no-regret players. In a 2-player game, the optimizer can always guarantee an expected average utility of at least the Stackelberg value per round. However, if there are several ...
Read More
Toward General Mathematical Game Playing Agents
2018 IEEE Conference on Computational Intelligence and Games (CIG)
General game playing AI and general video game playing AI are both active research areas. Mathematical games, like prisoner’s dilemma, rock-paper-scissors, or the snowdrift game can also be played by general purpose agents. An agent representation ...
Read More
General Game Playing
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
COLT '94: Proceedings of the seventh annual conference on Computational learning theory
July 1994
376 pages
ISBN:0897916557
DOI:10.1145/180139
Chairman:
Manfred Warmuth
Univ. of California, Santa Cruz
Copyright © 1994 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 16 July 1994
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate35of71submissions,49%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 33
  Total Downloads
- Downloads (Last 12 months)19
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Playing the matching-shoulders lob-pass game with logarithmic regret

COLT '94: Proceedings of the seventh annual conference on Computational learning theory

ABSTRACT

References

Cited By

Index Terms

Recommendations

Playing against no-regret players

Toward General Mathematical Game Playing Agents

General Game Playing

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Playing the matching-shoulders lob-pass game with logarithmic regret

COLT '94: Proceedings of the seventh annual conference on Computational learning theory

ABSTRACT

References

Cited By

Index Terms

Recommendations

Playing against no-regret players

Toward General Mathematical Game Playing Agents

General Game Playing

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media