| Hedged learning: regret-minimization with learning experts |
| Full text |
Pdf
(888 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 119
archive
Proceedings of the 22nd international conference on Machine learning
table of contents
Bonn, Germany
Pages: 121 - 128
Year of Publication: 2005
ISBN:1-59593-180-5
|
|
Authors
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 0, Downloads (12 Months): 7, Citation Count: 1
|
|
|
ABSTRACT
In non-cooperative multi-agent situations, there cannot exist a globally optimal, yet opponent-independent learning algorithm. Regret-minimization over a set of strategies optimized for potential opponent models is proposed as a good framework for deciding how to behave in such situations. Using longer playing horizons and experts that learn as they play, the regret-minimization framework can be extended to overcome several shortcomings of earlier approaches to the problem of multi-agent learning.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Chang, Y., & Kaelbling, L. P. (2001). Playing is believing: The role of beliefs in multi-agent learning. NIPS.
|
| |
3
|
de Farias, D. P., & Meggido, N. (2004). How to combine expert (or novice) advice when actions impact the environment. Proceedings of NIPS.
|
| |
4
|
Freund, Y., & Schapire, R. E. (1999). Adaptive game playing using multiplicative weights. Games and Economic Behavior, 29, 79--103.
|
| |
5
|
Fudenburg, D., & Levine, D. K. (1995). Consistency and cautious fictitious play. Journal of Economic Dynamics and Control, 19, 1065--1089.
|
| |
6
|
|
| |
7
|
|
| |
8
|
Littman, M. L. (1994). Markov games as a framework for multi-agent reinforcement learning. ICML.
|
| |
9
|
|
| |
10
|
Nachbar, J., & Zame, W. (1996). Non-computable strategies and discounted repeated games. Economic Theory.
|
|