ACM Home Page
Please provide us with feedback. Feedback
Hedged learning: regret-minimization with learning experts
Full text PdfPdf (888 KB)
Source ACM International Conference Proceeding Series; Vol. 119 archive
Proceedings of the 22nd international conference on Machine learning table of contents
Bonn, Germany
Pages: 121 - 128  
Year of Publication: 2005
ISBN:1-59593-180-5
Authors
Yu-Han Chang  Massachusetts Institute of Technology, Cambridge, MA
Leslie Pack Kaelbling  Massachusetts Institute of Technology, Cambridge, MA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 0,   Downloads (12 Months): 7,   Citation Count: 1
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1102351.1102367
What is a DOI?

ABSTRACT

In non-cooperative multi-agent situations, there cannot exist a globally optimal, yet opponent-independent learning algorithm. Regret-minimization over a set of strategies optimized for potential opponent models is proposed as a good framework for deciding how to behave in such situations. Using longer playing horizons and experts that learn as they play, the regret-minimization framework can be extended to overcome several shortcomings of earlier approaches to the problem of multi-agent learning.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Chang, Y., & Kaelbling, L. P. (2001). Playing is believing: The role of beliefs in multi-agent learning. NIPS.
 
3
de Farias, D. P., & Meggido, N. (2004). How to combine expert (or novice) advice when actions impact the environment. Proceedings of NIPS.
 
4
Freund, Y., & Schapire, R. E. (1999). Adaptive game playing using multiplicative weights. Games and Economic Behavior, 29, 79--103.
 
5
Fudenburg, D., & Levine, D. K. (1995). Consistency and cautious fictitious play. Journal of Economic Dynamics and Control, 19, 1065--1089.
 
6
 
7
 
8
Littman, M. L. (1994). Markov games as a framework for multi-agent reinforcement learning. ICML.
 
9
 
10
Nachbar, J., & Zame, W. (1996). Non-computable strategies and discounted repeated games. Economic Theory.

Collaborative Colleagues:
Yu-Han Chang: colleagues
Leslie Pack Kaelbling: colleagues