skip to main content
10.5555/1838206.1838223acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaamasConference Proceedingsconference-collections
research-article

To teach or not to teach?: decision making under uncertainty in ad hoc teams

Authors Info & Claims
Published:10 May 2010Publication History

ABSTRACT

In typical multiagent teamwork settings, the teammates are either programmed together, or are otherwise provided with standard communication languages and coordination protocols. In contrast, this paper presents an ad hoc team setting in which the teammates are not pre-coordinated, yet still must work together in order to achieve their common goal(s). We represent a specific instance of this scenario, in which a teammate has limited action capabilities and a fixed and known behavior, as a finite-horizon, cooperative k-armed bandit. In addition to motivating and studying this novel ad hoc teamwork scenario, the paper contributes to the k-armed bandits literature by characterizing the conditions under which certain actions are potentially optimal, and by presenting a polynomial dynamic programming algorithm that solves for the optimal action when the arm payoffs come from a discrete distribution.

References

  1. D. Bergemann and J. Valimaki. Bandit problems. Technical report, Cowles Foundation Discussion Paper, 2006.Google ScholarGoogle Scholar
  2. P. Bolton and C. Harris. Strategic experimentation. Econometrica, 67:349--374, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  3. R. I. Brafman and M. Tennenholtz. On partially controlled multi-agent systems. JAIR, 4:477--507, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. H. Chalupsky, Y. Gil, C. Knoblock, K. Lerman, J. Oh, D. Pynadath, T. Russ, and M. Tambe. Electric elves: Applying agent technology to support human organizations. In IAAI, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C. Claus and C. Boutilier. The dynamics of reinforcement learning in cooperative multiagent systems. In AAAI, pages 746--752, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Cripps, G. Keller, and S. Rady. Strategic experimentation with exponential bandits. ECONOMETRICA, 73:39--68, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  7. J. A. Giampapa, K. Sycara, and G. Sukthankar. Toward identifying process models in ad hoc and distributed teams. In K. V. Hindriks and W.-P. Brinkman, editors, HuCom, pages 55--62, December 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. B. J. Grosz and S. Kraus. Collaborative plans for complex group actions. AIJ, 86:269--358, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. L. ji Lin. Self-improving reactive agents based on reinforcement learning, planning and teaching. MLJ, 8(3/4):293--321, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. Just, M. Cornwell, and M. Huhns. Agents for establishing ad hoc cross-organizational teams. In International Conference on Intelligent Agent Technology, pages 526--30, September 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Kayay. When does it pay to get informed? International Economic Review, 2009. forthcoming.Google ScholarGoogle Scholar
  12. R. Kildare. Ad-hoc online teams as complex systems: agents that cater for team interaction rules. In Proceedings of the 7th Asia-Pacific Conference on Complex Systems, December 2004.Google ScholarGoogle Scholar
  13. R. D. Kleinberg. Online Decision Problems. PhD thesis, Department of Mathematics, MIT 2005.Google ScholarGoogle Scholar
  14. L. Panait and S. Luke. Cooperative multi-agent learning: The state of the art. JAAMAS, 11:387--434, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. H. Robbins. Some aspects of the sequential design of experiments. Bulletin American Mathematical Society, 55:527--535, 1952.Google ScholarGoogle ScholarCross RefCross Ref
  16. A. Schaerf, Y. Shoham, and M. Tennenholtz. Adaptive load balancing: A study in multi-agent learning. JAIR, 2:475--500, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. P. Stone and M. Veloso. Task decomposition, dynamic role assignment, and low-bandwidth communication for real-time strategic teamwork. AIJ, 110(2):241--273, June 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. P. Stone and M. Veloso. Multiagent systems: A survey from a machine learning perspective. Autonomous Robots, 8(3):345--383, July 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. K. Sycara, K. Decker, A. Pannu, M. Williamson, and D. Zeng. Distributed intelligent agents. IEEE Expert, 11(6), December 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. H. Zhang, Y. Chen, and D. Parkes. A general approach to environment design with one agent. In IJCAI, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. To teach or not to teach?: decision making under uncertainty in ad hoc teams

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        AAMAS '10: Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
        May 2010
        1578 pages
        ISBN:9780982657119

        Publisher

        International Foundation for Autonomous Agents and Multiagent Systems

        Richland, SC

        Publication History

        • Published: 10 May 2010

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate1,155of5,036submissions,23%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader