research-article

Adaptive Kanerva-based function approximation for multi-agent systems

Authors:

Cheng Wu,

Waleed M. MeleisAuthors Info & Claims

AAMAS '08: Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3

Pages 1361 - 1364

Published: 12 May 2008 Publication History

Get Access

Abstract

In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instances of classic multi-agent problems. We apply our techniques to the predator-prey pursuit problem. We first demonstrate that Kanerva Coding applied within a reinforcement learner does not give good results. We then describe our new adaptive Kanerva-based function approximation algorithm, based on prototype deletion and generation. We show that probabilistic prototype deletion with random prototype generation increases the fraction of test instances that are solved from 45% to 90%, and that prototype splitting increases that fraction to 94%. We also show that optimizing prototypes reduces the number of prototypes, and therefore the number of features, needed to achieve a 90% solution rate by up to 87%. These results demonstrate that our approach can dramatically improve the quality of the results obtained and reduce the number of prototypes required. We conclude that adaptive prototype optimization can greatly improve a Kanerva-based reinforcement learner's ability to solve large-scale multi-agent problems.

References

[1]

M. Adler, H. Racke, N. Sivadasan, C. Sohler, and B. Vocking. Randomized pursuit-evasion in graphs. In Proc. of the Intl. Colloq. on Automata, Languages and Programming, 2002.

Digital Library

Google Scholar

[2]

J. Albus. Brains, Behaviour, and Robotics. McGraw-Hill, 1981.

Digital Library

Google Scholar

[3]

L. Baird. Residual algorithms: Reinforcement learning with function approximation. In Proc. of the 12th Intl. Conf. on Machine Learning. Morgan Kaufmann, 1995.

Digital Library

Google Scholar

[4]

M. Benda, V. Jagannathan, and R. Rodhiawalla. On optimal cooperation of knowledge sources. Technical Report, Boeing Computer Services, 1985.

Google Scholar

[5]

T. Haynes and S. Sen. The evolution of multiagent coordination strategies. Adaptive Behavior, 1997.

Google Scholar

[6]

G. Hinton. Distributed representations. Technical Report, Department of Computer Science, Carnegie-Mellon University, Pittsburgh, 1984.

Google Scholar

[7]

V. Isler, S. Kannan, and S. Khanna. Randomized pursuit-evasion with local visibility. SIAM Journal on Discrete Mathematics, 20(1):26--41, 2006.

Digital Library

Google Scholar

[8]

P. Kanerva. Sparse Distributed Memory. MIT Press, 1988.

Digital Library

Google Scholar

[9]

K. Kostiadis and H. Hu. KaBaGe-RL: Kanerva-based generalisation and reinforcement learning for possession football. In Proc. of IEEE/RSJ Intl. Conf. on Intelligent Robots and Systems, 2001.

Crossref

Google Scholar

[10]

B. Ratitch and D. Precup. Sparse distributed memories for on-line value-based reinforcement learning. In Proc. of the European Conf. on Machine Learning, 2004.

Digital Library

Google Scholar

[11]

R. Sutton and A. Barto. Reinforcement Learning: An Introduction. Bradford Books, 1998.

Digital Library

Google Scholar

[12]

M. Tan. Multi-agent reinforcement learning: Independent vs. cooperative learning. In M. N. Huhns and M. P. Singh, editors, Readings in Agents, pages 487--494. Morgan Kaufmann, CA, 1997.

Digital Library

Google Scholar

[13]

C. Watkins and P. Dayan. Q-learning. Machine Learning, 8:279--292, 1989.

Digital Library

Google Scholar

Cited By

View all

Li WZhou FMeleis WChowdhury KLarson KWinikoff MDas SDurfee E(2017)Dynamic Generalization Kanerva Coding in Reinforcement Learning for TCP Congestion Control DesignProceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems10.5555/3091125.3091375(1598-1600)Online publication date: 8-May-2017
https://dl.acm.org/doi/10.5555/3091125.3091375
Wu CMeleis WSierra CCastelfranchi CDecker KSichman J(2009)Fuzzy Kanerva-based function approximation for reinforcement learningProceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 210.5555/1558109.1558240(1257-1258)Online publication date: 10-May-2009
https://dl.acm.org/doi/10.5555/1558109.1558240
Wu CMeleis W(2009)Adaptive Fuzzy Function Approximation for Multi-agent Reinforcement LearningProceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 0210.1109/WI-IAT.2009.147(169-176)Online publication date: 15-Sep-2009
https://dl.acm.org/doi/10.1109/WI-IAT.2009.147

Index Terms

Adaptive Kanerva-based function approximation for multi-agent systems
1. Computing methodologies
  1. Artificial intelligence
    1. Control methods
    2. Search methodologies
2. Theory of computation
  1. Design and analysis of algorithms
    1. Algorithm design techniques
      1. Dynamic programming

Recommendations

Adaptive Fuzzy Function Approximation for Multi-agent Reinforcement Learning
WI-IAT '09: Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 02

Reinforcement learning has difficulties in solving multi-agent problems because of the inefficiency of function approximation. Sparse distributed memories, which is implemented using Radial Basis Functions or Kanerva Coding, can be used to improve the ...
Fuzzy Kanerva-based function approximation for reinforcement learning
AAMAS '09: Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2

Radial Basis Functions and Kanerva Coding can give poor performance when applied to large-scale multi-agent systems. In this paper, we attempt to solve a collection of predator-prey pursuit instances and argue that the poor performance is caused by ...
A fuzzy-based function approximation technique for reinforcement learning¹
Special Section: Best papers of the 2016 International Conference on Management and Operations Research - ICMOR 2016

Reinforcement learning is hard to solve optimization problems in multi-agent system because of the inefficiency of function approximation. Sparse distributed memories, which is implemented using Radial Basis Functions or Kanerva Coding, can be used to ...

Comments

Information & Contributors

Information

Published In

AAMAS '08: Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3

May 2008

503 pages

ISBN:9780981738123

Publisher

International Foundation for Autonomous Agents and Multiagent Systems

Richland, SC

Publication History

Published: 12 May 2008

Check for updates

Author Tags

Qualifiers

Research-article

Conference

AAMAS08

Sponsor:

ACM
AAAI

AAMAS08: 7th International Conference on Autonomous Agents and Multi Agent Systems

May 12 - 16, 2008

Estoril, Portugal

Acceptance Rates

Overall Acceptance Rate 1,155 of 5,036 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
155
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 31 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Li WZhou FMeleis WChowdhury KLarson KWinikoff MDas SDurfee E(2017)Dynamic Generalization Kanerva Coding in Reinforcement Learning for TCP Congestion Control DesignProceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems10.5555/3091125.3091375(1598-1600)Online publication date: 8-May-2017
https://dl.acm.org/doi/10.5555/3091125.3091375
Wu CMeleis WSierra CCastelfranchi CDecker KSichman J(2009)Fuzzy Kanerva-based function approximation for reinforcement learningProceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 210.5555/1558109.1558240(1257-1258)Online publication date: 10-May-2009
https://dl.acm.org/doi/10.5555/1558109.1558240
Wu CMeleis W(2009)Adaptive Fuzzy Function Approximation for Multi-agent Reinforcement LearningProceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 0210.1109/WI-IAT.2009.147(169-176)Online publication date: 15-Sep-2009
https://dl.acm.org/doi/10.1109/WI-IAT.2009.147

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Adaptive Fuzzy Function Approximation for Multi-agent Reinforcement Learning

Fuzzy Kanerva-based function approximation for reinforcement learning

A fuzzy-based function approximation technique for reinforcement learning1

Comments

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations

A fuzzy-based function approximation technique for reinforcement learning¹