Article

Improving reinforcement learning function approximators via neuroevolution

Author:
Shimon Whiteson

University of Texas at Austin, Austin, TX

University of Texas at Austin, Austin, TX
View Profile

AAMAS '05: Proceedings of the fourth international joint conference on Autonomous agents and multiagent systemsJuly 2005Pages 1386https://doi.org/10.1145/1082473.1082794

Published:25 July 2005Publication History

AAMAS '05: Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems

Pages 1386

ABSTRACT

Reinforcement learning problems are commonly tackled with temporal difference methods, which estimate the long-term value of taking each action in each state. In most problems of real-world interest, learning this value function requires a function approximator. However, the feasibility of using function approximators depends on the ability of the human designer to select an appropriate representation for the value function. My thesis presents a new approach to function approximation that automates some of these difficult design choices by coupling temporal difference methods with policy search methods such as evolutionary computation. It also presents a particular implementation which combines NEAT, a neuroevolutionary policy search method, and Q-learning, a popular temporal difference method, to yield a new method called NEAT+Q that automatically learns effective representations for neural network function approximators. Empirical results in a server job scheduling task demonstrate that NEAT+Q can outperform both NEAT and Q-learning with manually designed neural networks.

References

K. O. Stanley and R. Miikkulainen. Evolving neural networks through augmenting topologies. Evolutionary Computation, 10(2):99--127, 2002. Google ScholarDigital Library
R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998. Google ScholarDigital Library

Index Terms

Improving reinforcement learning function approximators via neuroevolution
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Neural networks

Recommendations

Improving reinforcement learning function approximators via neuroevolution
AAAI'05: Proceedings of the 20th national conference on Artificial intelligence - Volume 4

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of taking each action in each state. In most problems of real-world interest, ...
Read More
Genetic Reinforcement Learning for Neurocontrol Problems
Special issue on genetic algorithms

Empirical tests indicate that at least one class of genetic algorithms yields good performance for neural network weight optimization in terms of learning rates and scalability. The successful application of these genetic algorithms to supervised ...
Read More
Reward Shaping in Episodic Reinforcement Learning
AAMAS '17: Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems

Recent advancements in reinforcement learning confirm that reinforcement learning techniques can solve large scale problems leading to high quality autonomous decision making. It is a matter of time until we will see large scale applications of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
AAMAS '05: Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
July 2005
1407 pages
ISBN:1595930930
DOI:10.1145/1082473
Program Chairs:
Michal Pechoucek
Czech Republic
,
Donald Steiner
USA
,
Simon Thompson
UK
Copyright © 2005 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 25 July 2005
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
genetic algorithms
neural networks
reinforcement learning
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate1,155of5,036submissions,23%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 8
  Total Citations
  View Citations
- 263
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Improving reinforcement learning function approximators via neuroevolution

AAMAS '05: Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Improving reinforcement learning function approximators via neuroevolution

Genetic Reinforcement Learning for Neurocontrol Problems

Reward Shaping in Episodic Reinforcement Learning

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Improving reinforcement learning function approximators via neuroevolution

AAMAS '05: Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Improving reinforcement learning function approximators via neuroevolution

Genetic Reinforcement Learning for Neurocontrol Problems

Reward Shaping in Episodic Reinforcement Learning

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media