skip to main content
10.1145/1102351.1102401acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

Interactive learning of mappings from visual percepts to actions

Published:07 August 2005Publication History

ABSTRACT

We introduce flexible algorithms that can automatically learn mappings from images to actions by interacting with their environment. They work by introducing an image classifier in front of a Reinforcement Learning algorithm. The classifier partitions the visual space according to the presence or absence of highly informative local descriptors. The image classifier is incrementally refined by selecting new local descriptors when perceptual aliasing is detected. Thus, we reduce the visual input domain down to a size manageable by Reinforcement Learning, permitting us to learn direct percept-to-action mappings. Experimental results on a continuous visual navigation task illustrate the applicability of the framework.

References

  1. Bellman. R. (1957). Dynamic programming. Princeton University Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Bertsekas, D., & Tsitsiklis, J. (1996). Neuro-dynamic programming. Athena Scientific. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Chapman, D., & Kaelbling, L. (1991). Input generalization in delayed reinforcement learning: An algorithm and performance comparisons. Proc. of the 12th International Joint Conference on Artificial Intelligence (IJCAI) (pp. 726--731). Sydney.Google ScholarGoogle Scholar
  4. Chrisman, L. (1992). Reinforcement learning with perceptual aliasing: The perceptual distinctions approach. National Conference on Artificial Intelligence (pp. 183--188).Google ScholarGoogle Scholar
  5. Coelho, J., Piater, J., & Grupen, R. (2001). Developing haptic and visual perceptual categories for reaching and grasping with a humanoid robot. Robotics and Autonomous Systems, 37, 195--218.Google ScholarGoogle ScholarCross RefCross Ref
  6. Gibson, E., & Spelke, E. (1983). The development of perception. Handbook of child psychology vol. iii: Cognitive development, chapter 1, 2--76. Wiley.Google ScholarGoogle Scholar
  7. Gouet, V., & Boujemaa, N. (2001). Object-based queries using color points of interest. IEEE Workshop on Content-Based Access of Image and Video Libraries (pp. 30--36). Kauai (HI, USA). Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Lowe, D. (1999). Object recognition from local scale-invariant features. International Conference on Computer Vision (pp. 1150--1157). Corfu, Greece. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. McCallum, R. (1996). Reinforcement learning with selective perception and hidden state. Doctoral dissertation, University of Rochester, New York. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Mikolajczyk, K., & Schmid, C. (2003). A performance evaluation of local descriptors. IEEE Conference on Computer Vision and Pattern Recognition (pp. 257--263). Madison (WI, USA).Google ScholarGoogle ScholarCross RefCross Ref
  11. Munos, R., & Moore, A. (2002). Variable resolution discretization in optimal control. Machine Learning, 49, 291--323. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Piater, J. (2001). Visual feature learning. Doctoral dissertation, University of Massachusetts, Computer Science Department, Amherst (MA, USA). Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Quinlan, J. (1993). C4.5: Programs for machine learning. Morgan Kaufmann Publishers Inc. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Scalzo, F., & Piater, J. (2005). Task-driven learning of spatial combinations of visual features. Proc. of the IEEE Workshop on Learning in Computer Vision and Pattern Recognition. San Diego (CA, USA). Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Schmid, C., & Mohr, R. (1997). Local greyvalue invariants for image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19, 530--535. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Schmid, C., Mohr, R., & Bauckhage, C. (2000). Evaluation of interest point detectors. International Journal of Computer Vision, 37, 151--172. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Schyns, P., & Rodet, L. (1997). Categorization creates functional features. Journ. of Experimental Psychology: Learning, Memory and Cognition, 23, 681--696.Google ScholarGoogle ScholarCross RefCross Ref
  18. Singh, S., Jaakkola, T., & Jordan, M. (1995). Reinforcement learning with soft state aggregation. Advances in Neural Information Processing Systems (pp. 361--368). MIT Press.Google ScholarGoogle Scholar
  19. Sutton, R., & Barto, A. (1998). Reinforcement learning, an introduction. MIT Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Uther, W. T. B., & Veloso, M. M. (1998). Tree based discretization for continuous state space reinforcement learning. Proc. of the 15th National Conference on Artificial Intelligence (AAAI) (pp. 769--774). Madison (WI, USA). Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Watkins, C. (1989). Learning from delayed rewards. Doctoral dissertation, King's College, Cambridge.Google ScholarGoogle Scholar
  22. Whitehead, S., & Ballard, D. (1991). Learning to perceive and act by trial and error. Machine Learning, 7, 45--83. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. Interactive learning of mappings from visual percepts to actions

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICML '05: Proceedings of the 22nd international conference on Machine learning
      August 2005
      1113 pages
      ISBN:1595931805
      DOI:10.1145/1102351

      Copyright © 2005 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 7 August 2005

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate140of548submissions,26%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader