Article

Efficient model learning for dialog management

Authors:

Finale Doshi,

Nicholas RoyAuthors Info & Claims

HRI '07: Proceedings of the ACM/IEEE international conference on Human-robot interaction

Pages 65 - 72

https://doi.org/10.1145/1228716.1228726

Published: 10 March 2007 Publication History

Get Access

Abstract

Intelligent planning algorithms such as the Partially Observable Markov Decision Process (POMDP) have succeeded in dialog management applications [10, 11, 12] because they are robust to the inherent uncertainty of human interaction. Like all dialog planning systems, however, POMDPs require an accurate model of the user (e.g., what the user might say or want). POMDPs are generally specified using a large probabilistic model with many parameters. These parameters are difficult to specify from domain knowledge, and gathering enough data to estimate the parameters accurately a priori is expensive.In this paper, we take a Bayesian approach to learning the user model simultaneously with dialog manager policy. At the heart of our approach is an efficient incremental update algorithm that allows the dialog manager to replan just long enough to improve the current dialog policy given data from recent interactions. The update process has a relatively small computational cost, preventing long delays in the interaction. We are able to demonstrate a robust dialog manager that learns from interaction data, out-performing a hand-coded model in simulation and in a robotic wheelchair application.

References

[1]

R. Dearden, N. Friedman, and D. Andre. Model based bayesian exploration. pages 150--159, 1999.

Google Scholar

[2]

G. J. Gordon. Stable function approximation in dynamic programming. In Proceedings of the Twelfth International Conference on Machine Learning, San Francisco, CA, 1995. Morgan Kaufmann.

Digital Library

Google Scholar

[3]

R. Jaulmes, J. Pineau, and D. Precup. Learning in non-stationary partially observable markov decision processes. Workshop on Non-Stationarity in Reinforcement Learning at the ECML, 2005.

Google Scholar

[4]

D. Litman, S. Singh, M. Kearns, and M. Walker. NJFun: a reinforcement learning spoken dialogue system. In Proceedings of the ANLP/NAACL 2000 Workshop on Conversational Systems, Seattle, 2000.

Digital Library

Google Scholar

[5]

A. Nilim and L. Ghaoui. Robustness in markov decision problems with uncertain transition matrices, 2004.

Digital Library

Google Scholar

[6]

J. Pineau, G. Gordon, and S. Thrun. Point-based value iteration: An anytime algorithm for pomdps, 2003.

Google Scholar

[7]

J. Pineau, N. Roy, and S. Thrun. A hierarchical approach to pomdp planning and execution. In Workshop on Hierarchy and Memory in Reinforcement Learning (ICML), June 2001.

Google Scholar

[8]

L. R. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2):257--286, 1989.

Crossref

Google Scholar

[9]

M. Ravishankar. Efficient Algorithms for Speech Recognition. PhD thesis, Carnegie Mellon, 1996.

Google Scholar

[10]

N. Roy, J. Pineau, and S. Thrun. Spoken dialogue management using probabilistic reasoning. In Proceedings of the 38th Annual Meeting of the ACL, Hong Kong, 2000.

Digital Library

Google Scholar

[11]

J. Williams and S. Young. Scaling up pomdps for dialogue management: The lhsummary pomdpla method. In Proceedings of the IEEE ASRU Workshop, 2005.

Google Scholar

[12]

J. D. Williams, P. Poupart, and S. Young. Partially observable markov decision processes with continuous observations for dialogue management. In Proceedings of SIGdial Workshop on Discourse and Dialogue 2005, 2005.

Google Scholar

Cited By

View all

Reimann MKunneman FOertel CHindriks K(2024)A Survey on Dialogue Management in Human-robot InteractionACM Transactions on Human-Robot Interaction10.1145/364860513:2(1-22)Online publication date: 14-Jun-2024
https://dl.acm.org/doi/10.1145/3648605
Idrees IYun TSharma NDeng YGopalan NKonidaris GTellex S(2023)Improved Inference of Human Intent by Combining Plan Recognition and Language Feedback2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)10.1109/IROS55552.2023.10342380(7976-7983)Online publication date: 1-Oct-2023
https://doi.org/10.1109/IROS55552.2023.10342380
Fontaine MNikolaidis S(2022)Evaluating Human–Robot Interaction Algorithms in Shared Autonomy via Quality Diversity Scenario GenerationACM Transactions on Human-Robot Interaction10.1145/347641211:3(1-30)Online publication date: 2-Sep-2022
https://dl.acm.org/doi/10.1145/3476412
Show More Cited By

Index Terms

Efficient model learning for dialog management
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
    2. Machine learning approaches
      1. Markov decision processes
2. Theory of computation
  1. Theory and algorithms for application domains
    1. Machine learning theory
      1. Markov decision processes

Recommendations

Human-robot collaborative tutoring using multiparty multimodal spoken dialogue
HRI '14: Proceedings of the 2014 ACM/IEEE international conference on Human-robot interaction

In this paper, we describe a project that explores a novel experimental setup towards building a spoken, multi-modally rich, and human-like multiparty tutoring robot. A human-robot interaction setup is designed, and a human-human dialogue corpus is ...
Applying politeness maxims in social robotics polite dialogue
HRI '12: Proceedings of the seventh annual ACM/IEEE international conference on Human-Robot Interaction

An important element of human-robot interaction, as with inter-human interaction, is conversation. Having previously suggested the Gricean maxims as suitable guidelines for social robotics dialogue, we discovered that a preferable alternative set of ...
Sample-efficient batch reinforcement learning for dialogue management optimization

Spoken Dialogue Systems (SDS) are systems which have the ability to interact with human beings using natural language as the medium of interaction. A dialogue policy plays a crucial role in determining the functioning of the dialogue management module. ...

Comments

Information & Contributors

Information

Published In

HRI '07: Proceedings of the ACM/IEEE international conference on Human-robot interaction

March 2007

392 pages

ISBN:9781595936172

DOI:10.1145/1228716

General Chairs:
Cynthia Breazeal
Massachusetts Institute of Technology, USA
,
Alan C. Schultz
Naval Research Laboratory, USA
,
Program Chairs:
Terry Fong
NASA Ames Research Center, USA
,
Sara Kiesler
Carnegie Mellon University, USA

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 March 2007

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

HRI07

Sponsor:

HRI07: International Conference on Human Robot Interaction

March 10 - 12, 2007

Virginia, Arlington, USA

Acceptance Rates

HRI '07 Paper Acceptance Rate 22 of 101 submissions, 22%;

Overall Acceptance Rate 268 of 1,124 submissions, 24%

Upcoming Conference

HRI '25

Sponsor:
sigai
sigai

ACM/IEEE International Conference on Human-Robot Interaction

March 4 - 6, 2025

Melbourne , VIC , Australia

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

35
Total Citations
View Citations
503
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)1

Reflects downloads up to 16 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Reimann MKunneman FOertel CHindriks K(2024)A Survey on Dialogue Management in Human-robot InteractionACM Transactions on Human-Robot Interaction10.1145/364860513:2(1-22)Online publication date: 14-Jun-2024
https://dl.acm.org/doi/10.1145/3648605
Idrees IYun TSharma NDeng YGopalan NKonidaris GTellex S(2023)Improved Inference of Human Intent by Combining Plan Recognition and Language Feedback2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)10.1109/IROS55552.2023.10342380(7976-7983)Online publication date: 1-Oct-2023
https://doi.org/10.1109/IROS55552.2023.10342380
Fontaine MNikolaidis S(2022)Evaluating Human–Robot Interaction Algorithms in Shared Autonomy via Quality Diversity Scenario GenerationACM Transactions on Human-Robot Interaction10.1145/347641211:3(1-30)Online publication date: 2-Sep-2022
https://dl.acm.org/doi/10.1145/3476412
Li MKwon MSadigh D(2021)Influencing leading and following in human–robot teamsAutonomous Robots10.1007/s10514-021-10016-745:7(959-978)Online publication date: 1-Oct-2021
https://dl.acm.org/doi/10.1007/s10514-021-10016-7
Fan ZMeng LChen TLi JMitchell I(2018)Learning Motion Predictors for Smart Wheelchair Using Autoregressive Sparse Gaussian Process2018 IEEE International Conference on Robotics and Automation (ICRA)10.1109/ICRA.2018.8460502(713-718)Online publication date: 21-May-2018
https://dl.acm.org/doi/10.1109/ICRA.2018.8460502
Lokesh SKanisha BNalini SRamya Devi MKumar R(2018)RETRACTED ARTICLE: Speech to speech interaction system using Multimedia Tools and Partially Observable Markov Decision Process for visually impaired studentsMultimedia Tools and Applications10.1007/s11042-018-6264-279:7-8(5023-5042)Online publication date: 23-Jun-2018
https://doi.org/10.1007/s11042-018-6264-2
Nikolaidis SHsu DSrinivasa S(2017)Human-robot mutual adaptation in collaborative tasks: Models and experimentsThe International Journal of Robotics Research10.1177/027836491769059336:5-7(618-634)Online publication date: 14-Feb-2017
https://doi.org/10.1177/0278364917690593
Schwesinger DShariati AMontella CSpletzer J(2017)A smart wheelchair ecosystem for autonomous navigation in urban environmentsAutonomous Robots10.1007/s10514-016-9549-141:3(519-538)Online publication date: 1-Mar-2017
https://dl.acm.org/doi/10.1007/s10514-016-9549-1
Vokhmintsev ATimchenko MYakovlev K(2016)Simultaneous localization and mapping in unknown environment using dynamic matching of images and registration of point clouds2016 2nd International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM)10.1109/ICIEAM.2016.7910967(1-6)Online publication date: 2016
https://doi.org/10.1109/ICIEAM.2016.7910967
Aida-zade KRustamov S(2016)Learning User Intentions in Natural Language Call Routing SystemsRecent Developments and New Direction in Soft-Computing Foundations and Applications10.1007/978-3-319-32229-2_4(37-46)Online publication date: 26-May-2016
https://doi.org/10.1007/978-3-319-32229-2_4
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Human-robot collaborative tutoring using multiparty multimodal spoken dialogue

Applying politeness maxims in social robotics polite dialogue

Sample-efficient batch reinforcement learning for dialogue management optimization

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations