skip to main content
10.5555/1734454.1734553acmconferencesArticle/Chapter ViewAbstractPublication PageshriConference Proceedingsconference-collections
research-article

Toward understanding natural language directions

Published: 02 March 2010 Publication History

Abstract

Speaking using unconstrained natural language is an intuitive and flexible way for humans to interact with robots. Understanding this kind of linguistic input is challenging because diverse words and phrases must be mapped into structures that the robot can understand, and elements in those structures must be grounded in an uncertain environment. We present a system that follows natural language directions by extracting a sequence of spatial description clauses from the linguistic input and then infers the most probable path through the environment given only information about the environmental geometry and detected visible objects. We use a probabilistic graphical model that factors into three key components. The first component grounds landmark phrases such as "the computers" in the perceptual frame of the robot by exploiting co-occurrence statistics from a database of tagged images such as Flickr. Second, a spatial reasoning component judges how well spatial relations such as "past the computers" describe a path. Finally, verb phrases such as "turn right" are modeled according to the amount of change in orientation in the path. Our system follows 60% of the directions in our corpus to within 15 meters of the true destination, significantly outperforming other approaches.

References

[1]
G. Bugmann, E. Klein, S. Lauria, and T. Kyriacou, "Corpus-based robotics: A route instruction example," Proceedings of Intelligent Autonomous Systems, pp. 96--103, 2004.
[2]
M. Levit and D. Roy, "Interpretation of spatial language in a map navigation task," Systems, Man, and Cybernetics, Part B, IEEE Transactions on, vol. 37, no. 3, pp. 667--679, 2007.
[3]
M. MacMahon, B. Stankiewicz, and B. Kuipers, "Walk the talk: Connecting language, knowledge, and action in route instructions," Proceedings of the National Conference on Artificial Intelligence, pp. 1475--1482, 2006.
[4]
G. Look, B. Kottahachchi, R. Laddaga, and H. Shrobe, "A location representation for generating descriptive walking directions," in International Conference on Intelligent User Interfaces, 2005, pp. 122--129.
[5]
J. Dzifcak, M. Scheutz, C. Baral, and P. Schermerhorn, "What to do and how to do it: Translating natural language directives into temporal and dynamic logic representation for goal management and action execution," in IEEE International Conference on Robotics and Automation, 2009, pp. 4163--4168.
[6]
M. Skubic, D. Perzanowski, S. Blisard, A. Schultz, W. Adams, M. Bugajska, and D. Brock, "Spatial language for human-robot dialogs," Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on, vol. 34, no. 2, pp. 154--167, 2004.
[7]
A. Bauer, K. Klasing, G. Lidoris, Q. Mühlbauer, F. Rohrmüller, S. Sosnowski, T. Xu, K. Kühnlenz, D. Wollherr, and M. Buss, "The Autonomous City Explorer: Towards natural human-robot interaction in urban environments," International Journal of Social Robotics, vol. 1, no. 2, pp. 127--140, Apr. 2009.
[8]
B. Landau and R. Jackendoff, ""What" and "where" in spatial language and spatial cognition," Behavioral and Brain Sciences, vol. 16, pp. 217--265, 1993.
[9]
L. Talmy, "The fundamental system of spatial schemas in language," in From Perception to Meaning: Image Schemas in Cognitive Linguistics, B. Hamp, Ed. Mouton de Gruyter, 2005.
[10]
B. Levin, English Verb Classes and Alternations: A Preliminary Investigation. University Of Chicago Press, Sep. 1993.
[11]
T. P. Regier, "The acquisition of lexical semantics for spatial terms: A connectionist model of perceptual categorization," Ph.D. dissertation, University of California at Berkeley, 1992.
[12]
J. D. Kelleher and F. J. Costello, "Applying computational models of spatial prepositions to visually situated dialog," Computational Linguistics, vol. 35, no. 2, pp. 271--306, Jun. 2009.
[13]
Y. Wei, E. Brunskill, T. Kollar, and N. Roy, "Where to go: Interpreting natural directions using global inference," in IEEE International Conference on Robotics and Automation, 2009.
[14]
G. Grisetti, C. Stachniss, and W. Burgard, "Improved techniques for grid mapping with Rao-Blackwellized particle filters," IEEE Transactions on Robotics, vol. 23, no. 1, pp. 34--46, 2007.
[15]
T. Kudo, "CRF: Yet another CRF toolkit," http://crfpp.sourceforge.net, 2009.
[16]
E. Brunskill, T. Kollar, and N. Roy, "Topological mapping using spectral clustering and classification," in International Conference on Intelligent Robots and Systems, October 2007, pp. 3491--3496.
[17]
T. Kollar and N. Roy, "Utilizing object-object and object-scene context when planning to find things." in IEEE International Conference on Robotics and Automation, 2009.
[18]
S. Tellex and D. Roy, "Grounding spatial prepositions for video search," in Proceedings of the International Conference on Multimodal Interfaces, 2009.
[19]
A. Viterbi, "Error bounds for convolutional codes and an asymptotically optimum decoding algorithm," Information Theory, IEEE Transactions on, vol. 13, no. 2, pp. 260--269, 1967.

Cited By

View all
  • (2020)INGRESSInternational Journal of Robotics Research10.1177/027836491989713339:2-3(217-232)Online publication date: 1-Mar-2020
  • (2020)User Needs and Design Opportunities in End-User Robot ProgrammingCompanion of the 2020 ACM/IEEE International Conference on Human-Robot Interaction10.1145/3371382.3378300(93-95)Online publication date: 23-Mar-2020
  • (2020)See What I SeeProceedings of the 2020 ACM/IEEE International Conference on Human-Robot Interaction10.1145/3319502.3374820(639-648)Online publication date: 9-Mar-2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
HRI '10: Proceedings of the 5th ACM/IEEE international conference on Human-robot interaction
March 2010
400 pages
ISBN:9781424448937

Sponsors

Publisher

IEEE Press

Publication History

Published: 02 March 2010

Check for updates

Author Tags

  1. direction understanding
  2. route instructions
  3. spatial language

Qualifiers

  • Research-article

Conference

HRI 10
Sponsor:

Acceptance Rates

HRI '10 Paper Acceptance Rate 26 of 124 submissions, 21%;
Overall Acceptance Rate 268 of 1,124 submissions, 24%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2020)INGRESSInternational Journal of Robotics Research10.1177/027836491989713339:2-3(217-232)Online publication date: 1-Mar-2020
  • (2020)User Needs and Design Opportunities in End-User Robot ProgrammingCompanion of the 2020 ACM/IEEE International Conference on Human-Robot Interaction10.1145/3371382.3378300(93-95)Online publication date: 23-Mar-2020
  • (2020)See What I SeeProceedings of the 2020 ACM/IEEE International Conference on Human-Robot Interaction10.1145/3319502.3374820(639-648)Online publication date: 9-Mar-2020
  • (2019)Learning Plans by Acquiring Grounded Linguistic Meanings from CorrectionsProceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems10.5555/3306127.3331834(1297-1305)Online publication date: 8-May-2019
  • (2019)PATIProceedings of the 24th International Conference on Intelligent User Interfaces10.1145/3301275.3302326(345-355)Online publication date: 17-Mar-2019
  • (2019)Miscommunication Detection and Recovery in Situated Human–Robot DialogueACM Transactions on Interactive Intelligent Systems10.1145/32371899:1(1-40)Online publication date: 17-Feb-2019
  • (2019)Robot–Robot Gesturing for Anchoring RepresentationsIEEE Transactions on Robotics10.1109/TRO.2018.287538835:1(216-230)Online publication date: 1-Feb-2019
  • (2018)Grounding language for transfer in deep reinforcement learningJournal of Artificial Intelligence Research10.1613/jair.1.1126363:1(849-874)Online publication date: 1-Sep-2018
  • (2018)Efficient grounding of abstract spatial concepts for natural language interaction with robot platformsInternational Journal of Robotics Research10.1177/027836491877762737:10(1269-1299)Online publication date: 1-Sep-2018
  • (2018)End-User Programming of Manipulator Robots in Situated Tangible Programming ParadigmCompanion of the 2018 ACM/IEEE International Conference on Human-Robot Interaction10.1145/3173386.3176923(319-320)Online publication date: 1-Mar-2018
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media