skip to main content
10.1145/1133265.1133338acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaviConference Proceedingsconference-collections
Article

The prospects for unrestricted speech input for TV content search

Published: 23 May 2006 Publication History

Abstract

The need for effective search for television content is growing as the number of choices for TV viewing and/or recording explodes. In this paper we describe a preliminary prototype of a multimodal Speech-In List-Out (SILO) interface in which users' input is unrestricted by vocabulary or grammar. We report on usability testing with a sample of six users. The prototype enables search through video content metadata downloaded from an electronic program guide (EPG) service. Our setup for testing included adding a microphone to a TV remote control and running an application on a PC whose visual interface was displayed on a TV.

References

[1]
Berglund, A., and Qvarfordt, P. Error Resolution Strategies for Interactive Television Speech Interfaces. In Proceedings of International Conference on Human-Computer Interaction (INTERACT '03) (Zurich, Switzerland, September 1--5, 2003). IFIP, Amsterdam, 2003, 105--112.
[2]
CMUSphinx: The Carnegie Mellon Sphinx Project. http://cmusphinx.sourceforge.net/html/cmusphinx.php.
[3]
Divi, V., Forlines, C., van Gemert, J. V., Raj, B., Schmidt-Nielsen, B., Wittenburg, K., Woelfel, J., Wolf, P.; and Zhang, F. A Speech-In List-Out Approach to Spoken User Interfaces. In Proceedings of Human Language Technology Conference (HLT 2004) (Boston, Massachusetts May 2--7, 2004). Association for Computational Linguistics, 2004, 113--116.
[4]
Forlines, C., Schmidt-Nielsen, B., Raj, B., Wittenburg, K., and Wolf, P. A Comparison between Spoken Queries and Menu-based Interfaces for In-Car Digital Music Selection. In Proceedings of International Conference on Human-Computer Interaction (INTERACT '05) (Rome, Italy, September 12--16, 2005). IFIP, Amsterdam, 2005, 536--549.
[5]
Ibrahim A., Lundberg J. and Johansson J. Speech Enhanced Remote Control for Media Terminal. In Proceedings of Euro-speech '01 (Aalborg, Denmark, September 2001). International Speech Communcation Association, Bonn, Germany, 2001, Volume 4, 2685--2688.
[6]
Johansson, P. MADFILM--A Multimodal Approach to Handle Search and Organization in a Movie Recommendation System. In Proceedings of the 1st Nordic Symposium on Multimodal Communication (Helsingör, Denmark, September 25--26, 2003). Nordic Network For Multimodal Interfaces, 3003, 53--65.
[7]
Nielsen, J. Usability Engineering. Morgan Kaufmann, 1st edition, 1994.
[8]
O' Sullivan, D., Smyth, B., and Winson, D. Improving the Quality of the Personalized Electronic Program Guide. User Modeling and User-Adapted Interaction, 14, 1 (2004), 4--36.
[9]
Stone, B. I Want a Movie! Now. Newsweek Magazine, Sept. 13, 2005.
[10]
Wahlster, W. SmartKom: Symmetric Multimodality in an Adaptive and Reusable Dialogue Shell. In Krahl, R., Guenther, D. (eds), Proceedings of the Human Computer Interaction Status Conference 2003 (Berlin, Germain, June 2003). DLR, 2003, 47--62.
[11]
Welcome to Promptu. http://www.promptu.com.
[12]
Wolf, P., and Raj, B. The MERL SpokenQuery Information Retrieval System: A System for Retrieving Pertinent Documents from a Spoken Query. In Proceedings of IEEE International Conference on Multimedia and Expo (ICME)(Lusanne, Switzerland, August 26--29, 2002). IEEE, 2002, Vol. 2, 317--320.
[13]
Wolf, P., Woelfel, J., van Gemert, J., Raj, B., and Wong, D. SpokenQuery: An Alternate Approach to Choosing Items with Speech. In Proceedings of International Conference on Speech and Language Processing (ICSLP) (Jeju Island, South Korea, October 4--8, 2004). ISCA, 2004, 221--224.

Cited By

View all
  • (2013)Designing natural speech interactions for the living roomCHI '13 Extended Abstracts on Human Factors in Computing Systems10.1145/2468356.2468574(1215-1220)Online publication date: 27-Apr-2013
  • (2011)Using paper and pen to control home-ITProceedings of the 9th European Conference on Interactive TV and Video10.1145/2000119.2000162(203-212)Online publication date: 29-Jun-2011
  • (2011)A multimodal interaction component for digital televisionProceedings of the 2011 ACM Symposium on Applied Computing10.1145/1982185.1982459(1253-1258)Online publication date: 21-Mar-2011
  • Show More Cited By

Index Terms

  1. The prospects for unrestricted speech input for TV content search

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    AVI '06: Proceedings of the working conference on Advanced visual interfaces
    May 2006
    512 pages
    ISBN:1595933530
    DOI:10.1145/1133265
    • General Chair:
    • Augusto Celentano
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 23 May 2006

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. electronic program guides
    2. information retrieval
    3. multi-modal interfaces
    4. speech interfaces
    5. television interfaces

    Qualifiers

    • Article

    Conference

    AVI06

    Acceptance Rates

    Overall Acceptance Rate 128 of 490 submissions, 26%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 15 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2013)Designing natural speech interactions for the living roomCHI '13 Extended Abstracts on Human Factors in Computing Systems10.1145/2468356.2468574(1215-1220)Online publication date: 27-Apr-2013
    • (2011)Using paper and pen to control home-ITProceedings of the 9th European Conference on Interactive TV and Video10.1145/2000119.2000162(203-212)Online publication date: 29-Jun-2011
    • (2011)A multimodal interaction component for digital televisionProceedings of the 2011 ACM Symposium on Applied Computing10.1145/1982185.1982459(1253-1258)Online publication date: 21-Mar-2011
    • (2011)Constructing n-gram rules for natural language models through exploring the limitation of the Zipf–Mandelbrot lawComputing10.1007/s00607-010-0116-x91:3(241-264)Online publication date: 1-Mar-2011
    • (2010)Conceptual modeling of online entertainment programming guide for natural language interfaceProceedings of the Natural language processing and information systems, and 15th international conference on Applications of natural language to information systems10.5555/1894525.1894551(188-195)Online publication date: 23-Jun-2010
    • (2010)"TV answers" - using the wisdom of crowds to facilitate searches with rich media contextProceedings of the 7th IEEE conference on Consumer communications and networking conference10.5555/1834217.1834385(749-753)Online publication date: 9-Jan-2010
    • (2010)Accessible Multimodal Media Center Application for Blind and Partially Sighted PeopleComputers in Entertainment10.1145/1902593.19025958:3(1-30)Online publication date: 1-Dec-2010
    • (2010)An approach based on multiple text input modes for interactive digital TV applicationsProceedings of the 28th ACM International Conference on Design of Communication10.1145/1878450.1878483(191-198)Online publication date: 27-Sep-2010
    • (2010)Natural language-based user interface for mobile devices with limited resourcesIEEE Transactions on Consumer Electronics10.1109/TCE.2010.568107656:4(2086-2092)Online publication date: 1-Nov-2010
    • (2010)Adjustable interactive rings for iDTVIEEE Transactions on Consumer Electronics10.1109/TCE.2010.560635656:3(1988-1996)Online publication date: 1-Aug-2010
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media