review-article

Multimedia with a speech track: searching spontaneous conversational speech

Authors:
Martha Larson

Delft University of Technology, Netherlands

Delft University of Technology, Netherlands
View Profile

,
Roeland Ordelman

University of Twente, Netherlands

University of Twente, Netherlands
View Profile

,
Franciska de Jong

University of Twente, Netherlands

University of Twente, Netherlands
View Profile

,
Joachim Kohler

Fraunhofer IAIS, Germany

Fraunhofer IAIS, Germany
View Profile

,
Wessel Kraaij

Radboud University Nijmegen, TNO ICT, Netherlands

Radboud University Nijmegen, TNO ICT, Netherlands
View Profile

Authors Info & Claims

ACM SIGIR Forum Volume 44 Issue 1June 2010pp 76–81https://doi.org/10.1145/1842890.1842901

Published:18 August 2010Publication History

ACM SIGIR Forum

Abstract

After two successful years at SIGIR in 2007 and 2008, the third workshop on Searching Spontaneous Conversational Speech (SSCS 2009) was held conjunction with the ACM Multimedia 2009. The goal of the SSCS series is to serve as a forum that brings together the disciplines that collaborate on spoken content retrieval, including information retrieval, speech recognition and multimedia analysis. Multimedia collections often contain a speech track, but in many cases it is ignored or not fully exploited for information retrieval. Currently, spoken content retrieval research is expanding beyond highly-conventionalized domains such as broadcast news in to domains involving speech that is produced spontaneously and in conversational settings. Such speech is characterized by wide variability of speaking styles, subject matter and recording conditions. The work presented at SSCS 2009 included techniques for searching meetings, interviews, telephone conversations, podcasts and spoken annotations. The work encompassed a large range of approaches including using subword units, exploiting dialogue structure, fusing retrieval models, modeling topics and integrating visual features. Taken in sum, the workshop demonstrated the high potential of new ideas emerging in the area of speech search and also reinforced the need for concentrated research devoted to the classic challenges of spoken content retrieval, many of which remain yet unsolved.

References

F.M.G. de Jong, D. Oard, R. Ordelman, and S. Raaijmakers. Searching spontaneous conversational speech. SIGIR Forum, 41(2):104--108, 2007. Google ScholarDigital Library
G. Friedland, L. Gottlieb, and A. Janin. Joke-o-mat: Browsing sitcoms punchline by punchline. In MM '09: Proceedings of the 17th ACM international conference on Multimedia, pages 1115--1116, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
W. Heeren, L. van der Werff, R. Ordelman, A. van Hessen, and F. de Jong. Radio Oranje: Searching the Queen's speech(es). In SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, pages 903--903, New York, NY, USA, 2007. ACM. Google ScholarDigital Library
J. Kohler, M. Larson, F.M.G. de Jong, W. Kraaij, and R.J.F. Ordelman. Spoken content retrieval: Searching spontaneous conversational speech. SIGIR Forum, 42(2):66--75, 2008. Google ScholarDigital Library
M. Larson, R.J.F. Ordelman, F.M.G. de Jong, W. Kraaij, and J. Kohler. Searching multimedia content with a spontaneous conversational speech track. In MM '09: Proceedings of the 17th ACM international conference on Multimedia, pages 1159--1160, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
S. Rudinac, M. Larson, and A. Hanjalic. Exploiting visual reranking to improve pseudorelevance feedback for spoken-content-based video retrieval. In Image Analysis for Multimedia Interactive Services, 2009. WIAMIS '09. 10th Workshop on, pages 17--20, 2009.Google ScholarCross Ref
F. Seide, K. Thambiratnam, L. Lu, and R. P. Yu. Multimedia retrieval through indexing speech: An enterprise perspective. In SSCS '09: Proceedings of the third workshop on Searching spontaneous conversational speech, pages 1--2, New York, NY, USA, 2009. ACM. Google ScholarDigital Library

Index Terms

Multimedia with a speech track: searching spontaneous conversational speech
1. Human-centered computing
  1. Human computer interaction (HCI)
2. Information systems
  1. Information retrieval

Index terms have been assigned to the content through auto-classification.

Recommendations

Improving Acoustic Models with Captioned Multimedia Speech
ICMCS '99: Proceedings of the IEEE International Conference on Multimedia Computing and Systems - Volume 2

Speech recognition can be used to create searchable transcripts for audio indexing in digital video libraries. Large amounts of hand-transcribed speech training data are required to build or improve acoustic models of highly accurate speech recognition ...
Read More
Multimedia content with a speech track: ACM multimedia 2010 workshop on searching spontaneous conversational speech
MM '10: Proceedings of the 18th ACM international conference on Multimedia
Read More
Improving Acoustic Models with Captioned Multimedia Speech
ICMCS '99: Proceedings of the 1999 IEEE International Conference on Multimedia Computing and Systems - Volume 02

Speech recognition can be used to create searchable transcripts for audio indexing in digital video libraries. Large amounts of hand-transcribed speech training data are required to build or improve acoustic models of highly accurate speech recognition ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM SIGIR Forum Volume 44, Issue 1
June 2010
88 pages
ISSN:0163-5840
DOI:10.1145/1842890
Issue’s Table of Contents

Copyright © 2010 Authors
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 18 August 2010
Check for updates
Qualifiers
- review-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 108
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Multimedia with a speech track: searching spontaneous conversational speech

ACM SIGIR Forum

Abstract

References

Cited By

Index Terms

Recommendations

Improving Acoustic Models with Captioned Multimedia Speech

Multimedia content with a speech track: ACM multimedia 2010 workshop on searching spontaneous conversational speech

Improving Acoustic Models with Captioned Multimedia Speech

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media