skip to main content
10.1145/1459359.1459419acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Evaluation of video browser features and user interaction with VAST MM

Published: 26 October 2008 Publication History

Abstract

In this paper, we present extensive user studies on browsing and information retrieval in the domain of unstructured videos using the VAST MM video library browser. Our studies were performed over a 3-year period with more than 1,000 participants in the university setting. The majority of students use the video library for retrieval of student presentations in a large engineering design course. Through iterative analysis of context-specific audio, visual, and textual cues, we are able to measure significant improvements on typical retrieval tasks, such as searching for unfamiliar content in a large database with over 300 hours of video. We also present user studies conducted in two videotaped core computer science courses to measure the usefulness of the VAST MM (Video Audio Structure Text MultiMedia) resource for final exam preparation. We find that students who use the lecture video library experience significant improvement in final exam scores.
To better compare video browsers featuring rich content cues to standard video players without cues, we have performed a large experiment to collect measurable data on search tasks. In general, the lack of index cues can be described by an inverse relationship between amount of matching video content and time required to find it. When index cues are available, the relationship is constant, that is, rare content is found in the same time as common content. We evaluate this data and provide additional insight into two common user interaction techniques: audio-visual browsing and visual-only browsing. We show that user preference is uniform, but that audio-visual browsing is significantly more effective for search and retrieval of video data.

References

[1]
Mukhopadhyay, S., and Smith, B. Passive capture and structuring of lectures. In Proc. of the ACM International Conference on Multimedia (Orlando, FL, Oct. 30 - Nov. 5, 1999). MM '99. ACM Press, New York, NY, 477--487.
[2]
Abowd, G. D., Atkeson, C. G., Feinstein, A., Hmelo, C., Kooper, R., Long, S., Sawhnet, N., and Tani, M. Teaching and Learning as Multimedia Authoring: The Classroom 2000 Project. In Proc. of the ACM International Conference on Multimedia (Los Angeles, CA, Oct. 30 - Nov. 3, 2000). MM '00. ACM Press, New York, NY, 187--198.
[3]
Haubold, A., and Kender, J. R. Analysis and Interface for Instructional Video. In Proc. of the IEEE International Conference on Multimedia & Expo (Baltimore, MD, Jul. 6-9, 2003). ICME '03. IEEE Press, New York, NY, 704--708.
[4]
Lin, M., Nunamaker, J. F., Chau, M., and Chen, H. Segmentation of Lecture Videos based on Text: A Method Combining Multiple Linguistic Features. In Proc. of the 37th Hawaii International Conference on System Sciences (Big Island, HI, Jan. 5-8, 2004). HICCS '04. IEEE Computer Society Press, New York, NY, 3--11.
[5]
Haubold, A., and Kender, J. R. Analysis and Visualization of Index Words from Audio Transcripts of Instructional Videos. In Proc. of the IEEE International Workshop on Multimedia Content-based Analysis and Retrieval (Miami, FL, Dec. 15, 2004). MCBAR '04. IEEE Press, New York, NY, 570--573.
[6]
Haubold, A. Kender, J. R. VAST MM: Multimedia Browser for Presentation Video. In Proc. of the ACM Conference on Image and Video Retrieval (Amsterdam, The Netherlands, Jul. 9-11, 2007). CIVR '07. ACM Press, New York, NY 41--48.
[7]
Christel, M. and Martin, D. Information Visualization within a Digital Video Library. In Journal of Intelligent Information Systems, Volume 11, Number 3 (1998). 235--257.
[8]
Lee H. and Smeaton, A. F. Designing the User Interface for the Físchlár Digital Video Library. In Journal of Digital Information, Volume 2, Issue 4 (May 2002).
[9]
Haubold, A. Selection and Ranking of Text from Highly Imperfect Transcripts for Retrieval of Video Content. In Proc. of the ACM SIGIR Conference on Research and Development in Information Retrieval (Amsterdam, The Netherlands, Jul. 23-27, 2007). SIGIR '07. ACM Press, New York, NY, 791--792.
[10]
Yu, D., Cheung, S. H., Legge, G. E., and Chung S. T. Effect of letter spacing on visual span and reading speed. In Journal of Vision, Vol. 7, Nr. 2, Article 2 (Feb. 2007). 1--10.
[11]
Thorpe, S., Fize, D., and Marlot C. Speed of processing in the human visual system. In Nature, Volume 381, (6 Jun. 1996). 520--522.
[12]
Walpole, R. E., Myers, R. H., Myers, S. L., Ye, K. Probability and Statistics for Engineers and Scientists, 7th Edition, 2002. Prentice Hall, New Jersey.

Cited By

View all
  • (2012)Towards a Video Browser for the Digital NativeProceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops10.1109/ICMEW.2012.29(127-132)Online publication date: 9-Jul-2012
  • (2010)Toward more efficient user interfaces for mobile video browsingProceedings of the 18th ACM international conference on Multimedia10.1145/1873951.1873999(341-350)Online publication date: 25-Oct-2010
  • (2009)Are Visual Informatics Actually Useful in PracticeProceedings of the 1st International Visual Informatics Conference on Visual Informatics: Bridging Research and Practice10.1007/978-3-642-05036-7_77(811-821)Online publication date: 15-Nov-2009

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '08: Proceedings of the 16th ACM international conference on Multimedia
October 2008
1206 pages
ISBN:9781605583037
DOI:10.1145/1459359
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 October 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. automatic speech recognition
  2. evaluation
  3. measures
  4. presentation video
  5. speaker index
  6. speaker segmentation
  7. streaming video
  8. structure in videos
  9. text augmentation
  10. transcript analysis
  11. user studies
  12. video library
  13. visual segmentation

Qualifiers

  • Research-article

Conference

MM08
Sponsor:
MM08: ACM Multimedia Conference 2008
October 26 - 31, 2008
British Columbia, Vancouver, Canada

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2012)Towards a Video Browser for the Digital NativeProceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops10.1109/ICMEW.2012.29(127-132)Online publication date: 9-Jul-2012
  • (2010)Toward more efficient user interfaces for mobile video browsingProceedings of the 18th ACM international conference on Multimedia10.1145/1873951.1873999(341-350)Online publication date: 25-Oct-2010
  • (2009)Are Visual Informatics Actually Useful in PracticeProceedings of the 1st International Visual Informatics Conference on Visual Informatics: Bridging Research and Practice10.1007/978-3-642-05036-7_77(811-821)Online publication date: 15-Nov-2009

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media