skip to main content
10.1145/1449715.1449720acmconferencesArticle/Chapter ViewAbstractPublication PagesuistConference Proceedingsconference-collections
research-article

Creating map-based storyboards for browsing tour videos

Published: 19 October 2008 Publication History

Abstract

Watching a long unedited video is usually a boring experience. In this paper we examine a particular subset of videos, tour videos, in which the video is captured by walking about with a running camera with the goal of conveying the essence of some place. We present a system that makes the process of sharing and watching a long tour video easier, less boring, and more informative. To achieve this, we augment the tour video with a map-based storyboard, where the tour path is reconstructed, and coherent shots at different locations are directly visualized on the map. This allows the viewer to navigate the video in the joint location-time space. To create such a storyboard we employ an automatic pre-processing component to parse the video into coherent shots, and an authoring tool to enable the user to tie the shots with landmarks on the map. The browser-based viewing tool allows users to navigate the video in a variety of creative modes with a rich set of controls, giving each viewer a unique, personal viewing experience. Informal evaluation shows that our approach works well for tour videos compared with conventional media players.

Supplementary Material

JPG File (16.jpg)
JPG File (p13-pongnumkul.jpg)
FLV File (16.flv)
MP4 File (p13-pongnumkul.mp4)

References

[1]
M. Cooper and J. Foote. Scene boundary detection via video self-similarity analysis. Image Processing, 2001. Proceedings. 2001 International Conference on, 3:378--381 vol.3, 2001.
[2]
J. Foote, J. Boreczhy, A. Girgensohn, and L. Wilcox. An intelligent media browser using automatic multimodal analysis. In MULTIMEDIA '98: Proceedings of the sixth ACM international conference on Multimedia, pages 375---380, New York, NY, USA, 1998. ACM.
[3]
A. Girgensohn, J. Boreczky, P. Chiu, J. Doherty, J. Foote, G. Golovchinsky, S. Uchihashi, and L.Wilcox. A semi-automatic approach to home video editing. In Proceedings of UIST '00, pages 81--89, 2000.
[4]
D. R. Goldman. A Framework for Video Annotation, Visualization, and Interaction. PhD thesis, University of Washington, 2007. Section 2.4.
[5]
R. Kadobayashi and K. Tanaka. 3d viewpoint-based photo search and information browsing. In Proceedings of SIGIR '05, pages 621--622, 2005.
[6]
A. Komlodi and G. Marchionini. Key frame preview techniques for video browsing. In Proc. of DL '98, pages 118--125, New York, NY, USA, 1998. ACM.
[7]
Y. Li, T. Zhang, and D. Tretter. An overview of video abstraction techniques. Technical Report HPL-2001-191, HP Laboratory, July 2001.
[8]
R. Lienhart. Abstracting home video automatically. In Proc. of MULTIMEDIA '99, pages 37--40, New York, NY, USA, 1999. ACM.
[9]
A. Lippman. Movie-maps: An application of the optical videodisc to computer graphics. SIGGRAPH Comput. Graph., 14(3):32--42, 1980.
[10]
D. Lowe. Distinctive image features from scaleinvariant keypoints. International Journal of Computer Vision, 60(2):91--110, 2004.
[11]
J. Matas, O. Chum, M. Urban, and T. Pajdla. Robust wide baseline stereo from maximally stable extremal regions. In Proc. of the British Machine Vision Conference, pages 384--393, 2002.
[12]
K. Mikolajczyk and C. Schmid. Scale and affine invariant interest point detectors. International Journal of Computer Vision, 60(1):63--86, 2004.
[13]
M. Naaman, Y. J. Song, A. Paepcke, and H. Garcia-Molina. Automatic organization for digital photographs with geographic coordinates. In Proceedings of JCDL '04, pages 53--62, 2004.
[14]
J. Nam and A. H. Tewfik. Video abstract of video. In Proc. of IEEE 3rdWorkshop on Multimedia Signal Processing, pages 117--122, 1999.
[15]
S. Pfeiffer, R. Lienhart, S. Fischer, and W. Effelsberg. Abstracting digital movies automatically. Technical Report TR-96-005, 1, 1996.
[16]
M. A. Smith and T. Kanade. Video skimming and characterization through the combination of image and language understanding. In CAIVD, pages 61--70, 1998.
[17]
N. Snavely, S.M. Seitz, and R. Szeliski. Photo tourism: exploring photo collections in 3d. In Proc. of ACMSIGGRAPH 2006, pages 835--846, 2006.
[18]
Y. Tonomura, A. Akutsu, K. Otsuji, and T. Sadakata. Videomap and videospaceicon: tools for anatomizing video content. In Proceedings of CHI '93, pages 131--136, 1993.
[19]
K. Toyama, R. Logan, and A. Roseway. Geographic location tags on digital images. In Proceedings of MULTIMEDIA '03, pages 156--166, 2003.
[20]
B. T. Truong and S. Venkatesh. Video abstraction: A systematic review and classification. ACM Trans. Multimedia Comput. Commun. Appl., 3(1):3, 2007.

Cited By

View all
  • (2024)SwapVid: Integrating Video Viewing and Document Exploration with Direct ManipulationProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642515(1-13)Online publication date: 11-May-2024
  • (2018)Hyper-Lapse From Multiple Spatially-Overlapping VideosIEEE Transactions on Image Processing10.1109/TIP.2017.274914327:4(1735-1747)Online publication date: 1-Apr-2018
  • (2017)EgoScanningProceedings of the 2017 CHI Conference on Human Factors in Computing Systems10.1145/3025453.3025821(6536-6546)Online publication date: 2-May-2017
  • Show More Cited By

Index Terms

  1. Creating map-based storyboards for browsing tour videos

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    UIST '08: Proceedings of the 21st annual ACM symposium on User interface software and technology
    October 2008
    308 pages
    ISBN:9781595939753
    DOI:10.1145/1449715
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 October 2008

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. map
    2. storyboard
    3. video browsing
    4. video summarization

    Qualifiers

    • Research-article

    Conference

    UIST08

    Acceptance Rates

    Overall Acceptance Rate 561 of 2,567 submissions, 22%

    Upcoming Conference

    UIST '25
    The 38th Annual ACM Symposium on User Interface Software and Technology
    September 28 - October 1, 2025
    Busan , Republic of Korea

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)15
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 15 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)SwapVid: Integrating Video Viewing and Document Exploration with Direct ManipulationProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642515(1-13)Online publication date: 11-May-2024
    • (2018)Hyper-Lapse From Multiple Spatially-Overlapping VideosIEEE Transactions on Image Processing10.1109/TIP.2017.274914327:4(1735-1747)Online publication date: 1-Apr-2018
    • (2017)EgoScanningProceedings of the 2017 CHI Conference on Human Factors in Computing Systems10.1145/3025453.3025821(6536-6546)Online publication date: 2-May-2017
    • (2017)VremiereProceedings of the 2017 CHI Conference on Human Factors in Computing Systems10.1145/3025453.3025675(5428-5438)Online publication date: 2-May-2017
    • (2017)SnapVideo: Personalized Video Generation for a Sightseeing TripIEEE Transactions on Cybernetics10.1109/TCYB.2016.258576447:11(3866-3878)Online publication date: Nov-2017
    • (2016)The value of information cues for lifelog video navigationProceedings of the 15th International Conference on Mobile and Ubiquitous Multimedia10.1145/3012709.3012712(153-157)Online publication date: 12-Dec-2016
    • (2016)Discovering picturesque highlights from egocentric vacation videos2016 IEEE Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV.2016.7477707(1-9)Online publication date: Mar-2016
    • (2016)Efficient Summarization From Multiple Georeferenced User-Generated VideosIEEE Transactions on Multimedia10.1109/TMM.2016.252082718:3(418-431)Online publication date: 1-Mar-2016
    • (2016)Perceptual Attributes Optimization for Multivideo SummarizationIEEE Transactions on Cybernetics10.1109/TCYB.2015.249355846:12(2991-3003)Online publication date: Dec-2016
    • (2015)Gaze-enabled egocentric video summarization via constrained submodular maximization2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR.2015.7298836(2235-2244)Online publication date: Jun-2015
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media