ABSTRACT
Due to the rapid increase in video capture technology, more and more tourist videos are captured every day, creating a challenge for organization and association with metadata. In this paper, we present a novel system for annotating and navigating tourist videos. Placing annotations in a video is difficult because of the need to track the movement of the camera. Navigation of a regular video is also challenging due to the sequential nature of the media. To overcome these challenges, we introduce a system for registering videos to geo-referenced 3D models and analyzing the video contents. We also introduce a novel scheduling algorithm for showing annotations in video. We show results in automatically annotated videos and in a map-based application for browsing videos. Our user study indicates the system is very useful.
- S. A. Ay, R. Zimmermann, and S. H. Kim. Viewable scene modeling for geospatial video search. ACM International Conference on Multimedia, pages 309--318, 2008. Google ScholarDigital Library
- R. E. Bellman. Dynamic programming. Princeton University Press, 1957.Google ScholarDigital Library
- B. Chen, G. Ramos, E. Ofek, M. Cohen, S. Drucker, and D. Nister. Interactive techniques for registering images to digital terrain and building models. Technical report, Microsoft Research, 2008.Google Scholar
- L. Cheong and H. Huo. Shot change detection using scenebased constraint. Multimedia Tools and Applications, 2001. Google ScholarDigital Library
- P. L. Cho. 3D organization of 2D urban imagery. Applied Image Pattern Recognition Workshop, 0:3--8, 2007. Google ScholarDigital Library
- B. Epshtein, E. Ofek, Y. Wexler, and P. Zhang. Hierarchical photo organization using geo-relevance. In ACM International Symposium on Advances in Geographic Information Systems, pages 1--7, New York, NY, USA, 2007. ACM. Google ScholarDigital Library
- Google. Google Maps. http://maps.google.com.Google Scholar
- W. Heng and K. Ngan. An object-based shot boundary detection using edge tracing and tracking. Journal of Visual Communication and Image Representation, 2001.Google ScholarDigital Library
- J. Kopf, B. Neubert, B. Chen, M. Cohen, D. Cohen-Or, O. Deussen, M. Uyttendaele, and D. Lischinski. Deep photo: Model-based photograph enhancement and viewing. ACM Trans. on Graphics (Proceedings of SIGGRAPH Asia 2008), 2008. Google ScholarDigital Library
- M. Kroepfl, Y. Wexler, and E. Ofek. Efficiently locating photographs in many panoramas. submitted to ACM GIS 2010, 2010. Google ScholarDigital Library
- D. G. Lowe. Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision, 60(2):91--110, 2004. Google ScholarDigital Library
- Q. Luan, S. M. Drucker, J. Kopf, Y.-Q. Xu, and M. F. Cohen. Annotating gigapixel images. In ACM Symposium on User Interface Software and Technology, 2008. Google ScholarDigital Library
- Microsoft. Bing Maps. http://maps.bing.com/.Google Scholar
- H. Nicolas, A. Manury, J. Benois-Pineau, W. Dupuy, and D. Barba. Grouping video shots into scenes based on 1D mosaic descriptors. Proc. Intl. Conf. on Image Proc., 2004.Google ScholarCross Ref
- S. Pongnumkul, J. Wang, and M. Cohen. Creating map-based storyboards for browsing tour videos. In ACM symposium on User Interface Software and Technology, pages 13--22, New York, NY, USA, 2008. ACM. Google ScholarDigital Library
- T. Rattenbury and M. Naaman. Methods for extracting place semantics from Flickr tags. ACM Trans. on Web, 3(1):1--30, 2009. Google ScholarDigital Library
- I. Simon, N. Snavely, and S. M. Seitz. Scene summarization for online image collections. IEEE International Conference on Computer Vision, 2007.Google ScholarCross Ref
- N. Snavely, S. M. Seitz, and R. Szeliski. Photo tourism: exploring photo collections in 3D. In ACM SIGGRAPH 2006 Papers, pages 835--846, New York, NY, USA, 2006. Google ScholarDigital Library
- K. Toyama, R. Logan, and A. Roseway. Geographic location tags on digital images. ACM International Conference on Multimedia, 2003. Google ScholarDigital Library
Index Terms
- Annotating and navigating tourist videos
Recommendations
Annotating Objects and Relations in User-Generated Videos
ICMR '19: Proceedings of the 2019 on International Conference on Multimedia RetrievalUnderstanding the objects and relations between them is indispensable to fine-grained video content analysis, which is widely studied in recent research works in multimedia and computer vision. However, existing works are limited to evaluating with ...
A Video Annotation Tool Using Vision-based AR Technology
CW '12: Proceedings of the 2012 International Conference on CyberworldsIn this paper, we present a video annotation tool using vision-based Augmented Reality (AR) technology. We apply AR technology and computer vision method for making videos with 3D annotations such as image textures, video clips, 3D objects and 3D text. ...
Using temporal video annotation as a navigational aid for video browsing
UIST '10: Adjunct proceedings of the 23nd annual ACM symposium on User interface software and technologyVideo is a complex information space that requires advanced navigational aids for effective browsing. The increasing number of temporal video annotations offers new opportunities to provide video navigation according to a user's needs. We present a ...
Comments