research-article

Time slice video synthesis by robust video alignment

Authors:

Jue WangAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 36, Issue 4

Article No.: 131, Pages 1 - 10

https://doi.org/10.1145/3072959.3073612

Published: 20 July 2017 Publication History

Abstract

Time slice photography is a popular effect that visualizes the passing of time by aligning and stitching multiple images capturing the same scene at different times together into a single image. Extending this effect to video is a difficult problem, and one where existing solutions have only had limited success. In this paper, we propose an easy-to-use and robust system for creating time slice videos from a wide variety of consumer videos. The main technical challenge we address is how to align videos taken at different times with substantially different appearances, in the presence of moving objects and moving cameras with slightly different trajectories. To achieve a temporally stable alignment, we perform a mixed 2D-3D alignment, where a rough 3D reconstruction is used to generate sparse constraints that are integrated into a pixelwise 2D registration. We apply our method to a number of challenging scenarios, and show that we can achieve a higher quality registration than prior work. We propose a 3D user interface that allows the user to easily specify how multiple videos should be composited in space and time. Finally, we show that our alignment method can be applied in more general video editing and compositing tasks, such as object removal.

Supplementary Material

ZIP File (a131-cui.zip)

Supplemental files.

Download
624.12 MB

MP4 File (papers-0150.mp4)

Download
433.48 MB

References

[1]

Robert Anderson, David Gallup, Jonathan T Barron, Janne Kontkanen, Noah Snavely, Carlos Hernández, Sameer Agarwal, and Steven M Seitz. 2016. Jump: virtual reality video. ACM Transactions on Graphics (TOG) 35, 6 (2016), 198.

Digital Library

[2]

Peter J Burt and Edward H Adelson. 1983. A multiresolution spline with application to image mosaics. ACM Transactions on Graphics (TOG) 2, 4 (1983), 217--236.

Digital Library

[3]

Zhaopeng Cui and Ping Tan. 2015. Global Structure-from-Motion by Similarity Averaging. In Proceedings of the IEEE International Conference on Computer Vision. 864--872.

Digital Library

[4]

Ido Freeman, Patrick Wieschollek, and Hendrik Lensch. 2016. Robust Video Synchronization using Unsupervised Deep Learning. arXiv preprint arXiv:1610.05985 (2016).

[5]

Heng Guo, Shuaicheng Liu, Tong He, Shuyuan Zhu, Bing Zeng, and Moncef Gabbouj. 2016. Joint Video Stitching and Stabilization From Moving Cameras. IEEE Transactions on Image Processing 25, 11 (2016), 5491.

[6]

Berthold KP Horn and Brian G Schunck. 1981. Determining optical flow. Artificial intelligence 17, 1--3 (1981), 185--203.

[7]

Felix Klose, Oliver Wang, Jean-Charles Bazin, Marcus Magnor, and Alexander Sorkine-Hornung. 2015. Sampling based scene-space video processing. ACM Transactions on Graphics (TOG) 34, 4 (2015), 67.

Digital Library

[8]

Johannes Kopf, Michael F Cohen, and Richard Szeliski. 2014. First-person hyper-lapse videos. ACM Transactions on Graphics (TOG) 33, 4 (2014), 78.

Digital Library

[9]

Till Kroeger, Radu Timofte, Dengxin Dai, and Luc Van Gool. 2016. Fast Optical Flow using Dense Inverse Search. In European Conference on Computer Vision. Springer.

[10]

Pierre-Yves Laffont, Zhile Ren, Xiaofeng Tao, Chao Qian, and James Hays. 2014. Transient attributes for high-level understanding and editing of outdoor scenes. ACM Transactions on Graphics (TOG) 33, 4 (2014), 149.

Digital Library

[11]

Jungjin Lee, Bumki Kim, Kyehyun Kim, Younghui Kim, and Junyong Noh. 2016. Rich360: optimized spherical representation from structured panoramic camera arrays. ACM Transactions on Graphics (TOG) 35, 4 (2016), 63.

Digital Library

[12]

Wenbin Li, Fabio Viola, Jonathan Starck, Gabriel J. Brostow, and Neill D.F. Campbell. 2016. Roto++: Accelerating Professional Rotoscoping using Shape Manifolds. ACM Transactions on Graphics (In proceeding of ACM SIGGRAPH'16) 35, 4 (2016).

[13]

Kaimo Lin, Shuaicheng Liu, Loong-Fah Cheong, and Bing Zeng. 2016. Seamless Video Stitching from Hand-held Camera Inputs. In Computer Graphics Forum, Vol. 35. Wiley Online Library, 479--487.

Digital Library

[14]

Ce Liu, Jenny Yuen, and Antonio Torralba. 2011. Sift flow: Dense correspondence across scenes and its applications. IEEE transactions on pattern analysis and machine intelligence 33, 5 (2011), 978--994.

Digital Library

[15]

Feng Liu, Michael Gleicher, Hailin Jin, and Aseem Agarwala. 2009. Content-preserving warps for 3D video stabilization. ACM Transactions on Graphics (TOG) 28, 3 (2009), 44.

Digital Library

[16]

Shuaicheng Liu, Lu Yuan, Ping Tan, and Jian Sun. 2013. Bundled camera paths for video stabilization. ACM Transactions on Graphics (TOG) 32, 4 (2013), 78.

Digital Library

[17]

David G Lowe. 1999. Object recognition from local scale-invariant features. In Computer vision, 1999. The proceedings of the seventh IEEE international conference on, Vol. 2. Ieee, 1150--1157.

Digital Library

[18]

Nicolas Märki, Federico Perazzi, Oliver Wang, and Alexander Sorkine-Hornung. 2016. Bilateral space video segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 743--751.

[19]

Meinard Müller. 2007. Information retrieval for music and motion. Vol. 2. Springer.

[20]

Federico Perazzi, Alexander Sorkine-Hornung, Henning Zimmer, Peter Kaufmann, Oliver Wang, S. Watson, and Markus H. Gross. 2015. Panoramic Video from Unstructured Camera Arrays. Comput. Graph. Forum 34, 2 (2015), 57--68.

Digital Library

[21]

Yael Pritch, Alex Rav-Acha, and Shmuel Peleg. 2008. Nonchronological video synopsis and indexing. IEEE Transactions on Pattern Analysis and Machine Intelligence 30, 11 (2008), 1971--1984.

Digital Library

[22]

Carsten Rother, Vladimir Kolmogorov, and Andrew Blake. 2004. Grabcut: Interactive foreground extraction using iterated graph cuts. In ACM transactions on graphics (TOG), Vol. 23. ACM, 309--314.

Digital Library

[23]

Jan Rüegg, Oliver Wang, Aljoscha Smolic, and Markus Gross. 2013. Ducttake: Spatiotemporal video compositing. In Computer Graphics Forum, Vol. 32. Wiley Online Library, 51--61.

[24]

Peter Sand and Seth Teller. 2004. Video matching. ACM Transactions on Graphics (TOG) 23, 3 (2004), 592--599.

Digital Library

[25]

Yichang Shih, Sylvain Paris, Frédo Durand, and William T Freeman. 2013. Data-driven hallucination of different times of day from a single outdoor photo. ACM Transactions on Graphics (TOG) 32, 6 (2013), 200.

Digital Library

[26]

Oliver Wang, Christopher Schroers, Henning Zimmer, Markus Gross, and Alexander Sorkine-Hornung. 2014. Videosnapping: Interactive synchronization of multiple videos. ACM Transactions on Graphics (TOG) 33, 4 (2014), 77.

Digital Library

[27]

Guofeng Zhang, Zilong Dong, Jiaya Jia, Liang Wan, Tien-Tsin Wong, and Hujun Bao. 2009. Refilming with depth-inferred videos. IEEE Transactions on Visualization and Computer Graphics 15, 5 (2009), 828--840.

Digital Library

[28]

Fan Zhong, Song Yang, Xueying Qin, Dani Lischinski, Daniel Cohen-Or, and Baoquan Chen. 2014. Slippage-free background replacement for hand-held video. ACM Transactions on Graphics (TOG) 33, 6 (2014), 199.

Digital Library

[29]

Danping Zou and Ping Tan. 2013. Coslam: Collaborative visual slam in dynamic environments. IEEE transactions on pattern analysis and machine intelligence 35, 2 (2013), 354--366.

Digital Library

Cited By

Bianchi ESakib NWoolsey CHebdon M(2022)Bridge inspection component registration for damage evolutionStructural Health Monitoring10.1177/1475921722108364722:1(472-495)Online publication date: 19-May-2022
https://doi.org/10.1177/14759217221083647
Evans M(2022)Nonprofit Journals Publication Patterns: Visibility or Invisibility of Gender?VOLUNTAS: International Journal of Voluntary and Nonprofit Organizations10.1007/s11266-022-00470-xOnline publication date: 16-Mar-2022
https://doi.org/10.1007/s11266-022-00470-x
Wehrwein SBala KSnavely N(2021)Scene Summarization via Motion NormalizationIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2020.299319527:4(2495-2501)Online publication date: 1-Apr-2021
https://doi.org/10.1109/TVCG.2020.2993195
Show More Cited By

Index Terms

Time slice video synthesis by robust video alignment
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Image and video acquisition
        Computational photography
  2. Computer graphics
    1. Image manipulation

Recommendations

Efficient video sequences alignment using unbiased bidirectional dynamic time warping

In this paper, we propose an efficient technique to synchronize video sequences of events that are acquired via uncalibrated cameras at unknown and dynamically varying temporal offsets. Unlike other existing techniques that just take unidirectional ...
Video-based interactive storytelling using real-time video compositing techniques

Interactive storytelling systems usually adopt computer graphics to represent virtual story worlds, which facilitates the dynamic generation of visual content. However, the quality of the images and motion produced by these systems is still inferior ...
Video puppetry: a performative interface for cutout animation
SIGGRAPH Asia '08: ACM SIGGRAPH Asia 2008 papers

We present a video-based interface that allows users of all skill levels to quickly create cutout-style animations by performing the character motions. The puppeteer first creates a cast of physical puppets using paper, markers and scissors. He then ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics

ACM Transactions on Graphics Volume 36, Issue 4

August 2017

2155 pages

ISSN:0730-0301

EISSN:1557-7368

DOI:10.1145/3072959

Issue’s Table of Contents

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 July 2017

Published in TOG Volume 36, Issue 4

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Canada NSERC Discovery
Discovery Accelerator Supplement
Adobe

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

8
Total Citations
View Citations
615
Total Downloads

Downloads (Last 12 months)19
Downloads (Last 6 weeks)2

Reflects downloads up to 01 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Bianchi ESakib NWoolsey CHebdon M(2022)Bridge inspection component registration for damage evolutionStructural Health Monitoring10.1177/1475921722108364722:1(472-495)Online publication date: 19-May-2022
https://doi.org/10.1177/14759217221083647
Evans M(2022)Nonprofit Journals Publication Patterns: Visibility or Invisibility of Gender?VOLUNTAS: International Journal of Voluntary and Nonprofit Organizations10.1007/s11266-022-00470-xOnline publication date: 16-Mar-2022
https://doi.org/10.1007/s11266-022-00470-x
Wehrwein SBala KSnavely N(2021)Scene Summarization via Motion NormalizationIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2020.299319527:4(2495-2501)Online publication date: 1-Apr-2021
https://doi.org/10.1109/TVCG.2020.2993195
Wang MFang XYang GShamir AHu S(2021)Prominent Structures for Video Analysis and EditingIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2020.297004527:7(3305-3317)Online publication date: 1-Jul-2021
https://doi.org/10.1109/TVCG.2020.2970045
Li RLiu SLiu GSun TGuo J(2020)Multi-exposure photomontage with hand-held camerasComputer Vision and Image Understanding10.1016/j.cviu.2020.102929(102929)Online publication date: Feb-2020
https://doi.org/10.1016/j.cviu.2020.102929
Zhang FBarnes CZhang HZhao JSalas G(2020)Coherent video generation for multiple hand-held cameras with dynamic foregroundComputational Visual Media10.1007/s41095-020-0187-36:3(291-306)Online publication date: 3-Sep-2020
https://doi.org/10.1007/s41095-020-0187-3
Chen KYao JXiang BTu J(2018)Video Stitching with Extended-MeshFlow2018 24th International Conference on Pattern Recognition (ICPR)10.1109/ICPR.2018.8545663(3049-3054)Online publication date: Aug-2018
https://doi.org/10.1109/ICPR.2018.8545663
Li RHe XLiu SLiu GZeng B(2018)Photomontage for Robust HDR Imaging with Hand-Held Cameras2018 25th IEEE International Conference on Image Processing (ICIP)10.1109/ICIP.2018.8451138(1708-1712)Online publication date: Oct-2018
https://doi.org/10.1109/ICIP.2018.8451138

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Issue’s Table of Contents