skip to main content
research-article

Time slice video synthesis by robust video alignment

Published: 20 July 2017 Publication History

Abstract

Time slice photography is a popular effect that visualizes the passing of time by aligning and stitching multiple images capturing the same scene at different times together into a single image. Extending this effect to video is a difficult problem, and one where existing solutions have only had limited success. In this paper, we propose an easy-to-use and robust system for creating time slice videos from a wide variety of consumer videos. The main technical challenge we address is how to align videos taken at different times with substantially different appearances, in the presence of moving objects and moving cameras with slightly different trajectories. To achieve a temporally stable alignment, we perform a mixed 2D-3D alignment, where a rough 3D reconstruction is used to generate sparse constraints that are integrated into a pixelwise 2D registration. We apply our method to a number of challenging scenarios, and show that we can achieve a higher quality registration than prior work. We propose a 3D user interface that allows the user to easily specify how multiple videos should be composited in space and time. Finally, we show that our alignment method can be applied in more general video editing and compositing tasks, such as object removal.

Supplementary Material

ZIP File (a131-cui.zip)
Supplemental files.
MP4 File (papers-0150.mp4)

References

[1]
Robert Anderson, David Gallup, Jonathan T Barron, Janne Kontkanen, Noah Snavely, Carlos Hernández, Sameer Agarwal, and Steven M Seitz. 2016. Jump: virtual reality video. ACM Transactions on Graphics (TOG) 35, 6 (2016), 198.
[2]
Peter J Burt and Edward H Adelson. 1983. A multiresolution spline with application to image mosaics. ACM Transactions on Graphics (TOG) 2, 4 (1983), 217--236.
[3]
Zhaopeng Cui and Ping Tan. 2015. Global Structure-from-Motion by Similarity Averaging. In Proceedings of the IEEE International Conference on Computer Vision. 864--872.
[4]
Ido Freeman, Patrick Wieschollek, and Hendrik Lensch. 2016. Robust Video Synchronization using Unsupervised Deep Learning. arXiv preprint arXiv:1610.05985 (2016).
[5]
Heng Guo, Shuaicheng Liu, Tong He, Shuyuan Zhu, Bing Zeng, and Moncef Gabbouj. 2016. Joint Video Stitching and Stabilization From Moving Cameras. IEEE Transactions on Image Processing 25, 11 (2016), 5491.
[6]
Berthold KP Horn and Brian G Schunck. 1981. Determining optical flow. Artificial intelligence 17, 1--3 (1981), 185--203.
[7]
Felix Klose, Oliver Wang, Jean-Charles Bazin, Marcus Magnor, and Alexander Sorkine-Hornung. 2015. Sampling based scene-space video processing. ACM Transactions on Graphics (TOG) 34, 4 (2015), 67.
[8]
Johannes Kopf, Michael F Cohen, and Richard Szeliski. 2014. First-person hyper-lapse videos. ACM Transactions on Graphics (TOG) 33, 4 (2014), 78.
[9]
Till Kroeger, Radu Timofte, Dengxin Dai, and Luc Van Gool. 2016. Fast Optical Flow using Dense Inverse Search. In European Conference on Computer Vision. Springer.
[10]
Pierre-Yves Laffont, Zhile Ren, Xiaofeng Tao, Chao Qian, and James Hays. 2014. Transient attributes for high-level understanding and editing of outdoor scenes. ACM Transactions on Graphics (TOG) 33, 4 (2014), 149.
[11]
Jungjin Lee, Bumki Kim, Kyehyun Kim, Younghui Kim, and Junyong Noh. 2016. Rich360: optimized spherical representation from structured panoramic camera arrays. ACM Transactions on Graphics (TOG) 35, 4 (2016), 63.
[12]
Wenbin Li, Fabio Viola, Jonathan Starck, Gabriel J. Brostow, and Neill D.F. Campbell. 2016. Roto++: Accelerating Professional Rotoscoping using Shape Manifolds. ACM Transactions on Graphics (In proceeding of ACM SIGGRAPH'16) 35, 4 (2016).
[13]
Kaimo Lin, Shuaicheng Liu, Loong-Fah Cheong, and Bing Zeng. 2016. Seamless Video Stitching from Hand-held Camera Inputs. In Computer Graphics Forum, Vol. 35. Wiley Online Library, 479--487.
[14]
Ce Liu, Jenny Yuen, and Antonio Torralba. 2011. Sift flow: Dense correspondence across scenes and its applications. IEEE transactions on pattern analysis and machine intelligence 33, 5 (2011), 978--994.
[15]
Feng Liu, Michael Gleicher, Hailin Jin, and Aseem Agarwala. 2009. Content-preserving warps for 3D video stabilization. ACM Transactions on Graphics (TOG) 28, 3 (2009), 44.
[16]
Shuaicheng Liu, Lu Yuan, Ping Tan, and Jian Sun. 2013. Bundled camera paths for video stabilization. ACM Transactions on Graphics (TOG) 32, 4 (2013), 78.
[17]
David G Lowe. 1999. Object recognition from local scale-invariant features. In Computer vision, 1999. The proceedings of the seventh IEEE international conference on, Vol. 2. Ieee, 1150--1157.
[18]
Nicolas Märki, Federico Perazzi, Oliver Wang, and Alexander Sorkine-Hornung. 2016. Bilateral space video segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 743--751.
[19]
Meinard Müller. 2007. Information retrieval for music and motion. Vol. 2. Springer.
[20]
Federico Perazzi, Alexander Sorkine-Hornung, Henning Zimmer, Peter Kaufmann, Oliver Wang, S. Watson, and Markus H. Gross. 2015. Panoramic Video from Unstructured Camera Arrays. Comput. Graph. Forum 34, 2 (2015), 57--68.
[21]
Yael Pritch, Alex Rav-Acha, and Shmuel Peleg. 2008. Nonchronological video synopsis and indexing. IEEE Transactions on Pattern Analysis and Machine Intelligence 30, 11 (2008), 1971--1984.
[22]
Carsten Rother, Vladimir Kolmogorov, and Andrew Blake. 2004. Grabcut: Interactive foreground extraction using iterated graph cuts. In ACM transactions on graphics (TOG), Vol. 23. ACM, 309--314.
[23]
Jan Rüegg, Oliver Wang, Aljoscha Smolic, and Markus Gross. 2013. Ducttake: Spatiotemporal video compositing. In Computer Graphics Forum, Vol. 32. Wiley Online Library, 51--61.
[24]
Peter Sand and Seth Teller. 2004. Video matching. ACM Transactions on Graphics (TOG) 23, 3 (2004), 592--599.
[25]
Yichang Shih, Sylvain Paris, Frédo Durand, and William T Freeman. 2013. Data-driven hallucination of different times of day from a single outdoor photo. ACM Transactions on Graphics (TOG) 32, 6 (2013), 200.
[26]
Oliver Wang, Christopher Schroers, Henning Zimmer, Markus Gross, and Alexander Sorkine-Hornung. 2014. Videosnapping: Interactive synchronization of multiple videos. ACM Transactions on Graphics (TOG) 33, 4 (2014), 77.
[27]
Guofeng Zhang, Zilong Dong, Jiaya Jia, Liang Wan, Tien-Tsin Wong, and Hujun Bao. 2009. Refilming with depth-inferred videos. IEEE Transactions on Visualization and Computer Graphics 15, 5 (2009), 828--840.
[28]
Fan Zhong, Song Yang, Xueying Qin, Dani Lischinski, Daniel Cohen-Or, and Baoquan Chen. 2014. Slippage-free background replacement for hand-held video. ACM Transactions on Graphics (TOG) 33, 6 (2014), 199.
[29]
Danping Zou and Ping Tan. 2013. Coslam: Collaborative visual slam in dynamic environments. IEEE transactions on pattern analysis and machine intelligence 35, 2 (2013), 354--366.

Cited By

View all
  • (2022)Bridge inspection component registration for damage evolutionStructural Health Monitoring10.1177/1475921722108364722:1(472-495)Online publication date: 19-May-2022
  • (2022)Nonprofit Journals Publication Patterns: Visibility or Invisibility of Gender?VOLUNTAS: International Journal of Voluntary and Nonprofit Organizations10.1007/s11266-022-00470-xOnline publication date: 16-Mar-2022
  • (2021)Scene Summarization via Motion NormalizationIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2020.299319527:4(2495-2501)Online publication date: 1-Apr-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics
ACM Transactions on Graphics  Volume 36, Issue 4
August 2017
2155 pages
ISSN:0730-0301
EISSN:1557-7368
DOI:10.1145/3072959
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 July 2017
Published in TOG Volume 36, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. 3D reconstruction
  2. SIFT flow
  3. time slice
  4. video alignment
  5. video compositing

Qualifiers

  • Research-article

Funding Sources

  • Canada NSERC Discovery
  • Discovery Accelerator Supplement
  • Adobe

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)19
  • Downloads (Last 6 weeks)2
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Bridge inspection component registration for damage evolutionStructural Health Monitoring10.1177/1475921722108364722:1(472-495)Online publication date: 19-May-2022
  • (2022)Nonprofit Journals Publication Patterns: Visibility or Invisibility of Gender?VOLUNTAS: International Journal of Voluntary and Nonprofit Organizations10.1007/s11266-022-00470-xOnline publication date: 16-Mar-2022
  • (2021)Scene Summarization via Motion NormalizationIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2020.299319527:4(2495-2501)Online publication date: 1-Apr-2021
  • (2021)Prominent Structures for Video Analysis and EditingIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2020.297004527:7(3305-3317)Online publication date: 1-Jul-2021
  • (2020)Multi-exposure photomontage with hand-held camerasComputer Vision and Image Understanding10.1016/j.cviu.2020.102929(102929)Online publication date: Feb-2020
  • (2020)Coherent video generation for multiple hand-held cameras with dynamic foregroundComputational Visual Media10.1007/s41095-020-0187-36:3(291-306)Online publication date: 3-Sep-2020
  • (2018)Video Stitching with Extended-MeshFlow2018 24th International Conference on Pattern Recognition (ICPR)10.1109/ICPR.2018.8545663(3049-3054)Online publication date: Aug-2018
  • (2018)Photomontage for Robust HDR Imaging with Hand-Held Cameras2018 25th IEEE International Conference on Image Processing (ICIP)10.1109/ICIP.2018.8451138(1708-1712)Online publication date: Oct-2018

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media