research-article

Geodesic image and video editing

Authors:
Antonio Criminisi

Microsoft Research Ltd, Cambridge, UK

Microsoft Research Ltd, Cambridge, UK
View Profile

,
Toby Sharp

Microsoft Research Ltd, Cambridge, UK

Microsoft Research Ltd, Cambridge, UK
View Profile

,
Carsten Rother

Microsoft Research Ltd, Cambridge, UK

Microsoft Research Ltd, Cambridge, UK
View Profile

,
Patrick P'erez

Technicolor Research and Innovation, Cesson-Sévigné, France

Technicolor Research and Innovation, Cesson-Sévigné, France
View Profile

Authors Info & Claims

ACM Transactions on Graphics Volume 29 Issue 5Article No.: 134pp 1–15https://doi.org/10.1145/1857907.1857910

Published:05 November 2010Publication History

ACM Transactions on Graphics

Abstract

This article presents a new, unified technique to perform general edge-sensitive editing operations on n-dimensional images and videos efficiently.

The first contribution of the article is the introduction of a Generalized Geodesic Distance Transform (GGDT), based on soft masks. This provides a unified framework to address several edge-aware editing operations. Diverse tasks such as denoising and nonphotorealistic rendering are all dealt with fundamentally the same, fast algorithm. Second, a new Geodesic Symmetric Filter (GSF) is presented which imposes contrast-sensitive spatial smoothness into segmentation and segmentation-based editing tasks (cutout, object highlighting, colorization, panorama stitching). The effect of the filter is controlled by two intuitive, geometric parameters. In contrast to existing techniques, the GSF filter is applied to real-valued pixel likelihoods (soft masks), thanks to GGDTs and it can be used for both interactive and automatic editing. Complex object topologies are dealt with effortlessly. Finally, the parallelism of GGDTs enables us to exploit modern multicore CPU architectures as well as powerful new GPUs, thus providing great flexibility of implementation and deployment. Our technique operates on both images and videos, and generalizes naturally to n-dimensional data.

The proposed algorithm is validated via quantitative and qualitative comparisons with existing, state-of-the-art approaches. Numerous results on a variety of image and video editing tasks further demonstrate the effectiveness of our method.

Supplemental Material

tp104_11.mp4

mp4

82.9 MB

Download

References

Agarwala, A., Dontcheva, M., Agrawala, M., Druker, A., Colburn, A., Curless, B., Salesin, D., and Cohen, M. 2004. Interactive digital photomontage. In Proceedings of ACM SIGGRAPH. Google ScholarDigital Library
Bai, X. and Sapiro, G. 2007. A geodesic framework for fast interactive image and video segmentation and matting. In Proceedings of the IEEE International Conference on Computer Vision.Google Scholar
Borgefors, G. 1986. Distance transformations in digital images. In Proceedings of Conference on Computer Vision, Graphics and Image Processing.Google ScholarDigital Library
Bousseau, A., Neyret, F., Thollot, J., and Salesin, D. 2007. Video watercolorization using bidirectional texture advection. In Proceedings of ACM SIGGRAPH. Google ScholarDigital Library
Boykov, J. and Jolly, M.-P. 2001. Interactive graph cuts for optimal boundary and region segmentation of objects in n-D images. In Proceedings of the IEEE International Conference on Computer Vision.Google Scholar
Brown, M., Szeliski, R., and Winder, S. 2005. Multi-image matching using multi-scale oriented patches. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 510--517. Google ScholarDigital Library
Buades, A., Coll, B., and Morel, J.-M. 2005. A non-local algorithm for image denoising. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Google ScholarDigital Library
Chen, J., Paris, J., and Durand, F. 2007. Real-time edge-aware image processing with the bilateral grid. In Proceedings of ACM SIGGRAPH. Google ScholarDigital Library
Couprie, C., Grady, L. amd Najman, L., and Talbot, H. 2009. Power watersheds: A new image segmentation framework extending graph cuts, random walker and optimal spanning forest. In Proceedings of the IEEE International Conference on Computer Vision.Google Scholar
Criminisi, A., Cross, G., Blake, A., and kolmogorov, V. 2006. Bilayer segmentation of live video. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Google ScholarDigital Library
Criminisi, A., Sharp, T., and Blake, A. 2008. GeoS: Geodesic image segmentation. In Proceedings of the European Conference on Computer Vision. Google ScholarDigital Library
Dijkstra, E. 1959. A note on two problems in connexion with graphs. Numer. Math. 1, 269--271.Google ScholarDigital Library
Durand, F. and Dorsey, J. 2002. Fast bilateral filtering for the display of high-dynamic-range images. In Proceedings of ACM SIGGRAPH. Google ScholarDigital Library
Fabbri, R., Costa, L., Torrelli, J., and Bruno, O. 2008. 2D euclidean distance transform algorithms: A comparative survey. ACM Comput. Surv. 40, 1. Google ScholarDigital Library
Felsberg, M., Forssen, P.-E., and Scharr, H. 2006. Efficient robust smoothing of low-level signal features. IEEE Trans. Pattern Anal. Mach. Intell. 28, 2, 209--222. Google ScholarDigital Library
Felzenszwalb, P. and Huttenlocher, D. P. 2004. Efficient belief propagation for early vision. Int. J. Comput. Vision 70, 1, 41--54. Google ScholarDigital Library
Grady, L. 2006. Random walks for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 28, 11. Google ScholarDigital Library
Grady, L. and Sinop, A. K. 2008. Fast approximate random walker segmentation using eigenvector precomputation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Google Scholar
Heijmans, H. J. A. M. 1995. Mathematical morphology: A modern approach in image processing based on algebra and geometry. SIAM Rev. 37, 1, 1--36. Google ScholarDigital Library
Jones, M., Baerentzen, J., and Sramek, M. 2006. 3D distance fields: a survey of techniques and applications. IEEE Trans. Visualiz. Comput. Graph. 12. Google ScholarDigital Library
Juan, O. and Boykov, J. 2006. Active graph cuts. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Google ScholarDigital Library
Kohli, P. and Torr, P. H. S. 2007. Dynamic graph cuts for efficient inference in Markov Random Fields. IEEE Trans. Pattern Anal. Mach. Intell. 29, 12, 2079--2088. Google ScholarDigital Library
Kolmogorov, V., Criminisi, A., Blake, A., Cross, G., and Rother, C. 2005. Bilayer segmentation of binocular stereo video. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Google ScholarDigital Library
Kolmogorov, V. and Zabih, R. 2004. What energy functions can be minimized via graph cuts&quest; IEEE Trans. Pattern Anal. Mach. Intell. 26, 2. Google ScholarDigital Library
Kopf, J., Cohen, M., Lischinski, D., and Uyttendaele, M. 2007. Joint bilateral upsampling. ACM Trans. Graph. 26, 3. Google ScholarDigital Library
Levin, A., Lischinski, D., and Weiss, Y. 2004. Colorization using optimization. ACM Trans. Graph. Google ScholarDigital Library
Li, Y., Sun, J., Tang, C.-K., and H.-Y., S. 2004. Lazy snapping. ACM Trans. Graph. 23, 3. Google ScholarDigital Library
Lischinski, D., Farbman, Z., Uyttendaele, M., and Szeliski, R. 2006. Interactive local adjustment of tonal values. ACM Trans. Graph. 25, 3, 646--653. Google ScholarDigital Library
Liu, J., Sun, J., and Shum, H.-Y. 2009. Paint selection. ACM Trans. Graph. 28, 3. Google ScholarDigital Library
Lombaert, H., Sun, Y., Grady, L., and Xu, C. 2005. A multilevel banded graph cuts method for fast image segmentation. In Proceedings of the IEEE International Conference on Computer Vision. Google ScholarDigital Library
Luan, Q., Wen, F., Cohen-Or, D., Liang, L., Xu, Y. Q., and Shum, H. Y. 2007. Natural image colorization. In Proceedings of the Eurographics Symposium on Rendering. J. Kautz and S. Pattanaik. Eds. Eurographics. Google ScholarDigital Library
Paris, S. and Durand, F. 2009. A fast approximation of the bilateral filter. Int. J. Comput. Vision. Google ScholarDigital Library
Perona, P. and Malik, J. 1990. Scale-space and edge detection using anisotropic diffusion. IEEE Trans. Pattern Anal. Mach. Intell. 12, 7. Google ScholarDigital Library
Roth, S. and Black, M. 2005. Fields of experts: A framework for learning image priors. In Proceedings of the IEEE Computer Conference on Vision and Pattern Recognition. Google ScholarDigital Library
Rother, C., Kolmogorov, V., and Blake, A. 2004. GrabCut: Interactive foreground extraction using iterated graph cuts. In ACM Trans. Graph. Google ScholarDigital Library
Sethian, J. A. 1999. Fast marching methods. SIAM Rev. 41, 2. Google ScholarDigital Library
Shotton, J., Winn, J., Rother, C., and Criminisi, A. 2007. Textonboost for image understanding: Multi-class object recognition and segmentation by jointly modeling appearance, shape and context. Int. J. Comput. Vision. Google ScholarDigital Library
Sinop, A. and Grady, L. 2007. A seeded image segmentation framework unifying graph cuts and random walker which yields a new algorithm. In Proceedings of the IEEE International Conference on Computer Vision.Google Scholar
Soille, P. 1999. Morphological Image Analysis. Springer. Google ScholarDigital Library
Szeliski, R. 2006. Locally adapted hierarchical basis preconditioning. ACM Trans. Graph. 25, 3, 1135--1143. Google ScholarDigital Library
Szeliski, R., Zabih, R., Scharstein, D., Veksler, O., Kolmogorov, V., Agarwala, A., Tappen, M., and Rother, C. 2007. A comparative study of energy minimization methods for Markov Random Fields with smoothness-based priors. Int. J. Comput. Vision. 30, 6, 1068--1080. Google ScholarDigital Library
Toivanen, P. J. 1996. New geodesic distance transforms for gray-scale images. Pattern Recogn. Lett. 17, 5, 437--450. Google ScholarDigital Library
Tomasi, C. and Manduchi, R. 1998. Bilateral filtering for gray and color images. In Proceeding of the IEEE International Conference on Computer Vision. 839--846. Google ScholarDigital Library
Wang, J., Bhat, P., Colburn, R. A., Agrawala, M., and Cohen, M. F. 2005. Interactive video cut out. ACM Trans. Graph. 24, 585--594. Google ScholarDigital Library
Wang, J., Xu, Y., Shum, H.-Y., and Cohen, M. 2004. Video tooning. In Proceedings of ACM SIGGRAPH. Google ScholarDigital Library
Weber, O., Devir, Y. S., Bronstein, A. M., Bronstein, M. M., and Kimmel, R. 2008. Parallel algorithms for approximation of distance maps on parametric surfaces. In Proceedings of ACM SIGGRAPH. Google ScholarDigital Library
Weiss, B. 2006. Fast median and bilateral filtering. In ACM SIGGRAPH. Google ScholarDigital Library
Weiss, Y. and Freeman, W. T. 2007. What makes a good model of natural images&quest; In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Google Scholar
Winnemoller, H., Olsen, S. C., and Gooch, B. 2006. Real time video abstraction. In Proceedings of ACM SIGGRAPH. Google ScholarDigital Library
Yatziv, L., Bartesaghi, A., and Sapiro, G. 2006. O(n) implementation of the fast marching algorithm. J. Computat. Phys. 212, 393--399. Google ScholarDigital Library
Yatziv, L. and Sapiro, G. 2006. Fast image and video colorization using chrominance blending. IEEE Trans. Image Proces. 15, 5. Google ScholarDigital Library

Index Terms

Geodesic image and video editing
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Image segmentation
        Video segmentation
  2. Computer graphics
    1. Image manipulation
    2. Rendering

Recommendations

An Improved Image Denoising and Segmentation Approach for Detecting Tumor from 2-D MRI Brain Images
ACSAT '12: Proceedings of the 2012 International Conference on Advanced Computer Science Applications and Technologies

Image denoising and segmentation are the two most challenging fields in medical image processing particularly when it is application specific. The presence of noise not only degrades the visual quality but also immensely affects the accuracies of ...
Read More
Segmentation of brain magnetic resonance images through morphological operators and geodesic distance

When segmenting magnetic resonance (MR) images, a wide range of useless information arises, which has to be discarded as a step prior to classifying the different cerebral cortex areas. To obtain effective results during the classification process, it ...
Read More
Authoring and animating painterly characters

Artists explore the visual style of animated characters through 2D concept art, since it affords them a nearly unlimited degree of creative freedom. Realizing the desired visual style, however, within the 3D character animation pipeline is often ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM Transactions on Graphics Volume 29, Issue 5
October 2010
58 pages
ISSN:0730-0301
EISSN:1557-7368
DOI:10.1145/1857907
Issue’s Table of Contents

Copyright © 2010 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 5 November 2010
- Accepted: 1 August 2010
- Revised: 1 April 2010
- Received: 1 September 2008
Published in tog Volume 29, Issue 5

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Image and video
denoising
geodesic distance
geodestic segmentation
nonphotorealistic rendering
restoration
segmentation
tooning
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 144
  Total Citations
  View Citations
- 1,491
  Total Downloads
- Downloads (Last 12 months)11
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Geodesic image and video editing

ACM Transactions on Graphics

Abstract

Supplemental Material

References

Cited By

Index Terms

Recommendations

An Improved Image Denoising and Segmentation Approach for Detecting Tumor from 2-D MRI Brain Images

Segmentation of brain magnetic resonance images through morphological operators and geodesic distance

Authoring and animating painterly characters

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Geodesic image and video editing

ACM Transactions on Graphics

Abstract

Supplemental Material

References

Cited By

Index Terms

Recommendations

An Improved Image Denoising and Segmentation Approach for Detecting Tumor from 2-D MRI Brain Images

Segmentation of brain magnetic resonance images through morphological operators and geodesic distance

Authoring and animating painterly characters

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media