skip to main content
10.1145/1531743.1531765acmconferencesArticle/Chapter ViewAbstractPublication PagescfConference Proceedingsconference-collections
research-article

Accelerating total variation regularization for matrix-valued images on GPUs

Authors Info & Claims
Published:18 May 2009Publication History

ABSTRACT

The advent of new matrix-valued magnetic resonance imaging modalities such as Diffusion Tensor Imaging (DTI) requires extensive computational acceleration. Computational acceleration on graphics processing units (GPUs) can make the regularization (denoising) of DTI images attractive in clinical settings, hence improving the quality of DTI images in a broad range of applications. Construction of DTI images consists of direction-specific Magnetic Resonance (MR) measurements. Compared with conventional MR, direction-sensitive acquisition has a lower signal-to-noise ratio (SNR). Therefore, high noise levels often limit DTI imaging. Advanced post-processing of imaging data can improve the quality of estimated tensors. However, the post-processing problem is only made more computationally difficult when considering matrix-valued imaging data.

This paper describes the acceleration of a Total Variation regularization method for matrix-valued images, in particular, for DTI images on NVIDIA Quadro FX 5600. The TV regularization of a 3-D image with 1283 voxels ultimately achieves 266X speedup and requires 1 minute and 30 seconds on the Quadro, while this algorithm on a dual-core CPU completes in more than 3 hours. In this application study we are aimed at analyzing the effective of excessive synchronization, which provides an insight into generally adapting Variational methods to the GPU architecture for other image processing algorithms designed for matrix-valued images.

References

  1. D. Manocha, M.C. Lin, N. Govindaraju. GPGPU to Many-Core Processing: Higher Performance for Mass Market Applications. Manycore Computing Workshop, 2007.Google ScholarGoogle Scholar
  2. NVIDIA Corporation. NVIDIA CUDA Programming Guide, version 1.1, 2007.Google ScholarGoogle Scholar
  3. AMD Stream Processor. http://ati.amd.com/products/streamprocessor/index.html.Google ScholarGoogle Scholar
  4. M. Segal and K. Akeley. The OpenGL Graphics System: A Specification (Version 2.0). Silicon Graphics, Inc., October 2004.Google ScholarGoogle Scholar
  5. DirectX Developer Center. http://www.msdn.com/directx/.Google ScholarGoogle Scholar
  6. Cg. http://developer.nvidia.com/page/cg main.html.Google ScholarGoogle Scholar
  7. D. Tarditi, S. Puri, and J. Oglesby. Accelerator: Using data parallelism to program GPUs for general-purpose uses. In Proceedings of the 12th International Conference on Architectural Support for Programming Languages and Operating Systems, pages 325--335, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. I. Buck. Brook Specification v0.2, October 2003.Google ScholarGoogle Scholar
  9. P.J. Basser, J. Mattiello, and D. LeBihan, "MR diffusion tensor spectroscopy and imaging," Biophysical Journal, vol. 66, no. 1, pp. 259--267, 1994.Google ScholarGoogle ScholarCross RefCross Ref
  10. D. Le Bihan, J.-F. Mangin, C. Poupon, et al., "Diffusion tensor imaging: concepts and applications," Journal of Magnetic Resonance Imaging, vol. 13, no. 4, pp. 534--546, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  11. C.-F. Westin, S.E. Maier, H. Mamata, A. Nabavi, F.A. Jolesz, and R. Kikinis, "Processing and visualization for diffusion tensor MRI," Medical Image Analysis, vol. 6, no. 2, pp. 93--108, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  12. S. Mori and P.B. Barker, "Diffusion magnetic resonance imaging: its principle and applications," The Anatomical Record vol. 257, no. 3, pp. 102--109, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  13. S. Mori and P.C.M. van Zijl, "Fiber tracking: principles and strategies'a technical review," NMR in Biomedicine, vol. 15, no. 7-8, pp. 468--480, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  14. R. Bammer, "Basic principles of diffusion-weighted imaging," European Journal of Radiology, vol. 45, no. 3, pp. 169--184, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  15. O. Christiansen, T.M. Lee, J. Lie, U. Sinha, and T.F. Chan, "Total Variation Regularization of Matrix-Valued Images," International Journal of Biomedical Imaging, vol. 2007, Article ID 27432, 11 pages, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  16. M. Lysaker, S. Osher, and X.-C. Tai, "Noise removal using smoothed normals and surface fitting," IEEE Transactions on Image Processing, vol. 13, no. 10, pp. 1345--1357, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. T.F. Chan and S. Esedoglu, "Aspects of total variation regularized L1 function approximation," SIAM Journal on Applied Mathematics, vol. 65, no. 10, pp. 1345--1357, 2005.Google ScholarGoogle Scholar
  18. J. Weickert and T. Brox, "Diffusion and regularization of vector- and matrix-valued images," Tech. Rep. preprint no. 58, Fachrichtung 6.1 Mathematik, Universitat des Saarlandes, Saarbrucken, Germany, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  19. L.I. Rudin, S. Osher, and E. Fatemi, "Nonlinear total variation based noise removal algorithms," Physica D, vol. 60, no. 1-4, pp. 259--268, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. S. Ryoo, C. Rodrigues, S. Stone, S. Baghsorkhi, S. Ueng, and W. Hwu. Program optimization study on a 128-core GPU. First Workshop on General Purpose Processing on Graphics Processing Units (GPGPU), 2007.Google ScholarGoogle Scholar
  21. S.S. Stone, J.P. Haldar, S.C. Tsao, W.W. Hwu, Z.P. Liang, B.P. Sutton, "Accelerating Advanced MRI Reconstructions on GPUs, " Proceedings of the 2008 International Conference on Computing Frontiers, May 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Che, S., Boyer, M., Meng, J., Tarjan, D., Sheaffer, J.W., and Skadron, K. A performance study of general-purpose applications on graphics processors using CUDA. J. Parallel Distrib. Comput. 68, 10 (Oct. 2008), 1370--1380. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. Owens, D. Luebke, N. Govindaraju, M. Harris, J. Kruger, A. Lefohn, and T. Purcell. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum, 26(1):80--113, March 2007.Google ScholarGoogle ScholarCross RefCross Ref
  24. B. Cabral, N. Cam, and J. Foran. Accelerated volume rendering and tomographic reconstruction using texture mapping hardware. In 1994 Symposium on Volume Visualization, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Pock, T.; Unger, M.; Cremers, D.; Bischof, H., "Fast and exact solution of Total Variation models on the GPU," Computer Vision and Pattern Recognition Workshops, 2008. CVPRW '08. IEEE Computer Society Conference on , vol., no., pp.1--8, 23-28 June 2008.Google ScholarGoogle Scholar
  26. T. Pock, M. Grabner, and H. Bischof. "Real-time Computation of Variational Methods on Graphics Hardware," Computer Vision Winter Workshop 2007, Michael Grabner, Helmut Grabner, St. Lambrecht, Austria, February 6-8.Google ScholarGoogle Scholar

Index Terms

  1. Accelerating total variation regularization for matrix-valued images on GPUs

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        CF '09: Proceedings of the 6th ACM conference on Computing frontiers
        May 2009
        238 pages
        ISBN:9781605584133
        DOI:10.1145/1531743

        Copyright © 2009 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 18 May 2009

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        CF '09 Paper Acceptance Rate26of113submissions,23%Overall Acceptance Rate240of680submissions,35%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader