research-article

Accelerating total variation regularization for matrix-valued images on GPUs

Authors:
Maryam Moazeni

University of California, Los Angeles, Los Angeles, CA, USA

University of California, Los Angeles, Los Angeles, CA, USA
View Profile

,
Alex Bui

University of California, Los Angeles, Los Angeles, CA, USA

University of California, Los Angeles, Los Angeles, CA, USA
View Profile

,
Majid Sarrafzadeh

University of California, Los Angeles, Los Angeles, CA, USA

University of California, Los Angeles, Los Angeles, CA, USA
View Profile

CF '09: Proceedings of the 6th ACM conference on Computing frontiersMay 2009Pages 137–146https://doi.org/10.1145/1531743.1531765

Published:18 May 2009Publication History

CF '09: Proceedings of the 6th ACM conference on Computing frontiers

Pages 137–146

ABSTRACT

The advent of new matrix-valued magnetic resonance imaging modalities such as Diffusion Tensor Imaging (DTI) requires extensive computational acceleration. Computational acceleration on graphics processing units (GPUs) can make the regularization (denoising) of DTI images attractive in clinical settings, hence improving the quality of DTI images in a broad range of applications. Construction of DTI images consists of direction-specific Magnetic Resonance (MR) measurements. Compared with conventional MR, direction-sensitive acquisition has a lower signal-to-noise ratio (SNR). Therefore, high noise levels often limit DTI imaging. Advanced post-processing of imaging data can improve the quality of estimated tensors. However, the post-processing problem is only made more computationally difficult when considering matrix-valued imaging data.

This paper describes the acceleration of a Total Variation regularization method for matrix-valued images, in particular, for DTI images on NVIDIA Quadro FX 5600. The TV regularization of a 3-D image with 128³ voxels ultimately achieves 266X speedup and requires 1 minute and 30 seconds on the Quadro, while this algorithm on a dual-core CPU completes in more than 3 hours. In this application study we are aimed at analyzing the effective of excessive synchronization, which provides an insight into generally adapting Variational methods to the GPU architecture for other image processing algorithms designed for matrix-valued images.

References

D. Manocha, M.C. Lin, N. Govindaraju. GPGPU to Many-Core Processing: Higher Performance for Mass Market Applications. Manycore Computing Workshop, 2007.Google Scholar
NVIDIA Corporation. NVIDIA CUDA Programming Guide, version 1.1, 2007.Google Scholar
AMD Stream Processor. http://ati.amd.com/products/streamprocessor/index.html.Google Scholar
M. Segal and K. Akeley. The OpenGL Graphics System: A Specification (Version 2.0). Silicon Graphics, Inc., October 2004.Google Scholar
DirectX Developer Center. http://www.msdn.com/directx/.Google Scholar
Cg. http://developer.nvidia.com/page/cg main.html.Google Scholar
D. Tarditi, S. Puri, and J. Oglesby. Accelerator: Using data parallelism to program GPUs for general-purpose uses. In Proceedings of the 12th International Conference on Architectural Support for Programming Languages and Operating Systems, pages 325--335, 2006. Google ScholarDigital Library
I. Buck. Brook Specification v0.2, October 2003.Google Scholar
P.J. Basser, J. Mattiello, and D. LeBihan, "MR diffusion tensor spectroscopy and imaging," Biophysical Journal, vol. 66, no. 1, pp. 259--267, 1994.Google ScholarCross Ref
D. Le Bihan, J.-F. Mangin, C. Poupon, et al., "Diffusion tensor imaging: concepts and applications," Journal of Magnetic Resonance Imaging, vol. 13, no. 4, pp. 534--546, 2001.Google ScholarCross Ref
C.-F. Westin, S.E. Maier, H. Mamata, A. Nabavi, F.A. Jolesz, and R. Kikinis, "Processing and visualization for diffusion tensor MRI," Medical Image Analysis, vol. 6, no. 2, pp. 93--108, 2002.Google ScholarCross Ref
S. Mori and P.B. Barker, "Diffusion magnetic resonance imaging: its principle and applications," The Anatomical Record vol. 257, no. 3, pp. 102--109, 1999.Google ScholarCross Ref
S. Mori and P.C.M. van Zijl, "Fiber tracking: principles and strategies'a technical review," NMR in Biomedicine, vol. 15, no. 7-8, pp. 468--480, 2002.Google ScholarCross Ref
R. Bammer, "Basic principles of diffusion-weighted imaging," European Journal of Radiology, vol. 45, no. 3, pp. 169--184, 2003.Google ScholarCross Ref
O. Christiansen, T.M. Lee, J. Lie, U. Sinha, and T.F. Chan, "Total Variation Regularization of Matrix-Valued Images," International Journal of Biomedical Imaging, vol. 2007, Article ID 27432, 11 pages, 2007.Google ScholarCross Ref
M. Lysaker, S. Osher, and X.-C. Tai, "Noise removal using smoothed normals and surface fitting," IEEE Transactions on Image Processing, vol. 13, no. 10, pp. 1345--1357, 2004. Google ScholarDigital Library
T.F. Chan and S. Esedoglu, "Aspects of total variation regularized L1 function approximation," SIAM Journal on Applied Mathematics, vol. 65, no. 10, pp. 1345--1357, 2005.Google Scholar
J. Weickert and T. Brox, "Diffusion and regularization of vector- and matrix-valued images," Tech. Rep. preprint no. 58, Fachrichtung 6.1 Mathematik, Universitat des Saarlandes, Saarbrucken, Germany, 2002.Google ScholarCross Ref
L.I. Rudin, S. Osher, and E. Fatemi, "Nonlinear total variation based noise removal algorithms," Physica D, vol. 60, no. 1-4, pp. 259--268, 1992. Google ScholarDigital Library
S. Ryoo, C. Rodrigues, S. Stone, S. Baghsorkhi, S. Ueng, and W. Hwu. Program optimization study on a 128-core GPU. First Workshop on General Purpose Processing on Graphics Processing Units (GPGPU), 2007.Google Scholar
S.S. Stone, J.P. Haldar, S.C. Tsao, W.W. Hwu, Z.P. Liang, B.P. Sutton, "Accelerating Advanced MRI Reconstructions on GPUs, " Proceedings of the 2008 International Conference on Computing Frontiers, May 2008. Google ScholarDigital Library
Che, S., Boyer, M., Meng, J., Tarjan, D., Sheaffer, J.W., and Skadron, K. A performance study of general-purpose applications on graphics processors using CUDA. J. Parallel Distrib. Comput. 68, 10 (Oct. 2008), 1370--1380. Google ScholarDigital Library
J. Owens, D. Luebke, N. Govindaraju, M. Harris, J. Kruger, A. Lefohn, and T. Purcell. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum, 26(1):80--113, March 2007.Google ScholarCross Ref
B. Cabral, N. Cam, and J. Foran. Accelerated volume rendering and tomographic reconstruction using texture mapping hardware. In 1994 Symposium on Volume Visualization, 1994. Google ScholarDigital Library
Pock, T.; Unger, M.; Cremers, D.; Bischof, H., "Fast and exact solution of Total Variation models on the GPU," Computer Vision and Pattern Recognition Workshops, 2008. CVPRW '08. IEEE Computer Society Conference on , vol., no., pp.1--8, 23-28 June 2008.Google Scholar
T. Pock, M. Grabner, and H. Bischof. "Real-time Computation of Variational Methods on Graphics Hardware," Computer Vision Winter Workshop 2007, Michael Grabner, Helmut Grabner, St. Lambrecht, Austria, February 6-8.Google Scholar

Index Terms

Accelerating total variation regularization for matrix-valued images on GPUs
1. Computer systems organization
  1. Architectures
    1. Distributed architectures
2. Computing methodologies
  1. Computer graphics
    1. Graphics systems and interfaces
      1. Graphics processors

Recommendations

Accelerating advanced mri reconstructions on gpus
CF '08: Proceedings of the 5th conference on Computing frontiers

Computational acceleration on graphics processing units

(GPUs) can make advanced magnetic resonance imaging

(MRI) reconstruction algorithms attractive in clinical settings, thereby improving the quality of MR images across a broad spectrum of ...
Read More
Accelerating advanced MRI reconstructions on GPUs

Computational acceleration on graphics processing units (GPUs) can make advanced magnetic resonance imaging (MRI) reconstruction algorithms attractive in clinical settings, thereby improving the quality of MR images across a broad spectrum of ...
Read More
A unified optimizing compiler framework for different GPGPU architectures

This article presents a novel optimizing compiler for general purpose computation on graphics processing units (GPGPU). It addresses two major challenges of developing high performance GPGPU programs: effective utilization of GPU memory hierarchy and ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CF '09: Proceedings of the 6th ACM conference on Computing frontiers
May 2009
238 pages
ISBN:9781605584133
DOI:10.1145/1531743
General Chairs:
Gearold Johnson
Colorado State University, USA
,
Cartsen Trinitis
TU München, Germany
,
Program Chairs:
Georgi N. Gaydadjiev
TU Delft, The Nederland
,
Alex Veidenbaum
University of California, USA
Copyright © 2009 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 18 May 2009
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
cuda
gpgpu
gpu computing
tv regularization
variational methods
Qualifiers
- research-article
Conference

Acceptance Rates
CF '09 Paper Acceptance Rate26of113submissions,23%Overall Acceptance Rate240of680submissions,35%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 252
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Accelerating total variation regularization for matrix-valued images on GPUs

CF '09: Proceedings of the 6th ACM conference on Computing frontiers

ABSTRACT

References

Cited By

Index Terms

Recommendations

Accelerating advanced mri reconstructions on gpus

Accelerating advanced MRI reconstructions on GPUs

A unified optimizing compiler framework for different GPGPU architectures

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Accelerating total variation regularization for matrix-valued images on GPUs

CF '09: Proceedings of the 6th ACM conference on Computing frontiers

ABSTRACT

References

Cited By

Index Terms

Recommendations

Accelerating advanced mri reconstructions on gpus

Accelerating advanced MRI reconstructions on GPUs

A unified optimizing compiler framework for different GPGPU architectures

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media