ABSTRACT
The advent of new matrix-valued magnetic resonance imaging modalities such as Diffusion Tensor Imaging (DTI) requires extensive computational acceleration. Computational acceleration on graphics processing units (GPUs) can make the regularization (denoising) of DTI images attractive in clinical settings, hence improving the quality of DTI images in a broad range of applications. Construction of DTI images consists of direction-specific Magnetic Resonance (MR) measurements. Compared with conventional MR, direction-sensitive acquisition has a lower signal-to-noise ratio (SNR). Therefore, high noise levels often limit DTI imaging. Advanced post-processing of imaging data can improve the quality of estimated tensors. However, the post-processing problem is only made more computationally difficult when considering matrix-valued imaging data.
This paper describes the acceleration of a Total Variation regularization method for matrix-valued images, in particular, for DTI images on NVIDIA Quadro FX 5600. The TV regularization of a 3-D image with 1283 voxels ultimately achieves 266X speedup and requires 1 minute and 30 seconds on the Quadro, while this algorithm on a dual-core CPU completes in more than 3 hours. In this application study we are aimed at analyzing the effective of excessive synchronization, which provides an insight into generally adapting Variational methods to the GPU architecture for other image processing algorithms designed for matrix-valued images.
- D. Manocha, M.C. Lin, N. Govindaraju. GPGPU to Many-Core Processing: Higher Performance for Mass Market Applications. Manycore Computing Workshop, 2007.Google Scholar
- NVIDIA Corporation. NVIDIA CUDA Programming Guide, version 1.1, 2007.Google Scholar
- AMD Stream Processor. http://ati.amd.com/products/streamprocessor/index.html.Google Scholar
- M. Segal and K. Akeley. The OpenGL Graphics System: A Specification (Version 2.0). Silicon Graphics, Inc., October 2004.Google Scholar
- DirectX Developer Center. http://www.msdn.com/directx/.Google Scholar
- Cg. http://developer.nvidia.com/page/cg main.html.Google Scholar
- D. Tarditi, S. Puri, and J. Oglesby. Accelerator: Using data parallelism to program GPUs for general-purpose uses. In Proceedings of the 12th International Conference on Architectural Support for Programming Languages and Operating Systems, pages 325--335, 2006. Google ScholarDigital Library
- I. Buck. Brook Specification v0.2, October 2003.Google Scholar
- P.J. Basser, J. Mattiello, and D. LeBihan, "MR diffusion tensor spectroscopy and imaging," Biophysical Journal, vol. 66, no. 1, pp. 259--267, 1994.Google ScholarCross Ref
- D. Le Bihan, J.-F. Mangin, C. Poupon, et al., "Diffusion tensor imaging: concepts and applications," Journal of Magnetic Resonance Imaging, vol. 13, no. 4, pp. 534--546, 2001.Google ScholarCross Ref
- C.-F. Westin, S.E. Maier, H. Mamata, A. Nabavi, F.A. Jolesz, and R. Kikinis, "Processing and visualization for diffusion tensor MRI," Medical Image Analysis, vol. 6, no. 2, pp. 93--108, 2002.Google ScholarCross Ref
- S. Mori and P.B. Barker, "Diffusion magnetic resonance imaging: its principle and applications," The Anatomical Record vol. 257, no. 3, pp. 102--109, 1999.Google ScholarCross Ref
- S. Mori and P.C.M. van Zijl, "Fiber tracking: principles and strategies'a technical review," NMR in Biomedicine, vol. 15, no. 7-8, pp. 468--480, 2002.Google ScholarCross Ref
- R. Bammer, "Basic principles of diffusion-weighted imaging," European Journal of Radiology, vol. 45, no. 3, pp. 169--184, 2003.Google ScholarCross Ref
- O. Christiansen, T.M. Lee, J. Lie, U. Sinha, and T.F. Chan, "Total Variation Regularization of Matrix-Valued Images," International Journal of Biomedical Imaging, vol. 2007, Article ID 27432, 11 pages, 2007.Google ScholarCross Ref
- M. Lysaker, S. Osher, and X.-C. Tai, "Noise removal using smoothed normals and surface fitting," IEEE Transactions on Image Processing, vol. 13, no. 10, pp. 1345--1357, 2004. Google ScholarDigital Library
- T.F. Chan and S. Esedoglu, "Aspects of total variation regularized L1 function approximation," SIAM Journal on Applied Mathematics, vol. 65, no. 10, pp. 1345--1357, 2005.Google Scholar
- J. Weickert and T. Brox, "Diffusion and regularization of vector- and matrix-valued images," Tech. Rep. preprint no. 58, Fachrichtung 6.1 Mathematik, Universitat des Saarlandes, Saarbrucken, Germany, 2002.Google ScholarCross Ref
- L.I. Rudin, S. Osher, and E. Fatemi, "Nonlinear total variation based noise removal algorithms," Physica D, vol. 60, no. 1-4, pp. 259--268, 1992. Google ScholarDigital Library
- S. Ryoo, C. Rodrigues, S. Stone, S. Baghsorkhi, S. Ueng, and W. Hwu. Program optimization study on a 128-core GPU. First Workshop on General Purpose Processing on Graphics Processing Units (GPGPU), 2007.Google Scholar
- S.S. Stone, J.P. Haldar, S.C. Tsao, W.W. Hwu, Z.P. Liang, B.P. Sutton, "Accelerating Advanced MRI Reconstructions on GPUs, " Proceedings of the 2008 International Conference on Computing Frontiers, May 2008. Google ScholarDigital Library
- Che, S., Boyer, M., Meng, J., Tarjan, D., Sheaffer, J.W., and Skadron, K. A performance study of general-purpose applications on graphics processors using CUDA. J. Parallel Distrib. Comput. 68, 10 (Oct. 2008), 1370--1380. Google ScholarDigital Library
- J. Owens, D. Luebke, N. Govindaraju, M. Harris, J. Kruger, A. Lefohn, and T. Purcell. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum, 26(1):80--113, March 2007.Google ScholarCross Ref
- B. Cabral, N. Cam, and J. Foran. Accelerated volume rendering and tomographic reconstruction using texture mapping hardware. In 1994 Symposium on Volume Visualization, 1994. Google ScholarDigital Library
- Pock, T.; Unger, M.; Cremers, D.; Bischof, H., "Fast and exact solution of Total Variation models on the GPU," Computer Vision and Pattern Recognition Workshops, 2008. CVPRW '08. IEEE Computer Society Conference on , vol., no., pp.1--8, 23-28 June 2008.Google Scholar
- T. Pock, M. Grabner, and H. Bischof. "Real-time Computation of Variational Methods on Graphics Hardware," Computer Vision Winter Workshop 2007, Michael Grabner, Helmut Grabner, St. Lambrecht, Austria, February 6-8.Google Scholar
Index Terms
- Accelerating total variation regularization for matrix-valued images on GPUs
Recommendations
Accelerating advanced mri reconstructions on gpus
CF '08: Proceedings of the 5th conference on Computing frontiersComputational acceleration on graphics processing units
(GPUs) can make advanced magnetic resonance imaging
(MRI) reconstruction algorithms attractive in clinical settings, thereby improving the quality of MR images across a broad spectrum of ...
Accelerating advanced MRI reconstructions on GPUs
Computational acceleration on graphics processing units (GPUs) can make advanced magnetic resonance imaging (MRI) reconstruction algorithms attractive in clinical settings, thereby improving the quality of MR images across a broad spectrum of ...
A unified optimizing compiler framework for different GPGPU architectures
This article presents a novel optimizing compiler for general purpose computation on graphics processing units (GPGPU). It addresses two major challenges of developing high performance GPGPU programs: effective utilization of GPU memory hierarchy and ...
Comments