research-article

Towards tangent-linear GPU programs using OpenACC

Authors:
Bui Tat Minh

King Mongkut's University of Technology North Bangkok (KMUTNB), Bangkok, Thailand

King Mongkut's University of Technology North Bangkok (KMUTNB), Bangkok, Thailand
View Profile

,
Michael Förster

RWTH Aachen University, Aachen, Germany

RWTH Aachen University, Aachen, Germany
View Profile

,
Uwe Naumann

RWTH Aachen University, Aachen, Germany

RWTH Aachen University, Aachen, Germany
View Profile

SoICT '13: Proceedings of the 4th Symposium on Information and Communication TechnologyDecember 2013Pages 27–34https://doi.org/10.1145/2542050.2542059

Published:05 December 2013Publication History

SoICT '13: Proceedings of the 4th Symposium on Information and Communication Technology

Pages 27–34

ABSTRACT

Recently, Graphics Processing Units(GPUs) have emerged as a very promisingly powerful resource in scientific computing. Algorithmic Differentiation is a technique to numerically evaluate first and higher derivatives of a function specified by a computer program efficiently up to machine precision. Derivative programs which are used to compute derivatives of functions are so-called tangent-linear program and adjoint program. This paper aims to offload any particular independent loop in tangent-linear program to GPUs. The proposed technique is OpenACC APIs for annotating an independent loop to be executed in parallel on GPUs. Our case study for OpenACC tangent-linear code shows an enormous speedup. OpenACC shows its simplicity of accelerating tangent-linear code by hiding the data movement between CPU and GPU memory.

References

The OpenACC#8482; Application Programming Interface version 1.0, November 2011.Google Scholar
M. Förster, U. Naumann, and J. Utke. Toward Adjoint OpenMP. Technical Report AIB-2011-13, RWTH Aachen, July 2011.Google Scholar
T. P. Group. OpenACC Kernels and Parallel Constructs. http://www.pgroup.com/lit/articles/insider/v4n2a1.htm, August 2012. {Online; accessed 29-July-2013}.Google Scholar
T. P. Group. Userforum: Initialize global variables with OpenACC pragma. www.pgroup.com/userforum/viewtopic.php?t=3869, May 2013. {Online; accessed 03-August-2013}.Google Scholar
B. T. Minh. Tangent-Linear and Adjoint GPU Code. diploma thesis, The Sirindhorn International Thai-German Graduate School of Engineering, King Mongkut's University of Technology North Bangkok, May 2013.Google Scholar
U. Naumann. The Art of Differentiating Computer Programs: An Introduction to Algorithmic Differentiation. SIAM, 2012. Google ScholarDigital Library

Index Terms

Towards tangent-linear GPU programs using OpenACC

Recommendations

OpenACC Execution Models for Manycore Processor with ARM SVE
HPCAsia '23 Workshops: Proceedings of the HPC Asia 2023 Workshops

OpenACC is designed to offer performance portability across CPUs with SIMD extensions and accelerators based on GPU or manycore architecture. We are working on the design of OpenACC compiler for A64FX manycore processor with Arm SVE. We use a source-to-...
Read More
OpenACC acceleration of the Nek5000 spectral element code

We present a case study of porting NekBone, a skeleton version of the Nek5000 code, to a parallel GPU-accelerated system. Nek5000 is a computational fluid dynamics code based on the spectral element method used for the simulation of incompressible flow. ...
Read More
Hybridizing S3D into an Exascale application using OpenACC: An approach for moving to multi-petaflops and beyond
SC '12: Proceedings of the 2012 International Conference for High Performance Computing, Networking, Storage and Analysis

Hybridization is the process of converting an application with a single level of parallelism to an application with multiple levels of parallelism. Over the past 15 years a majority of the applications that run on High Performance Computing systems have ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SoICT '13: Proceedings of the 4th Symposium on Information and Communication Technology
December 2013
345 pages
ISBN:9781450324540
DOI:10.1145/2542050
General Chairs:
Thang Huynh Quyet
HUST, Vietnam
,
Binh Nguyen Thanh
DUT, Vietnam
,
Program Chairs:
Tien Do Van
BME, Hungary
,
Marc Bui
EPHE, France
,
Son Ngo Hong
HUST, Vietnam
Copyright © 2013 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 5 December 2013
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
OpenACC
SIMD
arithmetic differentiation
data parallelism
tangent-linear model
Qualifiers
- research-article
Conference

Acceptance Rates
SoICT '13 Paper Acceptance Rate40of80submissions,50%Overall Acceptance Rate147of318submissions,46%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 67
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Towards tangent-linear GPU programs using OpenACC

SoICT '13: Proceedings of the 4th Symposium on Information and Communication Technology

ABSTRACT

References

Cited By

Index Terms

Recommendations

OpenACC Execution Models for Manycore Processor with ARM SVE

OpenACC acceleration of the Nek5000 spectral element code

Hybridizing S3D into an Exascale application using OpenACC: An approach for moving to multi-petaflops and beyond

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Towards tangent-linear GPU programs using OpenACC

SoICT '13: Proceedings of the 4th Symposium on Information and Communication Technology

ABSTRACT

References

Cited By

Index Terms

Recommendations

OpenACC Execution Models for Manycore Processor with ARM SVE

OpenACC acceleration of the Nek5000 spectral element code

Hybridizing S3D into an Exascale application using OpenACC: An approach for moving to multi-petaflops and beyond

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media