skip to main content
10.1145/2148600.2148629acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
poster

Poster: determining code segments that can benefit from execution on GPUs

Published:12 November 2011Publication History

ABSTRACT

Graphics Processing Units (GPUs) are a low cost, low power means of exploiting large-scale parallelism. Source-to-source transformation tools for mapping CPU code to GPU code (e.g. PGI Accelerator) are available. But identification of those code segments in an application that, when run on a GPU will attain significant performance enhancement, requires expert knowledge of algorithms, architectures, compilers and the program structure which many application developers may not possess. This poster demonstrates a process for identifying the code segments in programs optimized for multicore chip execution that are candidates for GPU execution and ranking these code segments by probable speedup. The identification and ranking are based on measurements of the programs by the PerfExpert tool and a new tool MACPO, which measures execution properties of data structures. The poster describes the identification and ranking process, gives the results of applying the process to the Rodina parallel benchmarks and gives the underlying assumptions for and the limitations of the process.

References

  1. M. Wolfe, "Implementing the pgi accelerator model." in GPGPU, 2010. pp. 43--50. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. Burtscher, B. D. Kim, J. Diamond, J. Mccalpin, L. Koesterke, and J. Browne. "PerfExpert: An Easy-to-Use Performance Diagnosis Tool for HPC Applications," in Computer. IEEE, 2010, pp. 1--11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. O. A. Sopeju, M. Burtscher, A. Rane, and J. Browne, "AutoSCOPE : Automatic Suggestions for Code Optimizations using PerfExpert," Evaluation.Google ScholarGoogle Scholar
  4. A. Rane and J. Browne, "Performance optimization of data structures using memory access characterization," in CLUSTER. IEEE, 2011, pp. 570--574. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Che, M. Boyer, J. Meng, D. Tarjan, J. W. Sheaffer, S.-H. Lee, and K. Skadron, "Rodinia: A benchmark suite for heterogeneous computing,". 2009 IEEE International Symposium on Workload Characterization IISWC. vol. 2009, no. c, pp. 44--54, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Poster: determining code segments that can benefit from execution on GPUs

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SC '11 Companion: Proceedings of the 2011 companion on High Performance Computing Networking, Storage and Analysis Companion
      November 2011
      166 pages
      ISBN:9781450310307
      DOI:10.1145/2148600

      Copyright © 2011 Authors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 12 November 2011

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Author Tags

      Qualifiers

      • poster

      Acceptance Rates

      Overall Acceptance Rate1,516of6,373submissions,24%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader