research-article

LLVM Compiler Implementation for Explicit Parallelization and SIMD Vectorization

Authors:
Xinmin Tian

Intel Corporation, Santa Clara, CA, US

Intel Corporation, Santa Clara, CA, US
View Profile

,
Hideki Saito

Intel Corporation, Santa Clara, CA, US

Intel Corporation, Santa Clara, CA, US
View Profile

,
Ernesto Su

Intel Corporation, Santa Clara, CA, US

Intel Corporation, Santa Clara, CA, US
View Profile

,
Jin Lin

Intel Corporation, Santa Clara, CA, US

Intel Corporation, Santa Clara, CA, US
View Profile

,
Satish Guggilla

Intel Corporation, Santa Clara, CA, US

Intel Corporation, Santa Clara, CA, US
View Profile

,
Diego Caballero

Intel Corporation, Santa Clara, CA, US

Intel Corporation, Santa Clara, CA, US
View Profile

,
Matt Masten

Intel Corporation, Santa Clara, CA, US

Intel Corporation, Santa Clara, CA, US
View Profile

,
Andrew Savonichev

Intel Corporation, Santa Clara, CA, US

Intel Corporation, Santa Clara, CA, US
View Profile

,
Michael Rice

Intel Corporation, Santa Clara, CA, US

Intel Corporation, Santa Clara, CA, US
View Profile

,
Elena Demikhovsky

Intel Corporation, Santa Clara, CA, US

Intel Corporation, Santa Clara, CA, US
View Profile

,
Ayal Zaks

Intel Corporation, Santa Clara, CA, US

Intel Corporation, Santa Clara, CA, US
View Profile

,
Gil Rapaport

Intel Corporation, Santa Clara, CA, US

Intel Corporation, Santa Clara, CA, US
View Profile

,
Abhinav Gaba

Intel Corporation, Santa Clara, CA, US

Intel Corporation, Santa Clara, CA, US
View Profile

,
Vasileios Porpodas

Intel Corporation, Santa Clara, CA, US

Intel Corporation, Santa Clara, CA, US
View Profile

,
Eric Garcia

Intel Corporation, Santa Clara, CA, US

Intel Corporation, Santa Clara, CA, US
View Profile

LLVM-HPC'17: Proceedings of the Fourth Workshop on the LLVM Compiler Infrastructure in HPCNovember 2017Article No.: 4Pages 1–11https://doi.org/10.1145/3148173.3148191

Published:12 November 2017Publication History

LLVM-HPC'17: Proceedings of the Fourth Workshop on the LLVM Compiler Infrastructure in HPC

Pages 1–11

ABSTRACT

With advances of modern multi-core processors and accelerators, many modern applications are increasingly turning to compiler-assisted parallel and vector programming models such as OpenMP, OpenCL, Halide, Python and TensorFlow. It is crucial to ensure that LLVM-based compilers can optimize parallel and vector code as effectively as possible. In this paper, we first present a set of updated LLVM IR extensions for explicitly parallel, vector, and offloading program constructs in the context of C/C++/OpenCL. Secondly, we describe our LLVM design and implementation for advanced features in OpenMP such as parallel loop reduction, task and taskloop, SIMD loop and functions, and we discuss the impact of our updated implementation on existing LLVM optimization passes. Finally, we present a re-use case of our infrastructure to enable explicit parallelization and vectorization extensions in our OpenCL compiler to achieve ~35x performance speedup for a well-known autonomous driving workload on a multi-core platform configured with Intel® Xeon® Scalable Processors.

References

C. Lattner and V. Adve. LLVM: A compilation framework for lifelong program analysis & transformation. In CGO '04, pages 75--86, 2004. Google ScholarCross Ref
X. Tian, M. Girkar, A. J.C. Bik, and H. Saito, "Practical Compiler Techniques on Efficient Multithreaded Code Generation for OpenMP Programs," The Computer Journal, Oxford, Vol. 48, Issue 5, pps. 558--601, 2005.Google Scholar
X. Tian, H. Saito, M. Girkar, S. Preis, S. Kozhukhov, A.G. Cherkasov, C. Nelson, N. Panchenko, R. Geva, Compiling C/C++ SIMD Extensions for Function and Loop Vectorization on Multicore-SIMD Processors. In Proc. of IEEE 26th International Parallel and Distributed Processing Symposium - Multicore and GPU Prog. Models, Lang. and Compilers Workshop, pp. 2349--2358, 2012.Google Scholar
OpenMP Architecture Review Board, "OpenMP Application Program Interface," v4.5, Oct. 2015, http://www.openmp.orgGoogle Scholar
J. Zhao, S. Nagarakatte, M. M. Martin, and S. Zdancewic. Formalizing the LLVM intermediate representation for verified program transformations. In POPL '12, pages 427--440, 2012. Google ScholarDigital Library
Intel Corporation, LLVM Intrinsic function and Tag name string interface specitication for directive representation, April 12, 2017Google Scholar
A. Zaks, et..al., "[llvm-dev] RFC: Extending LV to vectorize outerloops", Sept. 21, 2016, Intel Corporation.Google Scholar
H. Finkel and X. Tian "[llvm-dev] RPC: A Proposal for adding an experimental IR-level region-annotation infrastructure, Jan. 11, 2017. http://lists.llvm.org/pipermail/llvm-dev/2017-January/108906.html.Google Scholar
H. Saito, et. al., "Extending LoopVectorizer towards supporting OpenMP4.5 SIMD and outer loop auto-vectorization", LLVM Developer's Conference, Nov. 2016Google Scholar
X. Tian, et.al. "Proposal for function vectorization and loop vectorization with function calls", March 2, 2016. Intel Corp. http://lists.llvm.org/pipermail/cfe-dev/2016-March/047732.html.Google Scholar
F. Homm, N. Kaempchen, J. Ota and D. Burschka, "Efficient Occupancy Grid Computation on GPU with Lidar and Radar for Road Boundary Detection", In Proc. of IEEE Intelligent Vehicle Symposium, pp. 1006--1013 Universiry of California, San Diego, CA, USA, June 21-24, 2010. Google ScholarCross Ref
X. Tian, H. Saito, E. Su, A. Gaba, M. Masten, E. Garcia, A. Zaks, "LLVM Framework and IR Extensions for Parallelization, SIMD Vectorization and Offloading". LLVM-HPC@SC 2016: 21--31.Google Scholar
T.B. Schardl, W.S. Moses, C.E. Leiserson, "Tapir: Embedding Fork-Join Parallelism into LLVM's Intermediate Representation", PPoPP'17, Feburary. 4-7, 2017, Austin, Texas, USA. Google ScholarDigital Library

LLVM Compiler Implementation for Explicit Parallelization and SIMD Vectorization
1. Software and its engineering
  1. Software notations and tools
    1. General programming languages
      1. Language types

Recommendations

LLVM framework and IR extensions for parallelization, SIMD vectorization and offloading
LLVM-HPC '16: Proceedings of the Third Workshop on LLVM Compiler Infrastructure in HPC

LLVM has become an integral part of the software-development ecosystem for developing advanced compilers, high-performance computing software and tools. This paper presents a small set of LLVM IR extensions for explicitly parallel vector, and offloading ...
Read More
SIMD parallel MCMC sampling with applications for big-data Bayesian analytics

Computational intensity and sequential nature of estimation techniques for Bayesian methods in statistics and machine learning, combined with their increasing applications for big data analytics, necessitate both the identification of potential ...
Read More
Support OpenCL 2.0 Compiler on LLVM for PTX Simulators

Heterogeneous systems that consist of multiple CPUs and GPUs for high-performance computing are becoming increasingly popular, and OpenCL (Open Computing Language) provides a framework for writing programs that can be executed across heterogeneous ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

LLVM-HPC'17: Proceedings of the Fourth Workshop on the LLVM Compiler Infrastructure in HPC
November 2017
106 pages
ISBN:9781450355650
DOI:10.1145/3148173

Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 12 November 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
LLVM
Multi- and many-core processors
OpenMP
accelerators
offloading
parallelization
vectorization
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
LLVM-HPC'17 Paper Acceptance Rate9of10submissions,90%Overall Acceptance Rate16of22submissions,73%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 14
  Total Citations
  View Citations
- 618
  Total Downloads
- Downloads (Last 12 months)100
- Downloads (Last 6 weeks)14
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

LLVM Compiler Implementation for Explicit Parallelization and SIMD Vectorization

LLVM-HPC'17: Proceedings of the Fourth Workshop on the LLVM Compiler Infrastructure in HPC

ABSTRACT

References

Cited By

Recommendations

LLVM framework and IR extensions for parallelization, SIMD vectorization and offloading

SIMD parallel MCMC sampling with applications for big-data Bayesian analytics

Support OpenCL 2.0 Compiler on LLVM for PTX Simulators

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

LLVM Compiler Implementation for Explicit Parallelization and SIMD Vectorization

LLVM-HPC'17: Proceedings of the Fourth Workshop on the LLVM Compiler Infrastructure in HPC

ABSTRACT

References

Cited By

Recommendations

LLVM framework and IR extensions for parallelization, SIMD vectorization and offloading

SIMD parallel MCMC sampling with applications for big-data Bayesian analytics

Support OpenCL 2.0 Compiler on LLVM for PTX Simulators

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media