The International Workshop for OpenCL (IWOCL, which is pronounced "eye-wok-ul") was conceived in a meeting between Simon McIntosh-Smith and Ben Bergen at the Los Alamos National Laboratory on May 8th 2012. McIntosh-Smith and Bergen lamented that there were no organized workshops or meetings for the rapidly growing OpenCL community. After testing this idea with colleagues over the next few months, they decided to create the kind of OpenCL conference they wanted to go to themselves, and thus IWOCL was born.
Proceeding Downloads
A look at the OpenCL 2.0 execution model
A popular approach to programming manycore GPUs is the Single Instruction Multiple Thread (SIMT) abstraction. SIMT has the benefit of presenting a "single thread" view, alleviating the complexity of explicitly vectorizing the source code. However, due ...
Achieving performance with OpenCL 2.0 on Intel® processor graphics
OpenCL 2.0 is here, supported for the first time on the 5th Generation Intel ® Core Processors with Intel ® Processor Graphics. We are going to talk about the things we have learned in the past year developing workloads for OpenCL 2.0 and speedups ...
Asynchronous OpenCL/MPI numerical simulations of conservation laws
Hyperbolic conservation laws are important mathematical models for describing many phenomena in physics or engineering. The Finite Volume (FV) method and the Discontinuous Galerkin (DG) methods are two popular methods for solving conservation laws on ...
Exploring the features of OpenCL 2.0
- Saoni Mukherjee,
- Xiang Gong,
- Leiming Yu,
- Carter McCardwell,
- Yash Ukidave,
- Tuan Dao,
- Fanny Nina Paravecino,
- David Kaeli
The growth in demand for heterogeneous accelerators has stimulated the development of cutting-edge features in newer accelerators. The heterogeneous programming frameworks such as OpenCL have matured over the years and have introduced new software ...
Mapping C++ AMP to OpenCL / HSA
High-level programming languages and domain-specific languages can often benefit from the increased power efficiency of heterogeneous computing. OpenCL can serve as a compiler target for portable code generation and runtime management. By using OpenCL ...
Performance optimization for a SHA-1 cryptographic workload expressed in OpenCL for FPGA execution
The introduction of Field Programmable Gate Array (FPGA) based devices for OpenCL applications provides an opportunity to develop kernels which are executed on application specific compute units which can be optimized for specific workloads such as ...
The great beyond: higher productivity, parallel processors and the extraordinary search for a theory of expression
Embedded system on a chip (SOC) vendors of today are perpetually challenged with the following goals; provide more compute capability and reduce cost and power. Unfortunately, these goals compete rather than cooperate. In order to address this, SOC ...
Update on the SYCL for OpenCL open standard to enable C++ meta programming on top of OpenCL
SYCL is a royalty-free, open standard, higher-level C++ programming model for OpenCL. C++ developers can produce easy-to-use template libraries for OpenCL devices, as well as easily porting C++ applications to use OpenCL. By providing ease-of-use, high ...
CHO: towards a benchmark suite for OpenCL FPGA accelerators
Programming FPGAs with OpenCL-based high-level synthesis frameworks is gaining attention with a number of commercial and research frameworks announced. However, there are no benchmarks for evaluating these frameworks. To this end, we present CHO ...
Kernel composition in SYCL
Parallel primitives libraries reduce the burden of knowledge required for developers to begin developing parallel applications and accelerating them with OpenCL. Unfortunately some current libraries implement primitives as individual kernels and so ...
Oclgrind: an extensible OpenCL device simulator
We describe Oclgrind, a platform designed to enable the creation of developer tools for analysis and debugging of OpenCL programs. Oclgrind simulates how OpenCL kernels execute with respect to the OpenCL standard, adhering to the execution and memory ...
A compute model for augmented reality with integrated-GPU acceleration
Augmented Reality (AR) is a live view of real-world sequences with enhanced digital information. To enable such enhancement, typical modules of an AR application can be very compute intensive, which is a factor that prevents users from having smooth ...
Nano Simbox: an OpenCL-accelerated framework for interactive molecular dynamics
With increasing computational power and the maturation of new consumer grade human-computer interaction devices, interactive computational experiments are becoming feasible[1].
OpenCL accelerated deep learning for visual understanding
Visual understanding is a broad term used to describe efforts to extract meaning and knowledge from images computationally. Much effort has been spent by researchers to develop methods to allow for information to be gained from images the way humans do. ...
A framework for visualization of OpenCL applications execution: a tutorial
Evaluating parallel and heterogeneous programs written in OpenCL can be challenging. Commonly, simulators can be used to aid the programmer in this regard. One of the fundamental requirements of any simulator is to provide traces, reports, and debugging ...
Khronos SYCL for OpenCL: a tutorial
SYCL ([sikə l] as in sickle) is a royalty-free, cross-platform C++ abstraction layer that builds on the underlying concepts, portability and efficiency of OpenCL, while adding the ease-of-use and flexibility of modern C++11. For example, SYCL enables ...
Propel with OpenCL: a deep dive workshop to create, debug, analyze and optimize OpenCL applications using Intel tools: a tutorial
This workshop and tutorial will allow developers to learn underlying architecture relevant to running OpenCL applications on Intel ®Processor Graphics and use Intel tools such as Intel ®OpenCL ™Code Builder and Intel ®VTune ™Amplifier to create, develop ...
Index Terms
- Proceedings of the 3rd International Workshop on OpenCL