| An integrated simdization framework using virtual vectors |
| Full text |
Pdf
(339 KB)
|
| Source
|
International Conference on Supercomputing
archive
Proceedings of the 19th annual international conference on Supercomputing
table of contents
Cambridge, Massachusetts
SESSION: Session 5: compilers II
table of contents
Pages: 169 - 178
Year of Publication: 2005
ISBN:1-59593-167-8
|
|
Authors
|
|
Peng Wu
|
IBM T.J. Watson Research Center, Yorktown Heights, NY
|
|
Alexandre E. Eichenberger
|
IBM T.J. Watson Research Center, Yorktown Heights, NY
|
|
Amy Wang
|
IBM Toronto Laboratory, Markham, Ontario, Canada
|
|
Peng Zhao
|
IBM Toronto Laboratory, Markham, Ontario, Canada
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 8, Downloads (12 Months): 53, Citation Count: 5
|
|
|
ABSTRACT
Automatic simdization for multimedia extensions faces several new challenges that are not present in traditional vectorization. Some of the new issues are due to the more restrictive SIMD architectures designed for multimedia extensions. Among them are alignment constraints, lack of memory gather and scatter support, and the short and fixed-length nature of SIMD vectors. Since these constraints affect some very basic components of a program, a compiler must not only provide solid solutions to individual issues, but also take an integrated approach to address these constraints in combination.In this paper, we propose a simdization framework that addresses several orthogonal aspects of simdization, such as alignment handling, simdization of loops with mixed data lengths, and SIMD parallelism extraction from different program scopes (from basic blocks to inner loops). The novelty of this framework is its ability to facilitate interactions between different techniques based on the simple intermediate representation of virtual vectors. Measurements on a PPC970 with a VMX SIMD unit indicate speedup factors of up to 8.11 for numerical/video/communication kernels and speedup factors of up to 2.16 for benchmarks, when automatic simdization is turned on.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
IBM Corporation. PowerPC Microprocessor Family: AltiVec Technology Programming Environments Manual, July 2004.
|
| |
2
|
IA32 Intel Architecture Software Developer's Manual with Preliminary Intel Pentium 4 Processor Information Volume 1: Basic Architecture. Intel Corporation.
|
| |
3
|
B. Flachs et al. A Streaming Processing Unit for a CELL Processor. In IEEE International Solid-State Circuits Conference, February 2005.
|
| |
4
|
|
 |
5
|
|
 |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
Gerald Cheong and Monica S. Lam. An Optimizer for Multimedia Instruction Sets. In Second SUIF Compiler Workshop, August 1997.
|
| |
11
|
|
 |
12
|
Dorit Naishlos , Marina Biberstein , Shay Ben-David , Ayal Zaks, Vectorizing for a SIMdD DSP architecture, Proceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems, October 30-November 01, 2003, San Jose, California, USA
[doi> 10.1145/951710.951714]
|
| |
13
|
Free Software Foundation. http://gcc.gnu.org/projects/tree-ssa.
|
| |
14
|
|
| |
15
|
|
 |
16
|
|
 |
17
|
|
| |
18
|
Franz Franchetti, Stefan Kral, Huergen Lorenz, and Christoph Ueberhuber. Efficient utilization of SIMD Extensions. In IEEE Proceedings Special Issue on Program Generation, Optimization, and Platform Adaptation, 2005.
|
| |
19
|
|
| |
20
|
Crescent Bay Software. VAST-F/AltiVec: Automatic Fortran Vectorizer for PowerPC Vector Unit. http://www.psrv.com/vast_altivec.html, 2004.
|
CITED BY 5
|
|
A. E. Eichenberger , J. K. O'Brien , K. M. O'Brien , P. Wu , T. Chen , P. H. Oden , D. A. Prener , J. C. Shepherd , B. So , Z. Sura , A. Wang , T. Zhang , P. Zhao , M. K. Gschwind , R. Archambault , Y. Gao , R. Koo, Using advanced compiler technology to exploit the performance of the Cell Broadband EngineTM architecture, IBM Systems Journal, v.45 n.1, p.59-84, January 2006
|
|
|
|
|
|
|
|
|
|
|
|
|