ACM Home Page
Please provide us with feedback. Feedback
An integrated simdization framework using virtual vectors
Full text PdfPdf (339 KB)
Source International Conference on Supercomputing archive
Proceedings of the 19th annual international conference on Supercomputing table of contents
Cambridge, Massachusetts
SESSION: Session 5: compilers II table of contents
Pages: 169 - 178  
Year of Publication: 2005
ISBN:1-59593-167-8
Authors
Peng Wu  IBM T.J. Watson Research Center, Yorktown Heights, NY
Alexandre E. Eichenberger  IBM T.J. Watson Research Center, Yorktown Heights, NY
Amy Wang  IBM Toronto Laboratory, Markham, Ontario, Canada
Peng Zhao  IBM Toronto Laboratory, Markham, Ontario, Canada
Sponsor
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 53,   Citation Count: 5
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1088149.1088172
What is a DOI?

ABSTRACT

Automatic simdization for multimedia extensions faces several new challenges that are not present in traditional vectorization. Some of the new issues are due to the more restrictive SIMD architectures designed for multimedia extensions. Among them are alignment constraints, lack of memory gather and scatter support, and the short and fixed-length nature of SIMD vectors. Since these constraints affect some very basic components of a program, a compiler must not only provide solid solutions to individual issues, but also take an integrated approach to address these constraints in combination.In this paper, we propose a simdization framework that addresses several orthogonal aspects of simdization, such as alignment handling, simdization of loops with mixed data lengths, and SIMD parallelism extraction from different program scopes (from basic blocks to inner loops). The novelty of this framework is its ability to facilitate interactions between different techniques based on the simple intermediate representation of virtual vectors. Measurements on a PPC970 with a VMX SIMD unit indicate speedup factors of up to 8.11 for numerical/video/communication kernels and speedup factors of up to 2.16 for benchmarks, when automatic simdization is turned on.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
IBM Corporation. PowerPC Microprocessor Family: AltiVec Technology Programming Environments Manual, July 2004.
 
2
IA32 Intel Architecture Software Developer's Manual with Preliminary Intel Pentium 4 Processor Information Volume 1: Basic Architecture. Intel Corporation.
 
3
B. Flachs et al. A Streaming Processing Unit for a CELL Processor. In IEEE International Solid-State Circuits Conference, February 2005.
 
4
5
6
 
7
 
8
 
9
 
10
Gerald Cheong and Monica S. Lam. An Optimizer for Multimedia Instruction Sets. In Second SUIF Compiler Workshop, August 1997.
 
11
12
 
13
Free Software Foundation. http://gcc.gnu.org/projects/tree-ssa.
 
14
 
15
16
17
 
18
Franz Franchetti, Stefan Kral, Huergen Lorenz, and Christoph Ueberhuber. Efficient utilization of SIMD Extensions. In IEEE Proceedings Special Issue on Program Generation, Optimization, and Platform Adaptation, 2005.
 
19
 
20
Crescent Bay Software. VAST-F/AltiVec: Automatic Fortran Vectorizer for PowerPC Vector Unit. http://www.psrv.com/vast_altivec.html, 2004.

Collaborative Colleagues:
Peng Wu: colleagues
Alexandre E. Eichenberger: colleagues
Amy Wang: colleagues
Peng Zhao: colleagues