ABSTRACT
VLIW DSP architectures exhibit heterogeneous connections between functional units and register files for speeding up special tasks. Such architectural characteristics can be effectively exploited through the use of complex instruction set extensions (ISEs). Although VLIWs are increasingly being used for DSP applications to achieve very high performance, such architectures are known to suffer from increased code size. This paper addresses how to generate ISEs that can result in significant code size reduction in VLIW DSPs without degrading performance. Unfortunately, contemporary techniques for instruction set synthesis fail to extract legal ISEs for heterogeneous-connectivity-based architectures. We propose a Heuristic-based algorithm to synthesize ISEs for a generalized heterogeneous-connectivity-based VLIW DSP architecture. We achieve an average code size reduction of 25% on the MiBench suite with no penalty in performance by applying our ISE generation algorithm on the TI TMS320C6xx, a representative VLIW DSP.
- http://www.ti.com. TI TMS320C6xx user manual.Google Scholar
- Silvina Hanono and Srinivas Devadas. Instruction Selection, Resource Allocation and Scheduling in the AVIV Retargetable Code Generator. In Proc. of the Design Automation Conference (DAC), pages 510--515, 1998. Google ScholarDigital Library
- Thomas M. Conte, Sanjeev Banerjia, Sergei Y. Larin, Kishore N. Menezes and Sumedh W. Sathaye. Instruction Fetch Mechanisms for VLIW Architectures with Compressed Encodings. In Proc. 29th Int'l Symposium on Microarchitecture, pages 201--211, 1996. Google ScholarDigital Library
- Shail Aditya, Scott A. Mahlke and B. Ramakrishna Rau. Code Size Minimization and Retargetable Assembly for EPIC and VLIW Instruction Formats. Technical Report, HP Labs PL-2000-141.Google ScholarDigital Library
- M. Kozuch and A. Wolfe. Compression of Embedded System Programs. In Proc. of the Int'l Conference on Computer Design (ICCD), pages 270--277, 1994. Google ScholarDigital Library
- Stan Y. Liao, Srinivas Devadas and Kurt Keutzer. Code Density Optimization for Embedded DSP Processors using Data Compression Techniques. IEEE Transactions on CAD, 17(7):601--608, 1998. Google ScholarDigital Library
- Huiyang Zhou and Thomas M. Conte. Code Size Efficiency in Global Scheduling for ILP Processors. 6th Annual Workshop on ICCA, 2002. Google ScholarDigital Library
- Jong-eun Lee, Kiyoung Choi and Nikil Dutt. Efficient Instruction Encoding for Automatic Instruction Set Design of Configurable ASIPs. In Proc. of the Int'l Conference on Computer Aided Design (ICCAD), 2002. Google ScholarDigital Library
- "Hoon Choi, Jong-Sun Kim, Chi-Won Yoon, In-Cheol Park, Seung Ho Hwang and Chong-Min Kyung. Synthesis of Application Specific Instructions for Embedded DSP Software. IEEE Transactions on Computers, 1999. Google ScholarDigital Library
- Rainer Leupers and Peter Marwedel. Instruction Selection for Embedded DSPs with Complex Instructions. In Proc. of the European Design Automation Conference (EURO-DAC), 1996. Google ScholarDigital Library
- Marnix Arnold and Henk Corporaal. Instruction Set Synthesis Using Operation Pattern Detection. 5th Annual Conference of ASCI, 1999.Google Scholar
- Fei Sun, Srivaths Ravi, Anand Raghunathan and Niraj K. Jha. Synthesis of Custom Processors Based on Extensible Platforms. In Proc. of the Int'l Conference on Computer Aided Design (ICCAD), 2002. Google ScholarDigital Library
- Kubilay Atasu, Laura Pozzi and Paolo Ienne. Automatic Application-Specific Instruction-Set Extensions under Microarchitectural Constraints. In Proc. of the Design Automation Conference (DAC), 2003. Google ScholarDigital Library
- Partha Biswas and Nikil Dutt. Greedy and Heuristic-based Algorithms for Synthesis of Complex Instructions in Heterogeneous-Connectivity-based DSPs. UCI-ICS TR 03-16, 2003.Google Scholar
- R.M. Karp. Reducibility Among Combinatorial Problems. Complexity of Computer Computations, Plenum Press, 1972.Google ScholarCross Ref
- S.S. Muchnick. Advanced Compiler Design and Implementation, Morgan Kaufmann, 1997. Google ScholarDigital Library
- Ashok Halambi, Peter Grun, Vijay Ganesh, Asheesh Khare, Nikil Dutt and Alex Nicolau. EXPRESSION: A language for architecture exploration through compiler/simulator retargetability. In Proc. of the Conference on Design, Automation and Test in Europe (DATE), 1999. Google ScholarDigital Library
- Mathew R. Guthaus, Jeffrey S. Ringenberg, Dan Ernst, Todd M. Austin, Trevor Mudge and Richard B. Brown. MiBench: A Free Commercially Representative Embedded Benchmark Suite. http://www.eecs.umich.edu/jringenb/mibench/.Google Scholar
Index Terms
- Reducing code size for heterogeneous-connectivity-based VLIW DSPs through synthesis of instruction set extensions
Recommendations
Code Size Reduction in Heterogeneous-Connectivity-Based DSPs Using Instruction Set Extensions
Existing trend of processors shows a progress toward customizable and reconfigurable architectures. In this paper, we study the benefit of combining the architectural design of a VLIW DSP and the concepts of modern customizable processors like ASIPs (...
Compiling for automatically generated instruction set extensions
CGO '12: Proceedings of the Tenth International Symposium on Code Generation and OptimizationThe automatic generation of instruction set extensions (ISEs) to provide application-specific acceleration for embedded processors has been a productive area of research in recent years. The use of automatic algorithms, however, results in instructions ...
Block-aware instruction set architecture
Instruction delivery is a critical component for wide-issue, high-frequency processors since its bandwidth and accuracy place an upper limit on performance. The processor front-end accuracy and bandwidth are limited by instruction-cache misses, ...
Comments