|
ABSTRACT
Multimedia applications have become a dominant computing workload for computer systems as well as for wireless-based devices. Due to their repetitive computing and memory intensive nature, they can take effective advantage from Processor-In-Memory (PIM) technology. In this paper, a new low-power PIM-based 32-bit reconfigurable datapath optimized for multimedia applications is presented. The new circuit efficiently performs parallel arithmetic operations on either 8-, 16-, or 32-bit integer data or on 32-bit single precision floating-point data. As a result, high flexibility is provided at a very low hardware cost. When implemented using the UMC 0.18 μm 1.8 V CMOS technology, the proposed datapath exhibits a 285 MHz running frequency, dissipates just 0.12 mW/MHz and occupies a silicon area of only 107,323 μm2. When performing 2D-DCT, proposed architecture consumes 74% less power and is 28% more power efficient compared to top-of-the-line commercial TI DSP
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Jinwoo Suh , Eun-Gyu Kim , Stephen P. Crago , Lakshmi Srinivasan , Matthew C. French, A performance analysis of PIM, stream processing, and tiled processing on memory-intensive signal processing kernels, Proceedings of the 30th annual international symposium on Computer architecture, June 09-11, 2003, San Diego, California
|
| |
2
|
|
| |
3
|
|
| |
4
|
Jeffrey Draper , Jeff Sondeen , Sumit Mediratta , I. Kim, Implementation of a 32-bit RISC Processor for the Data-Intensive Architecture Processing-In-Memory Chip, Proceedings of the IEEE International Conference on Application-Specific Systems, Architectures, and Processors, p.163, July 17-19, 2002
|
| |
5
|
M. Margala, R. Lin, "Highly efficient digital CMOS accelerator for image and graphics processing", 15th Annual IEEE International ASIC/SOC Conference 2002, pp.127--132, 25-28 September 2002.
|
| |
6
|
|
| |
7
|
B. S.-H. Kwan, B. F. Cockburn, D. G. Elliott, "Implementation of DSP-RAM: an architecture for parallel digital signal processing in memory", Canadian Conference on Electrical and Computer Engineering, 2001, Vol. 1, pp. 341--345, 13-16 May 2001.
|
| |
8
|
J.-S. Moon, T.-J. Kwon, J. Sondeen, J.Draper ,"An area-efficient standard-cell floating-point unit design for a processing-in-memory system", Proc. of the Conference on European Solid-State Circuits, 2003, ESSCIRC '03, pp. 57--60, 16-18 September 2003.
|
| |
9
|
J. Fritts, W. Wolf, B. Liu, "Understanding multimedia application characteristics for designing programmable media processors, "SPIE Photonics West, Media Processors '99, San Jose, CA, pp. 2--13, January 1999.
|
| |
10
|
S. Nazareth, R. Asokan. "Processor Architectures for Multimedia", academics paper, November 2001. http://www.cs.dartmouth.edu/~nazareth/academic/CS107.pdf
|
| |
11
|
The institute of Electrical and Electronics Engineers, Inc., "IEEE Standard for Binary Floating-Point Arithmetic", ANSI/IEEE Standard No. 754-1985, New York, Aug. 1985.
|
| |
12
|
|
| |
13
|
A. Farooqui, V. G. Oklobdzija, "A Programmable Data-Path for MPEG-4 and Natural Hybrid Video Coding", 34th Annual Asilomar Conference on signals, Systems and Computers, Pacific Grove, California, October 29 - November 1, 2000.
|
| |
14
|
|
| |
15
|
W.H. Chen, C.H. Smith, and S.C. Fralick, "A Fast Computational Algorithm for the Discrete Cosine Transform," IEEE Trans. Communications, vol. 25, pp. 1004--1009, 1977.
|
| |
16
|
B. J. Jasionowski, "A Processor-in-memory for Image and Video Compression", MS Thesis, University of Rochester, Rochester, New York, 2004.
|
| |
17
|
I. Daubechies, "Orthonormal Bases of Compactly Supported Wavelets." Comm. Pure Appl. Math.,Vol 41, pp. 909--996, 1988.
|
| |
18
|
M. K.-F. Lay, "A Processor-in-Memory for Image Compression using the Discrete Wavelet Transform", MS Thesis, University of Rochester, Rochester, New York, 2004.
|
| |
19
|
TI TMS320VC5509 Fixed-Point DSP, http://focus.ti.com/docs/prod/folders/tms320vc5509.html
|
| |
20
|
TI TMS320VC5509 Revision D Power Consumption Measurements, January 8, 2002.
|
| |
21
|
TI TMS320C55x Hardware Extensions for Image/Video Applications Programmer's Reference, February 2002, http://focus.ti.com/lit/ug/spru098/spru098.pdf
|
CITED BY
|
Yoonjin Kim , Ilhyun Park , Kiyoung Choi , Yunheung Paek, Power-conscious configuration cache structure and code mapping for coarse-grained reconfigurable architecture, Proceedings of the 2006 international symposium on Low power electronics and design, October 04-06, 2006, Tegernsee, Bavaria, Germany
|
|