ACM Home Page
Please provide us with feedback. Feedback
Cost-effective low-power processor-in-memory-based reconfigurable datapath for multimedia applications
Full text PdfPdf (417 KB)
Source International Symposium on Low Power Electronics and Design archive
Proceedings of the 2005 international symposium on Low power electronics and design table of contents
San Diego, CA, USA
SESSION: Special purpose processing table of contents
Pages: 161 - 166  
Year of Publication: 2005
ISBN:1-59593-137-6
Authors
Marco Lanuzza  University of Calabria, Italy
Martin Margala  University of Rochester, Rochester, NY
Pasquale Corsonello  University of Calabria, Italy
Sponsors
SIGDA: ACM Special Interest Group on Design Automation
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 1,   Downloads (12 Months): 68,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1077603.1077645
What is a DOI?

ABSTRACT

Multimedia applications have become a dominant computing workload for computer systems as well as for wireless-based devices. Due to their repetitive computing and memory intensive nature, they can take effective advantage from Processor-In-Memory (PIM) technology. In this paper, a new low-power PIM-based 32-bit reconfigurable datapath optimized for multimedia applications is presented. The new circuit efficiently performs parallel arithmetic operations on either 8-, 16-, or 32-bit integer data or on 32-bit single precision floating-point data. As a result, high flexibility is provided at a very low hardware cost. When implemented using the UMC 0.18 μm 1.8 V CMOS technology, the proposed datapath exhibits a 285 MHz running frequency, dissipates just 0.12 mW/MHz and occupies a silicon area of only 107,323 μm2. When performing 2D-DCT, proposed architecture consumes 74% less power and is 28% more power efficient compared to top-of-the-line commercial TI DSP


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
 
4
 
5
M. Margala, R. Lin, "Highly efficient digital CMOS accelerator for image and graphics processing", 15th Annual IEEE International ASIC/SOC Conference 2002, pp.127--132, 25-28 September 2002.
 
6
 
7
B. S.-H. Kwan, B. F. Cockburn, D. G. Elliott, "Implementation of DSP-RAM: an architecture for parallel digital signal processing in memory", Canadian Conference on Electrical and Computer Engineering, 2001, Vol. 1, pp. 341--345, 13-16 May 2001.
 
8
J.-S. Moon, T.-J. Kwon, J. Sondeen, J.Draper ,"An area-efficient standard-cell floating-point unit design for a processing-in-memory system", Proc. of the Conference on European Solid-State Circuits, 2003, ESSCIRC '03, pp. 57--60, 16-18 September 2003.
 
9
J. Fritts, W. Wolf, B. Liu, "Understanding multimedia application characteristics for designing programmable media processors, "SPIE Photonics West, Media Processors '99, San Jose, CA, pp. 2--13, January 1999.
 
10
S. Nazareth, R. Asokan. "Processor Architectures for Multimedia", academics paper, November 2001. http://www.cs.dartmouth.edu/~nazareth/academic/CS107.pdf
 
11
The institute of Electrical and Electronics Engineers, Inc., "IEEE Standard for Binary Floating-Point Arithmetic", ANSI/IEEE Standard No. 754-1985, New York, Aug. 1985.
 
12
 
13
A. Farooqui, V. G. Oklobdzija, "A Programmable Data-Path for MPEG-4 and Natural Hybrid Video Coding", 34th Annual Asilomar Conference on signals, Systems and Computers, Pacific Grove, California, October 29 - November 1, 2000.
 
14
 
15
W.H. Chen, C.H. Smith, and S.C. Fralick, "A Fast Computational Algorithm for the Discrete Cosine Transform," IEEE Trans. Communications, vol. 25, pp. 1004--1009, 1977.
 
16
B. J. Jasionowski, "A Processor-in-memory for Image and Video Compression", MS Thesis, University of Rochester, Rochester, New York, 2004.
 
17
I. Daubechies, "Orthonormal Bases of Compactly Supported Wavelets." Comm. Pure Appl. Math.,Vol 41, pp. 909--996, 1988.
 
18
M. K.-F. Lay, "A Processor-in-Memory for Image Compression using the Discrete Wavelet Transform", MS Thesis, University of Rochester, Rochester, New York, 2004.
 
19
TI TMS320VC5509 Fixed-Point DSP, http://focus.ti.com/docs/prod/folders/tms320vc5509.html
 
20
TI TMS320VC5509 Revision D Power Consumption Measurements, January 8, 2002.
 
21
TI TMS320C55x Hardware Extensions for Image/Video Applications Programmer's Reference, February 2002, http://focus.ti.com/lit/ug/spru098/spru098.pdf


Collaborative Colleagues:
Marco Lanuzza: colleagues
Martin Margala: colleagues
Pasquale Corsonello: colleagues