| Selective code/data migration for reducing communication energy in embedded MpSoC architectures |
| Full text |
Pdf
(258 KB)
|
| Source
|
Great Lakes Symposium on VLSI
archive
Proceedings of the 16th ACM Great Lakes symposium on VLSI
table of contents
Philadelphia, PA, USA
SESSION: System & architectural-level power optimization
table of contents
Pages: 386 - 391
Year of Publication: 2006
ISBN:1-59593-347-6
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 9, Downloads (12 Months): 32, Citation Count: 3
|
|
|
ABSTRACT
Proliferation of embedded on-chip multiprocessor architectures (MpSoC) motivates researchers from both academia and industry to consider optimization techniques for such architectures. While many proposals on code/data partitioning on parallel architectures try to minimize interprocessor communication requirements at runtime, many applications still have significant runtime communication requirements. This paper proposes a novel task/data migration scheme that decides whether to migrate task or data in order to satisfy a given communication requirement. The choice between the two options is made at runtime based on the statistics collected off-line through profiling. An important characteristic of the approach proposed in this paper is that it takes into account future uses of tasks and data and tries to make a decision that is globally optimal (i.e., when considering multiple communications not just the current one). Our results collected so far are very encouraging and indicate that the proposed selective migration strategy is very successful in practice, reducing communication energy and total energy by 38.6% (resp. 18.9%) and 13.8% (resp. 6.8%) on an average, as compared to task (resp. data) migration only, across ten embedded applications tested.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
A. Baghdadi , D. Lyonnard , N. Zergainoh , A. Jerraya, An efficient architecture model for systematic design of application-specific multiprocessor SoC, Proceedings of the conference on Design, automation and test in Europe, p.55-63, March 2001, Munich, Germany
|
| |
2
|
Chip Multiprocessing. http://industry.java.sun.com/javanews/stories/ print/0,1797,32080,00.html
|
| |
3
|
Chip Multiprocessing. ITWorld.Com, http://www.itworld.com/Comp/1092/CWSTO54343/
|
| |
4
|
M. Collin, R. Haukilahti, M. Nikitovic, and J. Adomat, SoCrates - A Multiprocessor SoC in 40 Days, In Proc. Design Automation and Test in Europe, Munich, March 2001.
|
| |
5
|
F. Gharsalli, S. Meftali, F. Rousseau, and A. A. Jerraya, Automatic Generation of Embedded Memory Wrapper for Multiprocessor SoC. In Proc. of the 36th Design Automation Conference, New Orleans, Louisiana, 1999.
|
 |
6
|
M. Kandemir , O. Ozturk , M. Karakoy, Dynamic on-chip memory management for chip multiprocessors, Proceedings of the 2004 international conference on Compilers, architecture, and synthesis for embedded systems, September 22-25, 2004, Washington DC, USA
[doi> 10.1145/1023833.1023838]
|
| |
7
|
|
| |
8
|
|
 |
9
|
Samy Meftali , Ferid Gharsalli , Frederic Rousseau , Ahmed A. Jerraya, An optimal memory allocation for application-specific multiprocessor system-on-chip, Proceedings of the 14th international symposium on Systems synthesis, September 30-October 03, 2001, Montréal, P.Q., Canada
[doi> 10.1145/500001.500006]
|
 |
10
|
Basem A. Nayfeh , Lance Hammond , Kunle Olukotun, Evaluation of design alternatives for a multiprocessor microprocessor, Proceedings of the 23rd annual international symposium on Computer architecture, p.67-77, May 22-24, 1996, Philadelphia, Pennsylvania, United States
|
 |
11
|
Kunle Olukotun , Basem A. Nayfeh , Lance Hammond , Ken Wilson , Kunyung Chang, The case for a single-chip multiprocessor, Proceedings of the seventh international conference on Architectural support for programming languages and operating systems, p.2-11, October 01-04, 1996, Cambridge, Massachusetts, United States
|
| |
12
|
G. Reinman and N. P. Jouppi, CACTI 2.0: An Integrated Cache Timing and Power Model, Compaq, WRL, Research Report 2000/7, February 2000.
|
 |
13
|
Patrick Schaumont , Bo-Cheng Charles Lai , Wei Qin , Ingrid Verbauwhede, Cooperative multithreading on 3mbedded multiprocessor architectures enables energy-scalable design, Proceedings of the 42nd annual conference on Design automation, June 13-17, 2005, San Diego, California, USA
[doi> 10.1145/1065579.1065591]
|
| |
14
|
Scali System Guide version 2.0, white paper. Scali Computer AS, www.scali.com, 2000.
|
 |
15
|
W. Ye , N. Vijaykrishnan , M. Kandemir , M. J. Irwin, The design and use of simplepower: a cycle-accurate energy estimation tool, Proceedings of the 37th conference on Design automation, p.340-345, June 05-09, 2000, Los Angeles, California, United States
[doi> 10.1145/337292.337436]
|
|