|
ABSTRACT
Generating an accurate estimate of the performance of a program on a given system is important to a large number of people. Computer architects, compiler writers, and developers all need insight into a machine's performance. There are a number of performance estimation techniques in use, from profile-based approaches to full machine simulation. This paper discusses a profile-based performance estimation technique that uses a lightweight instrumentation phase that runs in order number of dynamic instructions, followed by an analysis phase that runs in roughly order number of static instructions. This technique accurately predicts the performance of the core pipeline of a detailed out-of-order issue processor model while scheduling far fewer instructions than does full simulation. The difference between the predicted execution time and the time obtained from full simulation is only a few percent.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
D. Burger and T.M. Austin. The SimpleScalar Tool Set, Version 2.0. University of Wisconsin-Madison Computer Sciences Technical Report #1342, June 1997.
|
 |
3
|
|
| |
4
|
|
| |
5
|
A. Chamas, A. Dalai, P. deDood, et el. A 64b microprocessor with multimedia support. In Proceedings of the 1EEE International SolM-State Cirruits Conference, pp. 178-179, February 1995.
|
 |
6
|
|
| |
7
|
R.P. Colwelt and R.L. Steck. A 0.6mu m BiCMOS processor with dynamic execution. In Proceedings of the 1995 IEEE International Solid-State Circuits Conference, pp. 176-177, February 1995.
|
| |
8
|
R.B. Garner, A. Agrawal, F. Briggs, et al. The sealable processor architecture (SPARC). In Proceedings of the 33rd IEEE Computer Society International Conference, pp. 278- 283, March 1988.
|
| |
9
|
B.A. Gieseke, R.L. Allmon, D.W. Bailey, et el. A 600 Mhz supersealar RISC microprocessor with out-of-order execution. In Proceedings of the 1997 1EEE international Solid- State Circuits Conference, pp. 176-177, February 1997.
|
| |
10
|
|
| |
11
|
D.E. Knuth. An empirical study of FORTRAN programs. Software Practice and Experience, vol. 1, pp. 105-133, 1971.
|
| |
12
|
|
| |
13
|
|
| |
14
|
D.E. Knuth and F.R. Stevenson. Optimal measurement points for program frequency counts. BIT, voI. 13, pp. 313-322, 1973.
|
 |
15
|
|
| |
16
|
|
| |
17
|
P.S. Magnusson, E Dahlgren, H. G-rahn, et al. SimlCS/sun4m: A Virtual Workstation, In Proceedings of the Usenix Annual Technical Conference, June 1998.
|
| |
18
|
MIPS Computer Systems. UMIPS- V Reference Manual (pixie and pixstats). MIPS Computer Systems, Sunnyvale, CA. 1990.
|
| |
19
|
|
| |
20
|
V.S. Pai, P. Ranganathan, and S.V. Adve. RSIM: An Execution-Driven Simulator for ILP-Based Shared-Memory Multiprocessors and Uniprocessors. In the Proceedings of the 3rd Wor~hop on Computer Architecture Education, February 1997.
|
| |
21
|
|
 |
22
|
|
 |
23
|
|
 |
24
|
|
| |
25
|
Standard Performance Evaluation Corporation. The SPEC benchmark suite, http://www.~bench.org.
|
 |
26
|
|
| |
27
|
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE conference on Design automation
Gwo-Dong Chen
, Daniel D. Gajski
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
|