skip to main content
10.5555/1326073.1326133acmconferencesArticle/Chapter ViewAbstractPublication PagesiccadConference Proceedingsconference-collections
research-article

The FAST methodology for high-speed SoC/computer simulation

Published: 05 November 2007 Publication History

Abstract

This paper describes the FAST methodology that enables a single FPGA to accelerate the performance of cycle-accurate computer system simulators modeling modern, realistic SoCs, embedded systems and standard desktop/laptop/server computer systems. The methodology partitions a simulator into (i) a functional model that simulates the functionality of the computer system and (ii) a predictive model that predicts performance and other metrics. The partitioning is crafted to map most of the parallel work onto a hardware-based predictive model, eliminating much of the complexity and difficulty of simulating parallel constructs on a sequential platform.
FAST conventions and libraries have been designed to make creating, modifying, using and measuring such simulators straightforward. We describe a prototype FAST system: a full-system, RTL-level cycle-accurate-capable computer system simulator that executes the x86 ISA, boots unmodified Linux and executes unmodified x86 applications. The prototype runs two to three orders of magnitude faster than the fastest Intel and AMD RTL-level cycle-accurate x86 software-based simulators and about six to seven times faster than RTL simulation.

References

[1]
T. Li, Y. Guo, and S.-K. Li, "Design and Implementation of a parallel Verilog simulator: PVSim," in Proceedings of the 17th International Conference on VLSI Design, 2004, pp. 329--334.
[2]
D. A. Penry, D. Fay, D. Hodgdon, R. Wells, G. Schelle, D. I. August, and D. Connors, "Exploiting Parallelism and Structure to Accelerate the Simulation of Chip Multi-processors," in 12th International Symposium on High-Performance Computer Architecture, Feb 2006, pp. 27--38.
[3]
D. Chiou, "FAST: FPGA-based Acceleration of Simulator Timing models," in Proceedings of the first Workshop on Architecture Research using FPGA Platforms, held in conjunction with HPCA-11, San Francisco, CA, Feb. 2005.
[4]
D. Chiou, H. Sanjeliwala, D. Sunwoo, J. Z. Xu, and N. Patil, "FPGA-based Fast, Cycle-Accurate, Full-System Simulators," in Proceedings of the second Workshop on Architecture Research using FPGA Platforms, held in conjunction with HPCA-12, Austin, TX, Feb. 2006.
[5]
D. Chiou, "A simulation method," United States Patent Application 11/633,657, Dec. 2006.
[6]
"Bluespec webpage," http://www.bluespec.com.
[7]
F. Bellard, "QEMU, a Fast and Portable Dynamic Translator," in USENIX 2005 Annual Technical Conference, FREENIX Track, 2005, pp. 41--46.
[8]
"DRC Computer," http://www.drccomputer.com/.
[9]
"SPEC webpage," http://www.spec.org.
[10]
E. Schnarr and J. R. Larus, "Fast out-of-order processor simulation using memoization," in Proceedings of the Eight International Conference on Architectural Support for Programming Languages and Operating Systems, Oct. 1998, pp. 283--294.
[11]
J. Emer, P. Ahuja, E. Borch, A. Klauser, C.-K. Luk, S. Manne, S. S. Mukherjee, H. Patil, S. Wallace, N. Binkert, R. Espasa, and T. Juan, "Asim: A performance model framework," Computer, vol. 35, no. 2, pp. 68--76, 2002.
[12]
N. L. Binkert, E. G. Hallnor, and S. K. Reinhardt, "Network-Oriented Full-System Simulation using M5," in Sixth Workshop on Computer Architecture Evaluation using Commerical Workloads (CAECW), Feb. 2003.
[13]
J. Emer, "HASim talk at RAMP Retreat, June 2007." {Online}. Available: http://ramp.eecs.berkeley.edu
[14]
R. Bhargava, L. Barnes, and B. Sander, "personal email communication," Aug. 2007.
[15]
L. Zhang, "personal email communication," Aug. 2007.
[16]
J. Holt, "personal email communication," July 2007.
[17]
T. Austin, E. Larson, and D. Ernst, "SimpleScalar: An Infrastructure for Computer System Modeling," IEEE Computer, vol. 35, no. 2, pp. 59--67, Feb. 2002.
[18]
R. Desikan, D. Burger, and S. W. Keckler, "Measuring experimental error in microprocessor simulation," in Proceedings of the 28th Annual International Symposium on Computer Architecture (ISCA), 2001, pp. 266--277. {Online}. Available: citeseer.ist.psu.edu/article/desikan01measuring.html
[19]
M. T. Yourst, "PTLSim: A Cycle Accurate Full System x86-64 Microarchitectural Simulator," in Proceedings of ISPASS, Jan. 2007.
[20]
"VAST Systems," www.vastsystems.com.
[21]
N. Dave, M. Pellauer, Arvind, and J. Emer, "Implementing a Functional/Timing Partitioned Microprocessor Simulator with an FPGA," in Proceedings of the Workshop on Architecture Research using FPGA Platforms, held at HPCA-12, Feb. 2006. {Online}. Available: http://www.cag.csail.mit.edu/warfp2006/submissions/dave-mit.pdf
[22]
J. D. Davis, L. Hammond, and K. Olukotun, "A Flexible Architecture for Simulation and Testing (FAST) Multiprocessor Systems," in Proceedings of the Workshop on Architecture Research using FPGA Platforms, held at HPCA-11, Feb. 2005. {Online}. Available: http://cag.csail.mit.edu/warfp2005/submissions/33-davis.pdf
[23]
S.-L. Lu, E. Nurvitadhi, J. Hong, and S. Larsen, "Memory Subsystem Performance Evaluation with FPGA based Emulators," in Proceedings of the Workshop on Architecture Research using FPGA Platforms, held at HPCA-11, Feb. 2005. {Online}. Available: http://cag.csail.mit.edu/warfp2005/submissions/16-lu.pdf
[24]
C. Kozyrakis and K. Olukotun, "ATLAS: A Scalable Emulator for Transactional Parallel Systems," in Proceedings of the Workshop on Architecture Research using FPGA Platforms, held at HPCA-11, Feb. 2005. {Online}. Available: http://cag.csail.mit.edu/warfp2005/submissions/6-kozyrakis.pdf
[25]
E. Nurvitadhi and J. Hoe, "Full-System Architectural Exploration Sandbox," in Proceedings of the Workshop on Architecture Research using FPGA Platforms, held at HPCA-11, Feb. 2005. {Online}. Available: http://cag.csail.mit.edu/warfp2005/submissions/18-nurvitadhi.pdf
[26]
D. Patterson, Arvind, K. Asanović, D. Chiou, J. C. Hoe, C. Kozyrakis, S.-L. Lu, M. Oskin, J. Rabaey, and J. Wawrzynek, "RAMP: Research Accelerator for Multiple Processors," in Proceedings of Hot Chips 18, Palo Alto, CA, Aug. 2006.
[27]
T. Suh, H.-H. S. Lee, S.-L. Lu, and J. Shen, "Initial Observations of Hardware/Software Co-Simulation using FPGA in Architectural Research," in Proceedings of the Workshop on Architecture Research using FPGA Platforms, held at HPCA-12, Feb. 2006. {Online}. Available: http://www.cag.csail.mit.edu/warfp2006/submissions/suh-git.pdf

Cited By

View all
  • (2016)StroberACM SIGARCH Computer Architecture News10.1145/3007787.300115144:3(128-139)Online publication date: 18-Jun-2016
  • (2016)StroberProceedings of the 43rd International Symposium on Computer Architecture10.1109/ISCA.2016.21(128-139)Online publication date: 18-Jun-2016
  • (2011)Exploiting temporal decoupling to accelerate trace-driven NoC emulationProceedings of the seventh IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis10.1145/2039370.2039418(315-324)Online publication date: 9-Oct-2011
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICCAD '07: Proceedings of the 2007 IEEE/ACM international conference on Computer-aided design
November 2007
933 pages
ISBN:1424413826
  • General Chair:
  • Georges Gielen

Sponsors

Publisher

IEEE Press

Publication History

Published: 05 November 2007

Check for updates

Qualifiers

  • Research-article

Conference

ICCAD07
Sponsor:

Acceptance Rates

ICCAD '07 Paper Acceptance Rate 139 of 510 submissions, 27%;
Overall Acceptance Rate 457 of 1,762 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)2
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2016)StroberACM SIGARCH Computer Architecture News10.1145/3007787.300115144:3(128-139)Online publication date: 18-Jun-2016
  • (2016)StroberProceedings of the 43rd International Symposium on Computer Architecture10.1109/ISCA.2016.21(128-139)Online publication date: 18-Jun-2016
  • (2011)Exploiting temporal decoupling to accelerate trace-driven NoC emulationProceedings of the seventh IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis10.1145/2039370.2039418(315-324)Online publication date: 9-Oct-2011
  • (2010)FastFwdProceedings of the eighth IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis10.1145/1878961.1879006(247-256)Online publication date: 24-Oct-2010
  • (2009)ReSim, a trace-driven, reconfigurable ILP processor simulatorProceedings of the Conference on Design, Automation and Test in Europe10.5555/1874620.1874751(536-541)Online publication date: 20-Apr-2009
  • (2009)Bounded dataflow networks and latency-insensitive circuitsProceedings of the 7th IEEE/ACM international conference on Formal Methods and Models for Codesign10.5555/1715759.1715781(171-180)Online publication date: 13-Jul-2009
  • (2009)Soft connectionsProceedings of the 46th Annual Design Automation Conference10.1145/1629911.1629986(276-281)Online publication date: 26-Jul-2009
  • (2009)A-Port NetworksACM Transactions on Reconfigurable Technology and Systems10.1145/1575774.15757752:3(1-26)Online publication date: 1-Sep-2009
  • (2008)A-PortsProceedings of the 16th international ACM/SIGDA symposium on Field programmable gate arrays10.1145/1344671.1344685(87-96)Online publication date: 24-Feb-2008

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media