ABSTRACT
Three-dimensional stacked Wide I/O DRAMs have been proposed as a promising solution to overcome the pin-limited memory performance growth, the power vs. bandwidth dilemma and the Memory Wall. This new DRAM architecture and organisation requires a new generation of DRAM memory controllers.
In this paper, we present a new methodology using virtual platforms to model the backend of a 3D-DRAM memory subsystem (channel controller and Wide I/O DRAM) with special SystemC TLM2.0 phase extensions. This methodology enables us to explore the complete design space of memory controllers at the system level at very fast simulation speeds with precise timing accuracy. We show simulation speedups of up to 377x with a timing accuracy of 99% compared to an equivalent cycle and pin accurate SystemC based RTL simulation.
- P. Stanley-Marbell, et al. Pinned to the walls; Impact of packaging and application properties on the memory and power walls. In proc. ISLPED 2011, Aug. 2011. Google ScholarDigital Library
- Wm. A. Wulf et al. Hitting the memory wall: implications of the obvious. SIGARCH Comput. Archit. News, March 1995. Google ScholarDigital Library
- C. Weis, et al. An energy efficient DRAM subsystem for 3D integrated SoCs. In proc. DATE 2012, march 2012.Google ScholarCross Ref
- G. Manil D., et al. DRAM selection and configuration for real-time mobile systems. In proc. DATE 2012, March 2012.Google Scholar
- C. Weis, et al. Design space exploration for 3D-stacked DRAMs. In proc. DATE 2011, March 2011.Google ScholarCross Ref
- Cadence Design IP: Wide-I/O Controller. Technical report, Cadence Design Systems, Inc., 2012.Google Scholar
- M. Ghosh et al. Smart Refresh: An Enhanced Memory Controller Design for Reducing Energy in Conventional and 3D Die-Stacked DRAMs. In proc. MICRO 2007, Dec. 2007. Google ScholarDigital Library
- I. Loi et al. An efficient distributed memory interface for many-core platform with 3D stacked DRAM. In proc. DATE 2010, March 2010. Google ScholarDigital Library
- Guangfei Zhang, et al. Heterogeneous multi-channel: fine-grained DRAM control for both system performance and power efficiency. In DAC 12, Jun. 2012. Google ScholarDigital Library
- F. Kesel. Modellierung von digitalen Systemen mit SystemC: Von der RTL- zur Transaction-Level-Modellierung. Oldenbourg Wissenschaftsverlag, 2012.Google ScholarCross Ref
- Lukai Cai et al. Transaction level modeling: an overview. In proc. CODES+ISSS '03, 2003. Google ScholarDigital Library
- IEEE Computer Society. IEEE 1666: SystemC Language Reference Manual, 2012 edition, 2011.Google Scholar
- D. C. Black, et al. SystemC: From the Ground Up, Second Edition. Springer, 2009. Google ScholarDigital Library
- P. Rosenfeld, et al. DRAMSim2: A Cycle Accurate Memory System Simulator. Computer Architecture Letters, 10(1), Jan.-June 2011. Google ScholarDigital Library
- Nathan Binkert, et al. The gem5 simulator. SIGARCH Comput. Archit. News, 39(2): 1--7, August 2011. Google ScholarDigital Library
- Nan Li, et al. Design and Implementation of an Accurate Memory Subsystem Model in SystemC. Technical report, December 2010.Google Scholar
- DesignWare TLM Library. http://www.synopsys.com/Systems/VirtualPrototyping/VPModels/Pages/DW-TLM-Library.aspx, 2012.Google Scholar
- MemMax Scheduler. http://sonicsinc.com/wp-content/uploads/2012/09/Sonics_ProductBrief_MemMax.pdf, 2012.Google Scholar
- Inc Synopsys. Synopsys Virtual Prototyping Solution. http://www.synopsys.com/Systems/VirtualPrototyping/Pages/default.aspx, 2012.Google Scholar
- Tim Kogel. Generating Workload Models from TLM-2.0-based Virtual Prototypes for Efficient Architecture Performance Analysis. http://www.nascug.org/events/13th/tlm20_workload_models.pdf, Jun. 2010.Google Scholar
- Doug Burger et al. The SimpleScalar tool set, version 2.0. SIGARCH Comput. Archit. News, 25(3), June 1997. Google ScholarDigital Library
- Jedec Solid State Technology Association. Wide I/O Single Data Rate JESD 229, Dec. 2011.Google Scholar
- SystemC Modeling Library (SCML). http://www.synopsys.com/cgi-bin/slcw/kits/reg.cgi.Google Scholar
- Micron Technology Inc. Calculating Memory System Power for DDR3. Technical report, 2007.Google Scholar
- Karthik Chandrasekar, et al. Improved Power Modeling of DDR SDRAMs. In proc. DSD'11, 2011. Google ScholarDigital Library
- Yuko Hara, et al. Proposal and Quantitative Analysis of the CHStone Benchmark Program Suite for Practical C-based High-level Synthesis. JIP, 17, 2009.Google Scholar
- Mediabench. http://euler.slu.edu/fritts/mediabench/.Google Scholar
Index Terms
- TLM modelling of 3D stacked wide I/O DRAM subsystems: a virtual platform for memory controller design space exploration
Recommendations
DRAMSys4.0: A Fast and Cycle-Accurate SystemC/TLM-Based DRAM Simulator
Embedded Computer Systems: Architectures, Modeling, and SimulationAbstractThe simulation of DRAMs (Dynamic Random Access Memories) on system level requires highly accurate models due to their complex timing and power behavior. However, conventional cycle-accurate DRAM models often become the bottleneck for the overall ...
Dynamic bandwidth scaling for embedded DSPs with 3D-stacked DRAM and wide I/Os
ICCAD '13: Proceedings of the International Conference on Computer-Aided Design3D main memory is an emerging technology that stacks DRAM dies underneath the processor die using through-silicon vias (TSVs). Prior studies assumed that such technology would decrease main memory access latency by 45% to 60%, while also allowing ...
Resilient die-stacked DRAM caches
ICSA '13Die-stacked DRAM can provide large amounts of in-package, high-bandwidth cache storage. For server and high-performance computing markets, however, such DRAM caches must also provide sufficient support for reliability and fault tolerance. While ...
Comments