ABSTRACT
The significant scaling challenges of conventional memories, i.e., SRAM and DRAM, motivated the research on emerging memory technologies. Many promising memory technology candidates, however, suffer from a common issue in their write operations: the switching processes at different write operations (i.e., 0 → 1 and 1 → 0) are asymmetric. Using a pessimistic design corner to cover the worst case of a write operation incurs large power and performance cost in the existing emerging memory technology designs. In this work, we propose a universal log style write methodology to mitigate this asymmetry issue by operating two switching processes in separate stages. The dedicated design optimizations are allowed on either switching process. The simulation results on the spin-transfer-torque random access memory based last-level cache show that our technique can improve the system performance by 4% while receiving 35% power reduction on average1.
- J. Kim, S. Chung, T. Jang, and et al. Vertical double gate Z-RAM technology with remarkable low voltage operation for DRAM application. pages 163--164, 2010.Google Scholar
- H. Labs. In http://www.hpl.hp.com/research/cacti/.Google Scholar
- B. C. Lee, E. Ipek, O. Mutlu, and D. Burger. Architecting phase change memory as a scalable DRAM alternative. In Proceedings of ISCA, pages 2--13, 2009. Google ScholarDigital Library
- Z. Lu, N. Collaert, M. Aoulaiche, B. De Wachter, A. De Keersgieter, W. Schwarzenbach, and et al. A novel low-voltage biasing scheme for double gate fbc achieving 5s retention and 1016 endurance at 85c. In IEDM, pages 12.3.1--12.3.4, dec. 2010.Google Scholar
- A. Nigam, C. Smullen, V. Mohan, E. Chen, S. Gurumurthi, and M. Stan. Delivering on the promise of universal memory for spin-transfer torque ram (stt-ram). In ISLPED 2011, pages 121--126, aug. 2011. Google ScholarDigital Library
- M. Qureshi, M. Franceschini, and L. Lastras-Montano. Improving read performance of phase change memories via write cancellation and write pausing. In HPCA, pages 1--11, jan. 2010.Google ScholarCross Ref
- M. K. Qureshi, V. Srinivasan, and J. A. Rivers. Scalable high performance main memory system using phase-change memory technology. In Proceedings of ISCA, pages 24--33, 2009. Google ScholarDigital Library
- C. Smullen, V. Mohan, A. Nigam, S. Gurumurthi, and M. Stan. Relaxing non-volatility for fast and energy-efficient stt-ram caches. In High Performance Computer Architecture (HPCA), 2011 IEEE 17th International Symposium on, pages 50--61, feb. 2011. Google ScholarDigital Library
- G. Sun, X. Dong, Y. Xie, J. Li, and Y. Chen. A novel architecture of the 3d stacked mram l2 cache for cmps. pages 239--249, feb. 2009.Google Scholar
- Z. Sun, X. Bi, H. H. Li, W.-F. Wong, Z.-L. Ong, X. Zhu, and W. Wu. Multi retention level stt-ram cache designs with a dynamic refresh scheme. In Proceedings of Micro, pages 329--338, 2011. Google ScholarDigital Library
- X. Wang, Y. Zheng, H. Xi, and D. Dimitrov. Thermal fluctuation effects on spin torque induced switching: Mean and variations. Journal of Applied Physics, 103(3):034507--034507-4, feb 2008.Google ScholarCross Ref
- Y. Zhang, Y. Li, A. Jones, and Y. Chen. Asymmetry of MTJ Switching and Its Implication to STT-RAM Designs. In Proceedings of the DATE, 2012.Google Scholar
- P. Zhou, B. Zhao, J. Yang, and Y. Zhang. A durable and energy efficient main memory using phase change memory technology. In Proceedings of ISCA, pages 14--23, 2009. Google ScholarDigital Library
- P. Zhou, B. Zhao, J. Yang, and Y. Zhang. Energy reduction for stt-ram using early write termination. In ICCAD, pages 264--268, nov. 2009. Google ScholarDigital Library
Index Terms
- Improving energy efficiency of write-asymmetric memories by log style write
Recommendations
Flip-N-Write: a simple deterministic technique to improve PRAM write performance, energy and endurance
MICRO 42: Proceedings of the 42nd Annual IEEE/ACM International Symposium on MicroarchitectureThe phase-change random access memory (PRAM) technology is fast maturing to production levels. Main advantages of PRAM are non-volatility, byte addressability, in-place programmability, low-power operation, and higher write endurance than that of ...
Improving writeback efficiency with decoupled last-write prediction
ISCA '12: Proceedings of the 39th Annual International Symposium on Computer ArchitectureIn modern DDRx memory systems, memory write requests compete with read requests for available memory resources, significantly increasing the average read request service time. Caches are used to mitigate long memory read latency that limits system ...
Improving writeback efficiency with decoupled last-write prediction
ISCA '12In modern DDRx memory systems, memory write requests compete with read requests for available memory resources, significantly increasing the average read request service time. Caches are used to mitigate long memory read latency that limits system ...
Comments