research-article

Public Access

3D nanosystems enable embedded abundant-data computing: special session paper

Authors:
William Hwang

Stanford University

Stanford University
View Profile

,
Mohamed M. Sabry Aly

Stanford University

Stanford University
View Profile

,
Yash H. Malviya

Stanford University

Stanford University
View Profile

,
Mingyu Gao

Stanford University

Stanford University
View Profile

,
Tony F. Wu

Stanford University

Stanford University
View Profile

,
Christos Kozyrakis

Stanford University

Stanford University
View Profile

,
H.-S. Philip Wong

Stanford University

Stanford University
View Profile

,
Subhasish Mitra

Stanford University

Stanford University
View Profile

CODES '17: Proceedings of the Twelfth IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis CompanionOctober 2017Article No.: 29Pages 1–2https://doi.org/10.1145/3125502.3125531

Published:15 October 2017Publication History

CODES '17: Proceedings of the Twelfth IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis Companion

Pages 1–2

ABSTRACT

The world's appetite for abundant-data computing, where a massive amount of structured and unstructured data is analyzed, has increased dramatically. The computational demands of these applications, such as deep learning, far exceed the capabilities of today's systems, especially for energy-constrained embedded systems (e.g., mobile systems with limited battery capacity). These demands are unlikely to be met by isolated improvements in transistor or memory technologies, or integrated circuit (IC) architectures alone. Transformative nanosystems, which leverage the unique properties of emerging nanotechnologies to create new IC architectures, are required to deliver unprecedented functionality, performance, and energy efficiency. We show that the projected energy efficiency benefits of domain-specific 3D nanosystems is in the range of 1,000x (quantified using the product of system-level energy consumption and execution time) over today's domain-specific 2D systems with off-chip DRAM. Such a drastic improvement is key to enabling new capabilities such as deep learning in embedded systems.

References

M.M.S. Aly et al., "Energy-Efficient Abundant-Data Computing: The N3XT 1,000X," IEEE Computer, 2015.Google Scholar
J. Zhang et al., "Carbon Nanotube Robust Digital VLSI," IEEE Trans. CAD, 2012.Google Scholar
H.Y. Chen et al., "HfOx based vertical resistive random-access memory for cost-effective 3D cross-point architecture without cell selector," IEDM, 2012. Google ScholarCross Ref
D.J. Frank and L. Chang, "Technology Optimization for High Energy-Efficiency Computation," IEDM Short Course, 2012.Google Scholar
G. Hills, "Variation-Aware Nanosystem Design Kit", https://nanohub.org/resources/22582Google Scholar
G. Hills et al., "Rapid Co-optimization of Processing and Circuit Design to Overcome Carbon Nanotube Variations," IEEE Trans. CAD, 2015.Google Scholar
M.M. Shulaker et al., "Carbon nanotube computer," Nature, 2013. Google ScholarCross Ref
H.-S.P. Wong and S. Salahuddin, "Memory Leads the way to better computing," Nature, 2015. Google ScholarCross Ref
R. Fackenthal et al., "A 16Gb ReRAM with 200MB/s Write and 1GB/s Read in 27nm Technology," ISSCC, 2014. Google ScholarCross Ref
M.M. Shulaker et al., "Three-dimensional integration of nanotechnologies for computing and data storage on a single chip," Nature, 2017. Google ScholarCross Ref
R. Braojos et al., "Nano-engineered architectures for ultra-low power wireless body sensor nodes," CODES+ISSS, 2016.Google Scholar
N. Jouppi et al. "In-Datacenter Performance Analysis of a Tensor Processing Unit," ISCA, 2017. Google ScholarDigital Library
M. Gao et al., "TETRIS: Scalable and Efficient Neural Network Acceleration with 3D Memory," ASPLOS, 2017. Google ScholarDigital Library
Y.-H. Chen et al., "Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks," IEEE JSSCC, 2017.Google Scholar
C. De Sa et al., "Understanding and Optimizing Asynchronous Low-precision Stochastic Gradient Descent," ISCA, 2017.Google Scholar
D. Sanchez et al., "ZSim: Fast and Accurate Microarchitectural Simulation of Thousand-Core Systems," ISCA, 2013. Google ScholarDigital Library
V. Sze et al., "Efficient Processing of Deep Neural Networks:A Tutorial and Survey," arXiv preprint, 2017.Google Scholar
A. Sridhar et al., "3D-ICE: A Compact Thermal Model for Early-Stage Design of Liquid-Cooled ICs," IEEE Trans. Computers, 2014.Google Scholar
V. Chiriac et al., "A figure of merit for mobile device thermal management," IEEE ITherm, 2016.Google Scholar
O. Vinyals et al., "Show and Tell: A Neural Image Caption Generator," IEEE CVPR, 2015.Google Scholar
R. Jozefowicz et al., "Exploring the Limits of Language Modeling," arXiv preprint, 2016.Google Scholar
A. Krizhevsky et al., "ImageNet Classification with Deep Convolution Neural Networks," NIPS, 2012.Google Scholar
K. Simoyan et al., "Very Deep Convolutional Networks for Large-Scale Image Recognition," ICLR, 2015.Google Scholar
K. He et al., "Deep Residual Learning for Image Recognition," IEEE CVPR, 2016. Google ScholarCross Ref

Recommendations

Improving Performance under Process and Voltage Variations in Near-Threshold Computing Using 3D ICs

Near-threshold computing (NTC) circuits have been shown to offer significant energy efficiency and power benefits but with a huge performance penalty. This performance loss exacerbates if process and voltage variations are considered. In this article, ...
Read More
Application of high-κ gate dielectrics and metal gate electrodes to enable silicon and non-silicon logic nanotechnology

High- gate dielectrics and metal gate electrodes are required for enabling continued equivalent gate oxide thickness scaling, and hence high performance, and for controlling gate oxide leakage for both future silicon and emerging non-silicon ...
Read More
Embedded Tutorial: Analog Circuit Performance Issues with Aggressively Scaled Gate Oxide CMOS Technologies
VLSID '06: Proceedings of the 19th International Conference on VLSI Design held jointly with 5th International Conference on Embedded Systems Design

MOS Transistors with sub 100 nm channel lengths need a gate oxide thickness in the range of 1 - 2 nm to combat the short channel effects. However at these gate dielectric thicknesses, the gate current is no longer negligible. In this paper, we report ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

CODES '17: Proceedings of the Twelfth IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis Companion
October 2017
84 pages
ISBN:9781450351850
DOI:10.1145/3125502

Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 15 October 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate280of864submissions,32%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 8
  Total Citations
  View Citations
- 433
  Total Downloads
- Downloads (Last 12 months)42
- Downloads (Last 6 weeks)6
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

3D nanosystems enable embedded abundant-data computing: special session paper

CODES '17: Proceedings of the Twelfth IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis Companion

ABSTRACT

References

Cited By

Recommendations

Improving Performance under Process and Voltage Variations in Near-Threshold Computing Using 3D ICs

Application of high-κ gate dielectrics and metal gate electrodes to enable silicon and non-silicon logic nanotechnology

Embedded Tutorial: Analog Circuit Performance Issues with Aggressively Scaled Gate Oxide CMOS Technologies

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media