Abstract
We show that continuously monitoring on-chip delays at the LUT-to-LUT link level during operation allows a field-programmable gate array to detect and self-adapt to aging and environmental timing effects. Using a lightweight (<4% added area) mechanism for monitoring transition timing, a Difference Detector with First-Fail Latch, we can estimate the timing margin on circuits and identify the individual links that have degraded and whose delay is determining the worst-case circuit delay. Combined with Choose-Your-own-Adventure precomputed, fine-grained repair alternatives, we introduce a strategy for rapid, in-system incremental repair of links with degraded timing. We show that these techniques allow us to respond to a single aging event in less than 190ms for the toronto20 benchmarks. The result is a step toward systems where adaptive reconfiguration on the time-scale of seconds is viable and beneficial.
- Syed M. Alam, Gan Chee Lip, Carl V. Thompson, and Donald E. Troxel. 2004. Circuit level reliability analysis of Cu interconnects. In Proceedings of the International Symposium on Quality Electronic Design. 238--243. Google ScholarDigital Library
- Altera. 2005. Implementing PLL Reconfiguration in Stratix 8 Stratix GX Devices (AN282). Retrieved from https://www.altera.com/content/dam/altera-www/global/en_US/pdfs/literature/an/an282.pdf.Google Scholar
- Todd Austin, David Blaauw, Trevor Mudge, and Krisztián Flautner. 2004. Making typical silicon matter with razor. IEEE Comput. 37, 3 (March 2004), 57--65. Google ScholarDigital Library
- Vaughn Betz and Jonathan Rose. 1999. FPGA Place-and-Route Challenge. Retrieved from http://www.eecg.toronto.edu/∼vaughn/challenge/challenge.html.Google Scholar
- Vaughn Betz, Jonathan Rose, and Alexander Marquardt. 1999. Architecture and CAD for Deep-Submicron FPGAs. Kluwer Academic Publishers, Norwell, MA. Google ScholarDigital Library
- David Bol, Renaud Ambroise, Denis Flandre, and Jean-Didier Legat. 2009. Interests and limitations of technology scaling for subthreshold logic. IEEE Trans. VLSI Syst. 17, 10 (2009), 1508--1519. Google ScholarDigital Library
- Shekhar Borkar. 2004. Microarchitecture and Design Challenges for Gigascale Integration. Retrieved from http://www.microarch.org/micro37/presentations/MICRO37%20Sborkar.pdf. Keynote talk of the International Symposium on Microarchitecture. Google ScholarDigital Library
- C. T. Chow, L. S. M. Tsui, Philip H. W. Leong, Wayne Luk, and Steve J. E. Wilton. 2005. Dynamic voltage scaling for commercial FPGAs. In Proceedings of the International Conference on Field-Programmable Technology. 173--180.Google Scholar
- Lloyd Condra, J. Qin, and Joseph B. Bernstein. 2007. State of the art semiconductor devices in future aerospace systems. In Proceedings of the FAA/NASA/DoD Joint Council on Aging Aircraft Conf.Google Scholar
- Thomas Cover and Joy Thomas. 1991. Elements of Information Theory. John Wiley and Sons, Inc., New York. Google Scholar
- André DeHon and Nikil Mehta. 2013. Exploiting partially defective LUTs: Why you don’t need perfect fabrication. In Proceedings of the International Conference on Field-Programmable Technology. 12--19Google ScholarCross Ref
- Claudio Favi and Edoardo Charbon. 2009. A 17ps time-to-digital converter implemented in 65nm FPGA technology. In Proceedings of the International Symposium on Field-Programmable Gate Arrays. 113--120. Retrieved from Google ScholarDigital Library
- Hans Giesen, Benjamin Gojman, Raphael Rubin, and André DeHon. 2016. Continuous online self-monitoring introspection circuitry for timing repair by incremental partial-reconfiguration (COSMIC TRIP). In Proceedings of the FCCM. 111--118.Google ScholarCross Ref
- Hans Giesen, Raphael Rubin, Benjamin Gojman, and André DeHon. 2017. Quality-time tradeoffs in component-specific mapping: How to train your dynamically reconfigurable array of gates with outrageous network-delays. In Proceedings of the International Symposium on Field-Programmable Gate Arrays. 85--94. Google ScholarDigital Library
- Jeffrey B. Goeders and Steven J. E. Wilton. 2012. VersaPower: Power estimation for diverse FPGA architectures. In Proceedings of the International Conference on Field-Programmable Technology. 229--234.Google Scholar
- Benjamin Gojman and André DeHon. 2014. GROK-INT: Generating real on-chip knowledge for interconnect delays using timing extraction. In Proceedings of the IEEE Symposium on Field-Programmable Custom Computing Machines. 88--95. Google ScholarDigital Library
- Benjamin Gojman, Sirisha Nalmela, Nikil Mehta, Nicholas Howarth, and André DeHon. 2014. GROK-LAB: Generating real on-chip knowledge for intra-cluster delays using timing extraction. ACM Trans. Reconfig. Tech. Syst. 7, 4, Article 5 (Dec. 2014), 23 pages. Google ScholarDigital Library
- Arthur E. Hoerl and Robert W. Kennard. 1970. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics 12, 1 (Feb. 1970), 55--67.Google ScholarCross Ref
- Edin Kadric, Kunal Mahajan, and André DeHon. 2014. Energy reduction through differential reliability and lightweight checking. In Proceedings of the IEEE Symposium on Field-Programmable Custom Computing Machines. 243--250 Google ScholarDigital Library
- Guy Lemieux, Edmund Lee, Marvin Tom, and Anthony Yu. 2004. Directional and single-driver wires in FPGA interconnect. In Proceedings of the International Conference on Field-Programmable Technology. 41--48.Google ScholarCross Ref
- Joshua M. Levine, Edward Stott, and Peter Y. K. Cheung. 2014. Dynamic voltage 8 frequency scaling with online slack measurement. In Proceedings of the International Symposium on Field-Programmable Gate Arrays. 65--74. Google ScholarDigital Library
- Joshua M. Levine, Edward Stott, George A. Constantinides, and Peter Y. K. Cheung. 2012. Online measurement of timing in circuits: For health monitoring and dynamic voltage 8 frequency scaling. In Proceedings of the IEEE Symposium on Field-Programmable Custom Computing Machines. 109--116. Google ScholarDigital Library
- David Lewis, Elias Ahmed, David Cashman, Tim Vanderhoek, Chris Lane, Andy Lee, and Philip Pan. 2009. Architectural enhancements in Stratix-III and Stratix-IV. In Proceedings of the International Symposium on Field-Programmable Gate Arrays. 33--42. Google ScholarDigital Library
- David Lewis, Vaughn Betz, David Jefferson, Andy Lee, Chris Lane, Paul Leventis, Sandy Marquardt, Cameron McClintock, Bruce Pedersen, Giles Powell, Srinivas Reddy, Chris Wysocki, Richard Cliff, and Jonathan Rose. 2003. The stratix routing and logic architecture. In Proceedings of the International Symposium on Field-Programmable Gate Arrays. 12--20. Google ScholarDigital Library
- David Lewis, David Cashman, Mark Chan, Jeffery Chromczak, Gary Lai, Andy Lee, Tim Vanderhoek, and Haiming Yu. 2013. Architectural enhancements in Stratix V. In Proceedings of the International Symposium on Field-Programmable Gate Arrays. 147--156. Google ScholarDigital Library
- Chin Jen Lin and S. M. Reddy. 1987. On delay fault testing in logic circuits. IEEE Trans. Comput.-Aid. Des. 6, 5 (September 1987), 694--703. Google ScholarDigital Library
- Timothy A. Linscott, Benjamin Gojman, Raphael Rubin, and André DeHon. 2016. Pitfalls and tradeoffs in simultaneous, on-chip FPGA delay measurement. In Proceedings of the International Symposium on Field-Programmable Gate Arrays. 100--104. Google ScholarDigital Library
- Jason Luu, Jeffrey Goeders, Michael Wainberg, Andrew Somerville, Thien Yu, Konstantin Nasartschuk, Miad Nasr, Sen Wang, Tim Liu, Nooruddin Ahmed, Kenneth B. Kent, Jason Anderson, Jonathan Rose, and Vaughn Betz. 2014. VTR 7.0: Next generation architecture and CAD system for FPGAs. ACM Trans. Reconfig. Tech. Syst. 7, 2 (July 2014), 6:1--6:30. Google ScholarDigital Library
- Jason Luu, Ian Kuon, Peter Jamieson, Ted Campbell, Andy Ye, Wei Mark Fang, and Jonathan Rose. 2009. VPR 5.0: FPGA CAD and architecture exploration tools with single-driver routing, heterogeneity and process scaling. In Proceedings of the FPGA. 133--142. Google ScholarDigital Library
- Alexander Marquardt, Vaughn Betz, and Jonathan Rose. 2000. Timing-driven placement for FPGAs. In Proceedings of the FPGA. 203--213. Google ScholarDigital Library
- M. Imran Masud and Steve Wilton. 1999. A new switch block for segmented FPGAs. In Proceedings of the International Conference on Field-Programmable Logic and Applications. 274--281. Google ScholarDigital Library
- F. G. Maunsell. 1937. A problem in cartophily. Math. Gazette 22 (1937), 328--331.Google ScholarCross Ref
- Nikil Mehta, Raphael Rubin, and André DeHon. 2012. Limit Study of Energy 8 Delay Benefits of Component-Specific Routing. In Proceedings of the International Symposium on Field-Programmable Gate Arrays. 97--106. Google ScholarDigital Library
- Evelyn Mintarno, Joëlle Skaf, Rui Zheng, Jyothi Velamela, Yu Cao, Stephen Boyd, Robert Dutton, and Subhasish Mitra. 2011. Self-tuning for maximized lifetime energy-efficiency in the presence of circuit aging. IEEE Trans. Comput.-Aid. Des. 30, 5 (May 2011), 760--773. Google ScholarDigital Library
- Shing-Hwa Renn, Christine Raynaud, Jean-Luc Pelloie, and Francis Balestra. 1998. A thorough investigation of the degradation induced by hot-carrier injection in deep submicron N- and P-channel partially and fully depleted unibond and SIMOX MOSFETs. IEEE Trans. Electron. Dev. 45, 10 (October 1998), 2146--2152.Google Scholar
- Jonathan Rose and Stephen Brown. 1991. Flexibility of interconnection structures for field-programmable gate arrays. IEEE J. Solid-State Circ. 26, 3 (March 1991), 277--282.Google ScholarCross Ref
- Elyse Rosenbaum, Peter M. Lee, Reza Moazzami, P. K. Ko, and Chenming Hu. 1989. Circuit reliability simulator-oxide breakdown module. In Technical Digest of the IEEE International Electron Device Meeting. 331--334.Google ScholarCross Ref
- Raphael Rubin and André DeHon. 2011. Choose-Your-Own-Adventure Routing: Lightweight Load-Time Defect Avoidance. ACM Trans. Reconfig. Tech. Syst. 4, 4 (December 2011), 33:1--33:24. Retrieved from Google ScholarDigital Library
- Dieter K. Schroder and Jeff A. Babcock. 2003. Negative bias temperature instability: Road to cross in deep submicron silicon semiconductor manufacturing. J. Appl. Phys. 94, 1 (July 2003), 1--18.Google ScholarCross Ref
- Stefanos Sidiropoulos and Mark A. Horowitz. 1997. A semidigital dual delay-locked loop. IEEE J. Solid-State Circ. 32, 11 (Nov 1997), 1683--1692.Google ScholarCross Ref
- Steven K. Sinha, Peter M. Kamarchik, and Seth Copen Goldstein. 2000. Tunable fault tolerance for runtime reconfigurable architectures. In Proceedings of the IEEE Symposium on Field-Programmable Custom Computing Machines. 185--192. Google ScholarDigital Library
- Suresh Srinivasan, Ramakrishnan Krishnan, Prasanth Mangalagiri, Yuan Xie, Vijaykrishnan Narayanan, Mary Jane Irwin, and Karthik Sarpatwari. 2008. Toward increasing FPGA lifetime. IEEE Trans. Dep. Secure Comput. 5, 2 (April 2008), 115--127. Google ScholarDigital Library
- Edward A. Stott, Justin S. J. Wong, Pete Sedcole, and Peter Y. K. Cheung. 2010. Degradation in FPGAs: Measurement and modelling. In Proceedings of the International Symposium on Field-Programmable Gate Arrays. 229--238. Google ScholarDigital Library
- Tim Tuan, Austin Lesea, Chris Kingsley, and Steven Trimberger. 2011. Analysis of within-die process variation in 65nm FPGAs. In Proceedings of the International Symposium on Quality Electronic Design. 1--5. Retrieved fromGoogle ScholarCross Ref
- Justin S. Wong, Pete Sedcole, and Peter Y. K. Cheung. 2009. Self-measurement of combinatorial circuit delays in FPGAs. ACM Trans. Reconfig. Tech. Syst. 2, 2 (June 2009), 1--22. Google ScholarDigital Library
- Xilinx. 2015. UltraScale Architecture and Product Overview (DS890). Retrieved from http://www.xilinx.com/support/documentation/data_sheets/ds890-ultrascale-overview.pdf.Google Scholar
- Xilinx, Inc. 2008. Virtex-5 FPGA Configuration User Guide. Xilinx, Inc., 2100 Logic Drive, San Jose, CA 95124. UG191 Retrieved from http://www.xilinx.com/bvdocs/userguides/ug191.pdf.Google Scholar
- Wei Zhao and Yu Cao. 2006. New generation of predictive technology model for sub-45 nm early design exploration. IEEE Trans. Electron. Dev. 53, 11 (2006), 2816--2823.Google ScholarCross Ref
- Ken M. Zick and John P. Hayes. 2010. On-line sensing for healthier FPGA systems. In Proceedings of the International Symposium on Field-Programmable Gate Arrays. 239--248. Google ScholarDigital Library
Index Terms
- Continuous Online Self-Monitoring Introspection Circuitry for Timing Repair by Incremental Partial-Reconfiguration (COSMIC TRIP)
Recommendations
GROK-LAB: Generating Real On-chip Knowledge for Intra-cluster Delays Using Timing Extraction
Timing Extraction identifies the delay of fine-grained components within an FPGA. From these computed delays, the delay of any path can be calculated. Moreover, a comparison of the fine-grained delays allows a detailed understanding of the amount and ...
Novel sizing algorithm for yield improvement under process variation in nanometer technology
DAC '04: Proceedings of the 41st annual Design Automation ConferenceDue to process parameter variations, a large variability in circuit delay occurs in scaled technologies affecting the yield. In this paper, we propose a sizing algorithm to ensure the speed of a circuit under process variation with a certain degree of ...
GROK-LAB: generating real on-chip knowledge for intra-cluster delays using timing extraction
FPGA '13: Proceedings of the ACM/SIGDA international symposium on Field programmable gate arraysTiming Extraction identifies the delay of fine-grained components within an FPGA. From these computed delays, the delay of any path can be calculated. Moreover, a comparison of the fine-grained delays allows a detailed understanding of the amount and ...
Comments