skip to main content
10.5555/996070.1009956acmconferencesArticle/Chapter ViewAbstractPublication PagesiccadConference Proceedingsconference-collections
Article

Dynamic Fault-Tolerance and Metrics for Battery Powered, Failure-Prone Systems

Published: 09 November 2003 Publication History

Abstract

Emerging VLSI technologies and platforms are giving rise tosystems with inherently high potential for runtime failure.Such failures range from intermittent electrical and mechanicalfailures at the system level, to device failures at the chip level.Techniques to provide reliable computation in the presence offailures must do so while maintaining high performance, withan eye toward energy efficiency. When possible, they shouldmaximize battery lifetime in the face of battery discharge non-linearities. This paper introduces the concept of adaptive fault-tolerance management for failure-prone systems, and a classification of local algorithms for achieving system-wide reliability.In order to judge the efficacy of the proposed algorithmsfor dynamic fault-tolerance management, a set of metrics, forcharacterizing system behavior in terms of energy efficiency,reliability, computation performance and battery lifetime, ispresented. For an example platform employed in a realistic evaluation scenario, it is shown that system configurations with the best performance and lifetime are not necessarilythose with the best combination of performance, reliability,battery lifetime and average power consumption.

References

[1]
{1} M. D. Beaudry. Performance-related reliability measures for computing systems. IEEE Transactions on Computers, c-27(6):540-547, June 1978.
[2]
{2} L. Benini, G. Castelli, A. Macii, E. Macii, M. Poncino, and R. Scarsi. A discrete-time battery model for high-level power estimation. In Proceedings of the conference on Design, automation and test in Europe, DATE'00, pages 35-39, January 2000.
[3]
{3} B. R. Borgerson and R. F. Freitas. A reliability model for gracefully degrading and standby-sparing systems. IEEE Transactions on Computers, c-24:517-525, May 1975.
[4]
{4} R. J. Cole, B. M. Maggs, and R. K. Sitaraman. Reconfiguring arrays with faults part I: worst-case faults. SIAM Journal on Computing, 26(6):1581-1611, December 1997.
[5]
{5} F. E. Heart, S. M. Ornstein, W. R. Crowther, and W. B. Barker. A New Minicomputer/Multiprocessor for the ARPA Network. In Proceedings of the 1973 NCC, AFIPS Conference Proceedings, pages 529-537, 1973.
[6]
{6} D. Milojici¿, F. Douglis, Y. Paindaveine, R. Wheeler, and S. Zhou. Process Migration. ACM Computing Surveys, 32(3):241-299, September 2000.
[7]
{7} M. Perillo and W. Heinzelman. Optimal Sensor Management Under Energy and Reliability Constraints. In Proc. of the IEEE Wireless Communications and Networking Conference, March 2003.
[8]
{8} D. Rakhmatov, S. Vrudhula, and D. A. Wallach. Battery Lifetime Prediction for Energy-Aware Computing. In International Symposium on Low Power Electronics and Design, ISLPED'02, pages 154-159, August 2002.
[9]
{9} T. Simunic, L. Benini, P. W. Glynn, and G. De Micheli. Dynamic power management for portable systems. In Mobile Computing and Networking, pages 11-19, 2000.
[10]
{10} P. Stanley-Marbell. Myrmigki Simulator Reference Manual. Technical report, CSSI, Dept. of ECE, Carnegie Mellon, 2003.
[11]
{11} P. Stanley-Marbell and M. Hsiao. Fast, flexible, cycle-accurate energy estimation. In Proceedings of the International Symposium on Low Power Electronics and Design, pages 141-146, August 2001.
[12]
{12} P. Stanley-Marbell and D. Marculescu. Exploiting Redundancy through Code Migration in Networked Embedded Systems. Technical Report 02-14, CSSI, Carnegie Mellon, April 2002.
[13]
{13} P. Stanley-Marbell, D. Marculescu, R. Marculescu, and P. K. Khosla. Modeling, Analysis, and Self-Management of Electronic Textiles. IEEE Trans. on Computers, 52(8):996-1010, August 2003.
[14]
{14} P. Stanley-Marbell, D. Marculescu, R. Marculescu, and P. K. Khosla. Modeling Computational, Sensing and Actuation Surfaces. In C. Piguet, editor, Low-Power Electronics Design. CRC Press, 2003.
[15]
{15} W. J. Stewart. Introduction to the Numerical Solution of Markov Chains. Princeton University Press, 1994.
[16]
{16} B. D. Van Veen and K. M. Buckley. Beamforming: a versatile approach to spatial filtering. IEEE ASSP Magazine, 5(2):4-24, April 1988.
[17]
{17} J. von Neumann. Probabilistic logics and the synthesis of reliable organisms from unreliable components. Automata Studies, pages 43-98, 1956.

Cited By

View all
  • (2013)Improving charging efficiency with workload scheduling in energy harvesting embedded systemsProceedings of the 50th Annual Design Automation Conference10.1145/2463209.2488803(1-8)Online publication date: 29-May-2013
  • (2010)Infrastructure and reliability analysis of electric networks for e-textilesIEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews10.1109/TSMCC.2009.203149740:1(36-51)Online publication date: 1-Jan-2010
  • (2007)Power and reliability management of SoCsIEEE Transactions on Very Large Scale Integration (VLSI) Systems10.1109/TVLSI.2007.89524515:4(391-403)Online publication date: 1-Apr-2007
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICCAD '03: Proceedings of the 2003 IEEE/ACM international conference on Computer-aided design
November 2003
899 pages
ISBN:1581137621

Sponsors

Publisher

IEEE Computer Society

United States

Publication History

Published: 09 November 2003

Check for updates

Qualifiers

  • Article

Conference

ICCAD03
Sponsor:

Acceptance Rates

ICCAD '03 Paper Acceptance Rate 129 of 490 submissions, 26%;
Overall Acceptance Rate 457 of 1,762 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2013)Improving charging efficiency with workload scheduling in energy harvesting embedded systemsProceedings of the 50th Annual Design Automation Conference10.1145/2463209.2488803(1-8)Online publication date: 29-May-2013
  • (2010)Infrastructure and reliability analysis of electric networks for e-textilesIEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews10.1109/TSMCC.2009.203149740:1(36-51)Online publication date: 1-Jan-2010
  • (2007)Power and reliability management of SoCsIEEE Transactions on Very Large Scale Integration (VLSI) Systems10.1109/TVLSI.2007.89524515:4(391-403)Online publication date: 1-Apr-2007
  • (2006)A dependable infrastructure of the electric network for E-textilesProceedings of the 20th international conference on Parallel and distributed processing10.5555/1898953.1899015(81-81)Online publication date: 25-Apr-2006
  • (2006)Energy-aware computation duplication for improving reliability in embedded chip multiprocessorsProceedings of the 2006 Asia and South Pacific Design Automation Conference10.1145/1118299.1118342(134-139)Online publication date: 24-Jan-2006
  • (2006)A novel power management scheme for e-textilesProceedings of the First international conference on Advances in Grid and Pervasive Computing10.1007/11745693_64(654-663)Online publication date: 3-May-2006
  • (2005)Application/architecture power co-optimization for embedded systems powered by renewable sourcesProceedings of the 42nd annual Design Automation Conference10.1145/1065579.1065742(618-623)Online publication date: 13-Jun-2005
  • (2005)Optimization of reliability and power consumption in systems on a chipProceedings of the 15th international conference on Integrated Circuit and System Design: power and Timing Modeling, Optimization and Simulation10.1007/11556930_25(237-246)Online publication date: 21-Sep-2005
  • (2004)Local Decisions and Triggering Mechanisms for Adaptive Fault-ToleranceProceedings of the conference on Design, automation and test in Europe - Volume 210.5555/968879.969164Online publication date: 16-Feb-2004

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media