skip to main content
10.1145/2000259.2000274acmconferencesArticle/Chapter ViewAbstractPublication PagescomparchConference Proceedingsconference-collections
research-article

Reliability prediction for fault-tolerant software architectures

Authors Info & Claims
Published:20 June 2011Publication History

ABSTRACT

Software fault tolerance mechanisms aim at improving the reliability of software systems. Their effectiveness (i.e., reliability impact) is highly application-specific and depends on the overall system architecture and usage profile. When examining multiple architecture configurations, such as in software product lines, it is a complex and error-prone task to include fault tolerance mechanisms effectively. Existing approaches for reliability analysis of software architectures either do not support modelling fault tolerance mechanisms or are not designed for an efficient evaluation of multiple architecture variants. We present a novel approach to analyse the effect of software fault tolerance mechanisms in varying architecture configurations. We have validated the approach in multiple case studies, including a large-scale industrial system, demonstrating its ability to support architecture design, and its robustness against imprecise input data.

References

  1. M. Auerswald, M. Herrmann, S. Kowalewski, and V. Schulte-Coerne. Software Product-Family Engineering, volume 2290 of LNCS, chapter Reliability-Oriented Product Line Engineering of Embedded Systems, pages 237--280. Springer, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. Becker, H. Koziolek, and R. Reussner. The Palladio Component Model for model-driven performance prediction. Journal of Systems and Software, 82(1):3--22, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Bernardi, J. Merseguer, and D. Petriu. A dependability profile within MARTE. Software and Systems Modeling, pages 1--24, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. F. Brosch, H. Koziolek, B. Buhnova, and R. Reussner. Parameterized reliability prediction for component-based software architectures. In Proc. of QoSA'10, volume 6093 of LNCS, pages 36--51. Springer, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. L. Cheung, R. Roshandel, N. Medvidovic, and L. Golubchik. Early prediction of software component reliability. In Proc. of ICSE'08, pages 111--120. ACM Press, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. R. C. Cheung. A user-oriented software reliability model. IEEE Trans. Softw. Eng., 6(2):118--125, 1980. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. P. Clements and L. Northrop. Software Product Lines: Practices and Patterns. Addison-Wesley, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. V. Cortellessa, H. Singh, and B. Cukic. Early reliability assessment of UML based software models. In Proc. of WOSP'02, pages 302--309. ACM, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Dehlinger and R. R. Lutz. Plfaultcat: A product-line software fault tree analysis tool. Automated Software Engineering, 13(1):169--193, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Filieri, C. Ghezzi, V. Grassi, and R. Mirandola. Reliability analysis of component-based systems with multiple failure modes. In Proc. of CBSE'10, volume 6092 of LNCS, pages 1--20. Springer, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. S. S. Gokhale. Architecture-based software reliability analysis: Overview and limitations. IEEE Trans. on Dependable and Secure Computing, 4(1):32--40, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. K. Goseva-Popstojanova, A. Hassan, A. Guedem, W. Abdelmoez, D. E. M. Nassar, H. Ammar, and A. Mili. Architectural-level risk analysis using UML. IEEE Trans. on Softw. Eng., 29(10):946--960, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. K. Goseva-Popstojanova and K. S. Trivedi. Architecture-based approach to reliability assessment of software systems. Performance Evaluation, 45(2--3):179--204, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Immonen. Software Product Lines, chapter A Method for Predicting Reliability and Availability at the Architecture Level, pages 373--422. Springer, 2006.Google ScholarGoogle Scholar
  15. A. Immonen and E. Niemelä. Survey of reliability and availability prediction methods from the viewpoint of software architecture. Software and Systems Modeling, 7(1):49--65, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  16. K. Kanoun and M. Ortalo-Borrel. Fault-tolerant system dependability-explicit modeling of hardware and software component-interactions. IEEE Transactions on Reliability, 49(4):363--376, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  17. H. Koziolek, B. Schlich, and C. Bilich. A large-scale industrial case study on architecture-based software reliability analysis. In Proc. of ISSRE'10, pages 279--288. IEEE Computer Society, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. H. Muccini and A. Romanovsky. Architecting fault tolerant systems. Technical Report CS-TR-1051, University of Newcastle upon Tyne, 2007.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. F. G. Olumofin and V. B. Misic. Extending the atam architecture evaluation to product line architectures. In Proc. of WICSA'05, pages 45--56. IEEE Computer Society, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. B. Randell. System structure for software fault tolerance. In Proc. Int. Conf. on Reliable software, pages 437--449. ACM, 1975. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. H. Reussner, H. W. Schmidt, and I. H. Poernomo. Reliability prediction for component-based software architectures. Journal of Systems and Software, 66(3):241--252, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. N. Sato and K. S. Trivedi. Accurate and efficient stochastic reliability analysis of composite services using their compact Markov reward model representations. In Proc. of SCC'07, pages 114--121. IEEE Computer Society, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  23. B. Schroeder and G. A. Gibson. Understanding disk failure rates: What does an mttf of 1,000,000 hours mean to you? ACM Trans. Storage, 3(3):8, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. V. Sharma and K. Trivedi. Quantifying software performance, reliability and security: An architecture-based approach. Journal of Systems and Software, 80:493--509, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. V. S. Sharma and K. S. Trivedi. Reliability and performance of component based software systems with restarts, retries, reboots and repairs. In Proc. of ISSRE'06, pages 299--310. IEEE, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. W.-L. Wang, D. Pan, and M.-H. Chen. Architecture-based software reliability modeling. Journal of Systems and Software, 79(1):132--146, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Z. Zheng and M. R. Lyu. Collaborative reliability prediction of service-oriented systems. In Proc. of ICSE'10, pages 35--44. ACM Press, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Reliability prediction for fault-tolerant software architectures

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          QoSA-ISARCS '11: Proceedings of the joint ACM SIGSOFT conference -- QoSA and ACM SIGSOFT symposium -- ISARCS on Quality of software architectures -- QoSA and architecting critical systems -- ISARCS
          June 2011
          206 pages
          ISBN:9781450307246
          DOI:10.1145/2000259

          Copyright © 2011 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 20 June 2011

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate46of131submissions,35%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader