skip to main content
10.1145/1958746.1958784acmconferencesArticle/Chapter ViewAbstractPublication PagesicpeConference Proceedingsconference-collections
abstract

A capacity planning process for performance assurance of component-based distributed systems

Published:30 September 2011Publication History

ABSTRACT

For service providers of multi-tiered component-based applications, such as web portals, assuring high performance and availability to their customers without impacting revenue requires effective and careful capacity planning that aims at minimizing the number of resources, and utilizing them efficiently while simultaneously supporting a large customer base and meeting their service level agreements. This paper presents a novel, hybrid capacity planning process that results from a systematic blending of 1) analytical modeling, where traditional modeling techniques are enhanced to overcome their limitations in providing accurate performance estimates; 2) profile-based techniques, which determine performance profiles of individual software components for use in resource allocation and balancing resource usage; and 3) allocation heuristics that determine minimum number of resources to allocate software components. Our results illustrate that using our technique, performance (i.e., bounded response time) can be assured while reducing operating costs by using 25% less resources and increasing revenues by handling 20% more clients compared to traditional approaches.

References

  1. C. Amza, A. Ch, A. Cox, S. Elnikety, R. Gil, K. Rajamani, and W. Zwaenepoel. Specification and Implementation of Dynamic Web Site Benchmarks. In 5th IEEE Workshop on Workload Characterization, pages 3-13, 2002.Google ScholarGoogle Scholar
  2. G. Bolch, S. Greiner, H. de Meer, and K. Trivedi. Queueing networks and Markov chains: modeling and performance evaluation with computer science applications. Wiley-Interscience, NY, USA, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. Budhiraja and A. Ghosh. A large deviations approach to asymptotically optimal control of crisscross network in heavy traffic. Annals of Applied Probability, 15(3):1887-1935, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  4. D. Carrera, M. Steinder, I. Whalley, J. Torres, and E. Ayguade. Utility-based placement of dynamic web applications with fairness goals. In Network Operations and Management Symposium, 2008. NOMS 2008. IEEE, pages 9-16, April 2008.Google ScholarGoogle ScholarCross RefCross Ref
  5. E. Coffman Jr, M. Garey, and D. Johnson. Approximation algos for bin packing: a survey. 1996.Google ScholarGoogle Scholar
  6. A. Karve, T. Kimbrel, G. Pacifici, M. Spreitzer, M. Steinder, M. Sviridenko, and A. Tantawi. Dynamic placement for clustered web applications. In Proceedings of the 15th international conference on World Wide Web, pages 595-604. ACM New York, NY, USA, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. T. Kimbrel, M. Steinder, M. Sviridenko, and A. Tantawi. Dynamic Application Placement Under Service and Memory Constraints. In Experimental And Efficient Algorithms: 4th International Workshop, WEA 2005, Greece, May 10-13, 2005, Springer, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Kumar and P. Kumar. Closed Queueing Networks in Heavy Traffic: Fluid Limits and Efficiency. Stochastic networks: stability and rare events, 1996.Google ScholarGoogle Scholar
  9. D. A. Menasce, L. W. Dowdy, and V. A. F. Almeida. Performance by Design: Computer Capacity Planning By Example. Prentice Hall PTR, Upper Saddle River, NJ, USA, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. G. Nilabja Roy and L. Dowdy. A Novel Capacity Planning Process for Performance Assurance of Multi-Tiered Web Applications. In Proceedings of MASCOTS 2010, Miami Beach, FL, USA, Aug. 2010.Google ScholarGoogle Scholar
  11. G. Pacifici, W. Segmuller, M. Spreitzer, M. Steinder, A. Tantawi, and A. Youssef. Managing the response time for multi-tiered web applications. IBM TJ Watson Research Center, Yorktown, NY, Tech. Rep. RC23651, 2005.Google ScholarGoogle Scholar
  12. E. Pekoz and J. Blanchet. Heavy Traffic Limits Via Brownian Embeddings. Probability in the Engineering and Informational Sciences, 20(04):595-598, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. N. Roy, J. S. Kinnebrew, N. Shankaran, G. Biswas, and D. C. Schmidt. Toward Effective Multicapacity Resource Allocation in Distributed Real-time and Embedded Systems. In Proceedings of the 11th ISORC), Orlando, Florida, May 2008. IEEE. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. N. Roy, Y. Xue, A. Gokhale, L. Dowdy, and D. C. Schmidt. A Component Assignment Framework for Improved Capacity and Assured Performance in Web Portals. In Proceedings of the DOA 2009, pages 671-689, Nov. 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. C. Stewart and K. Shen. Performance modeling and system management for multi-component online services. In Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation-Volume 2 table of contents, pages 71-84. USENIX Association Berkeley, CA, USA, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. C. Stewart, K. Shen, A. Iyengar and J. Yin. EntomoModel: Understanding and Avoiding Performance Anomaly Manifestations. In Proceedings of International Symposium of Modeling, Analysis, and Simulation of Computer Systems, pages 3-13. IEEE Computer Soc, Los Alamitos, CA, USA, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. Suri, S. Sahu, and M. Vernon. Approximate Mean Value Analysis for Closed Queuing Networks with Multiple-Server Stations. In Proceedings of the 2007 Industrial Engineering Research Conference. 2007.Google ScholarGoogle Scholar
  18. C. Tang, M. Steinder, M. Spreitzer, and G. Pacifici. A scalable application placement controller for enterprise data centers. In Proceedings of the 16th international conference on World Wide Web, page 340. ACM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. B. Urgaonkar, G. Pacifici, P. Shenoy, M. Spreitzer, and A. Tantawi. An Analytical Model for Multi-tier Internet Services and its Applications. SIGMETRICS Perform. Eval. Rev., 33(1):291-302, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. B. Urgaonkar, A. Rosenberg, P. Shenoy, and A. Zomaya. Application Placement on a Cluster of Servers. International Journal of Foundations of Computer Science, 18(5):1023-1041, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  21. B. Urgaonkar, P. Shenoy, A. Chandra, and P. Goyal. Dynamic provisioning of multi-tier internet applications. In Autonomic Computing, 2005. International Conference on, pages 217-228, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. B. Urgaonkar, P. Shenoy, and T. Roscoe. Resource overbooking and application profiling in a shared Internet hosting platform. ACM Transactions on Internet Technologies (TOIT), 9(1):1-45, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Q. Zhang, L. Cherkasova, G. Mathews, W. Greene, and E. Smirni. R-capriccio: a capacity planning and anomaly detection tool for enterprise services with live workloads. In Middleware '07: ACM/IFIP/USENIX 2007 International Conference on Middleware, pages 244-265, New York, NY, USA, 2007. Springer-Verlag New York, Inc. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Q. Zhang, L. Cherkasova, N. Mi, and E. Smirni. A regression-based analytic model for capacity planning of multi-tier applications. Cluster Computing, 11(3):197-211, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A capacity planning process for performance assurance of component-based distributed systems

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader