ABSTRACT
For service providers of multi-tiered component-based applications, such as web portals, assuring high performance and availability to their customers without impacting revenue requires effective and careful capacity planning that aims at minimizing the number of resources, and utilizing them efficiently while simultaneously supporting a large customer base and meeting their service level agreements. This paper presents a novel, hybrid capacity planning process that results from a systematic blending of 1) analytical modeling, where traditional modeling techniques are enhanced to overcome their limitations in providing accurate performance estimates; 2) profile-based techniques, which determine performance profiles of individual software components for use in resource allocation and balancing resource usage; and 3) allocation heuristics that determine minimum number of resources to allocate software components. Our results illustrate that using our technique, performance (i.e., bounded response time) can be assured while reducing operating costs by using 25% less resources and increasing revenues by handling 20% more clients compared to traditional approaches.
- C. Amza, A. Ch, A. Cox, S. Elnikety, R. Gil, K. Rajamani, and W. Zwaenepoel. Specification and Implementation of Dynamic Web Site Benchmarks. In 5th IEEE Workshop on Workload Characterization, pages 3-13, 2002.Google Scholar
- G. Bolch, S. Greiner, H. de Meer, and K. Trivedi. Queueing networks and Markov chains: modeling and performance evaluation with computer science applications. Wiley-Interscience, NY, USA, 1998. Google ScholarDigital Library
- A. Budhiraja and A. Ghosh. A large deviations approach to asymptotically optimal control of crisscross network in heavy traffic. Annals of Applied Probability, 15(3):1887-1935, 2005.Google ScholarCross Ref
- D. Carrera, M. Steinder, I. Whalley, J. Torres, and E. Ayguade. Utility-based placement of dynamic web applications with fairness goals. In Network Operations and Management Symposium, 2008. NOMS 2008. IEEE, pages 9-16, April 2008.Google ScholarCross Ref
- E. Coffman Jr, M. Garey, and D. Johnson. Approximation algos for bin packing: a survey. 1996.Google Scholar
- A. Karve, T. Kimbrel, G. Pacifici, M. Spreitzer, M. Steinder, M. Sviridenko, and A. Tantawi. Dynamic placement for clustered web applications. In Proceedings of the 15th international conference on World Wide Web, pages 595-604. ACM New York, NY, USA, 2006. Google ScholarDigital Library
- T. Kimbrel, M. Steinder, M. Sviridenko, and A. Tantawi. Dynamic Application Placement Under Service and Memory Constraints. In Experimental And Efficient Algorithms: 4th International Workshop, WEA 2005, Greece, May 10-13, 2005, Springer, 2005. Google ScholarDigital Library
- S. Kumar and P. Kumar. Closed Queueing Networks in Heavy Traffic: Fluid Limits and Efficiency. Stochastic networks: stability and rare events, 1996.Google Scholar
- D. A. Menasce, L. W. Dowdy, and V. A. F. Almeida. Performance by Design: Computer Capacity Planning By Example. Prentice Hall PTR, Upper Saddle River, NJ, USA, 2004. Google ScholarDigital Library
- A. G. Nilabja Roy and L. Dowdy. A Novel Capacity Planning Process for Performance Assurance of Multi-Tiered Web Applications. In Proceedings of MASCOTS 2010, Miami Beach, FL, USA, Aug. 2010.Google Scholar
- G. Pacifici, W. Segmuller, M. Spreitzer, M. Steinder, A. Tantawi, and A. Youssef. Managing the response time for multi-tiered web applications. IBM TJ Watson Research Center, Yorktown, NY, Tech. Rep. RC23651, 2005.Google Scholar
- E. Pekoz and J. Blanchet. Heavy Traffic Limits Via Brownian Embeddings. Probability in the Engineering and Informational Sciences, 20(04):595-598, 2006. Google ScholarDigital Library
- N. Roy, J. S. Kinnebrew, N. Shankaran, G. Biswas, and D. C. Schmidt. Toward Effective Multicapacity Resource Allocation in Distributed Real-time and Embedded Systems. In Proceedings of the 11th ISORC), Orlando, Florida, May 2008. IEEE. Google ScholarDigital Library
- N. Roy, Y. Xue, A. Gokhale, L. Dowdy, and D. C. Schmidt. A Component Assignment Framework for Improved Capacity and Assured Performance in Web Portals. In Proceedings of the DOA 2009, pages 671-689, Nov. 2009. Google ScholarDigital Library
- C. Stewart and K. Shen. Performance modeling and system management for multi-component online services. In Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation-Volume 2 table of contents, pages 71-84. USENIX Association Berkeley, CA, USA, 2005. Google ScholarDigital Library
- C. Stewart, K. Shen, A. Iyengar and J. Yin. EntomoModel: Understanding and Avoiding Performance Anomaly Manifestations. In Proceedings of International Symposium of Modeling, Analysis, and Simulation of Computer Systems, pages 3-13. IEEE Computer Soc, Los Alamitos, CA, USA, 2010. Google ScholarDigital Library
- R. Suri, S. Sahu, and M. Vernon. Approximate Mean Value Analysis for Closed Queuing Networks with Multiple-Server Stations. In Proceedings of the 2007 Industrial Engineering Research Conference. 2007.Google Scholar
- C. Tang, M. Steinder, M. Spreitzer, and G. Pacifici. A scalable application placement controller for enterprise data centers. In Proceedings of the 16th international conference on World Wide Web, page 340. ACM, 2007. Google ScholarDigital Library
- B. Urgaonkar, G. Pacifici, P. Shenoy, M. Spreitzer, and A. Tantawi. An Analytical Model for Multi-tier Internet Services and its Applications. SIGMETRICS Perform. Eval. Rev., 33(1):291-302, 2005. Google ScholarDigital Library
- B. Urgaonkar, A. Rosenberg, P. Shenoy, and A. Zomaya. Application Placement on a Cluster of Servers. International Journal of Foundations of Computer Science, 18(5):1023-1041, 2007.Google ScholarCross Ref
- B. Urgaonkar, P. Shenoy, A. Chandra, and P. Goyal. Dynamic provisioning of multi-tier internet applications. In Autonomic Computing, 2005. International Conference on, pages 217-228, 2005. Google ScholarDigital Library
- B. Urgaonkar, P. Shenoy, and T. Roscoe. Resource overbooking and application profiling in a shared Internet hosting platform. ACM Transactions on Internet Technologies (TOIT), 9(1):1-45, 2009. Google ScholarDigital Library
- Q. Zhang, L. Cherkasova, G. Mathews, W. Greene, and E. Smirni. R-capriccio: a capacity planning and anomaly detection tool for enterprise services with live workloads. In Middleware '07: ACM/IFIP/USENIX 2007 International Conference on Middleware, pages 244-265, New York, NY, USA, 2007. Springer-Verlag New York, Inc. Google ScholarDigital Library
- Q. Zhang, L. Cherkasova, N. Mi, and E. Smirni. A regression-based analytic model for capacity planning of multi-tier applications. Cluster Computing, 11(3):197-211, 2008. Google ScholarDigital Library
Index Terms
- A capacity planning process for performance assurance of component-based distributed systems
Recommendations
A capacity planning process for performance assurance of component-based distributed systems (abstracts only)
For service providers of multi-tiered component-based applications, such as web portals, assuring high performance and availability to their customers without impacting revenue requires effective and careful capacity planning that aims at minimizing the ...
Profiling services for resource optimization and capacity planning in distributed systems
The capacity needs of online services are mainly determined by the volume of user loads. For large-scale distributed systems running such services, it is quite difficult to match the capacities of various system components. In this paper, a novel and ...
Comments