skip to main content
technical-note

Automatic request categorization in internet services

Published: 31 August 2008 Publication History

Abstract

Modeling system performance and workload characteristics has become essential for efficiently provisioning Internet services and for accurately predicting future resource requirements on anticipated workloads. The accuracy of these models benefits substantially by differentiating among categories of requests based on their resource usage characteristics. However, categorizing requests and their resource demands often requires significantly more monitoring infrastructure. In this paper, we describe a method to automatically differentiate and categorize requests without requiring sophisticated monitoring techniques. Using machine learning, our method requires only aggregate measures such as total number of requests and the total CPU and network demands, and does not assume prior knowledge of request categories or their individual resource demands. We explore the feasibility of our method on the .Net PetShop 4.0 benchmark application, and show that it works well while being lightweight, generic, and easily deployable.

References

[1]
S. Bagchi, E. Hung, A. Iyengar, N. Vogl, and N. Wadia. Capacity planning tools for web and grid environments. In Proceedings of the 1st International Conference on Performance Evaluation Methodolgies and Tools, October 1996.
[2]
P. Barham, A. Donnelly, R. Isaacs, and R. Mortier. Using Magpie for request extraction and workload modelling. In Proceedings of the OSDI'04, December 2004.
[3]
M. Y. Chen, A. Accardi, E. Kiciman, A. Fox, D. Patterson, and E. Brewer. Path-Based Failure and Evolution Management. In Proceedings of the NSDI'04, March 2004.
[4]
A. Cockcroft and B. Walker. Capacity Planning for Internet Services. Sun Press, 2001.
[5]
The FastICA package for Matlab and R. http://www.cis.hut.fi/projects/ica/fastica/.
[6]
O. Friman, M. Borga, P. Lundberg, and H. Knutsson. Exploratory fMRI Analysis by Autocorrelation Maximization. NeuroImage, 16(2):454--464, 2002.
[7]
D. Gmach, J. Rolia, L. Cherkasova, and A. Kemper. Workload Analysis and Demand Prediction of Enterprise Data Center Applications. In Proceedings of the IISWC'07, September 2007.
[8]
M. Goldszmidt, D. Palma, and B. Sabata. On the Quantification of e-Business Capacity. In Proceedings of the Electronic Commerce, 2001.
[9]
X. Huang, F. Peng, A. An, and D. Schuurmans. Dynamic Web Log Session Identification With Statistical Language Models. Journal of the American Society for Information Science and Technology, 55(14):1290--1303, 2004.
[10]
A. Hyvarinen. Gaussian moments for noisy independent component ananlysis. IEEE Signal Processing Letters, 6(6), June 1999.
[11]
A. Hyvarinen, J. Karhunen, and E. Oja. Independent Component Analysis. Wiley-Interscience, 2001.
[12]
J. Judge. A Model for the Marginal Distribution of Aggregate Per Second HTTP Request Rate. In Proceedings of the 10th IEEE Workshop on Local and Metropolitan Area Networks, 1999.
[13]
Z. Liu, L. Wynter, C. X. Xia, and F. Zhang. Performance inference of queueing models for IT systems using end-to-end measurements. Performance Evaluation, 63(2006):36--60.
[14]
N. Mi, Q. Zhang, A. Riska, E. Smirni, and E. Riedel. Performance Impacts of Autocorrelated Flows in Multi-tiered Systems. In Proceedings of the Performance'07, October 2007.
[15]
W. Naanaa and J.-M. Nuzillard. Blind source separation of positive and partially correlated data. Signal Processing, 85(9):1711--1722, 2005.
[16]
Microsoft .NET Pet Shop 4.0. http://msdn2.microsoft.com/.
[17]
C. Stewart, T. Kelly, and A. Zhang. Exploiting Nonstationarity for Performance Prediction. In Proceedings of the EuroSys'07, March 2007.
[18]
C. Stewart and K. Shen. Performance Modeling and System Management for Multi-component Online Services. In Proceedings of the NSDI'05, May 2005.
[19]
TeamQuest model: Capacity planning software with modeling. http://www.teamquest.com/.
[20]
B. Urgaonkar, G. Pacifici, P. Shenoy, M. Spreitzer, and A. Tantawi. An Analytical Model for Multi-tier Internet Services and its Applications. In Proceedings of the SIGMETRICS'05, June 2005.
[21]
Microsoft Visual Studio 2005 Team Suite. http://msdn2.microsoft.com/.
[22]
K. H. Yeung and C. W. Szeto. On the Modeling of WWW Request Arrivals. In Proceedings of the International Conference on Parallel Processing Workshops, 1999.
[23]
Q. Zhang, L. Cherkasova, and E. Smirni. A Regression-Based Analytic Model for Dynamic Resource Provisioning of Multi-Tier Applications. In Proceedings of the ICAC'07, June 2007.

Cited By

View all
  • (2021)SARDEACM Transactions on Autonomous and Adaptive Systems10.1145/346336915:2(1-31)Online publication date: 9-Jun-2021
  • (2021)Learning Queuing Networks via Linear OptimizationProceedings of the ACM/SPEC International Conference on Performance Engineering10.1145/3427921.3450245(51-60)Online publication date: 9-Apr-2021
  • (2021)WITHDRAWN: A review on modeling techniques of quality-of-serviceMaterials Today: Proceedings10.1016/j.matpr.2020.12.536Online publication date: Feb-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGMETRICS Performance Evaluation Review
ACM SIGMETRICS Performance Evaluation Review  Volume 36, Issue 2
September 2008
138 pages
ISSN:0163-5999
DOI:10.1145/1453175
Issue’s Table of Contents
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 August 2008
Published in SIGMETRICS Volume 36, Issue 2

Check for updates

Qualifiers

  • Technical-note

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)1
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2021)SARDEACM Transactions on Autonomous and Adaptive Systems10.1145/346336915:2(1-31)Online publication date: 9-Jun-2021
  • (2021)Learning Queuing Networks via Linear OptimizationProceedings of the ACM/SPEC International Conference on Performance Engineering10.1145/3427921.3450245(51-60)Online publication date: 9-Apr-2021
  • (2021)WITHDRAWN: A review on modeling techniques of quality-of-serviceMaterials Today: Proceedings10.1016/j.matpr.2020.12.536Online publication date: Feb-2021
  • (2021)Estimation of Sharing Dependencies in Personal Storage Clouds Using Ensemble Learning ApproachesOperationalizing Multi-Cloud Environments10.1007/978-3-030-74402-1_4(65-85)Online publication date: 18-Sep-2021
  • (2020)Resource Demand EstimationSystems Benchmarking10.1007/978-3-030-41705-5_17(365-388)Online publication date: 29-Aug-2020
  • (2019)Earning and Utility Limits in Fisher MarketsACM Transactions on Economics and Computation10.1145/33402347:2(1-35)Online publication date: 18-Jul-2019
  • (2019)Simple Pricing Schemes for the CloudACM Transactions on Economics and Computation10.1145/33279737:2(1-27)Online publication date: 10-Jun-2019
  • (2019)Bid-Limited TargetingACM Transactions on Economics and Computation10.1145/33279687:2(1-33)Online publication date: 10-Jun-2019
  • (2019)ANCHORACM Transactions on Privacy and Security10.1145/330130522:2(1-36)Online publication date: 26-Feb-2019
  • (2019)Utilizing Clustering to Optimize Resource Demand Estimation Approaches2019 IEEE 4th International Workshops on Foundations and Applications of Self* Systems (FAS*W)10.1109/FAS-W.2019.00043(134-139)Online publication date: Jun-2019
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media