ABSTRACT
In this paper we measured and analyzed the workload on Yahoo! Video, the 2nd largest U.S. video sharing site, to understand its nature and the impact on online video data center design. We discovered interesting statistical properties on both static and temporal dimensions of the workload including file duration and popularity distributions, arrival rate dynamics and predictability, and workload stationarity and burstiness. Complemented with queueing-theoretic techniques, we further extended our understanding on the measurement data with a virtual design on the workload and capacity management components of a data center assuming the same workload as measured, which reveals key results regarding the impact of Service Level Agreements (SLAs) and workload scheduling schemes on the design and operations of such large-scale video distribution systems.
- Youtube. http://www.youtube.com.Google Scholar
- ComScore Video Metrix report: U.S. Viewers Watched an Average of 3 Hours of Online Video in July, 2007.Google Scholar
- L. Cherkasova and L. Staley. Measuring the Capacity of a Streaming Media Server in a Utility Data Center Environment. In MULTIMEDIA'02, New York, NY, USA, 2002. ACM. Google ScholarDigital Library
- M. J. Swain and D. H. Ballard. Color Indexing}. Int. J. Comput. Vision, 7(1):11--32, 1991. Google ScholarDigital Library
- W. Whitt. Partitioning Customers into Service Groups. Management Science, 45(11):1579--1592, Nov 1999. Google ScholarDigital Library
Index Terms
- Understanding internet video sharing site workload: a view from data center design
Recommendations
Understanding Internet Video sharing site workload: A view from data center design
Internet Video sharing sites, led by YouTube , have been gaining popularity in a dazzling speed, which also brings massive workload to their service data centers. In this paper we analyze Yahoo! Video, the 2nd largest U.S. video sharing site, to ...
A Tandem Queueing model for an appointment-based service system
We develop a queueing model for an appointment-based service system that consists of two queues in tandem: the appointment queue followed by the service queue. Customers join the appointment queue when they call for appointments, stay there (not ...
Stabilizing Customer Abandonment in Many-Server Queues with Time-Varying Arrivals
An algorithm is developed to determine time-dependent staffing levels to stabilize the time-dependent abandonment probabilities and expected delays at positive target values in the Mt/GI/st + GI many-server queueing model, which has a nonhomogeneous ...
Comments