Abstract
The unprecedent growth of microblog services poses significant challenges on network traffic and service latency to the underlay infrastructure (i.e., geo-distributed data centers). Furthermore, the dynamic evolution in microblog status generates a huge workload on data consistence maintenance. In this article, motivated by insights of cross-media analysis-based propagation patterns, we propose a novel cache strategy for microblog service systems to reduce the inter-data center traffic and consistence maintenance cost, while achieving low service latency. Specifically, we first present a microblog classification method, which utilizes the external knowledge from correlated domains, to categorize microblogs. Then we conduct a large-scale measurement on a representative online social network system to study the category-based propagation diversity on region and time scales. These insights illustrate social common habits on creating and consuming microblogs and further motivate our architecture design. Finally, we formulate the content cache problem as a constrained optimization problem. By jointly using the Lyapunov optimization framework and simplex gradient method, we find the optimal online control strategy. Extensive trace-driven experiments further demonstrate that our algorithm reduces the system cost by 24.5% against traditional approaches with the same service latency.
- Jingwen Bian, Yang Yang, and Tat-Seng Chua. 2014. Predicting trending messages and diffusion participants in microblogging network. In Proceedings of the 37th International ACM SIGIR Conference on Research 8 Development in Information Retrieval. ACM, 537--546. Google ScholarDigital Library
- Lorenzo Bruzzone and Mattia Marconcini. 2010. Domain adaptation problems: A DASVM classification technique and a circular validation strategy. IEEE Transactions on Pattern Analysis and Machine Intelligence 32, 5 (2010), 770--787. Google ScholarDigital Library
- Sonja Buchegger, Doris Schiöberg, Le-Hung Vu, and Anwitaman Datta. 2009. Peerson: P2P social networking: Early experiences and insights. In Proceedings of the 2nd ACM EuroSys Workshop on Social Network Systems. ACM, 46--52. Google ScholarDigital Library
- Andrew R. Conn, Katya Scheinberg, and Luis N. Vicente. 2009. Introduction to Derivative-Free Optimization. Vol. 8. SIAM. Google ScholarCross Ref
- Peter Sheridan Dodds and Duncan J. Watts. 2005. A generalized model of social and biological contagion. Journal of Theoretical Biology 232, 4 (2005), 587--604. Google ScholarCross Ref
- Qiang Duan. 2015. Modeling and performance analysis for composite network--compute service provisioning in software-defined cloud environments. Digital Communications and Networks 1, 3 (2015), 181--190. Google ScholarCross Ref
- Yue Gao, Meng Wang, Zheng-Jun Zha, Jialie Shen, Xuelong Li, and Xindong Wu. 2013. Visual-textual joint relevance learning for tag-based social image search. IEEE Transactions on Image Processing 22, 1 (2013), 363--376. Google ScholarDigital Library
- Tao Guan, Yunfeng He, Liya Duan, Jianzhong Yang, Juan Gao, and Junqing Yu. 2014. Efficient BOF generation and compression for on-device mobile visual location recognition. IEEE MultiMedia 21, 2 (2014), 32--41. Google ScholarCross Ref
- Tao Guan, Yunfeng He, Juan Gao, Jianzhong Yang, and Junqing Yu. 2013. On-device mobile visual location recognition by integrating vision and inertial sensors. IEEE Transactions on Multimedia 15, 7 (2013), 1688--1699. Google ScholarDigital Library
- Tao Guan, Yuesong Wang, Liya Duan, and Rongrong Ji. 2015. On-device mobile landmark recognition using binarized descriptor with multifeature fusion. ACM Transactions on Intelligent Systems and Technology (TIST) 7, 1 (2015), 12.Google ScholarDigital Library
- Han Hu, Yonggang Wen, Tat-Seng Chua, Jian Huang, Wenwu Zhu, and Xuelong Li. 2016. Joint content replication and request routing for social video distribution over cloud CDN: A community clustering method. IEEE Transactions on Circuits and Systems for Video Technology 26, 7 (July 2016), 1320--1333. Google ScholarDigital Library
- Han Hu, Yonggang Wen, Tat-Seng Chua, and Xuelong Li. 2014. Toward scalable systems for big data analytics: A technology tutorial. IEEE Access 2 (2014), 652--687. Google ScholarCross Ref
- Han Hu, Yonggang Wen, Huanbo Luan, Tat-Seng Chua, and Xuelong Li. 2014. Toward multiscreen social TV with geolocation-aware social sense. IEEE MultiMedia 21, 3 (July 2014), 10--19. Google ScholarCross Ref
- Rongrong Ji, Yue Gao, Bineng Zhong, Hongxun Yao, and Qi Tian. 2011. Mining flickr landmarks by modeling reconstruction sparsity. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 7, 1 (2011), 31.Google Scholar
- Yichao Jin, Yonggang Wen, and Han Hu. 2013. Minimizing monetary cost via cloud clone migration in multi-screen cloud social TV system. In Proceedings of the 2013 IEEE Global Communications Conference (GLOBECOM’13). IEEE, 1747--1752.Google Scholar
- Yichao Jin, Yonggang Wen, Han Hu, and M.-J. Montpetit. 2014. Reducing operational costs in cloud social TV: An opportunity for cloud cloning. IEEE Transactions on Multimedia 16, 6 (Oct. 2014), 1739--1751. Google ScholarCross Ref
- Balachander Krishnamurthy, Phillipa Gill, and Martin Arlitt. 2008. A few chirps about twitter. In Proceedings of the 1st Workshop on Online Social Networks. ACM, 19--24. Google ScholarDigital Library
- Michal Kryczka, Ruben Cuevas, Carmen Guerrero, Eiko Yoneki, and Arturo Azcorra. 2010. A first step towards user assisted online social networks. In Proceedings of the 3rd Workshop on Social Network Systems. ACM, 6. Google ScholarDigital Library
- Cheng Li, Daniel Porto, Allen Clement, Johannes Gehrke, Nuno M. Preguiça, and Rodrigo Rodrigues. 2012. Making geo-replicated systems fast as possible, consistent when necessary. In Presented as Part of the 10th USENIX Symposium on Operating Systems Design and Implementation (OSDI’12). 265--278.Google Scholar
- XueLong LI and HaiGang Gong. 2015. A survey on big data systems. SCIENTIA SINICA Informationis 45, 1 (2015), 1.Google Scholar
- Guoxin Liu, Haiying Shen, and H. Chandler. 2013. Selective data replication for online social networks with distributed datacenters. In Proceedings of the 2013 21st IEEE International Conference on Network Protocols (ICNP’13). 1--10. Google ScholarCross Ref
- Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze. 2008. Introduction to Information Retrieval. Vol. 1. Cambridge University Press, Cambridge. Google ScholarDigital Library
- Bertrand Mathieu and Patrick Truong. 2014. A CCN-based social network application optimising network proximity. In Proceedings of the 2014 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS’14). IEEE, 446--451.Google ScholarCross Ref
- Alan Mislove, Massimiliano Marcon, Krishna P. Gummadi, Peter Druschel, and Bobby Bhattacharjee. 2007. Measurement and analysis of online social networks. In Proceedings of the 7th ACM SIGCOMM Conference on Internet Measurement. ACM, 29--42. Google ScholarDigital Library
- Michael J. Neely. 2010. Stochastic network optimization with application to communication and queueing systems. Synthesis Lectures on Communication Networks 3, 1 (2010), 1--211. Google ScholarCross Ref
- Rajesh Nishtala, Hans Fugal, Steven Grimm, Marc Kwiatkowski, Herman Lee, Harry C. Li, Ryan McElroy, Mike Paleczny, Daniel Peek, Paul Saab, David Stafford, Tony Tung, and Venkateshwaran Venkataramani. 2013. Scaling memcache at facebook. In Presented as Part of the 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI’13). 385--398.Google ScholarDigital Library
- Josep M. Pujol, Vijay Erramilli, Georgos Siganos, Xiaoyuan Yang, Nikos Laoutaris, Parminder Chhabra, and Pablo Rodriguez. 2011. The little engine (s) that could: Scaling online social networks. ACM SIGCOMM Computer Communication Review 41, 4 (2011), 375--386.Google ScholarDigital Library
- Craig Simth. 2015. By the numbers: 150+ amazing Twitter statistics. Retrieved June 13, 2015, from http://expandedramblings.com/index.php/march-2013-by-the-numbers-a-few-amazing-twitter-stats/#.U855Wfm4WjA.Google Scholar
- Sina.Com. 2015. Sina. Retrieved June 13, 2015, from www.sina.com.cn.Google Scholar
- Hikaru Takemura and Keishi Tajima. 2012. Tweet classification based on their lifetime duration. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management. ACM, 2367--2370. Google ScholarDigital Library
- Zhi Wang, Lifeng Sun, Xiangwen Chen, Wenwu Zhu, Jiangchuan Liu, Minghua Chen, and Shiqiang Yang. 2012. Propagation-based social-aware replication for social video contents. In Proceedings of the 20th ACM International Conference on Multimedia. ACM, 29--38. Google ScholarDigital Library
- Benchang Wei, Tao Guan, Liya Duan, Junqing Yu, and Tan Mao. 2015. Wide area localization and tracking on camera phones for mobile augmented reality systems. Multimedia Systems 21, 4 (2015), 381--399. Google ScholarDigital Library
- Weibo.Com. 2015. Sina Weibo. Retrieved June 13, 2015, from www.weibo.com.Google Scholar
- Yonggang Wen, Xiaoqing Zhu, Joel JPC Rodrigues, and Chang Wen Chen. 2014. Cloud mobile media: Reflections and outlook. IEEE Transactions on Multimedia 16, 4 (2014), 885--902. Google ScholarDigital Library
- Mike P. Wittie, Veljko Pejovic, Lara Deek, Kevin C. Almeroth, and Ben Y. Zhao. 2010. Exploiting locality of interest in online social networks. In Proceedings of the 6th International Conference. ACM, 25. Google ScholarDigital Library
- Watcharee Wongyai and Luck Charoenwatana. 2012. Examining the network traffic of Facebook homepage retrieval: An end user perspective. In Proceedings of the 2012 International Joint Conference on Computer Science and Software Engineering (JCSSE). IEEE, 77--81. Google ScholarCross Ref
- Dapeng Wu, Boran Yang, and Ruyan Wang. 2016. Scalable privacy-preserving big data aggregation mechanism. Digital Communications and Networks 2, 3 (2016), 122--129. Google ScholarCross Ref
- Fangfei Zhou, Liang Zhang, Eric Franco, Alan Mislove, Richard Revis, and Ravi Sundaram. 2012. WebCloud: Recruiting social network users to assist in content distribution. In Proceedings of the 2012 11th IEEE International Symposium on Network Computing and Applications (NCA’12). IEEE, 10--19. Google ScholarDigital Library
- Yue-Ting Zhuang, Yi Yang, and Fei Wu. 2008. Mining semantic correlation of heterogeneous multimedia data for cross-media retrieval. IEEE Transactions on Multimedia 10, 2 (2008), 221--229. Google ScholarDigital Library
Index Terms
- Cost-Optimized Microblog Distribution over Geo-Distributed Data Centers: Insights from Cross-Media Analysis
Recommendations
Cost-Aware Streaming Workflow Allocation on Geo-Distributed Data Centers
The virtual machine (VM) allocation problem in cloud computing has been widely studied in recent years, and many algorithms have been proposed in the literature. Most of them have been successfully applied to batch processing models such as MapReduce; ...
Fast media caching for geo-distributed data centers
Recent years have witnessed a phenomenal increase in video traffic. Virtual content delivery networks (vCDNs) coordinate video content delivery through the use of computing and storage resources from the cloud and distributes content to edge nodes near ...
Exploring celebrity dynamics on Twitter
I-CARE '13: Proceedings of the 5th IBM Collaborative Academia Research Exchange Workshop"A celebrity is the person who is well-known for their well-knownness". A person achieves celebrity status by achieving something extra-ordinary in a specific domain. People often show more interest in the personal aspect of the celebrity more than ...
Comments