ABSTRACT
Quick recovery from a failure is required essentially for distributed stream processing systems. We focus on single-node fail-stop failures occurred in high availability stream processing systems in this paper. One of high availability mechanisms is to provide a backup node for a processing node in the systems. We propose exploitation of backup nodes for reducing recovery cost in such an environment. We report some simulation results to show the effectiveness of our proposal.
- ]]M. Cherniack, H. Balakrishnan, M. Balazinska, D. Carney, U. Çetintemel, Y. Xing, and S. B. Zdonik. Scalable distributed stream processing. In Proc. CIDR, 2003. Available at http://www-db.cs.wisc.edu/cidr/cidr2003/program/p23.pdf.Google Scholar
- ]]J.-H. Hwang, M. Balazinska, A. Rasin, U. Çetintemel, M. Stonebraker, and S. B. Zdonik. High-availability algorithms for distributed stream processing. In Proc. ICDE, pages 779--790, 2005. Google ScholarDigital Library
- ]]J.-H. Hwang, U. Çetintemel, and S. B. Zdonik. Fast and highly-available stream processing over wide area networks. In Proc. ICDE, pages 804--813, 2008. Google ScholarDigital Library
- ]]J.-H. Hwang, Y. Xing, U. Çetintemel, and S. B. Zdonik. A cooperative, self-configuring high-availability solution for stream processing. In Proc. ICDE, pages 176--185, 2007.Google ScholarCross Ref
- ]]Y. Kwon, M. Balazinska, and A. Greenberg. Fault-tolerant stream processing using a distributed, replicated file system. Proc. VLDB Endow., 1:574--585, August 2008. Google ScholarDigital Library
- ]]J. Nagle. Congestion control in IP/TCP internetworks, RFC 896, Jan. 1984. http://tools.ietf.org/html/rfc896. Google ScholarDigital Library
- ]]The ns-3 network simulator. http://www.nsnam.org/.Google Scholar
Comments