ABSTRACT
The current database market is fairly evenly split between shared nothing and data sharing systems. While shared nothing systems are easier to build and scale, data sharing systems have advantages in load balancing. In this paper we explore adding data sharing functionality as an extension to a shared nothing database system. Our approach isolates the data sharing functionality from the rest of the system and relies on well-studied, robust techniques to provide the data sharing extension. This reduces the difficulty in providing data sharing functionality, yet provides much of the flexibility of a data sharing system. We present the design and implementation of Chimera -- a hybrid database system, targeted at load balancing for many workloads, and scale-out for read-mostly workloads. The results of our experiments demonstrate that we can achieve almost linear scalability and effective load balancing with less than 2% overhead during normal operation.
- T. E. Anderson, M. D. Dahlin, J. M. Neefe, D. A. Patterson, D. S. Roselli, and R. Y. Wang. Serverless network file systems. ACM Transactions on Computer Systems, 14(1):41--79, February 1996. Google ScholarDigital Library
- P. Bruni, R. Cornford, R. Garcia, S. Kaschta, and R. Kumar. DB2 9 for z/OS Technical Overview. IBM Redbooks, 2007. Google ScholarDigital Library
- M. Burrows. The chubby lock service for loosely-coupled distributed systems. In Proceedings of the 7th Symposium on Operating Systems Design and Implementation (OSDI), pages 335--350, September 2006. Google ScholarDigital Library
- F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R. E. Gruber. Bigtable: A distributed storage system for structured data. ACM Transactions on Computer Systems, 26(4):1--4, June 2008. Google ScholarDigital Library
- B. F. Cooper, R. Ramakrishnan, U. Srivastava, A. Silberstein, P. Bohannon, H. A. Jacobsen, N. Puz, D. Weaver, and R. Yerneni. PNUTS: Yahoo!'s hosted data serving platform. In Proceedings of the 34th International Conference on Very Large Data Bases (VLDB), pages 1277--1288, August 2008. Google ScholarDigital Library
- M. Devarakonda, B. Kish, and A. Mohindra. Recovery in the Calypso file system. ACM Transactions on Computer Systems, 14(3):287--310, August 1996. Google ScholarDigital Library
- S. Ghemawat, H. Gobioff, and S.-T. Leung. The Google file system. In Proceedings of the 19th Symposium on Operating Systems Principles (SOSP), pages 29--43, October 2003. Google ScholarDigital Library
- A. C. Goldstein. The design and implementation of a distributed file system. Digital Technical Journal, 1(5):45--55, September 1987.Google Scholar
- C. Gray and D. Cheriton. Leases: An efficient fault-tolerant mechanism for distributed file cache consistency. In Proceedings of the 12th Symposium on Operating Systems Principles (SOSP), pages 202--210, December 1989. Google ScholarDigital Library
- C. R. Hertel. Implementing CIFS: The Common Internet File System, chapter Introduction. Prentice Hall, 2003.Google Scholar
- P. Hunt, M. Konar, F. P. Junqueira, and B. Reed. Zookeeper: wait-free coordination for internet-scale systems. In Proceedings of the USENIX Annual Technical Conference (USENIXATC), pages 11--11, June 2010. Google ScholarDigital Library
- L. Lamport. The part-time parliament. ACM Transactions on Computer Systems, 16(2):133--169, February 1998. Google ScholarDigital Library
- E. K. Lee and C. A. Thekkath. Petal: Distributed virtual disks. In Proceedings of the 7th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), pages 84--92, October 1996. Google ScholarDigital Library
- D. Lomet. Recovery for shared disk systems using multiple redo logs. Technical Report 4, Digital Cambridge Research Lab, 1990.Google Scholar
- D. Lomet. Private locking and distributed cache management. In Proceedings of the 3rd International Conference on Parallel and distributed information systems (PDIS), pages 151--159, September 1994. Google ScholarDigital Library
- D. Lomet, R. Anderson, T. K. Rengarajan, and P. Spiro. How the Rdb/VMS data sharing system became fast. Technical Report 2, Digital Cambridge Research Lab, 1992.Google Scholar
- K. Loney. Oracle Database 11g The Complete Reference. McGraw-Hill, 2008. Google ScholarDigital Library
- J. MacCormick, N. Murphy, M. Najork, C. A. Thekkath, and L. Zhou. Boxwood: Abstractions as the foundation for storage infrastructure. In Proceedings of the 6th Symposium on Operating Systems Design and Implementation (OSDI), pages 8--8, December 2004. Google ScholarDigital Library
- R. B. Melnyk and P. C. Zikopoulos. DB2: The Complete Reference. McGraw-Hill, 2001. Google ScholarDigital Library
- C. Mohan and I. Narang. Efficient locking and caching of data in the multisystem shard disks transaction environment. In Proceedings of the 3rd International Conference on Extending Database Technology (EDBT), pages 453--468, March 1992. Google ScholarDigital Library
- Microsoft SQL Server 2008. Microsoft. {online} http://www.microsoft.com/sqlserver/2008/en/us/default.aspx.Google Scholar
- T. Rengarajan, P. Spiro, and W. Wright. High availability mechanisms of VAX DBMS software. Digital Technical Journal, 1(8):88--98, February 1989.Google Scholar
- R. Sandberg, D. Goldberg, S. Kleiman, D. Walsh, and B. Lyon. Design and implementation of the sun network file system. In Proceedings of Summer UNIX, June 1985.Google Scholar
- F. Schmuck and R. Haskin. GPFS: A shared-disk file system for large computing clusters. In Proceedings of the 1st Conference on File and Storage Technologies(FAST), pages 231--244, January 2002. Google ScholarDigital Library
- M. Stonebraker. The case for shared nothing. Database Engineering Bulletin, 9(1):4--9, 1986.Google Scholar
- C. A. Thekkath, T. P. Mann, and E. K. Lee. Frangipani: A scalable distributed file system. In Proceedings of the 16th Symposium on Operating Systems Principles (SOSP), pages 224--237, October 1997. Google ScholarDigital Library
- TPC-H: An Ad-hoc, Decision Support Benchmark. {online} http://www.tpc.org/tpch/.Google Scholar
Index Terms
- Chimera: data sharing flexibility, shared nothing simplicity
Recommendations
VMCTune: A Load Balancing Scheme for Virtual Machine Cluster Using Dynamic Resource Allocation
GCC '10: Proceedings of the 2010 Ninth International Conference on Grid and Cloud ComputingThis paper designs and implements a load balancing scheme based on dynamic resource allocation policy for virtual machine cluster, which are running under para-virtualization mode on a cluster of physical machines (PM) in shared storage architecture. It ...
Resource allocation algorithms for virtualized service hosting platforms
Commodity clusters are used routinely for deploying service hosting platforms. Due to hardware and operation costs, clusters need to be shared among multiple services. Crucial for enabling such shared hosting platforms is virtual machine (VM) technology,...
Performance of Memory Virtualization Using Global Memory Resource Balancing
Virtualization has become a universal generalization layer in contemporary data centers. By multiplexing hardware resources into multiple virtual machines and facilitating several operating systems to run on the same physical platform at the same time, ...
Comments