skip to main content
10.1145/2076623.2076642acmotherconferencesArticle/Chapter ViewAbstractPublication PagesideasConference Proceedingsconference-collections
research-article

Chimera: data sharing flexibility, shared nothing simplicity

Published:21 September 2011Publication History

ABSTRACT

The current database market is fairly evenly split between shared nothing and data sharing systems. While shared nothing systems are easier to build and scale, data sharing systems have advantages in load balancing. In this paper we explore adding data sharing functionality as an extension to a shared nothing database system. Our approach isolates the data sharing functionality from the rest of the system and relies on well-studied, robust techniques to provide the data sharing extension. This reduces the difficulty in providing data sharing functionality, yet provides much of the flexibility of a data sharing system. We present the design and implementation of Chimera -- a hybrid database system, targeted at load balancing for many workloads, and scale-out for read-mostly workloads. The results of our experiments demonstrate that we can achieve almost linear scalability and effective load balancing with less than 2% overhead during normal operation.

References

  1. T. E. Anderson, M. D. Dahlin, J. M. Neefe, D. A. Patterson, D. S. Roselli, and R. Y. Wang. Serverless network file systems. ACM Transactions on Computer Systems, 14(1):41--79, February 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. P. Bruni, R. Cornford, R. Garcia, S. Kaschta, and R. Kumar. DB2 9 for z/OS Technical Overview. IBM Redbooks, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Burrows. The chubby lock service for loosely-coupled distributed systems. In Proceedings of the 7th Symposium on Operating Systems Design and Implementation (OSDI), pages 335--350, September 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R. E. Gruber. Bigtable: A distributed storage system for structured data. ACM Transactions on Computer Systems, 26(4):1--4, June 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. B. F. Cooper, R. Ramakrishnan, U. Srivastava, A. Silberstein, P. Bohannon, H. A. Jacobsen, N. Puz, D. Weaver, and R. Yerneni. PNUTS: Yahoo!'s hosted data serving platform. In Proceedings of the 34th International Conference on Very Large Data Bases (VLDB), pages 1277--1288, August 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Devarakonda, B. Kish, and A. Mohindra. Recovery in the Calypso file system. ACM Transactions on Computer Systems, 14(3):287--310, August 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Ghemawat, H. Gobioff, and S.-T. Leung. The Google file system. In Proceedings of the 19th Symposium on Operating Systems Principles (SOSP), pages 29--43, October 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. C. Goldstein. The design and implementation of a distributed file system. Digital Technical Journal, 1(5):45--55, September 1987.Google ScholarGoogle Scholar
  9. C. Gray and D. Cheriton. Leases: An efficient fault-tolerant mechanism for distributed file cache consistency. In Proceedings of the 12th Symposium on Operating Systems Principles (SOSP), pages 202--210, December 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. C. R. Hertel. Implementing CIFS: The Common Internet File System, chapter Introduction. Prentice Hall, 2003.Google ScholarGoogle Scholar
  11. P. Hunt, M. Konar, F. P. Junqueira, and B. Reed. Zookeeper: wait-free coordination for internet-scale systems. In Proceedings of the USENIX Annual Technical Conference (USENIXATC), pages 11--11, June 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. L. Lamport. The part-time parliament. ACM Transactions on Computer Systems, 16(2):133--169, February 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. E. K. Lee and C. A. Thekkath. Petal: Distributed virtual disks. In Proceedings of the 7th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), pages 84--92, October 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. D. Lomet. Recovery for shared disk systems using multiple redo logs. Technical Report 4, Digital Cambridge Research Lab, 1990.Google ScholarGoogle Scholar
  15. D. Lomet. Private locking and distributed cache management. In Proceedings of the 3rd International Conference on Parallel and distributed information systems (PDIS), pages 151--159, September 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. D. Lomet, R. Anderson, T. K. Rengarajan, and P. Spiro. How the Rdb/VMS data sharing system became fast. Technical Report 2, Digital Cambridge Research Lab, 1992.Google ScholarGoogle Scholar
  17. K. Loney. Oracle Database 11g The Complete Reference. McGraw-Hill, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. MacCormick, N. Murphy, M. Najork, C. A. Thekkath, and L. Zhou. Boxwood: Abstractions as the foundation for storage infrastructure. In Proceedings of the 6th Symposium on Operating Systems Design and Implementation (OSDI), pages 8--8, December 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. R. B. Melnyk and P. C. Zikopoulos. DB2: The Complete Reference. McGraw-Hill, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. C. Mohan and I. Narang. Efficient locking and caching of data in the multisystem shard disks transaction environment. In Proceedings of the 3rd International Conference on Extending Database Technology (EDBT), pages 453--468, March 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Microsoft SQL Server 2008. Microsoft. {online} http://www.microsoft.com/sqlserver/2008/en/us/default.aspx.Google ScholarGoogle Scholar
  22. T. Rengarajan, P. Spiro, and W. Wright. High availability mechanisms of VAX DBMS software. Digital Technical Journal, 1(8):88--98, February 1989.Google ScholarGoogle Scholar
  23. R. Sandberg, D. Goldberg, S. Kleiman, D. Walsh, and B. Lyon. Design and implementation of the sun network file system. In Proceedings of Summer UNIX, June 1985.Google ScholarGoogle Scholar
  24. F. Schmuck and R. Haskin. GPFS: A shared-disk file system for large computing clusters. In Proceedings of the 1st Conference on File and Storage Technologies(FAST), pages 231--244, January 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. M. Stonebraker. The case for shared nothing. Database Engineering Bulletin, 9(1):4--9, 1986.Google ScholarGoogle Scholar
  26. C. A. Thekkath, T. P. Mann, and E. K. Lee. Frangipani: A scalable distributed file system. In Proceedings of the 16th Symposium on Operating Systems Principles (SOSP), pages 224--237, October 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. TPC-H: An Ad-hoc, Decision Support Benchmark. {online} http://www.tpc.org/tpch/.Google ScholarGoogle Scholar

Index Terms

  1. Chimera: data sharing flexibility, shared nothing simplicity

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          IDEAS '11: Proceedings of the 15th Symposium on International Database Engineering & Applications
          September 2011
          274 pages
          ISBN:9781450306270
          DOI:10.1145/2076623

          Copyright © 2011 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 21 September 2011

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Author Tags

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate74of210submissions,35%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader