skip to main content
10.1145/2661829.2661879acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Rubato DB: A Highly Scalable Staged Grid Database System for OLTP and Big Data Applications

Published:03 November 2014Publication History

ABSTRACT

This paper proposes a new formula protocol for distributed concurrency control, and specifies a staged grid architecture for highly scalable database management systems. The paper also describes novel implementation techniques of Rubato DB based on the proposed protocol and architecture. We have conducted extensive experiments which clearly show that Rubato DB is highly scalable with efficient performance under both TPC-C and YCSB benchmarks. Our paper verifies that the formula protocol and the staged grid architecture provide a satisfactory solution to one of the important challenges in the database systems: to develop a highly scalable database management system that supports various consistency levels from ACID to BASE.

References

  1. A. Abouzeid and etc. Hadoopdb: an architectural hybrid of mapreduce and dbms technologies for analytical workloads. In PVLDB, pages 922--933, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. K. Aguilera, A. Merchant, and etc. Sinfonia: A new paradigm for building scalable distributed systems. volume 27, pages 5:1--5:48. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. P. Alvaro, N. Conway, J. Hellerstein, and W. R. Marczak. Consistency analysis in bloom: a calm and collected approach. In CIDR, pages 249--260, 2011.Google ScholarGoogle Scholar
  4. J. Baker, C. Bond, J. Corbett, and etc. Megastore: Providing scalable, highly available storage for interactive services. In CIDR, pages 223--234, 2011.Google ScholarGoogle Scholar
  5. S. Blott and H. F. Korth. An almost-serial protocol for transaction execution in main-memory database systems. In PVLDB, pages 706--717, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Brantner, D. Florescu, and etc. Building a database on s3. In SIGMOD, pages 251--264, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Chaiken, B. Jenkins, and etc. Scope: easy and efficient parallel processing of massive data sets. Proc. VLDB Endow., 1(2):1265--1276, Aug. 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. F. Chang, J. Dean, S. Ghemawat, and etc. Bigtable: A distributed storage system for structured data. In ACM TOCS, volume 26, pages 1--26, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Cohen, B. Dolan, and etc. Mad skills: new analysis practices for big data. PVLDB, 2(2):1481--1492, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. B. F. Cooper and etc. Pnuts: Yahoo!'s hosted data serving platform. In VLDB, pages 1277--1288, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. B. F. Cooper and etc. Benchmarking cloud serving systems with ycsb. In SoCC, pages 143--154, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. C. Corbett and etc. Spanner: Google's globally-distributed database. In OSDI, pages 251--264, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Dean and S. Ghemawat. Mapreduce: Simplified data processing on large clusters. In Commun. of ACM, volume 51, pages 107--113, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. G. DeCandia, D. Hastorun, and etc. Dynamo: Amazon's highly available key-value store. In SOSP, pages 205--220, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. Fekete, D. Liarokapis, and etc. Making snapshot isolation serializable. In TODS, volume 30, pages 492--528, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. Harizopoulos and A. Ailamaki. A case for staged database systems. In CIDR, 2003.Google ScholarGoogle Scholar
  17. M. Isard, M. Budiu, and etc. Dryad: distributed data-parallel programs from sequential building blocks. ACM SIGOPS, 41(3):59--72, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. R. Kallman and etc. H-store: a high-performance, distributed main memory transaction processing system. In VLDB, pages 1496--1499, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. Lakshman. Cassandra: a decentralized structured storage system. In SIGOPS, pages 35--40, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. P.-A. Larson, S. Blanas, and etc. High-performance concurrency control mechanisms for main-memory databases. In PVLDB, volume 5, pages 298--309, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. K. Manassiev and etc. Exploiting distributed version concurrency in a transactional memory cluster. In ACM SIGPLAN, pages 198--208. ACM, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. S. Melnik and etc. Dremel: interactive analysis of web-scale datasets. PVLDB, 3(1-2):330--339, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. Rao and etc. Using paxos to build a scalable, consistent, and highly available datastore. volume 4, pages 243--254. VLDB Endowment, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. M. Stonebraker and R. Cattell. 10 rules for scalable performance in 'simple operation' datastores. Commun. ACM, 54(6):72--80, June 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. M. Stonebraker, S. Madden, and etc. The end of an architectural era: (it's time for a complete rewrite). In VLDB, pages 1150--1160, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. R. Thomas. A solution to the concurrency control problem for multiple copy databases. In Digest of papers IEEE COMPCON spring, pages 56--62, 1984.Google ScholarGoogle Scholar
  27. A. Thomson, T. Diamond, and etc. Calvin: fast distributed transactions for partitioned database systems. In SIGMOD, pages 1--12, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. A. Thusoo, J. S. Sarma, and etc. Hive: a warehousing solution over a map-reduce framework. volume 2, pages 1626--1629. VLDB Endowment, Aug. 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. TPC-C. http://www.tpc.org/tpcc/. 2010.Google ScholarGoogle Scholar
  30. VoltDB. http://voltdb.com/products/technology.Google ScholarGoogle Scholar
  31. M. Welsh, D. Culler, and E. Brewer. Seda: an architecture for well-conditioned, scalable internet services. In SOSP, pages 230--243, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Rubato DB: A Highly Scalable Staged Grid Database System for OLTP and Big Data Applications

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Conferences
                CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management
                November 2014
                2152 pages
                ISBN:9781450325981
                DOI:10.1145/2661829

                Copyright © 2014 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 3 November 2014

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • research-article

                Acceptance Rates

                CIKM '14 Paper Acceptance Rate175of838submissions,21%Overall Acceptance Rate1,861of8,427submissions,22%

                Upcoming Conference

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader