skip to main content
10.1145/1247480.1247585acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

InfiniteDB: a pc-cluster based parallel massive database management system

Published: 11 June 2007 Publication History

Abstract

This paper describes a PC-cluster based parallel DBMS, InfiniteDB, developed by the authors. InfiniteDB aims at efficiently storing and processing of massive databases in response to the rapidly growing in database size and the need of high performance analyzing of massive databases. It supports the parallelisms of intra-query, inter-query, intra-operation, inter-operation and pipelining. It provides effective strategies for processing massive databases including the multiple data declustering methods, the declustering-aware algorithms for the execution of relational operations and other database operations, and the adaptive query optimization method. It also provides the functions of parallel data warehousing and data mining, the coordinator-wrapper mechanism to support the integration of heterogeneous information resources on the Internet, and the fault tolerant and resilient infrastructures. It has been used in many applications and has proved quite effective for storing and processing massive databases in practice.

References

[1]
Z. Afrookhteh. Technical comparison of oracle real application clusters 10g vs. ibm db2 udb v8.2. http://www.oracle.com/technology/products/database/clustering/pdf/twp_rac_10g_vs_db2_v8.2{1}.pdf, August 2005.
[2]
C. K. Baru, G. Fecteau, A. Goyal, H. I. Hsiao, A. Jhingran, S. Padmanabhan, G. P. Copeland, and W. G. Wilson. Db2 parallel edition. IBM Systems Journal, 34(2):292--322, 1995.
[3]
H. Boral, W. Alexander, L. Clay, G. P. Copeland, S. Danforth, M. J. Franklin, B. E. Hart, M. G. Smith, and P. Valduriez. Prototyping bubba, a highly parallel database system. IEEE Transactions on Knowledge and Data Engineering, 2(1):4--24, 1990.
[4]
D. J. DeWitt, S. Ghandeharizadeh, D. A. Schneider, A. Bricker, H. I.Hsiao, and R. Rasmussen. The gamma database machine project. IEEE Transactionson Knowledge and Data Engineering, 2(1):44--62, 1990.
[5]
G. Hallmark. Oracle parallel warehouse server. In W. A. Gray and P. -Å. Larson, editors, ICDE, pages 314--320. IEEE Computer Society, 1997.
[6]
IBM. Database partitioning feature (dpf). http://www.tendigit.com/izone/briefs/dpf.html.
[7]
T. Chen, J. Li. Parallelization techniques for query processing and data declustering methods. Chinese Journal of Advanced Software Research, 3(2), 1996.
[8]
J. Li, Z. Cai, and S. Chen. Multi-weighted tree based query optimization method for parallel relational database systems. In H. Lu and S. Spaccapietra, editors, CODAS, pages 205--212. IEEE Computer Society, 2001.
[9]
J. Li and W. Du. Parallel cmd-join algorithms on parallel databases. Chinese Journal of Software, 9(4):256--262, 1998.
[10]
J. Li, W. jun Sun, and Y. Li. Parallel join algorithms based on parallel b+-trees. In H. Lu and S. Spaccapietra, editors, CODAS, pages 197--204. IEEE Computer Society, 2001.
[11]
J. Li and J. Li. A parallel query plan model for parallel relational database systems. Chinese Journal of Advanced Software Research, 1(4):301--318, 1994.
[12]
J. Li and J. Srivastava. Efficient aggregation algorithms for compressed data warehouses. IEEE Transactions on Knowledge and Data Engineering, 14(3):515--529, 2002.
[13]
J. Li, J. Srivastava, and D. Rotem. Cmd: A multidimensional declustering method for parallel data systems. In L. Y. Yuan, editor, VLDB, pages 3--14. Morgan Kaufmann, 1992.
[14]
M. Stonebraker, R. H. Katz, D. A. Patterson, and J. K. Ousterhout. The design of xprs. In F. Bancilhon and D. J. DeWitt, editors, VLDB, pages 318--330. Morgan Kaufmann, 1988.
[15]
W. Wu, H. Gao, and J. Li. New algorithm for computing cube on very large compressed data sets. IEEE Transactions on Knowledge and Data Engineering, 18(12):1667--1680, 2006.

Cited By

View all
  • (2011)A best-effort approach to an infrastructure for Chinese Web related researchFrontiers of Electrical and Electronic Engineering in China10.1007/s11460-011-0137-z6:2(388-396)Online publication date: 17-May-2011
  • (2010)Storage and index support for data intensive web applications2010 4th International Universal Communication Symposium10.1109/IUCS.2010.5666650(62-68)Online publication date: Oct-2010

Index Terms

  1. InfiniteDB: a pc-cluster based parallel massive database management system

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data
    June 2007
    1210 pages
    ISBN:9781595936868
    DOI:10.1145/1247480
    • General Chairs:
    • Lizhu Zhou,
    • Tok Wang Ling,
    • Program Chair:
    • Beng Chin Ooi
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 June 2007

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. data declustering
    2. parallel algorithm
    3. parallel database
    4. parallel query processing

    Qualifiers

    • Article

    Conference

    SIGMOD/PODS07
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 10 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2011)A best-effort approach to an infrastructure for Chinese Web related researchFrontiers of Electrical and Electronic Engineering in China10.1007/s11460-011-0137-z6:2(388-396)Online publication date: 17-May-2011
    • (2010)Storage and index support for data intensive web applications2010 4th International Universal Communication Symposium10.1109/IUCS.2010.5666650(62-68)Online publication date: Oct-2010

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media