skip to main content
10.1145/1317331.1317343acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Efficient computation of view subsets

Published: 09 November 2007 Publication History

Abstract

Over the past ten to fifteen years, data warehouse platformshave grown enormously, both in terms of their importance and their sheer size. Traditionally, such systems have been based upon a dimensional model known as the Star Schema that consists of a central fact table and a series of related dimension tables. Given the enormous size of the fact table, virtually all current systems augment the primary fact table with a small number of focused summary tables. Previous research has addressed the issue of the selection or identification of the most cost-effective summaries. However, the problem of efficiently computing a given view subset has received far less attention. In this paper, we present a suite of greedy algorithms for the construction of these view subsets. Experimental results demonstrate cost savings of between 20% and 70% relative to the naive alternatives, depending upon the degree of materialization required.

References

[1]
S. Agarwal, R. Agrawal, P. Deshpande, A. Gupta, J. Naughton, R. Ramakrishnan, and S. Sarawagi. On the computation of multidimensional aggregates. pages 506--521, 1996.
[2]
K. Beyer and R. Ramakrishnan. Bottom-up computation of sparse and iceberg cubes. ACM SIGMOD, pages 359--370, 1999.
[3]
F. Dehne, T. Eavis, and A. Rau-Chaplin. Top-down computation of partial ROLAP data cubes. HICSS, page 80223c, 2004.
[4]
W. Feller. An introduction to probability theory and its applications. John Wiley and Sons, 1957.
[5]
J. Gray, A. Bosworth, A. Layman, and H. Pirahesh. Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals. ICDE, pages 152--159, 1996.
[6]
V. Harinarayan, A. Rajaraman, and J. Ullman. Implementing data cubes. ACM SIGMOD, pages 205--216, 1996.
[7]
P. Kalnis, N. Mamoulis, and D. Papadias. View selection using randomized search. Data Knowledge Engineering, 42(1):89--111, 2002.
[8]
L. Lakshmanan, J. Pei, and Y. Zhao. QC-trees: An efficient summary structure for semantic OLAP. In ACM SIGMOD, pages 64--75, 2003.
[9]
K. Morfonios and Y. Ioannidis. CURE for cubes: Cubing using a ROLAP engine. In VLDB, pages 379--390, 2006.
[10]
K. Ross and D. Srivastava. Fast computation of sparse data cubes. VLDB, pages 116--125, 1997.
[11]
S. Sarawagi, R. Agrawal, and A. Gupta. On computing the data cube. Technical Report RJ10026, IBM Almaden Research Center, San Jose, California, 1996.
[12]
A. Shukla, P. Deshpande, and J. Naughton. Materialized view selection for multidimensional datasets. VLDB, pages 488--499, 1998.
[13]
Y. Sismanis, A. Deligiannakis, N. Roussopoulos, and Y. Kotidis. Dwarf: Shrinking the PetaCube. ACM SIGMOD, pages 464--475, 2002.
[14]
Y. Zhao, P. Deshpande, and J. Naughton. An array-based algorithm for simultaneous multi-dimensional aggregates. ACM SIGMOD, pages 159--170, 1997.

Cited By

View all
  • (2009)High Performance Analytics with the R3-CacheProceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery10.1007/978-3-642-03730-6_22(271-286)Online publication date: 30-Aug-2009
  • (2008)Report on the Tenth ACM International Workshop on Data Warehousing and OLAP (DOLAP'07)ACM SIGMOD Record10.1145/1374780.137479737:1(59-61)Online publication date: 1-Mar-2008

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DOLAP '07: Proceedings of the ACM tenth international workshop on Data warehousing and OLAP
November 2007
112 pages
ISBN:9781595938275
DOI:10.1145/1317331
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 November 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. OLAP
  2. data cube
  3. partial materialization

Qualifiers

  • Research-article

Conference

CIKM07

Acceptance Rates

Overall Acceptance Rate 29 of 79 submissions, 37%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2009)High Performance Analytics with the R3-CacheProceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery10.1007/978-3-642-03730-6_22(271-286)Online publication date: 30-Aug-2009
  • (2008)Report on the Tenth ACM International Workshop on Data Warehousing and OLAP (DOLAP'07)ACM SIGMOD Record10.1145/1374780.137479737:1(59-61)Online publication date: 1-Mar-2008

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media