ABSTRACT
Recent OLAP systems, which are usually called HOLAP systems, are often developed using both NSM and DSM storages in a single database system for big data analytics in real-time. In HOLAP systems, a method for two types of storages usually depends on the type of SQL queries (e.g., insert, delete, or update). In short, we have the potential to find another approach to improve query processing time if we focus on the characteristics of data that an issued query handles.
In this paper, we propose a method for optimizing query processing in a HOLAP system considering the four types of data characteristics such as those of the data extracted by a correlated subquery and by a join operation using tables that are different in size with an appropriate index construction.
- D. Abadi. Hybrid Row-Column Stores: A General and Flexible Approach. http://blogs.teradata.com/data-points/hybrid-row-column-stores-general-flexible-approach/, March 2015. Retrieved September 2, 2015.Google Scholar
- D. J. Adadi, S. R. Madden, and N. Hachem. Column-stores vs. row-stores: how different are they really? In Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pages 967--980. ACM, June 2008.Google Scholar
- ApacheSoftwareFoundation. Apache HBase. http://hbase.apache.org/, August 2015. Retrieved September 2, 2015.Google Scholar
- F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R. E. Gruber. Bigtable: A Distributed Storage System for Structured Data. ACM Transactions on Computer Systems (TOCS), 26(4), June 2008.Google Scholar
- Y.-M. Choi. Web-enabled OLAP Tutorial. http://www.cis.drexel.edu/faculty/song/courses/info%20607/tutorial_OLAP/index.htm, April 2005. Retrieved May 1, 2015.Google Scholar
- M. Colgan, J. Kamp, and S. Lee. Oracle Database In-Memory. Oracle Corporation, October 2014. An Oracle White Paper.Google Scholar
- G. P. Copeland and S. N. Khoshafian. A decomposition storage model. In Proceedings of the 1985 ACM SIGMOD international conference on Management of data, pages 268--279. ACM, May 1985.Google ScholarDigital Library
- F. Färber, S.-K. Cha, J. Primsch, C. Bornhövd, S. Sigg, and W. Lehner. SAP HANA Database: Data Management for Modern Business Applications. ACM SIGMOD Record, 40(4):45--51, December 2011.Google ScholarDigital Library
- H. Garcia-Moluna, J. D. Ullman, and J. Widom. Database Systems: Pearson New International Edition: The Complete Book. Pearson, August 2013.Google Scholar
- L. George. HBase: The Definitive Guide Random Access to Your Planet-Size Data. O'Reilly Media, April 2015.Google Scholar
- A. Lamb, M. Fuller, R. Varadarajan, N. Tran, B. Vandiver, L. Doshi, and C. Bear. The vertica analytic database: C-store 7 years later. In Proceedings of the 1985 ACM SIGMOD international conference on Management of data, pages 1790--1801. VLDB Endowment, August 2012.Google ScholarDigital Library
- R. Ramamurthy, D. J. DeWitt, and Q. Su. A Case for Fractured Mirrors. In Proceedings of the 28th International Conference on Very Large Data Bases, pages 430--441. VLDB Endowment, August 2002.Google ScholarDigital Library
- R. Seiffert. Online Transactional and Analytics Processing using SPSS, DB2 z/OS, DB2 Analytics Accelerator. http://www-304.ibm.com/connections/blogs/systemz/entry/online_transactional_and_analytics_processing, March 2013. Retrieved April 20, 2015.Google Scholar
- M. Stonebraker, D. J. Asadi, A. Batkin, X. Chen, M. Cherniack, M. Ferreira, E. Lau, A. Lin, S. Madden, E. O'Neil, P. O'Neil, A. Rasin, N. Tran, and S. Zdonik. C-Store: A Column-Oriented DBMS. In Proceedings of the 31st International Conference on Very Large Data Bases, pages 553--564. VLDB Endowment, August 2005.Google ScholarDigital Library
- J. Thomas and S. Dessloch. Near real-time data warehousing using state-of-the-art ETL tools. In Enabling Real-Time Business Intelligence, pages 100--117. Springer Berlin Heidelberg, September 2010.Google ScholarCross Ref
Index Terms
- Query processing optimization using disk-based row-store and column-store
Recommendations
Finding the best between the column store and row store Databases
ICIST '20: Proceedings of the 10th International Conference on Information Systems and TechnologiesRow store databases are unavoidable to manage data in Information Systems. However, Web growth and consumer high-connectivity generate incredible amount of data and change ways to manage it. In a matter of fact, traditional Row Stores hardly satisfy new ...
Hybrid Materialization in a Disk-Based Column-Store
CODS-COMAD '24: Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)In column-oriented query processing, a materialization strategy determines when lightweight positions (row IDs) are translated into tuples. It is an important part of column-store architecture, since it defines the class of supported query plans, and, ...
Query Processing Optimization using Two Types of Storage Devices
IDEAS '15: Proceedings of the 19th International Database Engineering & Applications SymposiumCurrently, OLAP systems capable of handling a huge amount of data tend to use two types of storage device based on NSM and DSM, respectively.
Conventional approaches for query optimization in OLAP systems usually classify issued queries according to ...
Comments