skip to main content
10.1145/3141128.3141139acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccbdcConference Proceedingsconference-collections
research-article

Big Data and New Data Warehousing Approaches

Published:17 September 2017Publication History

ABSTRACT

Big data are a data trend present around us mainly through Internet -- social networks and smart devices and meters -- mostly without us being aware of them. Also they are a fact that both industry and scientific research needs to deal with. They are interesting from analytical point of view, for they contain knowledge that cannot be ignored and left unused. Traditional system that supports the advanced analytics and knowledge extraction -- data warehouse -- is not able to cope with large amounts of fast incoming various and unstructured data, and may be facing a paradigm shift in terms of utilized concepts, technologies and methodologies, which have become a very active research area in the last few years. This paper provides an overview of research trends important for the big data warehousing, concepts and technologies used for data storage and (ETL) processing, and research approaches done in attempts to empower traditional data warehouses for handling big data.

References

  1. Abelló, A., Ferrarons, J. and Romero, O. 2011. Building cubes with MapReduce. In Proceedings of the ACM 14th international workshop on Data Warehousing and OLAP (Glasgow, Scotland, UK, 2011). DOLAP '11. ACM Press, 17--24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Song, I. 2015. Big data technologies, use cases, and research issuses. Slides. In ACM SAC. (Salamanca, Spain, 2015), Retrieved May 2, 2017, from SIGAPP website: https://www.sigapp.org/sac/sac2015/SAC%202015_Keynote_BDT_IY_Song-2.pdfGoogle ScholarGoogle Scholar
  3. Bondarev A., Zakirov D. and Zakirov D. 2015. Data warehouse on Hadoop platform for decision support systems in education. In 2015 Twelve International Conference on Electronics Computer and Computation (Almaty, Kazakhstan, 2015), ICECCO 2015. IEEE, 1--4.Google ScholarGoogle Scholar
  4. Jukić, N., Sharma, A., Nestorov, S. and Jukić, B. 2015. Augmenting Data Warehouses with Big Data. Information Systems Management, 32, 3 (Jul. 2015), 200--209. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Duda, J. 2012. Business Intelligence and NoSQL Databases. Information Systems in Management, 1, 1 (2012), 25--37.Google ScholarGoogle Scholar
  6. Yuan, L.-Y., Wu, L., You, J.-H. and Chi, Y. 2014. Rubato DB: A Highly Scalable Staged Grid Database System for OLTP and Big Data Applications. In 23rd ACM International Conference on Information and Knowledge Management (Melbourne, Australia, 2014). CIKM 2014. ACM Press, 1--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Chang, F. et al. 2008. Bigtable: A Distributed Storage System for Structured Data. ACM Transactions on Computer Systems, 26, 2 (Jun. 2008), 1--26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Thusoo, A. et al. 2010. Hive - a petabyte scale data warehouse using Hadoop. In IEEE 26th International Conference on Data Engineering (Long Beach, CA, USA, 2010). ICDE 2010. IEEE, 996--1005.Google ScholarGoogle Scholar
  9. Abouzeid, A., Bajda-Pawlikowski, K., Abadi, D., Silberschatz, A. and Rasin, A. 2009. HadoopDB: an architectural hybrid of MapReduce and DBMS technologies for analytical workloads. Proceedings of the VLDB Endowment. 2, 1 (Aug. 2009), 922--933. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Herodotou, H. et al. 2011. Starfish: A Self-tuning System for Big Data Analytics. In 5th Biennial Conference on Innovative Data Systems Research (Asilomar, CA, USA, 2011). CIDR 2011. 261--272.Google ScholarGoogle Scholar
  11. Chen, Y., Xu, C., Rao, W., Min, H. and Su, G. 2015. Octopus: Hybrid Big Data Integration Engine. In IEEE 7th International Conference on Cloud Computing Technology and Science (Vancouver, Canada, 2015). CloudCom 2015. IEEE, 462--466. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Kolyadenko. 2015. olap4cloud user guide. Retrieved January 29, 2017, from GitHub akolyadenko repository: https://github.com/akolyadenko/olap4cloud/Google ScholarGoogle Scholar
  13. Chevalier, M., El Malki, M., Kopliku, A., Teste, O. and Tournier, R. 2016. Document-oriented data warehouses: Models and extended cuboids. In 2016 IEEE Tenth International Conference on Research Challenges in Information Science (Grenoble, France, 2016). RCIS 2016. IEEE, 1--11.Google ScholarGoogle Scholar
  14. Dehdouh, K., Bentayeb, F., Boussaid, O. and Kabachi, N. 2014. Columnar NoSQL CUBE: Agregation operator for columnar NoSQL data warehouse. In 2014 IEEE International Conference on Systems, Man and Cybernetics (San Diego, CA, USA, 2014). SMC 2014. IEEE, 3828--3833.Google ScholarGoogle ScholarCross RefCross Ref
  15. Dehdouh, K. 2016. Building OLAP Cubes from Columnar NoSQL Data Warehouses. In 6th International Conference Model and Data Engineering (Almería, Spain, 2016), MEDI 2016. Springer International Publishing, 166--179.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Big Data and New Data Warehousing Approaches

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICCBDC '17: Proceedings of the 2017 International Conference on Cloud and Big Data Computing
        September 2017
        135 pages
        ISBN:9781450353434
        DOI:10.1145/3141128

        Copyright © 2017 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 17 September 2017

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader