skip to main content
10.1145/2812428.2812429acmotherconferencesArticle/Chapter ViewAbstractPublication PagescompsystechConference Proceedingsconference-collections
research-article

Database technologies in the world of big data

Published: 25 June 2015 Publication History

Abstract

Now we have a number of database technologies called usually NoSQL, like key-value, column-oriented, and document stores as well as search engines and graph databases. Whereas SQL software vendors offer advanced products with the capability to handle highly complex queries and transactions, NoSQL databases share rather characteristics concerning scaling and performance, as e.g. auto-sharding, distributed query support, and integrated caching. Their drawbacks can be a lack of schema or data consistency, difficulty in testing and maintaining, and absence of a higher query language. Complex data modelling and the SQL language as the only access tool to data are missing here. On the other hand, last studies show that both SQL and NoSQL databases have value for both for transactional and analytical Big Data. Top databases providers offer rearchitected database technologies combining row data stores with columnar in-memory compression enabling processing large data sets and analytical querying, often over massive, continuous data streams. The technological progress led to development of massively parallel processing analytic databases. The paper presents some details of current database technologies, their pros and cons in different application environments, and emerging trends in this area.

References

[1]
Abramova, V, Bernardino, J., Furtado, P. Which NoSQL Database? A Performance Overview. Open Journal of Databases (OJDB), Vol. 1, No. 2, 2014, pp. 17--24.
[2]
Brewer, E. A.: Towards robust distributed systems. Invited Talk on PODC 2000, Portland, Oregon, 16-19 July, 2000.
[3]
Brewer, E. A.: CAP twelve years later: how the 'rules' have changed. Computer, Vol. 45, No. 2, 2012, pp. 22--29.
[4]
Bu, Y., Howe, Y., Balazinska, M, Ernstm M. D. The HaLoop approach to large-scale iterative data analysis. The VLDB Journal, Vol. 21, No. 2, 2012, pp. 169--190.
[5]
Chen, C. L. Ph., Zhang, CH.-Y.: Data-intensive applications, challenges, techniques and technologies: A survey on Big Data. Information Sciences 275 (2014), pp. 314--347.
[6]
Corbett, J. C., Dean, J. C., Epstein, M. et al. Spanner: Google's Globally-Distributed Database. In: Proc. of 10th USENIX Symposium on Operation Systems Design and Implementation (OSDI 2012), Hollywood, 2012, pp. 261--264.
[7]
Dean, D., Ghemawat, S. MapReduce: Simplified Data Processing on Large Clusters. Communications the ACM, 51(1), 2008, pp.107--113.
[8]
EnterpriseDB Corporation. Using the NoSQL Capabilities in Postgres. White Paper, 2014.
[9]
Gates, A., Natkovich, O., Chopra, S., Kamath, P., Narayanamurthy, et al. Building a high level dataflow system on top of MapReduce: The pig experience. PVLDB, 2(2), 2009, pp. 1414--1425.
[10]
Grolinger, K., Higashino, W. A., Tiwari, A., and Capretz, M. A. M.: Data management in cloud environments: NoSQL and NewSQL data stores. Journal of Cloud Computing: Advances, Systems and Applications, 2:22, 2013, pp. 1--24.
[11]
Google®: BigQuery Analytics. John Wiley & Sons, Inc., 2014.
[12]
Hellerstein, J., Stonebraker, M. Anatomy of a Database System. Chapter 1 in Reading in Database Systems, 4th Edition, MIT Press Book, 2005, pp. 42--54.
[13]
Lokegaonkar, S., Joshi, A.: Concurrency Control Schemes in NeWSQL Systems. Int. Journal of Computer Engineering and Technology (IJCET), Volume 5, Issue 8, August (2014), pp. 97--104.
[14]
Malewicz, G., Austern, M. H., Bik, A. J. C., Dehnert, J. C., Horn, I., Leiser, N., and Czajkowski, G.: Pregel: a system for large-scale graph processing. In: Proc. of SIGMOD '10 Proc. of the 2010 ACM SIGMOD Int. Conf. on Management of data, 2010, pp. 135--146.
[15]
Melnik, S., Gubarev, A., Long, J. J., Romer, G., Shivakumar, S., Tolton, M. and Vassilakis. T.: Dremel: Interactive analysis of web-scale datasets. In: Proc. of the 36th Int'l Conf on Very Large Data Bases, 2010, pp. 330--339.
[16]
Mohamed, M., A., Altrafi, O. G., Ismail, M. O. Relational vs. NoSQL Databases: A Survey. Int. Journal of Computer and Information Technology, Vol. 03, No. 03, 2014, pp. 598--601.
[17]
O'Neil, P., Cheng, E., Gawlick, D., O'Neil, E.: The log-structured merge-tree (LSM-tree). In: Acta Inf. 33 (1996), No. 4, pp. 351--385.
[18]
Pokorny, J. NoSQL Databases: a step to databases scalability in Web environment. International Journal of Web Information Systems, 9 (1), 2013, pp. 69--82,
[19]
Pokorný, J. New Database Architectures: Steps Towards Big Data Processing. In: Proc. of IADIS European Conference on Data Mining (ECDM'13), António Palma dos Reis and Ajith P. Abraham Eds., IADIS Press, 2013, pp. 3--10.
[20]
Radenski, A. Big Data, High-Performance Computing, and MapReduce. In: Proc. CompSysTech'14, June 27-28, 2014, Ruse, Bulgaria, 2014, pp. 13--24.
[21]
Rosenthal, D.: Next gen NoSQL: The demise of eventual consistency?, 2013 https://gigaom.com/2013/11/02/next-gen-nosql-the-demise-of-eventual-consistency/
[22]
Valiant, L. G.: A bridging model for parallel computation, Communications of the ACM, Volume 33, Issue 8, Aug. 1990, pp. 103--111.
[23]
Shute, J., Vingralek, R., Samwel, B., Handy, B., Whipkey, Ch., et al. F1 A Distributed SQL Database That Scales. PVLDB 6(11), 2013, pp. 1068--1079.

Cited By

View all
  • (2023)SQL and NoSQL Database Software Architecture Performance Analysis and Assessments—A Systematic Literature ReviewBig Data and Cognitive Computing10.3390/bdcc70200977:2(97)Online publication date: 12-May-2023
  • (2021)New Trends in Databases to NonSQL DatabasesEncyclopedia of Information Science and Technology, Fifth Edition10.4018/978-1-7998-3479-3.ch054(791-799)Online publication date: 2021
  • (2021)Experiments on Static Data Summarization Techniques2021 IEEE International Women in Engineering (WIE) Conference on Electrical and Computer Engineering (WIECON-ECE)10.1109/WIECON-ECE54711.2021.9829707(17-20)Online publication date: 4-Dec-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
CompSysTech '15: Proceedings of the 16th International Conference on Computer Systems and Technologies
June 2015
411 pages
ISBN:9781450333573
DOI:10.1145/2812428
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • UORB: University of Ruse, Bulgaria
  • Querbie: Querbie
  • TECHUVB: Technical University of Varna, Bulgaria

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 June 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. NewSQL databases
  2. NoSQL databases
  3. big analytics
  4. big data
  5. data distribution
  6. database technologies
  7. transaction processing

Qualifiers

  • Research-article

Funding Sources

  • Czech Science Foundation

Conference

CompSysTech '15
Sponsor:
  • UORB
  • Querbie
  • TECHUVB

Acceptance Rates

Overall Acceptance Rate 241 of 492 submissions, 49%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)57
  • Downloads (Last 6 weeks)4
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2023)SQL and NoSQL Database Software Architecture Performance Analysis and Assessments—A Systematic Literature ReviewBig Data and Cognitive Computing10.3390/bdcc70200977:2(97)Online publication date: 12-May-2023
  • (2021)New Trends in Databases to NonSQL DatabasesEncyclopedia of Information Science and Technology, Fifth Edition10.4018/978-1-7998-3479-3.ch054(791-799)Online publication date: 2021
  • (2021)Experiments on Static Data Summarization Techniques2021 IEEE International Women in Engineering (WIE) Conference on Electrical and Computer Engineering (WIECON-ECE)10.1109/WIECON-ECE54711.2021.9829707(17-20)Online publication date: 4-Dec-2021
  • (2019)The Role of NonSQL Databases in Big DataSmart Data10.1201/9780429507670-5(93-112)Online publication date: 19-Mar-2019
  • (2019)Database NewSQL Performance Evaluation for Big Data in the Public CloudCloud Computing and Big Data10.1007/978-3-030-27713-0_10(110-121)Online publication date: 27-Jul-2019
  • (2018)Big Data Storage and Management: Challenges and OpportunitiesEnvironmental Software Systems. Computer Science for Environmental Protection10.1007/978-3-319-89935-0_3(28-38)Online publication date: 25-Apr-2018
  • (2017)Distributed and parallel construction method for equi-width histogram in cloud databaseMultiagent and Grid Systems10.3233/MGS-17027313:3(311-329)Online publication date: 28-Sep-2017
  • (2017)Providing Clarity on Big Data Technologies: A Structured Literature Review2017 IEEE 19th Conference on Business Informatics (CBI)10.1109/CBI.2017.26(388-397)Online publication date: Jul-2017
  • (2016)On the Design of a Simple Network Resolver for DNS MiningProceedings of the 17th International Conference on Computer Systems and Technologies 201610.1145/2983468.2983513(105-112)Online publication date: 23-Jun-2016
  • (2016)Conceptual and Database Modelling of Graph DatabasesProceedings of the 20th International Database Engineering & Applications Symposium10.1145/2938503.2938547(370-377)Online publication date: 11-Jul-2016

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media