short-paper

Constructing a data accessing layer for in-memory data grid

Authors:
Shuping Ji

Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing, P.R. China

Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing, P.R. China
View Profile

,
Wei Wang

Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing, P.R. China

Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing, P.R. China
View Profile

,
Chunyang Ye

Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing, P.R. China

Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing, P.R. China
View Profile

,
Jun Wei

Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing, P.R. China

Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing, P.R. China
View Profile

,
Zhaohui Liu

Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing, P.R. China

Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing, P.R. China
View Profile

Internetware '12: Proceedings of the Fourth Asia-Pacific Symposium on InternetwareOctober 2012Article No.: 15Pages 1–7https://doi.org/10.1145/2430475.2430490

Published:30 October 2012Publication History

Internetware '12: Proceedings of the Fourth Asia-Pacific Symposium on Internetware

Pages 1–7

ABSTRACT

In-memory data grid (IMDG) is a novel data processing middleware for Internetware. It provides higher scalability and performance compared with traditional rational database. However, because the data stored in IMDG must follow the key/value data model, new challenges have been proposed. One important aspect is that IMDG does not support standard data accessing languages such as JPA and SQL, and application developers must design their programs according to the peculiarities of an IMDG product. This results in complex and error-prone code, especially for the programmers who have no deep understanding of IMDG. In this paper, we propose a data accessing reference architecture for IMDG and a methodology to design and implement its data accessing layer. In this methodology, data accessing engine construction, data model designation and join operation supporting are presented. Moreover, following this methodology, we develop and implement a JPA compatible data accessing engine for Hazelcast as a case study, which proves the feasibility of our approach.

References

Hasso Plattner. 2009. A common database approach for OLTP and OLAP using an in-memory column database. In Proceedings of the 2009 ACM SIGMOD International Conference on Management of data (SIGMOD '09), Carsten Binnig and Benoit Dageville (Eds.). ACM, New York, NY, USA, 1--2. Google ScholarDigital Library
J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. Communications of the ACM, 51(1): 107--113, 2008. Google ScholarDigital Library
B. Chattopadhyay, L. Lin, W. Liu, S. Mittal, P. Aragonda, V. Lychagina, Y. Kwon, and M. Wong. Tenzing: A SQL Implementation on the MapReduce Framework. PVLDB, 4(12):1318--1327, 2011.Google Scholar
R. Lee, et al., "YSmart: Yet Another SQL-to-MapReduce Translator," 31st International Conference on Distributed Computing Systems (Icdcs 2011), pp. 25--36, 2011. Google ScholarDigital Library
JPA: http://www.oracle.com/technetwork/articles/javaee/jpa-137156.html.Google Scholar
Terence Parr and Russell Quong. ANTLR: A predicated-LL(k) parser generator. Journal of Software Practice and Experience, 25(7), 1995. Google ScholarDigital Library
Oracle Coherence: http://www.oracle.com/technetwork/middleware/coherence/overview/index.html.Google Scholar
GigaSpaces XAP: http://www.gigaspaces.com/datagrid.Google Scholar
VMware GemFire: http://www.vmware.com/products/application-platform/vfabric-gemfire/overview.html.Google Scholar
Hazelcast: http://www.hazelcast.com/.Google Scholar
Infinispan: http://www.jboss.org/infinispan/.Google Scholar
R. Pike, S. Dorward, R. Griesemer, and S. Quinlan. Interpreting the data: Parallel analysis with Sawzall. Scientifc Programming, 13(4):277--298, 2005. Google ScholarDigital Library
C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins. Pig Latin: a not-so-foreign language for data processing. In Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pages 1099--1110. ACM, 2008. Google ScholarDigital Library
A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, S. Anthony, H. Liu, P. Wycko_, and R. Murthy. Hive: a warehousing solution over a Map-Reduce framework. Proceedings of the VLDB Endowment, 2(2):1626--1629, 2009. Google ScholarDigital Library
A. Abouzeid, K. Bajda-Pawlikowski, D. Abadi, A. Silberschatz, and A. Rasin. HadoopDB: an architectural hybrid of MapReduce and DBMS technologies for analytical workloads. Proceedings of the VLDB Endowment, 2:922--933, August 2009. Google ScholarDigital Library
G. L. Sanders and S. K. Shin. Denormalization effects on performance of RDBMS. In Proceedings of the HICSS Conference, January 2001. Google ScholarDigital Library
S. K. Shin and G. L. Sanders. Denormalisation strategies for data retrieval from data warehouses. Decision Support Systems, 42(1):267--282, October 2006. Google ScholarDigital Library
Caching policy: http://en.wikipedia.org/wiki/Cache_(computing).Google Scholar
Json: http://www.json.org/.Google Scholar
P. P. Chen. The Entity-Relationship Model: Towards a unified view of Data. ACM Transactions on Database Systems, 1:9--36, Jan 1976. Google ScholarDigital Library
Z. Wei, G. Pierre, and C. H. Chi. Scalable Join Queries in Cloud Data Stores. 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing. May 2012. Google ScholarDigital Library
TPC-W: http://www.tpc.org/tpcw/default.asp.Google Scholar
Hibernate ORM: http://www.hibernate.org/.Google Scholar
OpenJPA: http://openjpa.apache.org/.Google Scholar
TopLink: http://www.oracle.com/technetwork/middleware/toplink/overview/index.htmlGoogle Scholar
M. Keith and M. Schnicariol, "Introduction Pro JPA 2," ed: Apress, 2010, pp. 1--16.Google Scholar

Index Terms

Constructing a data accessing layer for in-memory data grid
1. Information systems
  1. Data management systems
    1. Information integration
    2. Middleware for databases
      1. Object-relational mapping facilities
2. Software and its engineering
  1. Software notations and tools
    1. Context specific languages
      1. Interface definition languages

Recommendations

Open Source In-Memory Data Grid Systems: Benchmarking Hazelcast and Infinispan
ICPE '17: Proceedings of the 8th ACM/SPEC on International Conference on Performance Engineering

Distributed cache systems are used to store and retrieve frequently used data for faster access by exploiting the memory of more than one machine, but they appear as one logical big cache. In this paper, we studied the performance of two popular open ...
Read More
In-Memory Data Grid System for Real-Time Processing of Machine Sensor Data in a Smart Factory Environment
BigDAS '15: Proceedings of the 2015 International Conference on Big Data Applications and Services

Industry 4.0 is aimed at setting up a smart factory, which focuses on developing base technologies such as Internet Of Things (IOT), sensor, cyber-physical system and etc. The smart factory produces process data in real time through the sensor for each ...
Read More
Big Data Analytics
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
Internetware '12: Proceedings of the Fourth Asia-Pacific Symposium on Internetware
October 2012
204 pages
ISBN:9781450318884
DOI:10.1145/2430475
Conference Chairs:
Hong Mei
Peking University
,
Jian Lv
Nanjing University
,
Program Chairs:
Qianxiang Wang
Peking University
,
Lin Liu
Tsinghua University
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 30 October 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
data accessing
in-memory data grid (IMDG)
key/value data model
Qualifiers
- short-paper
Conference

Acceptance Rates
Overall Acceptance Rate55of111submissions,50%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 241
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Constructing a data accessing layer for in-memory data grid

Internetware '12: Proceedings of the Fourth Asia-Pacific Symposium on Internetware

ABSTRACT

References

Cited By

Index Terms

Recommendations

Open Source In-Memory Data Grid Systems: Benchmarking Hazelcast and Infinispan

In-Memory Data Grid System for Real-Time Processing of Machine Sensor Data in a Smart Factory Environment

Big Data Analytics

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Constructing a data accessing layer for in-memory data grid

Internetware '12: Proceedings of the Fourth Asia-Pacific Symposium on Internetware

ABSTRACT

References

Cited By

Index Terms

Recommendations

Open Source In-Memory Data Grid Systems: Benchmarking Hazelcast and Infinispan

In-Memory Data Grid System for Real-Time Processing of Machine Sensor Data in a Smart Factory Environment

Big Data Analytics

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media