ABSTRACT
Nowadays, there has been an immense amount of data coming from various devices sensors, social networks and IoT services. Among these data, open data is playing more and more important role in practice. Many individuals and organizations collect a broad range of different types of data in order to perform their analytic tasks. However, the current open data platforms still have many limitations. Among the drawbacks, data management, an important process of analytic service development, needs to be improved significantly. The main reason is that the emergence of massive data explosion coming from various sources has been making the process become more and more complicated and costly. Therefore, we propose here a system related to the field of data management to allow multitenant users to find and access easily their desired data as well as metadata. It also helps improve the performance of platform.
- Viktor MS, Kenneth C (2013) Big data: a revolution that will transform how we live, work, and think. Houghton Mifflin Harcourt, BostonGoogle Scholar
- CKAN (2017) Comprehensive Kerbal Archive Network. https://ckan.org/ Accessed 14 Aug 2017.Google Scholar
- Apache Hadoop (2017) Apache Software Foundation. http://hadoop.apache.org/ Accessed 14 Aug 2017Google Scholar
- Park K, Nguyen MC, Won HS (2015) Web-based collaborative big data analytics on big data as a service platform. In: International Conference on Advanced Communication Technology, pp 564--567.Google ScholarCross Ref
- OKFN (2017) Open Knowledge International https://okfn.org Accessed 14 Aug 2017Google Scholar
- Datahub Open Web Portal (2017) https://old.datahub.io/ Accessed 14 Aug 2017.Google Scholar
- United Kingdom Open Data Web Portal (2017) https://data.gov.uk/ Accessed 14 Aug 2017Google Scholar
- Dutch National Data Register Web Portal (2017) https://data.overheid.nl/ Accessed 14 Aug 2017Google Scholar
- United State Open Data Web Portal (2017) U.S. General Services Administration, Technology Transformation Service https://www.data.gov/ Accessed 14 Aug 2017Google Scholar
- Pylons Web Framework (2017) https://pylonsproject.org/ Accessed 14 Aug 2017Google Scholar
- SQLAlchemy (2017) The Database Toolkit for Python https://www.sqlalchemy.org/ Accessed 14 Aug 2017Google Scholar
- PostgreSQL (2017) PostgreSQL Global Development Group https://www.postgresql.org Accessed 14 Aug 2017Google Scholar
- Apache Solr (2017) Apache Software Foundation. http://lucene.apache.org/solr/ Accessed 14 Aug 2017Google Scholar
- White T (2015) Hadoop: the definitive guide, 4th edn. O'Reilly Media, Sebastopol Google ScholarDigital Library
- Shvachko K, Kuang H, Radia S, Chansler R (2010) The hadoop distributed file system. In: Proceedings of the 26th IEEE Symposium on Mass Storage Systems and Technologies, Lake Tahoe, pp 1--10 Google ScholarDigital Library
- Jeffrey Dean, Sanjay Ghemawat (2004), MapReduce: simplified data processing on large clusters, Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation, p.10-10, December 06-08, San Francisco, CA Google ScholarDigital Library
- Vavilapalli VK, Murthy AC, Douglas C, Agarwal S, Konar M, Evans R, Graves T, Lowe J, Shah H, Seth S, Saha B, Curino C, O'Malley O, Radia S, Reed B, Baldeschwieler E (2013) Apache Hadoop YARN: yet another resource negotiator. In: Proceedings of the 4th Annual Symposium on Cloud Computing, Santa Clara, Article No. 5 Google ScholarDigital Library
- Won HS, Nguyen MC, Gil MS, Moon YS (2015) Advanced resource management with access control for multitenant Hadoop. J Commun Netw 17(6):592--601Google ScholarCross Ref
- Won HS (2016) Multitenant Hadoop with advanced resource management. Ph.d. dissertation, Department of Computer Science, KAIST University, Daejeon, KoreaGoogle Scholar
- Won HS, Nguyen MC, Gil MS, Moon YS, Whang KY (2017) Moving metadata from ad hoc files to database tables for robust, highly available, and scalable HDFS. J Supercomput 73(6):2657--2681 Google ScholarDigital Library
- Nguyen MC, Won HS, Son SW, Gil MS, Moon YS (2017) Prefetching-based metadata management in Advanced Multitenant Hadoop. J Supercomput (2017).Google Scholar
Index Terms
- Advanced Multitenant Hadoop in Smart Open Data Platform
Recommendations
Research on Security Mechanism of Hadoop Big Data Platform
CIUP '22: Proceedings of the 2022 International Conference on Computational Infrastructure and Urban PlanningAs a virtualized resource realization mode, Hadoop cloud platform has become an open-source cloud computing architecture and big data analysis platform. The platform plays a pivotal role in the information field, but the security mechanism of the Hadoop ...
Building Software Products with use Open Data and Big Data in Smart Cities
EATIS '18: Proceedings of the Euro American Conference on Telematics and Information SystemsThe use of Big Data and Open Data has been increasing and becoming a tendency in the last years. Big Data is about collect, store and analysis and interpretation of datasets so big and complex that traditional applications of data processing are not ...
Big Data and Open Government Data in Public Services
ICMLC '18: Proceedings of the 2018 10th International Conference on Machine Learning and ComputingBig data is a relatively new approach in managing and analyzing a huge amount of dynamic data to discover useful information and knowledge. Even though big data is still in its infancy, it has been benefiting private and public organizations in large ...
Comments