skip to main content
10.1145/3093241.3093276acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccdaConference Proceedingsconference-collections
research-article

On-The-Fly Academic Linked Data Integration

Published: 19 May 2017 Publication History

Abstract

The web of Linked Open Data (LOD) has a prominent and rapid evolution recently. Over the last few years, LOD had developed to involve a wide range of various domains. Due to these facts, and the great interconnections among linked open datasets, linked data integration task had gained a huge attention and became a focal point of research. LOD applications aim to incorporate data from different LOD sources. Unfortunately, these sources of data are heterogeneous in schema and/or in vocabularies. Due to this heterogeneity, numerous challenges are emerging that have to be overcome. In this paper, a LOD integration framework is proposed, which aims to tackle these challenges. It works on integrating academic LOD datasets that reside in different LOD repositories with intrinsic schema and vocabularies heterogeneity. An automatic mapping technique in the integration processes is proposed in this paper. Consequently, an obvious decrease in execution time for the entire integration process, as well as, a great progress in the integrated data quality assessment metrics has been achieved.

References

[1]
Sikos, Leslie. Mastering Structured Data on the Semantic Web: From HTML5 Microdata to Linked Open Data. Apress, 2015.
[2]
Guéret, Christophe, Stephane Boyera, Mike Powell, and Martin Murillo.The Semantic Web for all. Semantic Web, 6(1): 3--4, 2015.
[3]
Latif, Atif, Patrick Hoefler, and Klaus Tochtermann. Interlinking Scientific Authors with the LOD Cloud: A Case Study. International Conference on Networked Digital Technologies. Springer Berlin Heidelberg, pages 99--108, 2012.
[4]
Klyne, Graham, and Jeremy J. Carroll. Resource description framework (RDF), February 2004. W3C Recommendation, 2014.
[5]
Bizer, Christian, Anja Jentzsch, and Richard Cyganiak. State of the LOD Cloud (2011). URL: http://www4.wiwiss.fuberlin. de/lodcloud/state/(last visited June 2012), 2013.
[6]
Mendes, Pablo N., Hannes Mühleisen, and Christian Bizer. Sieve: linked data quality assessment and fusion. In Proceedings of the 2012 Joint EDBT/ICDT Workshops, pages 116--123. ACM, 2012.
[7]
Euzenat, Jérôme, Jérôme David, Angela Locoro, and Armen Inants. Context-based ontology matching and data interlinking. PhD diss., Lindicle,2015.
[8]
Burdick, Douglas, Ronald Fagin, Phokion G. Kolaitis, Lucian Popa, and Wang-Chiew Tan. A declarative framework for linking entities. In LIPIcs-Leibniz International Proceedings in Informatics, vol. 31. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik, 2015.
[9]
Debattista, Jeremy, Christoph Lange, and Sören Auer. Luzzu Quality Metric Language--A DSL for Linked Data Quality Assessment. arXiv preprint arXiv:1504.07758, 2015.
[10]
Wimalaratne, Sarala M., Jerven Bolleman, Nick Juty, Toshiaki Katayama, Michel Dumontier, Nicole Redaschi, Nicolas Le Novère, Henning Hermjakob, and Camille Laibe. SPARQL-enabled identifier conversion with Identifiers. org. Bioinformatics 31(11): 1875--1877, 2015.
[11]
Schultz, Andreas, Andrea Matteini, Robert Isele, Christian Bizer, and Christian Becker. Ldif-linked data integration framework. In Proceedings of the Second International Conference on Consuming Linked Data-Volume 782, pages 125--130. CEUR-WS. org, 2011.
[12]
Kettouch, Mohamed Salah, Cristina Luca, Mike Hobbs, and Arooj Fatima. Data integration approach for semi-structured and structured data (Linked Data). In 2015 IEEE 13th International Conference on Industrial Informatics (INDIN), pages 820--825, 2015.
[13]
Gupta, Shubham, Pedro Szekely, Craig A. Knoblock, Aman Goel, Mohsen Taheriyan, and Maria Muslea. Karma: A system for mapping structured sources into the Semantic Web. In Extended Semantic Web Conference, pages 430--434,2012.
[14]
Liu, Wenqiang, Jun Liu, Yanan Qian, Bifan Wei, and Qinghua Zheng. Truth Discovery to Resolve Object Conflicts in Linked Data. arXiv preprint arXiv:1509.00104, 2015.
[15]
Krisnadhi, A. A., Yingjie Hu, Krzysztof Janowicz, Pascal Hitzler, Robert Arko, Suzanne Carbotte, Cynthia Chandler et al. The GeoLink framework for pattern-based linked data integration. In Proceedings of the ISWC 2015 Posters & Demonstrations Track a track within the 14th International Semantic Web Conference, ISWC. 2015.
[16]
Bizer, Christian, and Andreas Schultz. The R2R framework: Publishing and discovering mappings on the web. In Proceedings of the First International Conference on Consuming Linked Data-Volume 665, pages 97--108, 2010.
[17]
Binding, Ceri, and Douglas Tudhope. Improving interoperability using vocabulary linked data. International Journal on Digital Libraries 17(1):5--21, 2016.
[18]
Miles, A., and S. Bechhofer. SKOS simple knowledge organization system In: W3C recommendation, 2015.
[19]
Nguyen, Khai, Ryutaro Ichise, and Bac Le. SLINT: a schema-independent linked data interlinking system. In Proceedings of the 7th International Conference on Ontology Matching-Volume 946, pages 1--12, 2012.
[20]
Šír, Michal, Petr Fiedler, and Václav Kaczmarczyk. Interoperability and ontology for heterogeneous systems. In Recent Advances in Computational Intelligence., pages 64--67, 2010.
[21]
Aumueller, David, Hong-Hai Do, Sabine Massmann, and Erhard Rahm. Schema and ontology matching with COMA++. In Proceedings of the 2005 ACM SIGMOD international conference on Management of data, pages 906--908, 2005.
[22]
Lambrix, Patrick, and He Tan. SAMBO-A System for Aligning and Merging Biomedical Ontologies. Web Semantics: Science, Services and Agents on the World Wide Web 4(3), 2011.
[23]
Volz, Julius, Christian Bizer, Martin Gaedke, and Georgi Kobilarov. Silk-A Link Discovery Framework for the Web of Data. 2nd Workshop about Linked Data on the Web (LDOW) 538, 2009.
[24]
Hassanzadeh, Oktie, Anastasios Kementsietsidis, Lipyeow Lim, Renée J. Miller, and Min Wang. Semantic Link Discovery over Relational Data. In Semantic Search over the Web, pages 193--223. Springer Berlin Heidelberg, 2012.
[25]
Raimond, Yves, Christopher Sutton, and Mark B. Sandler. Automatic Interlinking of Music Datasets on the Semantic Web. In Proceedings of the Linked Data on the Web (LDOW) Workshop, Beijing, China, April 22, 369, 2008.
[26]
Bryl, Volha, Christian Bizer, Robert Isele, Mateja Verlic, Soon Gill Hong, Sammy Jang, Mun Yong Yi, and Key-Sun Choi. Interlinking and knowledge fusion. In Linked Open Data--Creating Knowledge Out of Interlinked Data, pages 70--89. Springer International Publishing, 2014.
[27]
Rivero, Carlos R., Inma Hernández, David Ruiz, and Rafael Corchuelo. Generating SPARQL executable mappings to integrate ontologies. In International Conference on Conceptual Modeling, pages 118--131. Springer Berlin Heidelberg, 2011.
[28]
Digital Bibliography and Library Project (2009), http://www.informatik.uni-trier.de/~ley/db/ (Last visited 15/1/2017).
[29]
Semantic Web Dog Food (2009), http://data.semanticweb.org/ (Last visited 15/1/2017).
[30]
Juran, Joseph M., F. M. Gryna, and R. S. Bingham Jr. Quality control handbook. McGraw-Hill Book Company, Chapters 9:22, 1974.
[31]
Knight, Shirlee-ann, and Janice M. Burn. Developing a framework for assessing information quality on the World Wide Web. Informing Science: International Journal of an Emerging Transdiscipline 8(5):159--172, 2005.
[32]
Zaveri, Amrapali, Anisa Rula, Andrea Maurino, Ricardo Pietrobon, Jens Lehmann, and Sören Auer. Quality assessment for linked data: A survey. Semantic Web 7(1): 63--93, 2015.
[33]
Carothers, Gavin, and Andy Seaborne. RDF 1.1 N-Triples: A line-based syntax for an RDF graph. World Wide Web Consortium. http://www.w3.org/TR/n-triples/. Accessed 24, 2014.
[34]
Harris, Steve, Andy Seaborne, and Eric Prud'hommeaux. SPARQL 1.1 query language. W3C Recommendation 21, 2013.
[35]
Buil-Aranda, Carlos, Marcelo Arenas, Oscar Corcho, and Axel Polleres. Federating queries in SPARQL 1.1: Syntax, semantics and evaluation. Web Semantics: Science, Services and Agents on the World Wide Web 18(1): 1--17, 2013.
[36]
Carroll, Jeremy J., Ian Dickinson, Chris Dollin, Dave Reynolds, Andy Seaborne, and Kevin Wilkinson. Jena: implementing the semantic web recommendations. In Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters, pages 74--83. ACM, 2004.
[37]
Horridge, Matthew, Holger Knublauch, Alan Rector, Robert Stevens, and Chris Wroe. A Practical Guide To Building OWL Ontologies Using The Proteǵ-OWL Plugin and CO-ODE ToolsEdition 1.0. 2009.
[38]
Burdick, Douglas, Ronald Fagin, Phokion G. Kolaitis, Lucian Popa, and Wang-Chiew Tan. A declarative framework for linking entities. In LIPIcs-Leibniz International Proceedings in Informatics, vol. 31. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik, 2015.
[39]
Giannopoulou, Ioanna, Fatiha Saïs, and Rallou Thomopoulos. Linked Data Annotation and Fusion driven by Data Quality Evaluation. In EGC, pages 257--262, 2015.

Cited By

View all
  • (2024)PAPAYA: A library for performance analysis of SQL-based RDF processing systemsSemantic Web10.3233/SW-243582(1-19)Online publication date: 5-Apr-2024
  • (2024)The 1st Workshop on Decentralised Search and RecommendationCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3641302(1705-1708)Online publication date: 13-May-2024
  • (2024)ESPRESSO: A Framework to Empower Search on the Decentralized WebData Science and Engineering10.1007/s41019-024-00263-wOnline publication date: 26-Nov-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICCDA '17: Proceedings of the International Conference on Compute and Data Analysis
May 2017
307 pages
ISBN:9781450352413
DOI:10.1145/3093241
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • University of Florida: University of Florida

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 May 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Linked Data Schema mapping
  2. Linked Data Vocabulary mapping
  3. Linked Data integration
  4. Linked Open Data
  5. Resource Description Framework (RDF)
  6. SPARQL

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICCDA '17

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 08 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)PAPAYA: A library for performance analysis of SQL-based RDF processing systemsSemantic Web10.3233/SW-243582(1-19)Online publication date: 5-Apr-2024
  • (2024)The 1st Workshop on Decentralised Search and RecommendationCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3641302(1705-1708)Online publication date: 13-May-2024
  • (2024)ESPRESSO: A Framework to Empower Search on the Decentralized WebData Science and Engineering10.1007/s41019-024-00263-wOnline publication date: 26-Nov-2024
  • (2023)ESPRESSO: A Framework for Empowering Search on Decentralized WebWeb Information Systems Engineering – WISE 202310.1007/978-981-99-7254-8_28(360-375)Online publication date: 21-Oct-2023
  • (2022)Towards Prescriptive Analyses of Querying Large Knowledge GraphsNew Trends in Database and Information Systems10.1007/978-3-031-15743-1_59(639-647)Online publication date: 29-Aug-2022
  • (2021)Bench-Ranking: A First Step Towards Prescriptive Performance Analyses For Big Data Frameworks2021 IEEE International Conference on Big Data (Big Data)10.1109/BigData52589.2021.9671277(241-251)Online publication date: 15-Dec-2021
  • (2021)MOOCs Semantic Interoperability: Towards Unified and Pedagogically Enriched Model for Building a Linked Data RepositoryDigital Technologies and Applications10.1007/978-3-030-73882-2_56(621-631)Online publication date: 26-Jun-2021
  • (2019)Multidimensional Integration of RDF DatasetsBig Data Analytics and Knowledge Discovery10.1007/978-3-030-27520-4_9(119-135)Online publication date: 3-Aug-2019

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media