skip to main content
10.1145/2814864.2814880acmotherconferencesArticle/Chapter ViewAbstractPublication PagessemanticsConference Proceedingsconference-collections
research-article

Streaming transformation of XML to RDF using XPath-based mappings

Published:16 September 2015Publication History

ABSTRACT

The Extensible Markup Language (XML) has become a widely adopted data interchange format. With the rise of Linked Data published using the Resource Description Framework (RDF), a number of tools for transforming XML to RDF have been developed. Specifying XML→RDF mappings for these tools often requires skills in programming languages such as XSLT or XQuery. Moreover, these tools are rarely able to deal with large XML inputs. We introduce an XML to RDF transformation approach, which is based on mappings comprising RDF triple templates that employ simple XPath expressions. Thanks to the restricted XPath expressions, which can be evaluated against a stream of XML data, our implementation can handle extremely large input XML files. To process the XML input efficiently, we employ XML filtering techniques and a strategy for selecting relevant XML nodes to generate RDF triples from. We show that the time complexity of our mapping algorithm is linear in the size of the XML input and also prove its practical efficiency with an evaluation on large real-world data.

References

  1. W. Akhtar et al. XSPARQL: Traveling between the XML and RDF worlds--and avoiding the XSLT pilgrimage. Springer, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  2. N. Bikakis et al. "The XML and Semantic Web Worlds: Technologies, Interoperability and Integration: A Survey of the State of the Art". English. In: Semantic Hyper/Multimedia Adaptation. Ed. by I. E. Anagnostopoulos et al. Vol. 418. Studies in Computational Intelligence. Springer Berlin Heidelberg, 2013, pp. 319--360. url: http://dx.doi.org/10.1007/978-3-642-28977-4_12.Google ScholarGoogle Scholar
  3. S. Bischof et al. "Mapping between RDF and XML with XSPARQL". English. In: Journal on Data Semantics 1.3 (2012), pp. 147--185. url: http://dx.doi.org/10.1007/s13740-012-0008-7.Google ScholarGoogle ScholarCross RefCross Ref
  4. H. Bohring and S. Auer. "Mapping XML to OWL Ontologies". In: Leipziger Informatik-Tage 72 (2005), pp. 147--156.Google ScholarGoogle Scholar
  5. F Breitling. "A standard transformation from XML to RDF via XSLT". In: Astronomische Nachrichten 330.7 (2009), pp. 755--760.Google ScholarGoogle ScholarCross RefCross Ref
  6. D. Connolly. Gleaning Resource Descriptions from Dialects of Languages (GRDDL). W3C, 2007. url: http://www.w3.org/TR/grddl/.Google ScholarGoogle Scholar
  7. Y. Diao et al. "Yfilter: Efficient and scalable filtering of XML documents". In: Data Engineering, 2002. Proceedings. 18th International Conference on. IEEE. 2002, pp. 341--342. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Dimou et al. "Mapping Hierarchical Sources into RDF Using the RML Mapping Language". In: Semantic Computing (ICSC), 2014 IEEE International Conference on. 2014, pp. 151--158. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. Z. H. Gillani, M. I. Ali, and A. Mileo. "XSPARQL-Viz: A Mashup-Based Visual Query Editor for XSPARQL". In: The Semantic Web: ESWC 2013 Satellite Events. Springer, 2013, pp. 219--224.Google ScholarGoogle Scholar
  10. H. L. Joel Farrell. Semantic Annotations for WSDL and XML Schema. W3C, 2007.Google ScholarGoogle Scholar
  11. M. Klein. "Interpreting XML documents via an RDF schema ontology". In: Database and Expert Systems Applications, 2002. Proceedings. 13th International Workshop on. 2002, pp. 889--893. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. C. Lange. "Krextor--an extensible XML to RDF extraction framework". In: Scripting and Development for the Semantic Web (SFSW) (2009).Google ScholarGoogle Scholar
  13. T. Rodrigues, P. Rosa, and J. Cardoso. "Moving from syntactic to semantic organizations using JXML2OWL". In: Computers in Industry 59.8 (2008), pp. 808--819. url: http://www.sciencedirect.com/science/article/pii/S016636150800064X. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. S. S. Sahoo et al. "A survey of current approaches for mapping of relational databases to RDF". In: W3C RDB2RDF Incubator Group Report (2009).Google ScholarGoogle Scholar
  15. P. Saxena and R. Kamal. "System architecture and effect of depth of query on XML document filtering using PFilter". In: Contemporary Computing (IC3), 2013 Sixth International Conference on. 2013, pp. 192--195.Google ScholarGoogle ScholarCross RefCross Ref
  16. I. Stavrakantonakis et al. "Sparql2xquery 2.0: Supporting semantic-based queries over xml data". In: Semantic Media Adaptation and Personalization (SMAP), 2010 5th International Workshop on. IEEE. 2010, pp. 76--84.Google ScholarGoogle Scholar
  17. P. T. T. Thuy, Y.-K. Lee, and S. Lee. "DTD2OWL: Automatic Transforming XML Documents into OWL Ontology". In: Proceedings of the 2Nd International Conference on Interaction Sciences: Information Technology, Culture and Human. ICIS '09. Seoul, Korea: ACM, 2009, pp. 125--131. url: http://doi.acm.org/10.1145/1655925.1655949. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. P. T. T. Thuy, Y.-K. Lee, and S. Lee. "XSD2RDFS and XML2RDF Transformation: a Semantic Approach". In: The Second International Conference on Emerging Database (EDB 2010), Jeju, Korea. 2010.Google ScholarGoogle Scholar
  19. S. Vahdati et al. "Mapping Large Scale Research Metadata to Linked Data: A Performance Comparison of HBase, CSV and XML". In: Metadata and Semantics Research. CCIS. Springer, 2015. arXiv: 1506.04006 {cs.DB}.Google ScholarGoogle ScholarCross RefCross Ref
  20. D. Van Deursen et al. "XML to RDF Conversion: A Generic Approach". In: Automated solutions for Cross Media Content and Multi-channel Distribution, 2008. AXMEDIS '08. International Conference on. 2008, pp. 138--144. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. W3C. Extensible Markup Language (XML). World Wide Web. url: http://www.w3.org/XML/.Google ScholarGoogle Scholar
  22. B. Zhang and Z. Zhuang. "Efficient Structural XML Index for Multiple Queries". In: Recent Advances in Computer Science and Information Engineering. Springer, 2012, pp. 423--431.Google ScholarGoogle ScholarCross RefCross Ref
  23. H. Zhao, W. Xia, and J. Zhao. "The Research on XML Filtering Model using Lazy DFA". In: Journal of Software 7.8 (2012), pp. 1759--1766.Google ScholarGoogle ScholarCross RefCross Ref
  24. M. Zhou, H. Hu, and M. Zhou. "Searching XML data by SLCA on a MapReduce cluster". In: Universal Communication Symposium (IUCS), 2010 4th International. IEEE. 2010, pp. 84--89.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Streaming transformation of XML to RDF using XPath-based mappings

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            SEMANTICS '15: Proceedings of the 11th International Conference on Semantic Systems
            September 2015
            220 pages
            ISBN:9781450334624
            DOI:10.1145/2814864

            Copyright © 2015 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 16 September 2015

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Author Tags

            Qualifiers

            • research-article

            Acceptance Rates

            SEMANTICS '15 Paper Acceptance Rate22of97submissions,23%Overall Acceptance Rate40of182submissions,22%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader