ABSTRACT
The Extensible Markup Language (XML) has become a widely adopted data interchange format. With the rise of Linked Data published using the Resource Description Framework (RDF), a number of tools for transforming XML to RDF have been developed. Specifying XML→RDF mappings for these tools often requires skills in programming languages such as XSLT or XQuery. Moreover, these tools are rarely able to deal with large XML inputs. We introduce an XML to RDF transformation approach, which is based on mappings comprising RDF triple templates that employ simple XPath expressions. Thanks to the restricted XPath expressions, which can be evaluated against a stream of XML data, our implementation can handle extremely large input XML files. To process the XML input efficiently, we employ XML filtering techniques and a strategy for selecting relevant XML nodes to generate RDF triples from. We show that the time complexity of our mapping algorithm is linear in the size of the XML input and also prove its practical efficiency with an evaluation on large real-world data.
- W. Akhtar et al. XSPARQL: Traveling between the XML and RDF worlds--and avoiding the XSLT pilgrimage. Springer, 2008.Google ScholarCross Ref
- N. Bikakis et al. "The XML and Semantic Web Worlds: Technologies, Interoperability and Integration: A Survey of the State of the Art". English. In: Semantic Hyper/Multimedia Adaptation. Ed. by I. E. Anagnostopoulos et al. Vol. 418. Studies in Computational Intelligence. Springer Berlin Heidelberg, 2013, pp. 319--360. url: http://dx.doi.org/10.1007/978-3-642-28977-4_12.Google Scholar
- S. Bischof et al. "Mapping between RDF and XML with XSPARQL". English. In: Journal on Data Semantics 1.3 (2012), pp. 147--185. url: http://dx.doi.org/10.1007/s13740-012-0008-7.Google ScholarCross Ref
- H. Bohring and S. Auer. "Mapping XML to OWL Ontologies". In: Leipziger Informatik-Tage 72 (2005), pp. 147--156.Google Scholar
- F Breitling. "A standard transformation from XML to RDF via XSLT". In: Astronomische Nachrichten 330.7 (2009), pp. 755--760.Google ScholarCross Ref
- D. Connolly. Gleaning Resource Descriptions from Dialects of Languages (GRDDL). W3C, 2007. url: http://www.w3.org/TR/grddl/.Google Scholar
- Y. Diao et al. "Yfilter: Efficient and scalable filtering of XML documents". In: Data Engineering, 2002. Proceedings. 18th International Conference on. IEEE. 2002, pp. 341--342. Google ScholarDigital Library
- A. Dimou et al. "Mapping Hierarchical Sources into RDF Using the RML Mapping Language". In: Semantic Computing (ICSC), 2014 IEEE International Conference on. 2014, pp. 151--158. Google ScholarDigital Library
- S. Z. H. Gillani, M. I. Ali, and A. Mileo. "XSPARQL-Viz: A Mashup-Based Visual Query Editor for XSPARQL". In: The Semantic Web: ESWC 2013 Satellite Events. Springer, 2013, pp. 219--224.Google Scholar
- H. L. Joel Farrell. Semantic Annotations for WSDL and XML Schema. W3C, 2007.Google Scholar
- M. Klein. "Interpreting XML documents via an RDF schema ontology". In: Database and Expert Systems Applications, 2002. Proceedings. 13th International Workshop on. 2002, pp. 889--893. Google ScholarDigital Library
- C. Lange. "Krextor--an extensible XML to RDF extraction framework". In: Scripting and Development for the Semantic Web (SFSW) (2009).Google Scholar
- T. Rodrigues, P. Rosa, and J. Cardoso. "Moving from syntactic to semantic organizations using JXML2OWL". In: Computers in Industry 59.8 (2008), pp. 808--819. url: http://www.sciencedirect.com/science/article/pii/S016636150800064X. Google ScholarDigital Library
- S. S. Sahoo et al. "A survey of current approaches for mapping of relational databases to RDF". In: W3C RDB2RDF Incubator Group Report (2009).Google Scholar
- P. Saxena and R. Kamal. "System architecture and effect of depth of query on XML document filtering using PFilter". In: Contemporary Computing (IC3), 2013 Sixth International Conference on. 2013, pp. 192--195.Google ScholarCross Ref
- I. Stavrakantonakis et al. "Sparql2xquery 2.0: Supporting semantic-based queries over xml data". In: Semantic Media Adaptation and Personalization (SMAP), 2010 5th International Workshop on. IEEE. 2010, pp. 76--84.Google Scholar
- P. T. T. Thuy, Y.-K. Lee, and S. Lee. "DTD2OWL: Automatic Transforming XML Documents into OWL Ontology". In: Proceedings of the 2Nd International Conference on Interaction Sciences: Information Technology, Culture and Human. ICIS '09. Seoul, Korea: ACM, 2009, pp. 125--131. url: http://doi.acm.org/10.1145/1655925.1655949. Google ScholarDigital Library
- P. T. T. Thuy, Y.-K. Lee, and S. Lee. "XSD2RDFS and XML2RDF Transformation: a Semantic Approach". In: The Second International Conference on Emerging Database (EDB 2010), Jeju, Korea. 2010.Google Scholar
- S. Vahdati et al. "Mapping Large Scale Research Metadata to Linked Data: A Performance Comparison of HBase, CSV and XML". In: Metadata and Semantics Research. CCIS. Springer, 2015. arXiv: 1506.04006 {cs.DB}.Google ScholarCross Ref
- D. Van Deursen et al. "XML to RDF Conversion: A Generic Approach". In: Automated solutions for Cross Media Content and Multi-channel Distribution, 2008. AXMEDIS '08. International Conference on. 2008, pp. 138--144. Google ScholarDigital Library
- W3C. Extensible Markup Language (XML). World Wide Web. url: http://www.w3.org/XML/.Google Scholar
- B. Zhang and Z. Zhuang. "Efficient Structural XML Index for Multiple Queries". In: Recent Advances in Computer Science and Information Engineering. Springer, 2012, pp. 423--431.Google ScholarCross Ref
- H. Zhao, W. Xia, and J. Zhao. "The Research on XML Filtering Model using Lazy DFA". In: Journal of Software 7.8 (2012), pp. 1759--1766.Google ScholarCross Ref
- M. Zhou, H. Hu, and M. Zhou. "Searching XML data by SLCA on a MapReduce cluster". In: Universal Communication Symposium (IUCS), 2010 4th International. IEEE. 2010, pp. 84--89.Google ScholarCross Ref
Index Terms
- Streaming transformation of XML to RDF using XPath-based mappings
Recommendations
Using OWL for querying an XML/RDF syntax
WWW '05: Special interest tracks and posters of the 14th international conference on World Wide WebSome recent initiatives try to take profit from RDF to make XML documents interoperate at the semantic level. Ontologies are used to establish semantic connections among XML languages, and some mechanisms have been defined to query them with natural XML ...
XML to RDF Conversion: A Generic Approach
AXMEDIS '08: Proceedings of the 2008 International Conference on Automated solutions for Cross Media Content and Multi-channel DistributionThe role of metadata is gaining importance due to today's growth of multimedia content. Currently, XML is the standard for data interchange. However, as XML Schemas do not express semantics but rather the document structure, there is a lack of semantic ...
Beyond XML and RDF: the versatile web query language xcerpt
WWW '06: Proceedings of the 15th international conference on World Wide WebApplications and services that access Web data are becoming increasingly more useful and wide-spread. Current main-stream Web query languages such as XQuery, XSLT, or SPARQL, however, focus only on one of the different data formats available on the Web. ...
Comments