skip to main content
10.1145/1244002.1244122acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
Article

Building automatic mapping between XML documents using approximate tree matching

Published: 11 March 2007 Publication History

Abstract

The eXtensible Markup Language (XML) is becoming the standard format for data exchange on the Internet, providing interoperability among Web applications. It is important to provide efficient algorithms and tools to manipulate XML documents that are ubiquitous on the Web.
In this paper, we present a novel system for automating the transformation of XML documents based on structural mapping with the restriction that the leaf text information are exactly the same in the source and target documents.
Firstly, tree edit distance algorithm is used to find the mapping between a pair of source and target documents. With the introduction of tree partition, the efficiency of the tree matching algorithm has been improved significantly. Secondly, template rules for transformation are inferred from the mapping using generalization. Thirdly, a template matching component is used to process new documents.
Experimental studies have shown that our methods are very promising and can be widely used for Web document cleaning, information filtering, and other applications.

References

[1]
D. Shasha, K. Zhang, Approximate Tree Pattern Matching, Chapter 14 Pattern Matching Algorithms (eds. Apostolico, A. and Galil, Z.), Oxford University Press, June 1997.
[2]
M. Garofalakis, A. Gionis, R. Rastogi, etl., Xtract: A System For Extracting Document Type Descriptors From XML Documents, ACM SIGMOD'00, pp 165--176, 2000.
[3]
A. Nierman, H. V. Jagadish, Evaluating structural similarity in XML documents, WebDB'02, Madison, Wisconsin, 2002.
[4]
XML Document Mining Challenge, http://xmlmining.lip6.fr/

Cited By

View all
  • (2016)A Research on Improved Table-Based Model-Driven Mapping StrategiesSoftware Engineering and Applications10.12677/SEA.2016.5201705:02(154-163)Online publication date: 2016
  • (2011)Mapping Audiovisual Metadata Formats Using Formal SemanticsSemantic Multimedia10.1007/978-3-642-23017-2_6(80-94)Online publication date: 2011
  • (2010)Mapping audiovisual metadata formats using formal semanticsProceedings of the 5th international conference on Semantic and digital media technologies10.5555/2032129.2032136(80-94)Online publication date: 1-Dec-2010
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SAC '07: Proceedings of the 2007 ACM symposium on Applied computing
March 2007
1688 pages
ISBN:1595934804
DOI:10.1145/1244002
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 March 2007

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

SAC07
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,650 of 6,669 submissions, 25%

Upcoming Conference

SAC '25
The 40th ACM/SIGAPP Symposium on Applied Computing
March 31 - April 4, 2025
Catania , Italy

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2016)A Research on Improved Table-Based Model-Driven Mapping StrategiesSoftware Engineering and Applications10.12677/SEA.2016.5201705:02(154-163)Online publication date: 2016
  • (2011)Mapping Audiovisual Metadata Formats Using Formal SemanticsSemantic Multimedia10.1007/978-3-642-23017-2_6(80-94)Online publication date: 2011
  • (2010)Mapping audiovisual metadata formats using formal semanticsProceedings of the 5th international conference on Semantic and digital media technologies10.5555/2032129.2032136(80-94)Online publication date: 1-Dec-2010
  • (2010)Standardized interoperable image retrievalProceedings of the 2010 ACM Symposium on Applied Computing10.1145/1774088.1774272(880-886)Online publication date: 22-Mar-2010

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media