skip to main content
10.1145/1247480.1247562acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

Efficient xml data dissemination with piggybacking

Published: 11 June 2007 Publication History

Abstract

Content-based dissemination of XML data using the publish-subscribe paradigm is an effective means to deliver relevant data to interested data consumers. To meet the performance challenges of content-based filtering and routing, two key optimizations have been developed: the use of efficient indexes to speed up subscription filtering, and the use of effective aggregation algorithms to reduce the number of subscriptions. The effectiveness of both these techniques are, however, limited to locally improving the performance of individual routers. In this paper, we propose a novel and holistic optimization approach that allows a downstream router to leverage the subscription matchings done by upstream routers to reduce its own filtering work. This is achieved by piggybacking useful annotations to the XML document being forwarded. We explore several design options and tradeoffs of this novel optimization approach. Our experimental results demonstrate that our piggyback optimization achieves significant performance improvement under various conditions.

References

[1]
A. L. Diaz, D. Lovell (1999) XML Generator. http://www.alphaworks.ibm.com/tech/xmlgenerator.
[2]
DBLP. http://www.acm.org/sigmod/dblp/db/about/dblp.dtd.
[3]
NS2. http://www.isi.edu/nsnam/ns/.
[4]
Protein. http://pir.georgetown.edu.
[5]
R. Cover (1999) The SGML/XML web page. http://www.oasis.open.org/cover/sgml-ml.html.
[6]
Treebank. http://www.cis.upenn.edu/~treebank/.
[7]
M. K. Aguilera, R. E. Strom, D. C. Sturman, M. Astley, and T. D. Chandra. Matching events in a content-based subscription system. In PODC, 1999.
[8]
M. Altinel and M. Franklin. Efficient filtering of XML documents for selective dissemination of information. In VLDB, 2000.
[9]
N. Bruno, L. Gravano, N. Koudas, and D. Srivastava. Navigation- vs. index-based XML multi-query processing. In ICDE, 2003.
[10]
A. Carzaniga, D. Rosenblum, and A. Wolf. Design and evaluation of a wide-area event notification service. ACM TOCS, 19(3), 2001.
[11]
C. Y. Chan, W. Fan, P. Felber, M. Garofalakis, and R. Rastogi. Tree pattern aggregation for scalable XML data dissemination. In VLDB, 2002.
[12]
C. Y. Chan, P. Felber, M. Garofalakis, and R. Rastogi. Efficient filtering of XML documents with XPath expressions. VLDB Journal, 11(4), 2002.
[13]
R. Chand and P. A. Felber. A scalable protocol for content-based routing in overlay networks. In NCA, 2003.
[14]
Y. Diao, M. Altinel, M. Franklin, H. Zhang, and P. Fischer. Path sharing and predicate evaluation for high-performance XML filtering. ACM TODS, 28(4), 2003.
[15]
P. M. Fischer and D. Kossmann. Batched processing for information filters. In ICDE, 2005.
[16]
X. Gong, W. Qian, Y. Yan, and A. Zhou. Bloom filter-based XML packets filtering for millions of path queries. In ICDE, 2005.
[17]
T. J. Green, G. Miklau, M. Onizuka, and D. Suciu. Processing XML streams with deterministic automata. In ICDT, 2003.
[18]
A. Gupta, A. Halevy, and D. Suciu. View selection for XML stream processing. In WebDB, 2002.
[19]
A. Gupta and D. Suciu. Stream processing of XPath queries with predicates. In SIGMOD, 2003.
[20]
A. Gupta, D. Suciu, and A. Halevy. The view selection problem for XML content based routing. In PODS, 2003.
[21]
S. Hou and H. A. Jacobsen. Predicate-based filtering of XPath expressions. In ICDE, 2006.
[22]
J. Kwon, P. Rao, B. Moon, and S. Lee. Fist: Scalable XML document filtering by sequencing twig patterns. In VLDB, 2005.
[23]
D. S. Rosenblum and A. L. Wolf. A design framework for internet-scale event observation and notification. In ESEC/FSE-5, 1997.
[24]
B. Segall, D. Arnold, J. Boot, M. Henderson, and T. Phelps. Content based routing with Elvin4. In AUUG2K, 2000.
[25]
W3C. XML path language (XPath). 1999. http://www.w3.org/TR/xpath.
[26]
X. Zhang, L. H. Yang, M. L. Lee, and W. Hsu. Scaling SDI systems via query clustring and aggregation. In DASFAA, 2004.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data
June 2007
1210 pages
ISBN:9781595936868
DOI:10.1145/1247480
  • General Chairs:
  • Lizhu Zhou,
  • Tok Wang Ling,
  • Program Chair:
  • Beng Chin Ooi
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. XML
  2. XPath
  3. annotation
  4. data dissemination
  5. piggybacking
  6. pub/sub system

Qualifiers

  • Article

Conference

SIGMOD/PODS07
Sponsor:

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2012)FoXtrotACM Transactions on the Web10.1145/2344416.23444196:3(1-34)Online publication date: 2-Oct-2012
  • (2011)Scalable XML Filtering for Content SubscriptionsTheoretical and Practical Advances in Information Systems Development10.4018/978-1-60960-521-6.ch007(120-152)Online publication date: 2011
  • (2011)Fine-grained parallel XML filtering for content-based publish/subscribe systemsProceedings of the 5th ACM international conference on Distributed event-based system10.1145/2002259.2002288(219-228)Online publication date: 11-Jul-2011
  • (2011)KEMBIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2010.15923:7(1035-1049)Online publication date: 1-Jul-2011
  • (2010)Distributed structural and value XML filteringProceedings of the Fourth ACM International Conference on Distributed Event-Based Systems10.1145/1827418.1827422(2-13)Online publication date: 12-Jul-2010
  • (2009)Fast XML document filtering by sequencing twig patternsACM Transactions on Internet Technology10.1145/1592446.15924479:4(1-51)Online publication date: 14-Oct-2009
  • (2009)A Subscription Coverage Technique for XML Message DisseminationProceedings of the 2009 Ninth Annual International Symposium on Applications and the Internet10.1109/SAINT.2009.31(137-140)Online publication date: 20-Jul-2009
  • (2008)Xml data dissemination using automata on top of structured overlay networksProceedings of the 17th international conference on World Wide Web10.1145/1367497.1367614(865-874)Online publication date: 21-Apr-2008
  • (2008)Value-based predicate filtering of XML documentsData & Knowledge Engineering10.1016/j.datak.2008.06.01267:1(51-73)Online publication date: 1-Oct-2008
  • (2007)Early profile pruning on XML-aware publish-subscribe systemsProceedings of the 33rd international conference on Very large data bases10.5555/1325851.1325949(866-877)Online publication date: 23-Sep-2007

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media