skip to main content
10.1145/1242572.1242615acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
Article

Consistency-preserving caching of dynamic database content

Published: 08 May 2007 Publication History

Abstract

With the growing use of dynamic web content generated from relational databases, traditional caching solutions for through put and latency improvements are ineffective. We describe a middleware layer called Ganesh that reduces the volume of data transmitted without semantic interpretation of queries or results. It achieves this reduction through the use of cryptographic hashing to detect similarities with previous results. These benefits do not require any compromise of the strict consistency semantics provided by the back-end database. Further, Ganesh does not require modifications to applications, web servers, or database servers, and works with closed-source applications and databases. Using two bench marks representative of dynamic web sites, measurements of our prototype show that it can increase end-to-end throughput by as much as two fold for non-data intensive applications and by as much as ten fold for dataintensive ones.

References

[1]
Akella, A., Seshan, S., and Shaikh, A. An empirical evaluation of wide-area internet bottlenecks. In Proc. 3rd ACM SIGCOMM Conference on Internet Measurement (Miami Beach, FL, USA, Oct. 2003), pp101--114.
[2]
Altinel, M., Bornhövd, C., Krishnamurthy, S., Mohan, C., Pirahesh, H., and Reinwald, B. Cache tables: Paving the way for an adaptive database cache. In Proc. of 29th VLDB (Berlin, Germany, 2003), pp718--729.
[3]
Altinel, M., Luo, Q., Krishnamurthy, S., Mohan, C., Pirahesh, H., Lindsay, BG., Woo, H., and Brown, L. Dbcache: Database caching for web application servers. In Proc. 2002 ACM SIGMOD (2002), pp612--612.
[4]
Amiri, K., Park, S., Tewari, R., and Padmanabhan, S. Dbproxy: A dynamic data cache for web applications. In Proc. IEEE International Conference on Data Engineering (ICDE) (Mar. 2003).
[5]
Black, J. Compare-by-hash: A reasoned analysis. In Proc. 2006 USENIX Annual Technical Conference (Boston, MA, May 2006), pp85--90.
[6]
Bolosky, W. J., Corbin, S., Goebel, D., and Douceur, JR. Single instance storage in windows 2000. In Proc. 4th USENIX Windows Systems Symposium (Seattle, WA, Aug. 2000), pp13--24.
[7]
Brewer, EA. Lessons from giant-scale services. IEEE Internet Computing 5, 4 (2001), 46--55.
[8]
Broder, A., Glassman, S., Manasse, M., and Zweig, G. Syntactic clustering of the web. In Proc. 6th International WWW Conference (1997).
[9]
Cecchet, E., Chanda, A., Elnikety, S., Marguerite, J., and Zwaenepoel, W. Performance comparison of middleware architectures for generating dynamic web content. In Proc. Fourth ACM/IFIP/USENIX International Middleware Conference (Rio de Janeiro, Brazil, June 2003).
[10]
Cecchet, E., Marguerite, J., and Zwaenepoel, W. C-JDBC: Flexible database clustering middleware. In Proc. 2004 USENIX Annual Technical Conference (Boston, MA, June 2004).
[11]
Cox, LP., Murray, CD., and Noble, BD. Pastiche: Making backup cheap and easy. In OSDI: Symposium on Operating Systems Design and Implementation (2002).
[12]
Dabek, F., Kaashoek, MF., Karger, D., Morris, R., and Stoica, I. Wide-area cooperative storage with CFS. In 18th ACM Symposium on Operating Systems Principles (Banff, Canada, Oct. 2001).
[13]
Druschel, P., and Rowstron, A. PAST: A large-scale, persistent peer-to-peer storage utility. In HotOS VIII (Schloss Elmau, Germany, May 2001), pp75--80.
[14]
Edge side includes. http://www.esi.org.
[15]
Gao, L., Dahlin, M., Nayate, A., Zheng, J., and Iyengar, A. Application specific data replication for edge services. In WWW '03: Proc. Twelfth International Conference on World Wide Web/ (2003), pp449--460.
[16]
Hemminger, S. Netem -- emulating real networks in the lab. In Proc. 2005 Linux Conference Australia (Canberra, Australia, Apr. 2005).
[17]
Henson, V. An analysis of compare-by-hash. In Proc. 9th Workshop on Hot Topics in Operating Systems (HotOS IX) (May 2003), pp13--18.
[18]
Jmob benchmarks. http://jmob.objectweb.org/.
[19]
Labrinidis, A., and Roussopoulos, N. Balancing performance and data freshness in web database servers. In Proc. 29th VLDB Conference (Sept. 2003).
[20]
Larson, P.-A., Goldstein, J., and Zhou, J. Transparent mid-tier database caching in sql server. In Proc. 2003 ACM SIGMOD/ (2003), pp661--661.
[21]
Manber, U. Finding similar files in a large file system. In Proc. USENIX Winter 1994 Technical Conference(San Fransisco, CA, 17-21 1994), pp1--10.
[22]
Manjhi, A., Ailamaki, A., Maggs, BM., Mowry, TC., Olston, C., and Tomasic, A. Simultaneous scalability and security for data-intensive web applications. In Proc. 2006 ACM SIGMOD (June 2006), pp241--252.
[23]
Menezes, AJ., Vanstone, SA., and Oorschot, P. CV. Handbook of Applied Cryptography. CRC Press, 1996.
[24]
Miller, RB. Response time in man-computer conversational transactions. In Proc. AFIPS Fall Joint Computer Conference (1968), pp267--277.
[25]
Mogul, JC., Chan, YM., and Kelly, T. Design, implementation, and evaluation of duplicate transfer detection in http. In Proc. First Symposium on Networked Systems Design and Implementation (San Francisco, CA, Mar. 2004).
[26]
Muthitacharoen, A., Chen, B., and Mazieres, D. A low-bandwidth network file system. In Proc. 18th ACM Symposium on Operating Systems Principles (Banff, Canada, Oct. 2001).
[27]
Pfeifer, D., and Jakschitsch, H. Method-based caching in multi-tiered server applications. In Proc. Fifth International Symposium on Distributed Objects and Applications (Catania, Sicily, Italy, Nov. 2003).
[28]
Plattner, C., and Alonso, G. Ganymed: Scalable replication for transactional web applications. In Proc. 5th ACM/IFIP/USENIX International Conference on Middleware (2004), pp155--174.
[29]
Quinlan, S., and Dorward, S. Venti: A new approach to archival storage. In Proc. FAST 2002 Conference on File and Storage Technologies (2002).
[30]
Rabin, M. Fingerprinting by random polynomials. In Harvard University Center for Research in Computing Technology Technical Report TR-15-81 (1981).
[31]
Rabinovich, M., Xiao, Z., Douglis, F., and Kalmanek, C. Moving edge side includes to the real edge -- the clients. In Proc. 4th USENIX Symposium on Internet Technologies and Systems (Seattle, WA, Mar. 2003).
[32]
Reese, G. Database Programming with JDBC and Java, 1st ed. O'Reilly, June 1997.
[33]
Rhea, S., Liang, K., and Brewer, E. Value-based web caching. In Proc. Twelfth International World Wide Web Conference (May 2003).
[34]
Sivasubramanian, S., Alonso, G., Pierre, G., and van Steen, M. Globedb: Autonomic data replication for web applications. In WWW '05: Proc. 14th International World-Wide Web conference (May 2005).
[35]
Spring, NT., and Wetherall, D. A protocol-independent technique for eliminating redundant network traffic. In Proc. of ACM SIGCOMM/ (Aug. 2000).
[36]
Tolia, N., Harkes, J., Kozuch, M., and Satyanarayanan, M. Integrating portable and distributed storage. In Proc. 3rd USENIX Conference on File and Storage Technologies (San Francisco, CA, Mar. 2004).
[37]
Tolia, N., Kozuch, M., Satyanarayanan, M., Karp, B., Perrig, A., and Bressoud, T. Opportunistic use of content addressable storage for distributed file systems. In Proc. 2003 USENIX Annual Technical Conference (San Antonio, TX, June 2003), pp127--140.
[38]
Yuan, C., Chen, Y., and Zhang, Z. Evaluation of edge caching/offloading for dynamic content delivery. In WWW '03: Proc. Twelfth International Conference on World Wide Web (2003), pp461--471.

Cited By

View all
  • (2013)Performance-Enhanced Caching Scheme for Web Clusters for Dynamic ContentWeb-Based Multimedia Advancements in Data Communications and Networking Technologies10.4018/978-1-4666-2026-1.ch010(185-206)Online publication date: 2013
  • (2013)IMPACT OF REFLEXIVE ONTOLOGIES IN SEMANTIC CLINICAL DECISION SUPPORT SYSTEMSCybernetics and Systems10.1080/01969722.2013.76225644:2-3(187-203)Online publication date: 1-Mar-2013
  • (2011)Performance-Enhanced Caching Scheme for Web Clusters for Dynamic ContentInternational Journal of Business Data Communications and Networking10.4018/jbdcn.20110701027:3(16-36)Online publication date: 1-Jul-2011
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WWW '07: Proceedings of the 16th international conference on World Wide Web
May 2007
1382 pages
ISBN:9781595936547
DOI:10.1145/1242572
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 May 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. bandwidth optimization
  2. content addressable storage
  3. database caching
  4. relational database systems
  5. wide area networks

Qualifiers

  • Article

Conference

WWW'07
Sponsor:
WWW'07: 16th International World Wide Web Conference
May 8 - 12, 2007
Alberta, Banff, Canada

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 19 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2013)Performance-Enhanced Caching Scheme for Web Clusters for Dynamic ContentWeb-Based Multimedia Advancements in Data Communications and Networking Technologies10.4018/978-1-4666-2026-1.ch010(185-206)Online publication date: 2013
  • (2013)IMPACT OF REFLEXIVE ONTOLOGIES IN SEMANTIC CLINICAL DECISION SUPPORT SYSTEMSCybernetics and Systems10.1080/01969722.2013.76225644:2-3(187-203)Online publication date: 1-Mar-2013
  • (2011)Performance-Enhanced Caching Scheme for Web Clusters for Dynamic ContentInternational Journal of Business Data Communications and Networking10.4018/jbdcn.20110701027:3(16-36)Online publication date: 1-Jul-2011
  • (2011)Distributed database caching for web applications and web servicesProceedings of the International Conference & Workshop on Emerging Trends in Technology10.1145/1980022.1980132(510-515)Online publication date: 25-Feb-2011
  • (2011)The SHARC framework for data quality in Web archivingThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-011-0219-920:2(183-207)Online publication date: 1-Apr-2011
  • (2010)Consistent caching of data objects in database driven websitesProceedings of the 14th east European conference on Advances in databases and information systems10.5555/1885872.1885902(363-377)Online publication date: 20-Sep-2010
  • (2010)Sync kitProceedings of the 19th international conference on World wide web10.1145/1772690.1772704(121-130)Online publication date: 26-Apr-2010
  • (2010)Model of Efficient Delivery of Dynamic Web Pages Based on Shared Fragments2010 International Conference on Computational Intelligence and Software Engineering10.1109/WICOM.2010.5601246(1-4)Online publication date: Sep-2010
  • (2010)Tree-Adapting: An Adaptive Data Aggregation Method for Wireless Sensor Networks2010 International Conference on Computational Intelligence and Software Engineering10.1109/WICOM.2010.5601092(1-5)Online publication date: Sep-2010
  • (2010)A Subspace Clustering Algorithm2010 International Conference on Computational Intelligence and Software Engineering10.1109/WICOM.2010.5600143(1-4)Online publication date: Sep-2010
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media