Abstract
The CLIDE System assists the owners of sources that participate in Web service-based data publishing systems to publish a restricted set of parameterized queries over the schema of their sources and package them as WSDL services. The sources may be relational databases, which naturally have a schema, or ad hoc information/application systems whereas the owner publishes a virtual schema. CLIDE allows information clients to pose queries over the published schema and utilizes prior work on answering queries using views to answer queries that can be processed by combining and processing the results of one or more Web service calls. These queries are called feasible. Contrary to prior work, where infeasible queries are rejected without an explanatory feedback, leading the user into a frustrating trial-and-error cycle, CLIDE features a query formulation interface, which extends the QBE-like query builder of Microsoft's SQL Server with a color scheme that guides the user toward formulating feasible queries. CLIDE guarantees that the suggested query edit actions are complete (i.e., each feasible query can be built by following only suggestions), rapidly convergent (the suggestions are tuned to lead to the closest feasible completions of the query), and suitably summarized (at each interaction step, only a minimal number of actions needed to preserve completeness are suggested). We present the algorithms, implementation, and performance evaluation showing that CLIDE is a viable on-line tool.
- Abiteboul, S., Hull, R., and Vianu, V. 1995. Foundations of Databases. Addison-Wesley, Reading, MA. Google ScholarDigital Library
- Biron, P. V. and Malhotra, A. 2004. XML Schema part 2: Datatypes second edition. W3C Recommendation 28 October 2004. Go online to http://www.w3.org/TR/xmlschema-2/.Google Scholar
- Boag, S., Chamberlin, D., Fernandez, M. F., Florescu, D., Robie, J., and Siméon, J. 2007. XQuery 1.0: An XML query language. W3C Recommendation 23 January 2007. Go online to http://www.w3.org/TR/xquery/.Google Scholar
- Borkar, V. R., Carey, M. J., Lychagin, D., Westmann, T., Engovatov, D., and Onose, N. 2006. Query processing in the aqualogic data services platform. In Proceedings of VLDB. 1037--1048. Google ScholarDigital Library
- Braga, D., Campi, A., and Ceri, S. 2005. QBE (query y xample): A visual interface to the standard xml query language. ACM Trans. Database Syst. 30, 2, 398--443. Google ScholarDigital Library
- Carey, M. J. 2006. Data delivery in a service-oriented world: The Bea aqualogic data services platform. In Proceedings of the SIGMOD Conference. 695--705. Google ScholarDigital Library
- Carey, M. J., Haas, L. M., Maganty, V., and Williams, J. H. 1996. PESTO: An integrated query/browser for object databases. In Proceedings of VLDB. 203--214. Google ScholarDigital Library
- Chandra, A. K. and Merlin, P. M. 1977. Optimal implementation of conjunctive queries in relational data bases. In Proceedings of STOC. 77--90. Google ScholarDigital Library
- Christensen, E., Curbera, F., Meredith, G., and Weerawarana, S. 2001. Web Services Description Language (WSDL) 1.1. W3C Note 15 March 2001. Go online to http://www.w3.org/TR/wsdl.Google Scholar
- Dong, X., Halevy, A. Y., Madhavan, J., Nemes, E., and Zhang, J. 2004. Simlarity search for Web services. In Proceedings of VLDB. 372--383. Google ScholarDigital Library
- Duschka, O. M., Genesereth, M. R., and Levy, A. Y. 2000. Recursive query plans for data integration. J. Log. Program. 43, 1, 49--73.Google ScholarCross Ref
- Fallside, D. C. and Walmsley, P. 2004. XML schema part 0: Primer second edition. W3C Recommendation 28 October 2004. Go online to http://www.w3.org/TR/xmlschema-0/.Google Scholar
- Fan, W., Chan, C. Y., and Garofalakis, M. N. 2004. Secure XML querying with security views. In Proceedings of the SIGMOD Conference. 587--598. Google ScholarDigital Library
- Garcia-Molina, H., Ullman, J. D., and Widom, J. D. 2001. Database Systems: The Complete Book. Prentice Hall, Englewood Cliffs, NJ. Google ScholarDigital Library
- Halevy, A. 2001. Answering queries using views: A survey. VLDB J. 10, 4, 270--294. Google ScholarDigital Library
- Jordan, D. and Evdemon, J. 2006. Web Services Business Process Execution Language Version 2.0. OASIS Public Review Draft, 23th August, 2006. Go online to http://docs.oasis-open.org/wsbpel/2.0/wsbpel-specification-draft.html.Google Scholar
- LeFevre, K., Agrawal, R., Ercegovac, V., Ramakrishnan, R., Xu, Y., and DeWitt, D. J. 2004. Limiting disclosure in hippocratic databases. In Proceedings of VLDB. 108--119. Google ScholarDigital Library
- Levy, A. Y., Rajaraman, A., and Ullman, J. D. 1996. Answering queries using limited external processors. In Proceedings of PODS. 227--237. Google ScholarDigital Library
- Li, C. and Chang, E. Y. 2001. Answering queries with useful bindings. ACM Trans. Database Syst. 26, 3, 313--343. Google ScholarDigital Library
- Melton, J. 2003a. Database languages---SQL---part 14: XML-related specifications (SQL/XML). In Proceedings of ISO/IEC 9075-14:2003.Google Scholar
- Melton, J. 2003b. Database languages---SQL---part 2: Foundation (SQL/foundation). In Proceedings of ISO/IEC 9075-2:2003.Google Scholar
- Microsoft, Inc. Native XML Web services for Microsoft SQL server. 2005. Go online to http://msdn2.microsoft.com/en-us/library/ms345123.aspx.Google Scholar
- Microsoft, Inc. 2004. Microsoft BizTalk Server. Go online to http://www.microsoft.com/biztalk/.Google Scholar
- Nash, A. and Ludaescher, B. 2004. Processing unions of conjunctive queries with negation under limited access patterns. In Proceedings of EDBT.Google Scholar
- Nielsen, J. 2000. Designing Web Usability. New Riders Publishing, Berkeley, CA. Google ScholarDigital Library
- Popa, L., Velegrakis, Y., Miller, R. J., Hernández, M. A., and Fagin, R. 2002. Translating Web data. In Proceedings of VLDB. 598--609. Google ScholarDigital Library
- Pottinger, R. and Halevy, A. Y. 2001. Minicon: A scalable algorithm for answering queries using views. VLDB J. 10, 2-3, 182--198. Google ScholarDigital Library
- Rahm, E. and Bernstein, P. A. 2001. A survey of approaches to automatic schema matching. VLDB J. 10, 4, 334--350. Google ScholarDigital Library
- Rajaraman, A., Sagiv, Y., and Ullman, J. D. 1995. Answering queries using templates with binding patterns. In Proceedings of PODS. 105--112. Google ScholarDigital Library
- Rizvi, S., Mendelzon, A. O., Sudarshan, S., and Roy, P. 2004. Extending query rewriting techniques for fine-grained access control. In Proceedings of SIGMOD Conference. 551--562. Google ScholarDigital Library
- Roth, M. T. and Schwarz, P. M. 1997. Don't scrap it, wrap it! a wrapper architecture for legacy data sources. In Proceedings of VLDB. 266--275. Google ScholarDigital Library
- Shipman, D. W. 1981. The functional data model and the data language daplex. ACM Trans. Database Syst. 6, 1, 140--173. Google ScholarDigital Library
- Tufte, E. R. 1997. Visual Explanations: Images and Quantities, Evidence and Narrative. Graphics Press, Cheshire, CT. Google ScholarDigital Library
- Vassalos, V. and Papakonstantinou, Y. 1997. Describing and using query capabilities of heterogeneous sources. In Proceedings of VLDB. Google ScholarDigital Library
- Yannakakis, M. 1981. Algorithms for acyclic database schemes. In Proceedings of VLDB. 82--94. Google ScholarDigital Library
- Yerneni, R., Li, C., Garcia-Molina, H., and Ullman, J. D. 1999. Computing capabilities of mediators. In Proceedings of SIGMOD Conference. 443--454. Google ScholarDigital Library
- Zloof, M. 1975. Query by example. AFIPS NCC 44, 431--438.Google Scholar
Index Terms
- Exporting and interactively querying Web service-accessed sources: The CLIDE System
Recommendations
Interactive query formulation over web service-accessed sources
SIGMOD '06: Proceedings of the 2006 ACM SIGMOD international conference on Management of dataIntegration systems typically support only a restricted set of queries over the schema they export. The reason is that the participating information sources contribute limited content and limited access methods. In prior work, these limited access ...
View-based query processing: On the relationship between rewriting, answering and losslessness
As a result of the extensive research in view-based query processing, three notions have been identified as fundamental, namely rewriting, answering, and losslessness. Answering amounts to computing the tuples satisfying the query in all databases ...
Querying XML data sources that export very large sets of views
We study the problem of querying XML data sources that accept only a limited set of queries, such as sources accessible by Web services which can implement very large (potentially infinite) families of XPath queries. To compactly specify such families ...
Comments