skip to main content
10.1145/584792.584801acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Searching web databases by structuring keyword-based queries

Authors Info & Claims
Published:04 November 2002Publication History

ABSTRACT

On-line information services have become widespread in the Web nowadays. However, Web users are non-specialized and have a great variety of interests. Thus, interfaces for Web databases must be simple and uniform. In this paper we present an approach, based on Bayesian networks, for querying Web databases using keywords only. According to this approach, the user inputs a query through a simple search-box interface. From the input query, one or more plausible structured queries are derived and submitted to Web databases. The results are then retrieved and presented to the user as ranked answers. Our approach reduces the complexity of existing on-line interfaces and offers a solution to the problem of querying several distinct Web databases with a single interface. The applicability of the proposed approach was demonstrated by experimental results with 3 databases, obtained with a prototype search system that implements it. We have found that from 77% to 95% of the time, one of the top three resulting structured queries is the proper one. Further, when the user selects one of these three top queries for processing, the ranked answers present average precision figures from 60% to about 100%.

References

  1. Agrawal, S., Chaudhuri, S., and Das, G. DBXplorer: A System For Keyword-Based Search Over Relational Data ases.In In 18th International Conference on Data Engineering (San Jose, California, 2002). Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Baeza-Yates, R., and Ribeiro-Neto, B. Modern Information Retrieval Addison Wesley, New York, NY, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Bruno, N., Gravano, L., and Marian, A. Evaluating top-k queries over web-accessible databases. In 20th International Conference on Data Engineering (San Jose, California, USA, 2002). Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Chaudhuri, S., and Gravano, L. Evaluating top-k selection queries. In Proceedings of 25th International Conference on Very Large Data Bases (Edinburgh, Scotland, UK, 1999), pp.397--410. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Cohen, W. W. Reasoning a out Textual Similarity in a Web-Based Information Access System.Autonomous Agents and Multi-Agent Systems 2 1 (1999), 65--86. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Dar, S., Entin, G., Geva, S., and Palmon, E. DTL's DataSpot: Data ase exploration using plain language. In Proceedings of 24th International Conference on Very Large Data Bases (New York, New York, USA, 1998), pp.645--649. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Florescu, D., Kossmann, D., and Manolescu, I. Integrating Keyword Search into XML Query Processing. WWW9 / Computer Networks 33 1-6 (2000), 119--135. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Goldman, R., Shivakumar, N., Venkatasubramanian, S., and Garcia-Molina, H. Proximity search in databases. In Proceedings of 24th International Conference on Very Large Data Bases (New York, New York, USA, 1998), pp.26--37. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Pearl, J. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference Morgan Kaufmann Publishers, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Ribeiro-Neto, B., and Muntz, R. A belief network model for IR. In Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (Zurich, Switzerland, August 1996), pp.253--260. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Ribeiro-Neto, B., Silva, I., and Muntz, R. Soft Computing in Information Retrieval: Techniques and Applications 1st ed. Springer Verlag, 2000, ch.11. Bayesian Network Models for IR, pp.259--291.Google ScholarGoogle Scholar
  12. Salton, G., and McGill, M. J. Introduction to Modern Information Retrieval 1st ed. McGraw-Hill, 1983. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Silva, I., Ribeiro-Neto, B., Calado, P., Moura, E., and Ziviani, N. Link-ased and Content-Based Evidential Information in a Belief Network Model. In Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (Athens, Greece, July 2000), pp.96--103. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Turtle, H., and Croft, W. B. Evaluation of an Inference Network-Based Retrieval Model.ACM Transactions on Information Systems 9, 3 (July 1991), 187--222. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Searching web databases by structuring keyword-based queries

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          CIKM '02: Proceedings of the eleventh international conference on Information and knowledge management
          November 2002
          704 pages
          ISBN:1581134924
          DOI:10.1145/584792

          Copyright © 2002 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 4 November 2002

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Acceptance Rates

          Overall Acceptance Rate1,861of8,427submissions,22%

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader