ACM Home Page
Please provide us with feedback. Feedback
Effective keyword search in relational databases
Full text PdfPdf (336 KB)
Source International Conference on Management of Data archive
Proceedings of the 2006 ACM SIGMOD international conference on Management of data table of contents
Chicago, IL, USA
SESSION: Potpourri table of contents
Pages: 563 - 574  
Year of Publication: 2006
ISBN:1-59593-434-0
Authors
Fang Liu  University of Illinois at Chicago
Clement Yu  University of Illinois at Chicago
Weiyi Meng  SUNY at Binghamton
Abdur Chowdhury  America Online, Inc.
Sponsors
ACM: Association for Computing Machinery
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1142473.1142536
What is a DOI?

ABSTRACT

With the amount of available text data in relational databases growing rapidly, the need for ordinary users to search such information is dramatically increasing. Even though the major RDBMSs have provided full-text search capabilities, they still require users to have knowledge of the database schemas and use a structured query language to search information. This search model is complicated for most ordinary users. Inspired by the big success of information retrieval (IR) style keyword search on the web, keyword search in relational databases has recently emerged as a new research topic. The differences between text databases and relational databases result in three new challenges: (1) Answers needed by users are not limited to individual tuples, but results assembled from joining tuples from multiple tables are used to form answers in the form of tuple trees. (2) A single score for each answer (i.e. a tuple tree) is needed to estimate its relevance to a given query. These scores are used to rank the most relevant answers as high as possible. (3) Relational databases have much richer structures than text databases. Existing IR strategies to rank relational outputs are not adequate. In this paper, we propose a novel IR ranking strategy for effective keyword search. We are the first that conducts comprehensive experiments on search effectiveness using a real world database and a set of keyword queries collected by a major search company. Experimental results show that our strategy is significantly better than existing strategies. Our approach can be used both at the application level and be incorporated into a RDBMS to support keyword-based search in relational databases.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
S Agrawal, S Chaudhuri, G Das: DBXplorer: A system for keyword-based search over relational databases. ICDE 2002
 
2
 
3
A. Balmin, V. Hristidis,Y. Papakonstantinou: Authority Based Keyword Queries in Databases using ObjectRank. VLDB 2004
 
4
 
5
S. Cohen, Jonathan Mamou, Yaron Kanza, Yehoshua Sagiv: XSEarch: A Semantic Search Engine for XML. VLDB 2003
 
6
 
7
8
 
9
 
10
V. Hristidis and Y. Papakonstantinou. DISCOVER: Keyword search in relational databases. VLDB 2002.
 
11
V. Hristidis,L. Gravano,Y. Papakonstantinou:Efficient IR-Style Keyword Search over Relational Databases.VLDB 2003
12
 
13
Y. Li, Cong Yu, H. V. Jagadish: Schema-Free XQuery. VLDB 2004
14
 
15
L. Page, S. Brin, R. Motwani and T. Winograd. The PageRank Citation Ranking: Bringing Order to the Web, Technical Report, 1998
 
16
R. Sacks-Davis, Tuong Dao, James A. Thom, Justin Zobel Indexing documents for queries on structure, content and attributes. ISDM 1997
17
 
18
A. Singhal. Modern information retrieval: A brief overview. IEEE Data Eng. Bull. 24(4), 2001
 
19
E. M. Voorhees. Overview of the TREC-9 Question Answering Track. TREC 2000
 
20
Pew Internet & American Life Project Report: Search Engine Users, 2005. www.pewinternet.org/pdfs/PIP_Searchengine_users.pd
 
21
Google. www.google.com/ 2005
 
22
DB2 Text Information Extender. 2005 http://www.ibm.com/software/data/db2/extenders/textinformation/index.html
 
23
Micorsoft SQL Server 2000. www.microsoft.com/sql/ 2005
 
24
MySQL. dev.mysql.com/doc/mysql/en/Fulltext_Search.html.
25
 
26
 
27


Collaborative Colleagues:
Fang Liu: colleagues
Clement Yu: colleagues
Weiyi Meng: colleagues
Abdur Chowdhury: colleagues