ABSTRACT
Efficiently answering XML keyword queries has attracted much research effort in the last decade. One key factors resulting in the inefficiency of existing methods are the common-ancestor-repetition (CAR) and visiting-useless-nodes (VUN) problems. In this paper, we propose a generic top-down processing strategy to answer a given keyword query w.r.t. LCA/SLCA/ELCA semantics. By top-down, we mean that we visit all common ancestor (CA) nodes in a depth-first, left-to-right order, thus avoid the CAR problem; by generic, we mean that our method is independent of the labeling schemes and query semantics. We show that the satisfiability of a node v w.r.t. the given semantics can be determined by v's child nodes, based on which our methods avoid the VUN problem. We propose two algorithms that are based on either traditional inverted lists or our newly proposed LLists to improve the overall performance. The experimental results verify the benefits of our methods according to various evaluation metrics.
- L. J. Chen and Y. Papakonstantinou. Supporting top-k keyword search in xml databases. In ICDE, pages 689--700, 2010.Google ScholarCross Ref
- Y. Chen, W. Wang, and Z. Liu. Keyword-based search and exploration on databases. In ICDE, pages 1380--1383, 2011. Google ScholarDigital Library
- S. Cohen, J. Mamou, Y. Kanza, and Y. Sagiv. Xsearch: A semantic search engine for xml. In VLDB, pages 45--56, 2003. Google ScholarDigital Library
- L. Guo, F. Shao, C. Botev, and J. Shanmugasundaram. Xrank: Ranked keyword search over xml documents. In SIGMOD Conference, pages 16--27, 2003. Google ScholarDigital Library
- V. Hristidis, N. Koudas, Y. Papakonstantinou, and D. Srivastava. Keyword proximity search in xml trees. IEEE Trans. Knowl. Data Eng., 18(4):525--539, 2006. Google ScholarDigital Library
- L. Kong, R. Gilleron, and A. Lemay. Retrieving meaningful relaxed tightest fragments for xml keyword search. In EDBT, pages 815--826, 2009. Google ScholarDigital Library
- G. Li, J. Feng, J. Wang, and L. Zhou. Effective keyword search for valuable lcas over xml documents. In CIKM, pages 31--40, 2007. Google ScholarDigital Library
- Y. Li, C. Yu, and H. V. Jagadish. Schema-free xquery. In VLDB, pages 72--83, 2004. Google ScholarDigital Library
- Z. Liu and Y. Chen. Reasoning and identifying relevant matches for xml keyword search. PVLDB, 1(1):921--932, 2008. Google ScholarDigital Library
- C. Sun, C. Y. Chan, and A. K. Goenka. Multiway slca-based keyword search in xml data. In WWW, pages 1043--1052, 2007. Google ScholarDigital Library
- I. Tatarinov, S. Viglas, K. S. Beyer, J. Shanmugasundaram, E. J. Shekita, and C. Zhang. Storing and querying ordered xml using a relational database system. In SIGMOD Conference, pages 204--215, 2002. Google ScholarDigital Library
- W. Wang, X. Wang, and A. Zhou. Hash-search: An efficient slca-based keyword search algorithm on xml documents. In DASFAA, pages 496--510, 2009. Google ScholarDigital Library
- Y. Xu and Y. Papakonstantinou. Efficient keyword search for smallest lcas in xml databases. In SIGMOD Conference, pages 537--538, 2005. Google ScholarDigital Library
- Y. Xu and Y. Papakonstantinou. Efficient lca based keyword search in xml data. In EDBT, pages 535--546, 2008. Google ScholarDigital Library
- J. Zhou, Z. Bao, W. Wang, T. W. Ling, Z. Chen, X. Lin, and J. Guo. Fast slca and elca computation for xml keyword queries based on set intersection. In ICDE, pages 905--916, 2012. Google ScholarDigital Library
- J. Zhou, Z. Bao, W. Wang, J. Zhao, and X. Meng. Efficient query processing for xml keyword queries based on the idlist index. The VLDB Journal, DOI: 10.1007/s00778-013-0313-2.Google ScholarDigital Library
- J. Zhou, X. Zhao, W. Wang, Z. Chen, and J. X. Yu. Topdown keyword query processing on xml data. Technical report, University of New South Wales, CSR-TR-2013-21, 2013.Google Scholar
- R. Zhou, C. Liu, and J. Li. Fast elca computation for keyword queries on xml data. In EDBT, pages 549--560, 2010. Google ScholarDigital Library
Index Terms
- Top-down keyword query processing on XML data
Recommendations
Return specification inference and result clustering for keyword search on XML
Keyword search enables Web users to easily access XML data without the need to learn a structured query language and to study possibly complex data schemas. Existing work has addressed the problem of selecting qualified data nodes that match keywords ...
Processing keyword search on XML: a survey
Keyword search is a user-friendly approach for users to retrieve information from XML data. Since an XML document can have a large size and contain a lot of information, an XML keyword search result should be a fragment of an XML document dynamically ...
Using semantics in XML query processing
ICUIMC '08: Proceedings of the 2nd international conference on Ubiquitous information management and communicationEfficient query processing in XML database has attracted extensive research efforts. However, majority of existing approaches for XML query processing do not exploit semantic information in the underlying database. In this paper, we address the ...
Comments