| Efficient processing of complex similarity queries in RDBMS through query rewriting |
| Full text |
Pdf
(761 KB)
|
| Source
|
Conference on Information and Knowledge Management
archive
Proceedings of the 15th ACM international conference on Information and knowledge management
table of contents
Arlington, Virginia, USA
SESSION: Similarity and matching
table of contents
Pages: 4 - 13
Year of Publication: 2006
ISBN:1-59593-433-2
|
|
Authors
|
|
Caetano Traina, Jr.
|
ICMC University of Sao Paulo, Sao Carlos, SP, Brazil
|
|
Agma J. M. Traina
|
ICMC University of Sao Paulo, Sao Carlos, SP, Brazil
|
|
Marcos R. Vieira
|
ICMC University of Sao Paulo, Sao Carlos, SP, Brazil
|
|
Adriano S. Arantes
|
IBM, San Jose, CA
|
|
Christos Faloutsos
|
Carnegie Mellon University, Pittsburgh, PA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 9, Downloads (12 Months): 96, Citation Count: 1
|
|
|
ABSTRACT
Multimedia and complex data are usually queried by similarity predicates. Whereas there are many works dealing with algorithms to answer basic similarity predicates, there are not generic algorithms able to efficiently handle similarity complex queries combining several basic similarity predicates. In this work we propose a simple and effective set of algorithms that can be combined to answer complex similarity queries, and a set of algebraic rules useful to rewrite similarity query expressions into an adequate format for those algorithms. Those rules and algorithms allow relational database management systems to turn complex queries into efficient query execution plans. We present experiments that highlight interesting scenarios. They show that the proposed algorithms are orders of magnitude faster than the traditional similarity algorithms. Moreover, they are linearly scalable considering the database size.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Stefan Berchtold , Christian Böhm , Bernhard Braunmüller , Daniel A. Keim , Hans-Peter Kriegel, Fast parallel similarity search in multimedia databases, Proceedings of the 1997 ACM SIGMOD international conference on Management of data, p.1-12, May 11-15, 1997, Tucson, Arizona, United States
|
| |
2
|
C. Böhm, B. Braunmüller, and H.-P. Kriegel. The pruning power: Theory and heuristics for mining databases with multiple k-nearest-neighbor queries. In Int. Conf. on DaWaK, pages 244--257, 2000.
|
| |
3
|
|
| |
4
|
|
 |
5
|
|
 |
6
|
|
 |
7
|
|
 |
8
|
|
| |
9
|
|
| |
10
|
|
 |
11
|
|
 |
12
|
A. Kemper , G. Moerkotte , K. Peithner , M. Steinbrunn, Optimizing disjunctive queries with expensive predicates, Proceedings of the 1994 ACM SIGMOD international conference on Management of data, p.336-347, May 24-27, 1994, Minneapolis, Minnesota, United States
|
| |
13
|
|
| |
14
|
|
| |
15
|
C. Mohan , Donald J. Haderle , Yun Wang , Josephine M. Cheng, Single Table Access Using Multiple Indexes: Optimization, Execution, and Concurrency Control Techniques, Proceedings of the International Conference on Extending Database Technology: Advances in Database Technology, p.29-43, March 26-30, 1990
|
| |
16
|
|
 |
17
|
Nick Roussopoulos , Stephen Kelley , Frédéric Vincent, Nearest neighbor queries, Proceedings of the 1995 ACM SIGMOD international conference on Management of data, p.71-79, May 22-25, 1995, San Jose, California, United States
|
 |
18
|
|
 |
19
|
P. Griffiths Selinger , M. M. Astrahan , D. D. Chamberlin , R. A. Lorie , T. G. Price, Access path selection in a relational database management system, Proceedings of the 1979 ACM SIGMOD international conference on Management of data, May 30-June 01, 1979, Boston, Massachusetts
[doi> 10.1145/582095.582099]
|
| |
20
|
|
| |
21
|
|
| |
22
|
|
|