ACM Home Page
Please provide us with feedback. Feedback
The complexity of XPath query evaluation and XML typing
Full text PdfPdf (448 KB)
Source Journal of the ACM (JACM) archive
Volume 52 ,  Issue 2  (March 2005) table of contents
Pages: 284 - 335  
Year of Publication: 2005
ISSN:0004-5411
Authors
Georg Gottlob  Technische Universität Wien, Wien, Austria
Christoph Koch  Technische Universität Wien, Wien, Austria
Reinhard Pichler  Technische Universität Wien, Wien, Austria
Luc Segoufin  INRIA, France
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 126,   Citation Count: 11
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1059513.1059520
What is a DOI?

ABSTRACT

We study the complexity of two central XML processing problems. The first is XPath 1.0 query processing, which has been shown to be in PTIME in previous work. We prove that both the data complexity and the query complexity of XPath 1.0 fall into lower (highly parallelizable) complexity classes, while the combined complexity is PTIME-hard. Subsequently, we study the sources of this hardness and identify a large and practically important fragment of XPath 1.0 for which the combined complexity is LOGCFL-complete and, therefore, in the highly parallelizable complexity class NC2. The second problem is the complexity of validating XML documents against various typing schemes like Document Type Definitions (DTDs), XML Schema Definitions (XSDs), and tree automata, both with respect to data and to combined complexity. For data complexity, we prove that validation is in LOGSPACE and depends crucially on how XML data is represented. For the combined complexity, we show that the complexity ranges from LOGSPACE to LOGCFL, depending on the typing scheme.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Aho, A. V., and Ullman, J. D. 1971. Translations on a context-free grammar. Inf. Control 19, 439--475.
 
3
Allender, E. 2001. The division breakthroughs. The Computational Complexity Column, EATCS Bull. 74, 61--77.
 
4
 
5
 
6
 
7
Bojanczyk, M., and Colcombet, T. 2004. TWA cannot be determinized. In Proceedings of the 31st International Colloquium on Automata, Languages and Programming (ICALP).
 
8
 
9
Brüggemann-Klein, A., Murata, M., and Wood, D. 2001. Regular tree and regular hedge languages over non-ranked alphabets: Version 1, April 3, 2001. Tech. Rep. HKUST-TCSC-2001-05, Hong Kong University of Science and Technology, Hong Kong SAR, China. (Available at ftp://ftp11.informatik.tu-muenchen.de/pub/misc/caterpillars/.)
10
 
11
12
 
13
Comon, H., Dauchet, M., Gilleron, R., Jacquemard, F., Lugiez, D., Tison, S., and Tommasi, M. 1999. Tree automata techniques and applications. Available at http://www.grappa.univ-lille3.fr/tata.
 
14
 
15
 
16
 
17
 
18
Gottlob, G., Koch, C., and Pichler, R. 2002. Efficient algorithms for processing XPath queries. In Proceedings of the 28th International Conference on Very Large Data Bases (VLDB'02). 95--106.
19
 
20
Gottlob, G., Koch, C., and Pichler, R. 2003b. XPath query evaluation: Improving time and space efficiency. In Proceedings of the 19th IEEE International Conference on Data Engineering (ICDE'03). IEEE Computer Society Press, Los Alamitos, Calif., 379--390.
21
 
22
 
23
 
24
 
25
Jones, N. D. 1975. Space-bounded reducibility among combinatorial problems. J. Comput. Syst. Sci. 11, 68--85.
26
 
27
 
28
Lyons, R. 2001. Turing machine markup language. http://www.unidex.com/turing/.
29
 
30
 
31
 
32
Papadimitriou, C. H. 1994. Computational Complexity. Addison-Wesley, Reading Mass.
33
 
34
 
35
 
36
Ruzzo, W. 1980. Tree-size bounded alternation. J. Comput. Syst. Sci. 21, 218--235.
 
37
SAX Project Collaboration. 2004. Simple API for XML. Available at http://www.saxproject.org/.
38
39
 
40
 
41
Sudborough, I. 1977. Time and tape bounded auxiliary pushdown automata. In Proceedings of the 6th Symposium on Mathematical Foundations of Computer Science (MFCS'77). Lecture Notes in Computer Science, vol. 53. Springer-Verlag, New York, 493--503.
42
43
 
44
 
45
 
46
Wadler, P. 2000. Two semantics for XPath. Draft paper available at http://www.research.avayalabs.com/user/wadler/.
 
47
World Wide Web Consortium. 1999. XML path language (XPath) recommendation. http://www. w3c.org/TR/xpath/.
 
48
World Wide Web Consortium. 2004. Document object model. Available at http://www.w3c.org/dom, level 1 specification available at http://www.w3.org/TR/REC-DOM-Level-1/.

CITED BY  11
 
 
 

Collaborative Colleagues:
Georg Gottlob: colleagues
Christoph Koch: colleagues
Reinhard Pichler: colleagues
Luc Segoufin: colleagues