|
ABSTRACT
We study the complexity of two central XML processing problems. The first is XPath 1.0 query processing, which has been shown to be in PTIME in previous work. We prove that both the data complexity and the query complexity of XPath 1.0 fall into lower (highly parallelizable) complexity classes, while the combined complexity is PTIME-hard. Subsequently, we study the sources of this hardness and identify a large and practically important fragment of XPath 1.0 for which the combined complexity is LOGCFL-complete and, therefore, in the highly parallelizable complexity class NC2. The second problem is the complexity of validating XML documents against various typing schemes like Document Type Definitions (DTDs), XML Schema Definitions (XSDs), and tree automata, both with respect to data and to combined complexity. For data complexity, we prove that validation is in LOGSPACE and depends crucially on how XML data is represented. For the combined complexity, we show that the complexity ranges from LOGSPACE to LOGCFL, depending on the typing scheme.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Aho, A. V., and Ullman, J. D. 1971. Translations on a context-free grammar. Inf. Control 19, 439--475.
|
| |
3
|
Allender, E. 2001. The division breakthroughs. The Computational Complexity Column, EATCS Bull. 74, 61--77.
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
Bojanczyk, M., and Colcombet, T. 2004. TWA cannot be determinized. In Proceedings of the 31st International Colloquium on Automata, Languages and Programming (ICALP).
|
| |
8
|
|
| |
9
|
Brüggemann-Klein, A., Murata, M., and Wood, D. 2001. Regular tree and regular hedge languages over non-ranked alphabets: Version 1, April 3, 2001. Tech. Rep. HKUST-TCSC-2001-05, Hong Kong University of Science and Technology, Hong Kong SAR, China. (Available at ftp://ftp11.informatik.tu-muenchen.de/pub/misc/caterpillars/.)
|
 |
10
|
|
| |
11
|
|
 |
12
|
Sophie Cluet , Claude Delobel , Jérǒme Siméon , Katarzyna Smaga, Your mediators need data conversion!, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.177-188, June 01-04, 1998, Seattle, Washington, United States
|
| |
13
|
Comon, H., Dauchet, M., Gilleron, R., Jacquemard, F., Lugiez, D., Tison, S., and Tommasi, M. 1999. Tree automata techniques and applications. Available at http://www.grappa.univ-lille3.fr/tata.
|
| |
14
|
|
| |
15
|
|
| |
16
|
|
| |
17
|
|
| |
18
|
Gottlob, G., Koch, C., and Pichler, R. 2002. Efficient algorithms for processing XPath queries. In Proceedings of the 28th International Conference on Very Large Data Bases (VLDB'02). 95--106.
|
 |
19
|
|
| |
20
|
Gottlob, G., Koch, C., and Pichler, R. 2003b. XPath query evaluation: Improving time and space efficiency. In Proceedings of the 19th IEEE International Conference on Data Engineering (ICDE'03). IEEE Computer Society Press, Los Alamitos, Calif., 379--390.
|
 |
21
|
|
| |
22
|
|
| |
23
|
|
| |
24
|
|
| |
25
|
Jones, N. D. 1975. Space-bounded reducibility among combinatorial problems. J. Comput. Syst. Sci. 11, 68--85.
|
 |
26
|
|
| |
27
|
|
| |
28
|
Lyons, R. 2001. Turing machine markup language. http://www.unidex.com/turing/.
|
 |
29
|
|
| |
30
|
|
| |
31
|
Dan Olteanu , Holger Meuss , Tim Furche , François Bry, XPath: Looking Forward, Proceedings of the Worshops XMLDM, MDDE, and YRWS on XML-Based Data Management and Multimedia Engineering-Revised Papers, p.109-127, March 24-28, 2002
|
| |
32
|
Papadimitriou, C. H. 1994. Computational Complexity. Addison-Wesley, Reading Mass.
|
 |
33
|
|
| |
34
|
|
| |
35
|
|
| |
36
|
Ruzzo, W. 1980. Tree-size bounded alternation. J. Comput. Syst. Sci. 21, 218--235.
|
| |
37
|
SAX Project Collaboration. 2004. Simple API for XML. Available at http://www.saxproject.org/.
|
 |
38
|
|
 |
39
|
|
| |
40
|
|
| |
41
|
Sudborough, I. 1977. Time and tape bounded auxiliary pushdown automata. In Proceedings of the 6th Symposium on Mathematical Foundations of Computer Science (MFCS'77). Lecture Notes in Computer Science, vol. 53. Springer-Verlag, New York, 493--503.
|
 |
42
|
|
 |
43
|
|
| |
44
|
|
| |
45
|
|
| |
46
|
Wadler, P. 2000. Two semantics for XPath. Draft paper available at http://www.research.avayalabs.com/user/wadler/.
|
| |
47
|
World Wide Web Consortium. 1999. XML path language (XPath) recommendation. http://www. w3c.org/TR/xpath/.
|
| |
48
|
World Wide Web Consortium. 2004. Document object model. Available at http://www.w3c.org/dom, level 1 specification available at http://www.w3.org/TR/REC-DOM-Level-1/.
|
CITED BY 11
|
François Bry , Tim Furche , Benedikt Linse , Andreas Schroeder, Efficient evaluation of n-ary conjunctive queries over trees and graphs, Proceedings of the eighth ACM international workshop on Web information and data management, November 10-10, 2006, Arlington, Virginia, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
INDEX TERMS
Primary Classification:
F.
Theory of Computation
F.1
COMPUTATION BY ABSTRACT DEVICES
F.1.1
Models of Computation
Subjects:
Automata (e.g., finite, push-down, resource-bounded)
Additional Classification:
F.
Theory of Computation
F.4
MATHEMATICAL LOGIC AND FORMAL LANGUAGES
F.4.1
Mathematical Logic
Subjects:
Computational logic
F.4.3
Formal Languages
Subjects:
Classes defined by grammars or automata (e.g., context-free languages, regular sets, recursive sets)
H.
Information Systems
H.2
DATABASE MANAGEMENT
H.2.3
Languages
Subjects:
Query languages
I.
Computing Methodologies
I.7
DOCUMENT AND TEXT PROCESSING
I.7.2
Document Preparation
Subjects:
Markup languages
General Terms:
Algorithms,
Languages,
Theory
Keywords:
Complexity,
DTD,
LOGCFL,
XML,
XPath
|