ACM Home Page
Please provide us with feedback. Feedback
DTD inference for views of XML data
Full text PdfPdf (348 KB)
Source Symposium on Principles of Database Systems archive
Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems table of contents
Dallas, Texas, United States
Pages: 35 - 46  
Year of Publication: 2000
ISBN:1-58113-214-X
Authors
Yannis Papakonstantinou  Computer Science & Engineering, U.C. San Diego, La Jolla, CA
Victor Vianu  Computer Science & Engineering, U.C. San Diego, La Jolla, CA
Sponsor
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 67,   Citation Count: 50
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/335168.335173
What is a DOI?

ABSTRACT

We study the inference of Data Type Definitions (DTDs) for views of XML data, using an abstraction that focuses on document content structure. The views are defined by a query language that produces a list of documents selected from one or more input sources. The selection conditions involve vertical and horizontal navigation, thus querying explicitly the order present in input documents. We point several strong limitations in the descriptive ability of current DTDs and the need for extending them with (i) a subtyping mechanism and (ii) a more powerful specification mechanism than regular languages, such as context-free languages. With these extensions, we show that one can always infer tight DTDs, that precisely characterize a selection view on sources satisfying given DTDs. We also show important special cases where one can infer a tight DTD without requiring extension (ii). Finally we consider related problems such as verifying conformance of a view definition with a predefined DTD. Extensions to more powerful views that construct complex documents are also briefly discussed.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
Abi97
 
AM98
 
AQM+97
S. Abiteboul, D. Quass, J. McHugh, J. Widom, and J. Wiener. The LOREL query language for semistructured data. International Journal on Digital Libraries, 1(1), 1997.
B+99
 
BDFS97
BDHS96
 
BKMW98
A. Bruggemann-Klein, M. Murata, and D. Wood. Regular tree languages over nonranked alphabets, 1998. Available at ftp:// ftp I I. inf ormat ik. tu-muenchen, de/pub/mis c /caterpillars/.
 
BM99
 
BPSM
T. Bray, J. Paoli, and C. Sperberg-McQueen. Extensible markup language (XML) 1.0, W3C recommendation. Latest version available at http://www, w3. org. TR/REC-xml.
Bun97
 
CD
J. Clark and S. Deach. Extensible stylesheet language (xsl) 1.0, W3C working draft. http://www, w3. org/TR/WD-xsl.
CDSS98
CM90
 
dBV93
J. Van den Bussche and G. Vossen. An extension of path expressions to simplify navigation in object-oriented queries. In Proc. of Intl. Conf. on Deductive and Object-Oriented Databases (DOOD), 1993.
 
DFF+
A. Deutch, M. Fernandez, D. Florescu, A. Levy, and D. Suciu. XML-QL: A query language for XML. Submission to W3C. Latest version available at http://www, w3. org/TR/NOTE-xml-ql.
FFLS98
FLS98
 
FS98
 
GJ79
 
GS97
 
GW97
 
HU79
 
Inca
Bluestone Inc. Visual XML. http://www, bluestone, com/xml/Visual-XML/.
 
Incb
SoftQuad Inc. XMetal editor. http://www, sq. com/products/xmetal/.
 
KS95
 
LJM+
A. Layman, E. Jung, E. Maler, H. Thompson, J. Paoli, J. Tigue, N. Mikula, and S. De Rose. XML-Data. Available at http: //www. w3. org/TR/1998/NOTE-XML-dat a.
 
MD
E. Maler and S. DeRose. XML pointer language (XPointer). http://www, w3. org /TR/1998/WD-xpt r- 19980303.
 
Mit90
 
Mit96
 
MP
K. Munroe and Y. Papakonstantinou. BBQ: A visual interface for integrated browsing and querying of XML. Available at http ://www. db. ucsd. edu/publications/ BBO. pdf.
MS99
MSV
 
MW95
 
MZ98
NAM98
NdB98
 
Nev99
NS99
 
NUWC97
 
PAGM96
 
PV
Y. Papakonstantinou and P. Velikhov. The use and computation of specialized DTDs in the MIX mediator system. Manuscript, available at http://www, db. ucsd. edu/ publicat ions/UseComputeDTD, pdf.
PV99a
 
PV99b
Y. Papakonstantinou and P. Velikhov. Enhancing semistructured data mediators with document type definitions. In Proc. ICDE Conf., 1999.
 
SPS99
S.Abiteboul, P.Buneman, and D. Suciu. Data on the Web. Morgan Kauffman, 1999.
 
Suc98
D. Suciu. Semistructured data and XML. In Proc. 5th International Conference of Foundations of Data Organization (FODO'98), 1998.
 
VLP00

CITED BY  50
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Collaborative Colleagues:
Yannis Papakonstantinou: colleagues
Victor Vianu: colleagues

Peer to Peer - Readers of this Article have also read: