ACM Home Page
Please provide us with feedback. Feedback
A case study on alternate representations of data structures in XML
Full text PdfPdf (101 KB)
Source Document Engineering archive
Proceedings of the 2005 ACM symposium on Document engineering table of contents
Bristol, United Kingdom
SESSION: Techniques for document management and document engineering table of contents
Pages: 217 - 219  
Year of Publication: 2005
ISBN:1-59593-240-2
Authors
Daniel Gruhl  IBM Almaden Research Center
Daniel Meredith  IBM Almaden Research Center
Jan Pieper  IBM Almaden Research Center
Sponsors
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 57,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1096601.1096652
What is a DOI?

ABSTRACT

XML provides a universal and portable format for document and data exchange. While the syntax and specification of XML makes documents both human readable and machine parsable, it is often at the expense of efficiency when representing simple data structures.We investigate the ``costs'' associated with XML serialization from several resource perspectives: storage, transport, processing and human readability. These experiments are done within the context of a large text-centric service oriented architecture -- IBM's WebFountain project.We find that for several applications, human readable formats outperform binary equivalents, especially in the area of data size, and that the costs of processing encoded binary data often exceeds that of processing terse human readable formats.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
D. Box, D. Ehnebuske, G. Kakivaya, A. Layman, N. Mendelsohn, H. F. Nielsen, S. Thatte, and D. Winder. Simple Object Access Protocol. http://www.w3.org/TR/SOAP/, May 2000.
 
3
 
4
P. Deutsch. Gzip file format specification version 4.3. RFC 1952, 1996.
 
5
 
6
S. Josefsson. The base16, base32, and base64 data encodings. RFC 3548, 2003.
 
7
8
 
9
OMG. Xml metadata interchange (xmi). http://www.omg.org/technology/documents/formal/xmi.htm, 2002.
10
 
11
W3C. Wap binary xml content format. http://www.w3.org/TR/wbxml/, 1999.
 
12
W3C. Xml binary characterization. http://www.w3.org/TR/xbc-characterization/, 2005.

Collaborative Colleagues:
Daniel Gruhl: colleagues
Daniel Meredith: colleagues
Jan Pieper: colleagues