|
ABSTRACT
XML provides a universal and portable format for document and data exchange. While the syntax and specification of XML makes documents both human readable and machine parsable, it is often at the expense of efficiency when representing simple data structures.We investigate the ``costs'' associated with XML serialization from several resource perspectives: storage, transport, processing and human readability. These experiments are done within the context of a large text-centric service oriented architecture -- IBM's WebFountain project.We find that for several applications, human readable formats outperform binary equivalents, especially in the area of data size, and that the costs of processing encoded binary data often exceeds that of processing terse human readable formats.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Rakesh Agrawal , Roberto J. Bayardo, Jr. , Daniel Gruhl , Spiros Papadimitriou, Vinci: a service-oriented architecture for rapid development of web applications, Proceedings of the 10th international conference on World Wide Web, p.355-365, May 01-05, 2001, Hong Kong, Hong Kong
[doi> 10.1145/371920.372088]
|
| |
2
|
D. Box, D. Ehnebuske, G. Kakivaya, A. Layman, N. Mendelsohn, H. F. Nielsen, S. Thatte, and D. Winder. Simple Object Access Protocol. http://www.w3.org/TR/SOAP/, May 2000.
|
| |
3
|
|
| |
4
|
P. Deutsch. Gzip file format specification version 4.3. RFC 1952, 1996.
|
| |
5
|
D. Gruhl , L. Chavet , D. Gibson , J. Meyer , P. Pattanayak , A. Tomkins , J. Zien, How to build a WebFountain: An architecture for very large-scale text analytics, IBM Systems Journal, v.43 n.1, p.64-77, January 2004
|
| |
6
|
S. Josefsson. The base16, base32, and base64 data encodings. RFC 3548, 2003.
|
| |
7
|
|
 |
8
|
|
| |
9
|
OMG. Xml metadata interchange (xmi). http://www.omg.org/technology/documents/formal/xmi.htm, 2002.
|
 |
10
|
|
| |
11
|
W3C. Wap binary xml content format. http://www.w3.org/TR/wbxml/, 1999.
|
| |
12
|
W3C. Xml binary characterization. http://www.w3.org/TR/xbc-characterization/, 2005.
|
|