|
ABSTRACT
Text models focus on the manipulation of textual data. They describe texts by their structure, operations on the texts, and constraints on both structure and operations. In this article common characteristics of machine readable texts in general are outlined. Subsequently, ten text models are introduced. They are described in terms of the datatypes that they support, and the operations defined by these datatypes. Finally, the models are compared.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
BARNA91 D.T. Barnard et al: 'SGML documents and non-linear text retrieval'. In: [LICHN91] pp. 226-244.
|
| |
2
|
BURKO91 F.J. Burkowsky: "The use of retrieval filters to localize information in a hierarchically tagged text-dominated database'. In: [LICHN91] pp. 264-284.
|
 |
3
|
|
| |
4
|
|
 |
5
|
|
| |
6
|
DESAI86 B.C. Desai, P. Goyal, S. Sadri: 'A data model for use with formatted and textual data'. In: [JASIS] 37 (3) 1986 Pg. 158-165.
|
| |
7
|
DOEDE94 C.J. Doedens: Natural and formal language access to text databases. PhD Thesis, Univ. of Utrecht, Holland, 1994 (to appear).
|
| |
8
|
|
| |
9
|
GONNE91 G. H. Gonnet et al: 'Lexicological indices for text:, inverted files vs. PAT trees'. Technical report OED-91-01, University of Waterloo, 1991.
|
 |
10
|
|
| |
11
|
HARMA90 D. Harman, G. Candela: 'Retrieving records from a gigabyte of text on a minicomputer using statistical ranking'. In: [JASIS] 41 (8) 1990 Pg. 581-589.
|
| |
12
|
ISO8613 Information processing- text and office systems - office document architecture (ODA) and interchange format. International organization for standardization. 1989.
|
| |
13
|
ISO8879 Information processing - Text and office systems - Standard generalized markup language (SGML). International organization for standardization. 1986, with amendment 1, 1988 (ISO 8879-1986/A1:1988 (E)).
|
| |
14
|
JASIS Journal of the American society for information science. Washington D.C. (ASIS).
|
 |
15
|
|
| |
16
|
LICHN91 Intelligent text and image handling. Proceedings of a conference on intelligent text and image handling 'Raio 91', Barcelona, Spain, 2-5 April 1991 Edited by A. Lichnerowicz Amsterdam et al, 1991 (Elsevier).
|
| |
17
|
LUTZ89 E. Lutz: "Knowledge based classification of office documents'. In: [WOODM89] pp. 353- 362.
|
| |
18
|
QUINT89 V. Quint, I. Vatton: Modularity in structured documents'. In: [WOODM89] pp. 170- 177.
|
 |
19
|
|
| |
20
|
SALMI92 A. Salminen, F. W. Tompa: 'PAT expressions: an algebra for text search', papers in Computational Lexicography: COMPLEX '92, Proc. 2nd Int. Conf. on Computational Lexicography (F. Kiefer, G. Kiss, J. Pajzs, ed.), Linguistics Inst., Hungarian Academy of Science, Budapest (October 1992), 309-332 (also available as Technical Report OED-92-02).
|
| |
21
|
SIGIR-N Proceedings of the Nth annual international ACM SIGIR conference on research and development in information retrieval.
|
 |
22
|
Jean Tague , Airi Salminen , Charles McClellan, Complete formal model for information retrieval systems, Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval, p.14-20, October 13-16, 1991, Chicago, Illinois, United States
[doi> 10.1145/122860.122862]
|
| |
23
|
WOODM89 Woodman '89. Workshop on object-oriented document manipulation. Rennes, France, 28-31 may, 1989. Preprints, edited by J. André, J. Bézivin. In the Bigre/Globule series, nr. 63-64, May 1989. (Afcet, Bigre, Ccett).
|
CITED BY 9
|
Lin-Ju Yeh , Hsiu-Hsen Yao , Yuan-Kuo Chen, SSQL: a semi-structured query language for SGML document retrievals, Proceedings of the 14th annual international conference on Systems documentation: Marshaling new technological forces: building a corporate, academic, and user-oriented triangle, p.221-228, October 19-22, 1996, Research Triangle Park, North Carolina, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE conference on Design automation
Gwo-Dong Chen
, Daniel D. Gajski
|