No abstract available.
Recent trends in automatic information retrieval
Substantial successes were achieved in the early years in automatic indexing and retrieval using single term indexing theories with term weight assignments based on frequency considerations. The development of more refined indexing systems using ...
Using structural representation of anomalous states of knowledge for choosing document retrieval strategies
We report on a project which attempts to classify representations of the anomalous states of knowledge (ASKs) of users of document retrieval systems on the basis of structural characteristics of the representations, and which specifies different ...
Document presentation and query formulation in Muse
Several problems of document presentation and query, formulation arising in systems dealing with multimedia documents are discussed. Examples from a prototype distributed multimedia document filing system are described.
An approach to multimedia information management
The integrated management of multimedia information, that is of complex information consisting of conventional data, text, graphics, images and voice, is of great interest not only in fields like information Retrieval, Office Automation, Computer Aided ...
Methodological issues for the design of an office information server: focal topics for the analysis from an office system perspective
This paper deal with the necessity of consideration of organizational and user requirements to create the basis for the successful design of future office information servers.
Today volumes of the order of 10.000 to 50.000 multimode documents and 1 to 10 ...
IR, NLP, AI and UFOS: or IR-relevance, natural language problems, artful intelligence and user-friendly online systems
User Friendly Online Searching is examined in the context of Natural Language Processing in Information Retrieval and Artificial Intelligence. Opportunities for synergetic R & D are identified as the basis for Intelligent Information Retrieval and ...
The visual display of information in an information retrieval environment
This paper gives an overview of the graphical techniques which have been used in the representation of information in a document collection environment. An assessment of the applicability of existing multivariate data graphical techniques to the vector ...
Improved subject access, browsing and scanning mechanisms in modern on-line IR
Focusing on communication, the paper analyses and proposes practical solutions to key problems in online IR, in particular concerned with ill defined and “muddled” information requirements, concept interpretation in searching and text representation.
The ...
S-tree: a dynamic balanced signature index for office retrieval
The signature approach is an access method for partial-match retrieval which meets many requirements of an office environment. Signatures are hash coded binary words derived from objects stored in the data base. They serve as a filter for retrieval in ...
Improved hierarchical bit-vector compression in document retrieval systems
The “concordance” of an information retrieval system can often be stored in form of bit-maps, which are usually very sparse and should be compressed. Hierarchical bit-vector compression consists of partitioning a vector vi into equi-sized blocks, ...
Text compression using prediction
In the compression of the text files, the dependencies between the successive characters should be exploited to as great an extent as possible. There are two obvious possibilities: either to detect and encode often occurring character strings, or to ...
Incorporating syntactic information into a document retrieval strategy: an investigation
This paper deals with mechanisms for performing text retrieval which incorporate a degree of linguistic processing into the overall strategy. We have performed some experiments using parsing of text an a test collection of documents and queries to try ...
CALIN: a user interface based on a simple natural language
In the framework of an application dealing with classified advertisement matching, a dedicated user interface has been designed and implemented. Its major originality relies on the user's language which is neither an artificial one, nor the usual ...
Solving grammatical ambiguities within a surface syntactical parser for automatic indexing
This paper describes linguistic tools specifically designed for performant automatic indexing of natural language texts. By performant indexing, we mean the ability of the system to extract noun phrases (considered as main conceptual frames regarding ...
A design of a distributed full text retrieval system
This paper describes the design of a distributed information system for full text retrieval. The system is similar in functionality to STAIRS and is being developed on a network of PC's interconnected by PC Network. The implementation is built on a ...
A common architecture for different text processing techniques in an information retrieval environment
The following paper gives an overview on a text processing software called REALIST (Retrieval Aids by Linguistics and Statistics) which integrates different text processing techniques into a common surface. It supports the user by offering the ...
Hierarchic document classification using Ward's clustering method
In this paper, we discuss the application of a recent hierarchic clustering algorithm to the automatic classification of files of documents. Whereas most hierarchic clustering algorithms involve the generation and updating of an inter-object ...
User-oriented document clustering: a framework for learning in information retrieval
In information retrieval, cluster analysis is an important tool employed to enhance both efficiency and effectiveness of the retrieval process. Most clustering algorithms have difficulty in reflecting the closeness of documents as perceived by the user. ...
The efficiency of inverted index and cluster searches
The processing time and disk space requirements of an inverted index and top-down cluster search are compared. The cluster search is shown to use both more time and more disk space, mostly due to the large number of cluster centroids needed by the ...
On extending the vector space model for Boolean query processing
An information retrieval model, named the Generalized Vector Space Model (GVSM), is extended to handle situations where queries are specified as (extended) Boolean expressions. It is shown that this unified model, unlike currently available alternatives,...
An experimental study of factors important in document ranking
The ability to effectively rank retrieved documents in order of their probable relevance to a query is a critical factor in statistically-based keyword retrieval systems. This paper summarizes a set of experiments with different methods of term ...
(invited paper) A new theoretical framework for information retrieval
A new framework based on a non-classical logic is proposed for investigating IR. The paper motivates the use of a particular conditional logic as the 'right' logic for IR. A new principle, the logical uncertainty principle, is proposed, to deal with the ...
User-specified domain knowledge for document retrieval
The introduction of domain knowledge into a document retrieval system has two important consequences; an increase in the effectiveness of retrieval and a decrease in the efficiency of text processing. In this paper, a method is presented of combining ...
IOTA: a full text information retrieval system
IOTA is a prototype of an Information Retrieval System which can manage a corpus made of highly structured, full text documents. The first version presented here has intelligent capabilities related to heuristic pattern matching procedures for ...
An information retrieval system based on artificial intelligence techniques
This paper describes a possible use of Artificial Intelligence models and techniques in the design of a small Information Retrieval system. In particular, some knowledge representation models, such as semantic networks and frame-like structures, are ...
The using of inference mechanisms to improve the retrieval facilities from large relational databases
This paper describes the development of “intelligent” tools aimed at improving the retrieval facilities from large relational databases. When a natural language query does not correspond directly to the data contained in the base, a class of inferential ...
An automatic and tunable document indexing system
In this article we present an interactive automatic document indexing software together with various index tuning/optimization strategies. After stems are generated from the raw text, the initial index vocabulary is narrowed down and tuned with the use ...
Performance of self-taught documents: exploiting co-relevance structure in a document collection
In this paper we study the behavior of an information retrieval system in which index terms are assigned at random to both documents and requests. The random indexing is then modified by means of a feedback mechanism derived from a normal probability ...