ABSTRACT
Personal Information Retrieval Systems (PIRs) for microcomputers have not yet had the same kind of impact as have Database Management Systems (DBMSs) for microcomputers. This may be attributed to the differences in the kinds of problems the two systems address. What a retrieval system can do, that a DBMS cannot, is to surmise what a record (document) is about. It can help find things that the user cannot clearly describe - sometimes even by subject matter. It is this capability that developers of PIRs should accentuate, because increasingly, personal databases are becoming sufficiently large so that finding relevant records is a recognized problem.
If PIRs offer little more than DBMSs they will fail to meet that need. Fortunately, twenty-five years of research have produced some techniques (mostly statistical) that can improve the ability of retrieval systems to find relevant items. Even though these techniques have not been implemented by the large-scale retrieval services or mainframe-based retrieval systems, they could be; in fact, it is quite possible to implement them efficiently in a microcomputer-based PIR.
Index Terms
- Use of statistical techniques in Personal Retrieval Systems
Recommendations
Statistical information retrieval modelling: from the probability ranking principle to recent advances in diversity, portfolio theory, and beyond
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge managementStatistical modelling of Information Retrieval (IR) systems is a key driving force in the development of the IR field. The goal of this tutorial is to provide a comprehensive and up-to-date introduction to statistical IR modelling. We take a fresh and ...
A statistical correlation model for image retrieval
MULTIMEDIA '01: Proceedings of the 2001 ACM workshops on Multimedia: multimedia information retrievalA bigram correlation model for image retrieval is proposed, which captures the semantic relationship among images in a database from simple statistics of users' relevance feedback information. It is used in the post-processing of image retrieval results ...
Comments