ISR10
Scientific Report No. ISR-10 Information Storage and Retrieval
Evaluation of Document Retrieval Systems
chapter
Joseph John Rocchio
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
½
with imperfect representations of both the' needs of its users and the
information content of documents, and since the notion of relevance is
/10 likely to be quite variable over any realistic user population, any
performance measure must represent a statistical estimate of the
probability distribution of a correct assessment of relevance.
The subjective nature of relevance implies that any realistic
system1s evaluation will require a large amount of data. Since the
collection of such data is costly, and since the notion of evaluation
is a critical element of any design process, the designer of a
retrieval system must rely on analytical tools, local performance
measures, and intuition to select, the most likely set of[OCRerr]functions to
`satisfy his objectives. In this connection, computer based simulation
systems such as SMART can be of significant benefit since they allow
large sm6unts of data to be generated and analyzed. One of the major
-. objectives of the functional model td be considered here (based on the
SMART simulation system) is to `iiaximize[OCRerr]the utility of the data
generating capabilities of the sy'stem by allowing evaluation of the
individual functional elements `as well' as of the overall performance
characteristics.
5-2
2. [OCRerr]valAxation Measures and the Collection of Statistics
A. The Idealized Exper[OCRerr]iment
`Most of the evaluation measures proposed for document
retrieval systems are based on the follQwing idealized characterization