ISR10
Scientific Report No. ISR-10 Information Storage and Retrieval
Evaluation of Document Retrieval Systems
chapter
Joseph John Rocchio
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
5-18
evaluatich experiments for retrieval systems. Conventionally, on an
experimental basis, there is a serious problem in obtaining representa-'
tive search requests with accompanying relevance judgments (witnessed
by the controversy as to the value of the test queries used in the
10
Cranfield study for example). [OCRerr]ather than measuring performance by
the use of test queries, a retrieval system can be evaluated by
obtaining user judgments as to the degree of association between
documents which by their location in the index space are necessarily
associated.
Such an evaluation represents a measure of the ability of the
indexin[OCRerr] [OCRerr]cheme to preserve the associations in the index space which
users can detect from the information content as expressed in the
natural langnage. Since the retrieval performance which a user can
expect is a function of the degree to which input search requests
correspond (or can be, adjusted to correspond) to `their respective
optimal forms, and since the performance of an optimal query is
directly related to the consistency of the associations of documents in
the' index space, a measure of the latter is indicative of a measure of
the former. While such a procedure does not mecessarily reduce the
quantity of subjective data required. for a significant test of a
retrieval system, it offers the potential for providing an increased
measure of experimental control Since the requirements for examining
the implications of such a test program are prohibitive with respect to
the scope of this thesis.(by virtue of requiring a large subjective test
effort), this test de,sign is offered as a suggestion worthy of
additional consideration.