IRS13
Scientific Report No. IRS-13 Information Storage and Retrieval
Test Environment
chapter
E. M. Keen
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
1-12
The test procedure that is followed requires that no more than one single
describable change be made to these procedures at any one time, so that
search results may be obtained each time one system component is altered.
In this way a series of comparisons based on differences in document input
may be made, and then perhaps a second series which compares different
content analysis procedures. The primary use of the different test en-
vironments is to find out whether a conclusion drawn from an experiment in
one envit:onment also holds for another. Thus, if a given content analysis
procedure is found to be very effective for the computer science collection,
a parallel test of content analysis procedures can then be made with the
aerodynamics and documentation collections.
Conclusions about the effectiveness of search results and system
performance generally can be made from different viewpoints using several
criteria £33. For various reasons given in (3], the measurement of re-
trieval performance oriented towards user satisfaction predominates in
the current SMART text experiments, and a discussion of methods and
measures used is to be found in Section II of this report. A step
recently added to the evalu[OCRerr]ation procedures is that of making statis-
tical significance tests of the results, as described in references (4]
and [1]. A further[OCRerr]step in the evaluation, which is not formally built
into the system requires a hand analysis of the search results involving
an examination of individual requests rather than the use only of the
averages for a set of requests. A fast-search analysis of specific
instances of poor retrieval, for example, is necessary in order to
make improvements to the system and to identify areas in which further