IRS13
Scientific Report No. IRS-13 Information Storage and Retrieval
Evaluation Parameters
chapter
E. M. Keen
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
11-29
simply interpreted by noting that a cut-off is established immediately when
a relevant document is encountered in each output list. It very effectively
reflects merit at the high recall end of the plot, since the lowest precision
ratio for any individual request is computed when a recall of 1.0 is reached,
unlike the 1'Pseudot' plot which continues making cut-offs until the last
document in the collection is reached. This technique is quite adequate for
making comparisons with in SMART, but a possible disadvantage in some cir-
cumstances is that the curve is not typical of a real user environment since
it produces too optimistic a result. Figure 16 b) compares a `Pseudo1T and
a 11Quasi1 curve for the same set of averaged results.
A modification to the technique is being tested, in which the cut-off
having the highest precision at each vertical segment is still used, but the
interpolation lines are altered to produce what is believed to represent the
best possible curve that a user could achieve, assuming that almost optimum
choices of cut-off are made. Figure 17 a) gives an example of this, and the
reason for this type of extrapolation line which retains constant precision
resides in the fact that user requirements would ask for the best possible
precision above x recall. Whatever the value of 11x11 is, the best possible
precision is always the next peak in the step curve, so a line of constant
precision leading to that peak is thought to give the required result. A
slight modification, yet to be made, is that sometimes, the next peak encountered
above `IxTT recall is eclipsed by a higher peak at still greater recall (occurring,
for example, when one relevant document is followed by another in the rank
list). The line should thus be connected to the highest peak. This technique
is known as the "Semi-Cranfield'1 method, and an average curve is presented
in Figure 17 b), together with curves of the other two types. The comparison
is slightly affected by a different tied rank procedure used for the "Semi-