CRANV2
Aslib Cranfield Research Project: Factors Determining the Performance of Indexing Systems: Volume 2
Methods for presentation of results
chapter
Cyril Cleverdon
Michael Keen
Cranfield
An investigation supported by a grant to Aslib by the National Science Foundation.
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
- 32 -
I00
9O
8O
70
60
50
4O
3O
2O
I0
\
0 I0 20 30 40 50 60 70 80 90 I00
0/o PRECISION
x Relevance 2 o Relevance 2 and 3,
FIGURE 3.1P PERFORMANCE CURVE OBTAINED WITH FACET
INDEX IN W.R.U. TEST
While the recall and precision ratios have been generally accepted
as performance measures for information retrieval systems, they have
also aroused some criticism. No serious attempt has been made to
answer this criticism, partly because it was mostly trivial and never
supported by experimental data, but mainly because an intention of Cranfield
II was to investigate the performance measures which could or should be
used. For this, sets of performance data were required and it was known
that for every set of figures in Cranfield I, there would be hundreds of
sets in Cranfield II, and it was obvious that the decisions regarding the
measures to be used and the methods of presenting the test results would
be of major importance. The programme of work which this aspect of
the project has involved has been considerable, with many sets of results
being calculated in a number of different ways. Based on this work, which
has taken up a significant part of the effort during the last eighteen months
of the project, the decision was finally reached that the most satisfactory
method of calculating results involves three measures, namely Recall Ratio
and Precision Ratio with, additionally, the new measure of Fallout Ratio.
For the presentation of results on a plot, it is believed that, in the large